[go: up one dir, main page]

US20220162544A1 - Control of nitrogen fixation in rhizobia that associate with cereals - Google Patents

Control of nitrogen fixation in rhizobia that associate with cereals Download PDF

Info

Publication number
US20220162544A1
US20220162544A1 US17/440,618 US202017440618A US2022162544A1 US 20220162544 A1 US20220162544 A1 US 20220162544A1 US 202017440618 A US202017440618 A US 202017440618A US 2022162544 A1 US2022162544 A1 US 2022162544A1
Authority
US
United States
Prior art keywords
nif
cluster
rhizobium
bacterium
inducible
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
US17/440,618
Inventor
Christopher A. Voigt
Min-Hyung Ryu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Massachusetts Institute of Technology
Original Assignee
Massachusetts Institute of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Massachusetts Institute of Technology filed Critical Massachusetts Institute of Technology
Priority to US17/440,618 priority Critical patent/US20220162544A1/en
Assigned to MASSACHUSETTS INSTITUTE OF TECHNOLOGY reassignment MASSACHUSETTS INSTITUTE OF TECHNOLOGY ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: RYU, Min-Hyung, VOIGT, CHRISTOPHER A.
Publication of US20220162544A1 publication Critical patent/US20220162544A1/en
Assigned to NATIONAL SCIENCE FOUNDATION reassignment NATIONAL SCIENCE FOUNDATION CONFIRMATORY LICENSE (SEE DOCUMENT FOR DETAILS). Assignors: MASSACHUSETTS INSTITUTE OF TECHNOLOGY
Pending legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/24Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
    • C07K14/26Klebsiella (G)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N1/00Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
    • C12N1/20Bacteria; Culture media therefor
    • AHUMAN NECESSITIES
    • A01AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
    • A01NPRESERVATION OF BODIES OF HUMANS OR ANIMALS OR PLANTS OR PARTS THEREOF; BIOCIDES, e.g. AS DISINFECTANTS, AS PESTICIDES OR AS HERBICIDES; PEST REPELLANTS OR ATTRACTANTS; PLANT GROWTH REGULATORS
    • A01N63/00Biocides, pest repellants or attractants, or plant growth regulators containing microorganisms, viruses, microbial fungi, animals or substances produced by, or obtained from, microorganisms, viruses, microbial fungi or animals, e.g. enzymes or fermentates
    • A01N63/20Bacteria; Substances produced thereby or obtained therefrom
    • CCHEMISTRY; METALLURGY
    • C05FERTILISERS; MANUFACTURE THEREOF
    • C05FORGANIC FERTILISERS NOT COVERED BY SUBCLASSES C05B, C05C, e.g. FERTILISERS FROM WASTE OR REFUSE
    • C05F11/00Other organic fertilisers
    • C05F11/08Organic fertilisers containing added bacterial cultures, mycelia or the like
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/87Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells

Definitions

  • nitrogen is a limiting nutrient that needs to be added as fertilizer to those crops that cannot produce it on their own, including the cereals rice, corn, and wheat.
  • legumes are able to obtain nitrogen from the atmosphere using nitrogen-fixing bacteria that reside in root nodules.
  • nitrogen-fixing bacteria that reside in root nodules.
  • the majority of the world's calories are from cereals; thus, it has been a longstanding problem in genetic engineering to transfer this ability to these crops. This would reduce the need for nitrogenous fertilizer and the economic, environmental, and energy burdens that it brings.
  • the present disclosure is based, at least in part, rhizobia and methods for making rhizobia that can fix nitrogen under aerobic free-living conditions.
  • the present disclosure also provides refactored nif-clusters that confer the ability to fix nitrogen under aerobic free-living conditions.
  • one aspect of the present disclosure provides a rhizobium that can fix nitrogen under aerobic free-living conditions, comprising a symbiotic rhizobium having an exogenous nif cluster, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic rhizobium under aerobic free-living conditions, and wherein the rhizobium is not Azorhizobium caulinodans .
  • the exogenous nif cluster is from a free-living diazotroph.
  • the exogenous nif cluster is from a symbiotic diazotroph.
  • the exogenous nif cluster is from a photosynthetic Alphaproteobacteria. In some embodiments, the exogenous nif cluster is from a Gammaproteobacteria. In some embodiments, the exogenous nif cluster is from a cyanobacteria. In some embodiments, the exogenous nif cluster is from a firmicutes. In some embodiments, the exogenous nif cluster is from Rhodobacter sphaeroides . In some embodiments, the exogenous nif cluster is from Rhodopseudomonas palustris . In some embodiments, the exogenous nif cluster is an inducible refactored nif cluster.
  • the inducible refactored nif cluster is an inducible refactored Klebsiella nif cluster.
  • the rhizobium is IRBG74.
  • the exogenous nif cluster comprises 6 nif genes.
  • the 6 nif genes are nifHDK(T)Y, nifEN(X), nifJ, nifBQ, nifF, and nifUSVWZM.
  • each nif gene of the exogenous nif cluster is preceded by a T7 promoter.
  • the T7 promoter is a wild-type promoter.
  • the rhizobium further comprises an endogenous nif cluster.
  • the nif cluster has a nifV gene.
  • the nifV gene is endogenous.
  • the exogenous nif cluster further comprises a terminator.
  • the T7 promoter has a terminator and the terminator is downstream from the T7 promoter.
  • the exogenous nif cluster is a refactored v3.2 nif cluster as shown in FIG. 2H .
  • a plant growth promoting bacterium that can fix nitrogen under aerobic free-living conditions, comprising a bacterium having an exogenous nif cluster having at least one inducible promoter, wherein the exogenous nif cluster confers nitrogen fixation capability on the bacterium, under aerobic free-living conditions, and wherein the bacterium is not Azorhizobium caulinodans .
  • the bacterium is a symbiotic bacterium.
  • the bacterium is an endophyte.
  • the endophyte is rhizobium IRBG74.
  • the bacterium is an epiphyte.
  • the epiphyte is Pseudomonas protogens PF-5.
  • the plant growth promoting bacterium is associated with a genetically modified cereal plant.
  • the genetically modified cereal plant includes an exogenous gene encoding a chemical signal.
  • the nitrogen fixation is under the control of the chemical signal.
  • the chemical signal is an opine (e.g., octopine, nopaine, or mannopine), phlorogluconol or rhizopene.
  • the exogenous nif cluster comprises 6 nif genes.
  • the 6 nif genes are nifHDK(T)Y, nifEN(X), nifJ, nifBQ, nifF, and nifUSVWZM.
  • the inducible promoter is a T7 promoter.
  • the inducible promoter is P A1lacO1 promoter.
  • the inducible promoter is activated by an agent selected from a group that includes IPTG, sodium salicylate, octapine, nopaline, the quorum signal 3OC6HSL, aTc, cuminic acid, DAPG, and salicylic acid.
  • the exogenous nif cluster further comprises a terminator.
  • the inducible promoter has a terminator and the terminator is downstream from the inducible promoter.
  • an Azorhizobium caulinodans capable of inducible ammonium-independent nitrogen fixation in a cereal crop, comprising: (i) a modified nif cluster, wherein an endogenous nifA gene is deleted or altered; and (ii) at least one operon comprising nifA and RNA polymerase sigma factor (RpoN), wherein the operon comprises a regulatory element including an inducible promoter.
  • the inducible promoter is P A1lacO1 promotor.
  • the inducible promoter is activated by an agent selected from the group consisting of IPTG, sodium salicylate, octapine, nopaline, the quorum signal 3OC6HSL, aTc, cuminic acid, DAPG, and salicylic acid.
  • the endogenous nifA gene is altered with at least one of the following substitutions: (i) L94Q, (ii) D95Q, and (iii) both L94Q and D95Q.
  • Another aspect of the present disclosure provides a method of engineering a rhizobium that can fix nitrogen under aerobic free-living conditions, comprising transferring an exogenous nif cluster to a symbiotic rhizobium, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic rhizobium, under aerobic free-living conditions, and wherein the rhizobium is not Azorhizobium caulinodans .
  • the exogenous nif cluster comprises 6 nif genes.
  • the 6 nif genes are nifHDK(T)Y, nifEN(X), nifJ, nifF and nifUSVWZM.
  • each of the nif genes is preceded by a wild-type T7 promoter.
  • the exogenous nif cluster is transferred to the rhizobium in a plasmid.
  • the exogenous nif cluster further comprises a terminator.
  • the wild-type T7 promoter has a terminator, and the terminator is downstream from the wild-type T7 promoter.
  • the endogenous NifL gene is deleted.
  • Another aspect of the present disclosure provides a method of producing nitrogen for consumption by a cereal plant, comprising providing a plant growth promoting bacterium that can fix nitrogen under aerobic free-living conditions in proximity of the cereal plant, wherein the plant growth promoting bacterium is a symbiotic bacterium having an exogenous nif cluster, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic bacterium, enabling nitrogen fixation under aerobic free-living conditions.
  • the plant growth promoting bacterium is a rhizobium.
  • the plant growth bacterium is a bacterium as described in the present disclosure.
  • the cereal plant is a genetically modified cereal plant.
  • the genetically modified cereal plant includes an exogenous gene encoding a chemical signal.
  • the nitrogen fixation is under the control of the chemical signal.
  • the chemical signal is opine, phlorogluconol or rhizopene.
  • the nitrogen fixation is under the control of a chemical signal.
  • the chemical signal is a root exudate, biocontrol agent or phytohormone.
  • the root exudate is selected from the group consisting of sugars, hormones, flavonoids, and antimicrobials.
  • the chemical signal is vanillate.
  • the chemical signal is IPTG, aTc, cuminic acid, DAPG, and salicylic acid, 3,4-dihydroxybenzoic acid, 3OC6HSL or 3OC14HSL.
  • the disclosure also provides a genetically engineered plant that can produce orthogonal carbon sources, such as opines or less common sugars, and bacteria with the corresponding catabolism pathways, which can respond to these signals.
  • the present disclosure provides a method for making a nitrogen-fixing bacterium, the method comprising a) identifying a host bacterium; b) selecting a donor bacterium having a nif cluster based on evolutionary distance between the host bacterium and the donor bacterium; and c) inserting the nif cluster of the donor bacterium to the host bacterium, thereby making a nitrogen-fixing bacterium.
  • the evolutionary distance between the host bacterium and the donor bacterium is less than 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, or 0.1% substitutions per site in 16S ribosomal RNA gene sequence.
  • the host bacterium and the donor bacterium are in the same genus, family, order, or class.
  • the donor bacterium is selected from Klebsiella, Pseudomonas, Azotobacter, Gluconacetobacter, Azospirillum, Azorhizobium, Rhodopseudomonas, Rhodobacter , Cyanothece, or Paenibacillus genus.
  • the host bacterium is selected from the group consisting of E. coli, Pseudomonas protegens Pf-5, and Rhizobium IRBG74.
  • the donor bacterium is selected from the group consisting of K. oxytoca, P. stutzeri, A. vinelandii, G. diazotrophicus, A.
  • the host bacterium is E. coli and the donor bacterium is K. oxytoca .
  • the host bacterium is Pseudomonas protegens Pf-5, and the donor bacterium is P. stutzeri .
  • the host bacterium is Rhizobium IRBG74, and the donor bacterium is R. sphaeroides .
  • the host bacterium is a nonsymbiotic bacterium, e.g., Azotobacter, Beijerinckia , or Clostridium bacterium.
  • the host bacterium is a symbiotic bacterium, e.g., Rhizobium, Frankia , or Azospirillum bacterium.
  • the host bacterium is symbiotic with a leguminous plant, an actinorhizal plant, or a cereal crop.
  • the inserted nif cluster is under inducible control.
  • the present disclosure provides a method of selecting a nif cluster of a donor bacterium that is compatible with a host bacterium, the method comprising a) performing a phylogenetic analysis for the donor bacterium and the host bacterium; b) determining evolutionary distance based on the phylogenetic analysis between the donor bacterium and the host bacterium is less than a reference value; and c) selecting the nif cluster of the donor bacterium for the host bacterium.
  • the phylogenetic analysis is performed by using distance-matrix, maximum parsimony, maximum likelihood, or Bayesian inference.
  • the phylogenetic analysis is performed by analyzing ribosomal RNA (e.g., 16s rRNA) substitution rate.
  • the reference value is 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, or 0.1% substitutions per site in 16S ribosomal RNA gene sequence.
  • the reference value is 500, 400, 300, 200, 100, 50, or 10 million years.
  • the method further comprises inserting the nif cluster to the host bacterium and evaluating the nitrogen fixation activity.
  • the nif cluster is under inducible control.
  • the present disclosure provides a bacterium comprising a nif cluster, where the nif cluster is under control of an exogenous control genetic element.
  • the nif cluster is an endogenous or exogenous nif cluster.
  • the exogenous control genetic element initiates promoter activities in response to an inducer (e.g., a chemical signal).
  • the promoter activities are measured by the below equation:
  • the inducer is delivered to the bacterium by chemical delivery or biocontrol delivery.
  • the inducer is a chemical signal in seeds (e.g., cuminic acid), a native root exudate (e.g., arabinose, salicylic acid, vanillic acid, or narigenin), a chemical signal from a bacterium (e.g., 3OC6HSL, 3OC14HSL, DHBA, or DAPG), or a chemical signal from a genetically modified plant (e.g., Nopaline or Octopine).
  • seeds e.g., cuminic acid
  • a native root exudate e.g., arabinose, salicylic acid, vanillic acid, or narigenin
  • a chemical signal from a bacterium e.g., 3OC6HSL, 3OC14HSL, DHBA, or DAPG
  • a genetically modified plant e.g., Nopaline or Octopine
  • FIGS. 1A-1F include diagrams showing transfer of nif clusters across species.
  • FIG. 1A Eight nif clusters from free-living nitrogen fixing bacteria are aligned based on phylogenetic relationships of 16S rRNA sequences. The genes and operons are based on K. oxytoca M5al. Dots in the DNA line indicate where multiple regions were cloned from genomic DNA and combined to form one large plasmid-borne nif cluster. A complete list of strain genotypes is provided in Table 3. Nitrogenase activity from transfer of the native nif clusters was measured in three species. The activities of the R. palustris and R. sphaeroides nif clusters were also measured in 12 Rhizobia strains.
  • FIG. 1B Transcriptomic profile of the native K. oxytoca nif cluster in K. oxytoca , compared with those obtained from its transfer to the indicated species.
  • FIG. 1C Transcription levels (FPKM) of the native K. oxytoca nif cluster across species. Transcriptional units are underlined.
  • FIG. 1D Transcription levels (FPKM) of the K. oxytoca nif genes in K. oxytoca ( ⁇ Klebsiella ) compared to that obtained when transferred to a new host.
  • FIG. 1E Same as in ( FIG.
  • FIG. 1C Same as in ( FIG. 1D ), except the ribosome densities (RD) are compared, as calculated using ribosome profiling.
  • FIGS. 2A-2M include diagrams showing the transfer of the refactored K. oxytoca nif clusters to R. sp. IRBG74.
  • FIG. 2A The genetic systems for the controller for E. coli MG1655 (left) and R. sp. IRBG74 (right) are shown.
  • a variant of T7 RNAP (R6232S, N-terminal lon tag, GTG start codon) is used for the E. coli controller.
  • Several genetic parts were substituted to build the R. sp. IRBG74 controller ( FIG. 16 ). The sequences for the genetic parts are provided in Table 5. ( FIG.
  • FIG. 2B The response functions for the controllers with the reporter plasmid pMR-79 (Table 4 and Table 5). The IPTG concentrations used to induce nitrogenase were circled.
  • FIG. 2C The genetic parts used to build the refactored v2.1 nif gene cluster are shown (Table 5).
  • FIG. 2D The activity of the refactored nif gene cluster v2.1 in different hosts is shown. Asterisks indicate ethylene production below the detection limit ( ⁇ 10 a.u.).
  • FIG. 2E The activities of the v2.1 promoters and terminators in E. coli MG1655 and R. sp. IRBG74 as calculated from RNA-seq data (see Materials and Methods).
  • FIGS. 2F The translation efficiency of the v2.1 nif genes in E. coli MG1655 and R. sp. IRBG74, as calculated using ribosome profiling and RNA-seq. Lines connect points that occur in the same operon.
  • FIG. 2G The ribosome density (RD) is compared for the refactored v2.1 nif genes in a new host ( E. coli MG155; R. sp. IRBG74) versus that measured for the nif genes from the native K. oxytoca cluster in K. oxytoca ( ⁇ Klebsiella ). The points corresponding to nifH is marked H.
  • FIGS. 2H-2L The same as ( FIGS.
  • FIG. 2M Nitrogenase activity is shown as a function of T7 promoter strength.
  • the refactored nif cluster v3.2 was expressed from three controller strains with varying strengths ( FIG. 16 ). Error bars represent s.d. from three independent experiments.
  • FIGS. 3A-3F include diagrams showing the control of nitrogen fixation in A. caulinodans ORS571.
  • FIG. 3A The controller is shown, carried on a pBBR1 origin plasmid (genetic parts are provided in Table 5). NifA and RpoN co-induce the expression of three sites in the genome (identified by consensus NifA binding sequences).
  • FIG. 3B Expression from the nifH promoter was evaluated using a fluorescent reporter (see Materials and Methods). NifA and RpoN were complemented (+) individually or in combination in the A. caulinodans ⁇ nifA strain where the genomic rpoN remains intact.
  • FIG. 3A The controller is shown, carried on a pBBR1 origin plasmid (genetic parts are provided in Table 5). NifA and RpoN co-induce the expression of three sites in the genome (identified by consensus NifA binding sequences).
  • FIG. 3B Expression from the nifH promoter was evaluated
  • FIG. 3C The response function for the induction of the nifH promoter by the controller is shown.
  • FIG. 3D The nitrogenase activity is shown for wild-type A. caulinodans ORS571 compared to the ⁇ nifA complemented with the controller plasmid (+) and the addition of 1 mM IPTG (+).
  • FIG. 3E The effect of the absence or presence of 10 mM ammonium chloride is shown.
  • the WT NifA from A. caulinodans ORS571 is compared to different combinations of amino acid substitutions with additional RpoN expression.
  • NifA/RpoN expression is induced by 1 mM IPTG (+) for the ⁇ nifA strain containing the controller plasmid pMR-121, 122, 123, and 124 (+). Asterisks indicate ethylene production below the detection limit ( ⁇ 10 au).
  • FIG. 3F The nitrogenase activity is shown as a function of the oxygen concentration in the headspace (see Materials and Methods). The native nif cluster (wild-type A. caulinodans ORS571) is compared to the inducible version including the controller plasmid and 1 mM IPTG. Error bars represent s.d. from three independent experiments.
  • FIGS. 4A-4F include diagrams showing Nitrogenase activity of the inducible nif clusters in Pseudomonas protegens Pf-5.
  • FIG. 4A The controllers, based on P. stutzeri NifA, were used for all three clusters. Plasmids and genetic parts are provided in Table 4 and Table 5.
  • FIG. 4B The nif clusters from K. oxytoca, P. stutzeri , and A. vinelandii are shown. The deleted regions corresponding the NifLA regulators are marked. The dotted lines indicate that multiple regions from the genome were cloned and combined for form the nif cluster.
  • FIG. 4C The induction of the nifH promoters from each species by the controller are shown (0.5 mM IPTG) (see Materials and Methods).
  • FIG. 4D The nitrogenase activities of the native cluster (intact nifLA) is compared to the inducible clusters in the presence and absence of 0.5 mM IPTG.
  • the dashed lines indicate the activity of the native clusters in the wild-type context (top to bottom, K. oxytoca M5al, P. stutzeri A1501 and A. vinelandii DJ).
  • FIG. 4E The sensitivity of the native and inducible (+0.5 mM IPTG) nif clusters to 17.1 mM ammonium acetate are compared. Asterisks indicate ethylene production below the detection limit ( ⁇ 10 au).
  • FIG. 4F The nitrogenase activity is shown as a function of the oxygen concentration in the headspace (see Materials and Methods). The native nif cluster is compared to the inducible version including the controller plasmid and 0.5 mM IPTG. Error bars represent s.d. from three independent experiments.
  • FIGS. 5A-5D include diagrams showing the control of nitrogenase activity with sensors that respond to diverse chemicals in the rhizosphere.
  • FIG. 5A Schematic showing the origins of the chemicals. “Introduced DNA” refers to the genetic modification of the plant to produce nopaline and octopine.
  • FIG. 5B The genetic sensors built for A. caulinodans are shown. Sequences for the genetic parts are provided in Table 5.
  • FIG. 5C The response functions for the sensors are shown. Either the sensor expresses T7 RNAP, which then activates PT7, or it expresses NifA ( P. protegens Pf-5) or NifA/RpoN ( A.
  • FIG. 5D The nitrogenase activity is measured in the presence or absence of inducer (see Materials and Methods).
  • the refactored Klebsiella nif clusters v2.1 and v3.2 were used in E. coli MG1655 and R. sp. IRBG74, respectively.
  • the inducible A. vinelandii nif cluster was used in P. protegens Pf-5.
  • the controller containing nifA/rpoN was used in A. caulinodans ⁇ nifA.
  • the inducer concentrations are: 50 ⁇ M vanillic acid, 500 ⁇ M DHBA, 50 ⁇ M cuminic acid, 25 nM 3OC6HSL, 500 nM 3OC14HSL, 33 ⁇ M arabinose, 100 ⁇ M naringenin, 100 nM DAPG, 200 ⁇ M salicylic acid, 1 mM nopaline and 1 mM octopine. Error bars represent s.d. from three independent experiments.
  • FIG. 6 includes a plot of the growth curve of R. sp. IRBG74 in UMS minimal medium with varying carbon sources.
  • Bacterial growth was spectrophotometrically monitored at OD 600 nm. Error bars represent s.d. from three independent experiments.
  • FIGS. 7A-7F include diagrams showing the nitrogenase activity when different inducible nif clusters are transferred to E. coli MG1655.
  • FIG. 7A The same controller system based on K. oxytoca NifA was used for all three clusters. The controller plasmid pMR-99 and genetic parts are provided in Table 4 and Table 5.
  • FIG. 7B The nif clusters from K. oxytoca, P. stutzeri , and A. vinelandii are shown. The deleted regions corresponding the NifLA regulators are marked. The dotted lines indicate that multiple regions from the genome were cloned and combined for form the nif cluster.
  • FIG. 7C The induction of the nifH promoters from each species by the controller is shown (50 ⁇ M IPTG) (see Materials and Methods)
  • FIG. 7D The nitrogenase activities of the native cluster (intact nifLA) is compared to the inducible clusters in the presence and absence of 50 ⁇ M IPTG.
  • the dashed lines indicate the activity of the native clusters in the wild-type context (top to bottom, K. oxytoca M5al, P. stutzeri A1501 and A. vinelandii DJ).
  • FIG. 7E Regulation of nitrogenase activity by ammonia.
  • FIG. 7F Regulation of nitrogenase activity by oxygen.
  • the native nif cluster is compared to the inducible version including the controller plasmid and 50 ⁇ M IPTG. Nitrogenase activities were measured after 3 h of incubation at constant oxygen concentrations (0 to 3%) in the headspace (see Materials and Methods). Error bars represent s.d. from three independent experiments.
  • FIGS. 8A-8B include plots showing ammonium repression of the transferred nif clusters. Nitrogenase sensitivity to ammonium was measured by nitrogenase assay in the absence ( ⁇ ) or presence (+) of 17.1 mM ammonium acetate. The sensitivity of the native and inducible nif clusters in E. coli MG1655 ( FIG. 8A ) and P. protegens Pf-5 ( FIG. 8B ). Note that the data are from FIGS. 4A-4F and FIGS. 7A-7F . The nif clusters were induced by 50 ⁇ M and 0.5 mM IPTG in E. coli MG1655 and P. protegens Pf-5, respectively. Asterisks indicate ethylene production below the detection limit ( ⁇ 10 au). Error bars represent s.d. from three independent experiments.
  • FIG. 9 includes a diagram showing the ribosome profiling data for the K. oxytoca native nif cluster in K. oxytoca M5al, E. coli MG1655, P. protegens Pf-5 and R. sp. IRBG74 (see Materials and Methods).
  • FIGS. 10A-10B include diagrams showing the effect of NifA overexpression on the nifH promoter activity in R. sp. IRBG74.
  • FIG. 10A The reporter construct used to measure nifH promoter activity is shown. The nifH promoter activity was analyzed in the R. sp. IRBG74 wild-type background using flow cytometry. Additional copies of NifA of R. sp. IRBG74 increased activity of the R. sp. IRBG74 nifH promoter but failed to complement or enhance activity of the other nifH promoters including K. oxytoca, P. stutzeri and A. caulinodans . Error bars represent s.d. from three independent experiments.
  • FIG. 10B Plasmid maps used to assess the effect of NifA overexpression in R. sp. IRBG74. WT, wild-type; Rsp, R. sp. IRBG74; Kox, K. oxytoca M5al; Pst, P. stutzeri A1501; Aca, A. caulinodans ORS571
  • FIGS. 11A-11C include diagrams showing Promoter characterization in R. sp. IRBG74 and P. protegens Pf-5.
  • FIG. 11A Constitutive promoters are rank-ordered by their strength. Plasmids used to measure promoter activity are depicted on the top.
  • FIG. 11B The strength of the T7 promoter wild-type and its variants was analyzed in the controller strains containing the IPTG-inducible T7 RNAP on the genome of R. sp. IRBG74 and P. protegens Pf-5 with 1 mM IPTG induction. A reporter plasmid used to measure T7 promoter activity is shown on the right.
  • FIG. 11C Correlation of T7 promoter strength between species. Error bars represent s.d. from three independent experiments.
  • FIGS. 12A-12B include diagrams showing RBS characterization in R. sp. IRBG74 and P. protegens Pf-5.
  • RBS library for GFP was designed using the RBS library calculator at the highest-resolution mode.
  • FIG. 12A The strengths of the synthetic RBSs in R. sp. IRBG74 were analyzed in the plasmid pMR-40 containing the IPTG-inducible system with 1 mM IPTG induction. 33 of the RBSs spanning a range of 5,684-fold expression were selected and their sequences are provided in Table 6.
  • FIG. 12B The strengths of the synthetic RBSs in P.
  • protegens Pf-5 was analyzed in the plasmid pMR-65 containing the arabinose-inducible system with 7 ⁇ M arabinose induction. 33 of the RBSs spanning a range of 1,075-fold expression were selected and their sequences are provided in Table 6.
  • FIGS. 13A-13B include diagrams showing the characterization of terminators for T7 RNAP in R. sp. IRBG74 ( FIG. 13A ) and P. protegens Pf-5 ( FIG. 13B ).
  • FIG. 13A The strength of terminators was analyzed in the controller R. sp. IRBG74 strains MR16 containing the IPTG-inducible T7 RNAP on the genome with 1 mM IPTG induction.
  • FIG. 13B Plasmids used to measure terminator strength are shown on right. Genetic parts are provided in Table 5. Error bars represent s.d. from three independent experiments.
  • FIG. 14 includes diagrams showing the response functions for the sensors in R. sp. IRBG74. Plasmids used to characterize the sensors are shown on top of each panel and provided in Table 4. Genetic parts are provided in Table 5. Error bars represent s.d. from three independent experiments. Experimental details are provided in Methods.
  • FIGS. 15A-15C include diagrams showing the response functions for the sensors in P. protegens Pf-5. The output changes as a function of input inducer concentrations. Plasmids used to characterize the sensors are shown on top of each panel.
  • FIG. 15A Inducible promoter characterization in P. protegens Pf-5.
  • FIG. 15B Optimization of the arabinose-inducible systems. Constitutive expression of a plasmid-borne AraE transporter decreased a dissociation constant of arabinose (dark gray). A mutation in the ⁇ 10 region (TACTGT to TATATT) of the P BAD promoter increased promoter strength (black).
  • FIG. 15C Optimization of IPTG-inducible systems.
  • IPTG-inducible promoters were induced by 1 mM IPTG.
  • Plasmids and genetic parts are provided in Table 4 and Table 5. Error bars represent s.d. from three independent experiments.
  • FIG. 16 includes diagrams showing the tuning controller strength in R. sp. IRBG74.
  • the controller containing the IPTG-inducible T7 RNAP is integrated into the genome of R. sp. IRBG74 (top). Controller strengths were adjusted by modulating the RBS of T7 RNAP in the plasmids pMR-81, 82, and 83. Response functions of the T7 promoter were measured with the reporter plasmid pMR-79 (right) in the R. sp. IRBG74 controller strains MR16, MR17, and MR18. Genetic parts and RBS sequences are provided in Table 5 and Table 5. Error bars represent s.d. from three independent experiments.
  • FIG. 17 includes a plot showing the nitrogenase activity of the refactored nif clusters across species. Error bars represent s.d. from three independent experiments.
  • FIG. 18 includes diagrams showing RNA-seq (top) and Ribosome profiling (bottom) data, respectively in E. coli MG1655 and R. sp. IRBG74. The nif genes were induced by 1 mM IPTG for 6 hours (see Materials and Methods).
  • FIG. 19 includes diagrams showing RNA-seq (top) and ribosome profiling (bottom) data, respectively, in E. coli MG1655 and P. protegens Pf-5 and R. sp. IRBG74.
  • the nif genes were induce by 1 mM, 0.1 mM, and 0.5 mM IPTG for 6 h in E. coli MG1655, P. protegens Pf-5 and R. sp. IRBG74, respectively (see Materials and Methods).
  • FIGS. 20A-20F include diagrams showing the transfer of the refactored nif cluster v3.2 in P. protegens Pf-5.
  • FIG. 20A Controllers whose output is T7 RNAP from the genome of P. protegens Pf-5 are described. Substituted genetic parts including a new RBS and IPTG-inducible promoter for the controller optimization compared to the controller module pKT249 in E. coli MG1655 highlighted in red. The response functions for the controllers with the reporter plasmid pMR-80 was measured in the P. protegens Pf-5 controller strain MR7. Controllers driving the expression of GFP by the T7 promoter achieved large dynamic to 96-fold activation by IPTG.
  • FIG. 20B The genetic parts used to build the refactored v3.2 nif gene cluster are shown (Table 5).
  • FIG. 20C The activity of the refactored nif cluster v3.2. Nitrogenase expression was induced by 1 mM IPTG.
  • FIG. 20D Function of the transcriptional parts of the cluster v3.2 was analyzed by RNA-seq ( FIG. 19 ). The performance of the promoters (left) and terminators (right) was calculated (see Materials and Methods).
  • FIG. 20E The translation efficiency of the nif genes v3.2 as calculated using ribosome profiling and RNA-seq.
  • FIG. 20F The ribosome density (RD) is compared for the refactored v3.2 nif genes in P. protegens Pf-5 versus that measured for the nif genes from the native K. oxytoca cluster in K. oxytoca ( ⁇ Klebsiella ).
  • FIG. 21 includes diagrams showing the response function of inducible promoters in A. caulinodans ORS571. Plasmids used to characterize inducible promoters are shown on top of each panel and provided in Table 4. Genetic parts are provided in Table 5. Error bars represent s.d. from three independent experiments.
  • FIG. 22 includes a diagram showing the multiple sequence alignment of NifA of A. caulinodans ORS571 with R. spheroides 2.4.1 was generated using MUSCLE2. The corresponding residues for ammonium tolerance in R. sphaeroides are outlined.
  • the A. caulinodans strand corresponds to SEQ ID NO: 293, and the R. sphaeroides strand corresponds to SEQ ID NO: 292.
  • FIGS. 23A-23B include diagrams showing functional testing of the NifA homologues that activate the nifH promoters.
  • FIG. 23A The ability of the various NifA to activate the nifH promoters was tested with pairwise combinations of the nifH promoters and the NifA in E. coli MG1655 and P. protegens Pf-5. Error bars represent s.d. from three independent experiments.
  • FIG. 23B Plasmids used to measure nifH promoter activity by NifA overexpression are shown and provided in Table 4. Genetic parts are provided in Table 5.
  • FIGS. 24A-24B include diagrams showing optimization of the controllers in P. protegens Pf-5 and E. coli MG1655 that induce the nifH promoters.
  • FIG. 24A The controllers with different strengths were designed by RBS replacement and tested with the reporter plasmids (pMR103-105) in which each of the three nifH promoter is fused to sfgfp (Methods). The nifH promoters were induced with 0.5 mM IPTG. Genetic parts and RBS sequences are provided in Table 5 and 6, respectively.
  • FIG. 24B Activation of the nifH promoters in the E.
  • coli MG1655 containing the controller plasmid pMR102 was tested with the reporter plasmids pMR106-108.
  • the P. protegens Pf-5 controller strain MR10 was used to drive expression of the nifH promoter of K. oxytoca and the controller strain MR9 was used to drive expression of the nifH promoters of P. stutzeri and A. vinelandii .
  • the nifH promoters were induced with 0.05 mM IPTG and 0.5 mM IPTG in E. coli MG1655 and P. protegens Pf-5, respectively. Error bars represent s.d. from three independent experiments.
  • FIG. 25 includes diagrams showing the effect of oxygen on the activity of the nifH promoters. Expression from the nifH promoters was analyzed in E. coli MG1655 containing the controller plasmid pMR102, P. protegens Pf-5 MR10 (for K. oxytoca ) and MR9 (for P. stutzeri and A. vinelandii ) at varying initial oxygen levels in the headspace. The three nifH promoters were induced with 0.05 mM IPTG and 0.5 mM IPTG in E. coli MG1655 and P. protegens Pf-5, respectively, and incubated at varying initial oxygen concentrations. Oxygen has no effects on nifH expression in both strains. Error bars represent s.d. from three independent experiments.
  • FIGS. 26A-26B include diagrams describing the nitrogenase activity assay.
  • FIG. 26A Nitrogenase activity assay at constant oxygen levels in the headspace. Experimental setup used in this study to analyze oxygen tolerance of nitrogenase. Following the expression induction of nitrogenase with preincubation under low oxygen conditions, targeted oxygen concentrations in the headspace is maintained by oxygen spiking while monitoring with oxygen monitoring system (Methods).
  • FIG. 26B Nitrogenase activity in E. Coli MG1655 and P. protegens Pf-5 over a course of three hours.
  • FIG. 27 includes diagrams showing the effect of the rnf and fix complex on nitrogenase activity.
  • the modified nif clusters of A. vinelandii on the plasmids pMR25-28 were analyzed in the controller strain P. protegens Pf-5 MR9.
  • the deleted regions from the clusters were provided in Table 4. Nitrogenase was induced with 0.5 mM IPTG. Removing the rnf complex from the cluster abrogated activity.
  • the cluster without the fixABCX complex showed identical oxygen tolerance to the cluster with the complex. Error bars represent s.d. from three independent experiments.
  • FIGS. 28A-28C include diagrams showing regulation of nitrogenase activity in E. coli MG1655 “Marionette” strain5.
  • FIG. 28A Controller plasmids used to drive expression of T7 promoters.
  • FIG. 28B Inducibility of the T7 promoter by the controller plasmids encoding T7 RNAP under the regulation of the 12 sensors was tested with a reporter plasmid pMR121 (right).
  • FIG. 28C Inducible control of nitrogenase activity in response to 12 inducers was with the plasmid pMR136 (right) carrying the refactored nif cluster v2.1 on pBBR1 origin.
  • the choline-Cl inducible system was omitted for activity assay as the system was not inducible.
  • the refactored cluster was carried on a lower copy number plasmid pMR31 (right) as transformation of the plasmid pMR29 gave rise to no colony formation.
  • the inducers concentrations are: 400 ⁇ M arabinose, 1 mM choline-Cl, 500 nM 3OC14HSL, 50 ⁇ M cuminic acid, 25 nM 3OC6HSL, 25 ⁇ M DAPG, 500 ⁇ M DHBA, 1 mM IPTG, 100 nM aTc, 250 ⁇ M naringenin, 50 ⁇ M vanillic acid, and 250 ⁇ M salicylic acid.
  • Plasmid and genetic parts are provided in Table 4 and 5. Error bars represent s.d. from three independent experiments.
  • FIG. 29 includes schematic plasmid maps used to assess the effect of inducible expression of NifA/RpoN on the activity of the nifH promoter in A. caulinodans ORS571.
  • FIG. 30A-30B include diagrams showing the phylogenetic relationships of 10 diazotrophs based on 16S ribosomal RNA sequences.
  • the scale bar indicates 2% substitutions per site.
  • the clusters based on evolutionary closeness are circled.
  • FIG. 30B shows the relative nitrogenase activity in three host strains ( E. coli, Pseudomonas protegens Pf-5, and Rhizobium sp. IRBG74) carrying each of the 10 nif clusters.
  • the result suggests that the phylogenetic closeness has a predictive power for achieving highest nitrogenase activity in a new host that lacks an endogenous nif cluster.
  • Nitrogen fixation in the root nodules of leguminous plants is a major contributor to world food production and therefore, the practical applications of this field are of major interest. Legumes obtain nitrogen from air through bacteria residing in root nodules, some species of which also associate with cereals but do not fix nitrogen under these conditions. Disabling native regulation can turn on expression, even in the presence of nitrogenous fertilizer and low O 2 , but continuous nitrogenase production confers an energetic burden.
  • the present disclosure in some aspects describes the surprising discovery that bacteria can be genetically altered in a manner that will enable the bacteria to deliver fixed nitrogen to cereal crops.
  • Several strategies to implement control over nitrogen fixation in bacteria that live on or inside the roots of cereals are described. At least two approaches can be taken. In one embodiment, the native regulation is replaced. In alternative embodiments, a nif cluster is transferred from another species and placed under inducible control.
  • the Examples section below includes a description of the achievement of these two approaches in multiple species with multiple constructs. For example, A. caulinodans , ammonium-independent control can be achieved using a sensor to drive the co-expression of a NifA mutant and RpoN in a ⁇ nifA strain. Rhizobium sp.
  • IRBG74 can be engineered to express functional nitrogenase under free living conditions either by transferring a native nif cluster from Rhodobacter or a refactored cluster from Klebsiella .
  • Multiple approaches enable P. protegens Pf-5 to express functional nitrogenase, of which the transfer of the nif cluster from Azotobacter vinelandii DJ yields the highest activity and O 2 tolerance.
  • Rhizobium strain can be engineered to fix nitrogen under free-living conditions when it does not do so naturally.
  • Some Rhizobia isolated from legume root nodules are also cereal endophytes, however most are unable to fix nitrogen under free-living conditions (outside of the nodule) (Ramachandran, V. K., East, A. K., Karunakaran, R., Downie, J. A. & Poole, P. S. Adaptation of Rhizobium leguminosarum to pea, alfalfa and sugar beet rhizospheres investigated by comparative transcriptomics. Genome biology 12, R106 (2011); Frans, J. et al.
  • Rhizobium sp. IRBG74 has been reported in Nitrogen Fixation 33-44 (Springer, 1990).
  • Rhizobium sp. IRBG74 has been reports of cereal yield improvements due to these bacteria, including a 20% increase for rice by Rhizobium sp. IRBG74, but this is likely due to other growth-promoting mechanisms, such as improved nutrient uptake or root formation (Ramachandran, V. K., East, A. K., Karunakaran, R., Downie, J. A. & Poole, P. S. Adaptation of Rhizobium leguminosarum to pea, alfalfa and sugar beet rhizospheres investigated by comparative transcriptomics. Genome biology 12, R106 (2011); Delmotte, N. et al.
  • Cereal crops are broadly defined as any grass cultivated for the edible components of its grain (also referred to as caryopsis), composed of the endosperm, germ, and bran. Cereal crops are considered staple crops in many parts of the world. They are grown in greater quantities and provide more food energy worldwide than any other type of crop.
  • Non-limiting examples of cereal crops include maize, rye, barley, wheat, sorghum, oats, millet and rice.
  • the terms “cereal crop” and “cereal plant” are used interchangeably.
  • Nitrogen fixation is the process by which atmospheric nitrogen is assimilated into organic compounds as part of the nitrogen cycle.
  • the fixation of atmospheric nitrogen associated with specific legumes is the result of a highly specific symbiotic relationship with rhizobial bacteria. These indigenous bacteria dwell in the soil and are responsible for the formation of nodules in the roots of leguminous plants as sites for the nitrogen fixation.
  • Most Rhizobium symbioses are confined to leguminous plants.
  • Rhizobium strains which fix nitrogen in association with the agriculturally-important temperate legumes are usually restricted in their host range to a single legume genus.
  • nif genes are genes encoding enzymes involved in the fixation of atmospheric nitrogen into a form of nitrogen available to living organisms.
  • the primary enzyme encoded by the nif genes is the nitrogenase complex which converts atmospheric nitrogen (N 2 ) to other nitrogen forms (e.g. ammonia) which the organism can process.
  • N 2 atmospheric nitrogen
  • refactored refers to an engineered gene cluster, i.e. its genes have reordered, deleted or altered in some way.
  • Rhizobia are diazotrophic bacteria. In general, they are gram negative, motile, non-sporulating rods. In terms of taxonomy, they fall into two classes: alphaproteobacteria and betaproteobacteria.
  • Non-limiting examples of rhizobia include Azorhizobium caulinodans, Rhizobium (R.) sp. IRBG74, R. radiobacter, R. rhizogenes, R. rubi, R. vitis , Alfalfa Rhizobia ( R.
  • the rhizobia of the present invention are Azorhizobium caulinodans . In some embodiments, the rhizobia of the present invention are not Azorhizobium caulinodans.
  • the term “free-living conditions” refers to a bacterium (e.g. rhizobium) that is not within a leguminous root nodule. It generally refers to something that has not formed a parasitic (or dependent) relationship with another organism or is not on a substrate.
  • the term “symbiotic” refers to the interaction between two organisms living in close proximity. Close proximity can be about 0.2 ⁇ m, 0.4 ⁇ m, 0.6 ⁇ m, 0.8 ⁇ m, 1 ⁇ m, 5 ⁇ m, 10 ⁇ m, 20 ⁇ m, 50 ⁇ m, 100 ⁇ m, 500 ⁇ m, 1 mm, 1 cm, 5 cm, 10 cm. Close proximity can also be less than 0.2 ⁇ m. In many cases, a symbiotic relationship refers to a mutually beneficial interaction.
  • aerobic free-living conditions refer to conditions under which a bacterium is not within a leguminous root nodule and the bacterium is in the presence of oxygen. Aerobic free-living conditions can also be referred to as nonsymbiotic or non-parasitic conditions in the presence of oxygen. The bacterium can be in close proximity to a crop, as defined above.
  • endophyte refers to a group of organisms, often fungi and bacteria, that live within living plant cells for at least part of its life cycle without having an apparent detrimental effect on the plant cell. This is contrasting with an epiphyte, which is a plant that grows on another plant, without being parasitic.
  • diazotroph refers to microorganisms that are able to grow without external sources of fixed nitrogen.
  • the group includes some bacteria and some archae.
  • An example of a free-living diazotroph is Klebsiella pneumoniae.
  • K. pneumoniae is a facultative anaerobes—these species can grow either with or without oxygen, but they only fix nitrogen anaerobically.
  • Alphaproteobacteria refers to a diverse class of bacteria falling under the phylum Proteobacteria.
  • Non-limiting examples of Alphaproteobacteria include species Rhodobacter sphaeroides and Rhodopseudomonas palustris .
  • the term “Gammaproteobacteria” refers to another class of bacteria falling under the phylum of Proteobacteria. All proteobacteria are gram negative.
  • Cyanobacteria refers to a phylum of bacteria that obtain their energy through photosynthesis. They are also referred to as Cyanophyta.
  • Firmicutes refer to a phylum of bacteria. This phylum includes the classes Bacilli, Clostridia, and Thermolithobacteria.
  • Nif genes are genes that encode the enzyme involved in nitrogen fixation. In most cases nif genes occur as an operon. Some of these genes encode the subunits for the nitrogenase complex, which is the primary enzyme imparting the ability to convert atmospheric nitrogen (N 2 ) to forms of nitrogen accessible to living organisms. In most genes, the regulation of the nif gene transcription is conducted by NifA protein, which is responsive to nitrogen levels. When there are nitrogen deficits, NtrC activates NifA expression, which in turn leads to the activation of the remaining nif genes. When nitrogen levels are adequate or in excess, NifL protein, encoded by NifL, inhibits NifA activity.
  • Nif gene pathways are generally sensitive to small changes in expression.
  • Important genes include nifHDK, which form the subunits for nitrogenase.
  • the chaperone NifY is required to achieve full activity and broadens the tolerance to changes in expression level.
  • the nifUSVWZM operon encodes proteins for early Fe—S cluster formation (NifUS) and proteins for component maturation (NifVWZ for Component I and NifM for Component II), whereas nifBQ encodes proteins for FeMo-co core synthesis (NifB) and molybdenum integration (NifQ).
  • NifEN is tolerant to varied expression levels.
  • nif genes include nifH, nifD, nifK, nifE, nifN, nifU, nifS, nifV, nifW, nifX, nifB, nifQ, nifY, nifT, nifJ, nifF, nifX, nifU, and nifS.
  • the nitrogen fixation (nif) genes are organized as genomic clusters, ranging from a 10.5 kb single operon in Paenibacillus to 64 kb divided amongst three genomic locations in A. caulinodans .
  • conserveed genes include those encoding the nitrogenase enzyme (nifHDK), FeMoCo biosynthesis, and chaperones.
  • Species that can fix nitrogen under more conditions tend to have larger gene clusters that include environment-specific paralogues, alternative electron transport routes, and oxygen protective mechanisms. Often, the functions of many genes in the larger clusters are unknown.
  • Nitrogenase is under stringent control because it is oxygen sensitive and energetically expensive: it can make up 20% of the cell mass and each NH 3 requires ⁇ 40 ATP. It is also irreversibly deactivated by oxygen. Across species, transcription of nif genes is strongly repressed by fixed nitrogen (ammonia) and oxygen with these signals converging on the NifA regulatory protein that works in concert with the sigma factor RpoN. Diverse, species-specific, and often poorly understood signals control these regulators, including plant-produced chemicals, ATP, reducing power, temperature, and carbon sources. Those bacteria that can fix nitrogen in a wider range of environmental conditions tend to be controlled by more complex regulatory networks.
  • a nif cluster When a nif cluster is transferred from one species to another, it either preserves its regulation by environmental stimuli or has an unregulated constitutive phenotype. Maintaining the native regulation, notably ammonium repression, limits their use in agriculture because such levels are likely to fluctuate according to soil types, irrigation, and fertilization.
  • Nitrogen-fixing diazotrophs have been engineered to reduce ammonia sensitivity by disrupting NifL or mutating NifA and placing the entire cluster under the control of T7 RNA polymerase (RNAP). Constitutive expression of nitrogenase is also undesirable as it imparts a fitness burden on the cells. For example, when the nif cluster from P. stutzeri A1501 was transferred to P.
  • protegens Pf-5 this was reported to result in sufficient ammonia production to support maize and wheat growth, but the bacteria quickly declined after a month when competing with other species in soil. Constitutive activity is detrimental even before the bacteria are introduced to the soil, impacting production, formulation, and long-term storage. Therefore, uncontrolled nitrogenase production could lead to more expensive production, shorter shelf life, and more in-field variability.
  • nif clusters or nif genes the present disclosure can each be under the control of a regulatory element.
  • 2 or more genes are under the control of a regulatory element.
  • all the genes are under the control of a regulatory element.
  • the regulatory elements may also be activation elements or inhibitory elements.
  • An activation element is a nucleic acid sequence that when presented in context with a nucleic acid to be expressed will cause expression of the nucleic acid in the presence of an activation signal.
  • An inhibitory signal is a nucleic acid sequence that when presented in context with a nucleic acid to be expressed will cause expression of the nucleic acid unless an inhibitory signal is present.
  • Each of the activation and inhibitory elements may be a promoter, such as a bacteriophage T7 promoter, sigma 70 promoter, sigma 54 promoter, lac promoter, etc.
  • promoter is intended to refer to those regulatory sequences which are sufficient to enable the transcription of an operably linked DNA molecule. Promoters may be constitutive or inducible. As used herein, the term “constitutive promoter” refers to a promoter that is always on (i.e. causing transcription at a constant level). Examples of constitutive promoters include, without limitation, sigma 70 promoter, bla promoter, lacI. promoter, etc. Non-limiting examples of inducible promoters are shown in Table 1. The P A1lacO1 promoter is another example of an inducible promoter that can be used in the present invention.
  • regulatory elements e.g. inducible promoters, repressors.
  • Essential regulatory Name Chemical inducer and/or repressor gene(s) ParaBAD L-arabinose (ON) & glucose (OFF) araC (“PBAD”) PrhaBAD L-rhamnose (ON) & glucose (OFF) rhaR &rhaS Plac lactose or IPTG (ON) & glucose lacI (OFF) Ptac lactose or IPTG (ON) lacI Plux acyl-homoserine lactone (ON) luxR Ptet tetracycline or aTc (ON) tetR Psal salycilate (ON) nahR Ptrp tryptophan (OFF) (NONE) Ppho phosphate (OFF) phoB & phoR
  • Inducible promoters allow regulation of gene expression and can be regulated by exogenously supplied compounds, environmental factors such as temperature, or the presence of a specific physiological state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells only.
  • Inducible promoters and inducible systems are available from a variety of commercial sources, including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have been described and can be readily selected by one of skill in the art.
  • inducible promoters regulated by exogenously supplied promoters include the zinc-inducible sheep metallothionine (MT) promoter, the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter, the T7 polymerase promoter system [WO 98/10088]; the ecdysone insect promoter [No et al, Proc. Natl. Acad. Sci. USA, 93:3346-3351 (1996)], the tetracycline-repressible system [Gossen et al, Proc. Natl. Acad. Sci.
  • inducible promoters which may be useful in this context are those which are regulated by a specific physiological state, e.g., temperature, acute phase, a particular differentiation state of the cell, or in replicating cells only.
  • terminatator is a section of nucleic acid sequence that marks the end of a gene or operon in genomic DNA during transcription. They stop transcription of a polymerase. Terminators can be classified into several groups. At the first group of termination signals the core enzyme can terminate in vitro at certain sites in the absence of any other factors (as tested in vitro). These sites of termination are called intrinsic terminators or also class I terminators. Intrinsic terminators usually share one common structural feature, the so called hairpin or stem-loop structure. On the one hand the hairpin comprises a stem structure, encoded by a dG-dC rich sequence of dyad symmetrical structure.
  • the terminator also exhibits a dA-dT rich region at the 3′-end directly following the stem structure.
  • the uridine rich region at the 3′ end is thought to facilitate transcript release when RNA polymerase pauses at hairpin structures.
  • Two or more terminators can be operatively linked if they are positioned to each other to provide concerted termination of a preceding coding sequence.
  • the terminator sequences are downstream of coding sequences, i.e. on the 3′ position of the coding sequence.
  • the terminator can e.g.
  • terminators include, but are not limited to, T7 terminator, rrnBT1, L3S2P21, tonB, rrnA, rrnB, rrnD, RNAI, crp, his, ilv lambda, M13, rpoC, and trp (see for example U.S. Pat. No. 9,745,588, incorporated herein by reference).
  • RpoN refers to a gene that encodes the sigma factor sigma-54 ( ⁇ 54, sigma N, or RpoN), a protein in Escherichia coli and other species of bacteria.
  • Sigma factors are initiation factors that promote attachment of RNA polymerase to specific initiation sites and are then released.
  • Bacteria normally only have one functional copy of the alternative sigma factor, ⁇ 54 or RpoN, which regulates a complex genetic network that extends into various facets of bacterial physiology, including metabolism, survival in strenuous environments, production of virulence factors, and formation of biofilms.
  • RpoN is one of seven RNA polymerase sigma subunits in E.
  • RpoN required for promoter-initiated transcription and RpoN plays a major role in the response of E. coli to nitrogen-limiting conditions. Under such conditions, RpoN directs the transcription of at least 14 E. coli operons/regulators in the nitrogen regulatory (Ntr) response. RpoN also plays an important role in stress resistance (e.g. resistance to osmotic stress) and virulence of bacteria. RpoN is structurally and functionally distinct from the other E. coli ⁇ factors. It is able to bind promoter DNA in the absence of core RNA polymerase and it recognizes promoter sequences with conserved GG and GC elements located ⁇ 24 to ⁇ 12 nucleotides upstream of the transcription start site. Additionally, Regulatory proteins like NtrB and NtrC can activate ⁇ 54 holoenzyme.
  • RpoN works in concert with NifA to turn on the transcription of nif clusters.
  • An exemplary sequence for RpoN is provided in Table 5.
  • a “gene cluster” or “genetic cluster” refers to a set of two or more genes that encode gene products.
  • a target, naturally occurring, or wild type genetic cluster can be used as the original model for refactoring.
  • the gene products are enzymes.
  • the gene products of a genetic cluster function in a biosynthetic pathway.
  • the gene cluster encodes proteins of the nif nitrogen fixation pathway.
  • the genetic clusters can encode proteins of a biosynthetic pathway.
  • a biosynthetic pathway refers to any pathway found in a biological system that involves more than one protein. In some instances, these pathways involve 2-1,000 proteins. In other instances the number of proteins involved in a biosynthetic pathway may be 2-500, 2-100, 5-1000, 5-500, 5-100, 5-10, 10-1,000, 10-900, 10-800, 10-700, 10-600, 10-500, 10-400, 10-300, 10-200, 10-100, 50-1,000, 50-500, 50-100, 100-1,000, or 100-500. Examples of biosynthetic pathways include but are not limited to the nitrogen fixation pathway.
  • the refactored genetic clusters have naturally occurring non-coding DNA, naturally occurring regulatory sequences, and/or non-essential genes that have been removed from at least one or in some instances all of the transcriptional units. These can be replaced by synthetic regulatory sequences, not replaced at all or replaced by spacers.
  • a spacer simply refers to a set of nucleotides or analogs thereof that don't have a function such as coding for a protein or in any way regulating the activity of the gene cluster.
  • the genetic components in the genetic cluster typically will include at least one regulatory element.
  • a synthetic regulatory element is any nucleic acid sequence which plays a role in regulating gene expression and which differs from the naturally occurring regulatory element. It may differ for instance by a single nucleotide from the naturally occurring element. In some cases, it is an exogenous regulatory element (i.e. not identical to the naturally occurring version).
  • a “regulatory element” refers to a nucleic acid having nucleotide sequences that influence transcription or translation initiation or rate, or stability and/or mobility of a transcription or translation product.
  • Regulatory regions include, without limitation, promoter sequences, ribosome binding sites, ribozymes, enhancer sequences, response elements, protein recognition sites, inducible elements, protein binding sequences, 5′ and 3′ untranslated regions (UTRs), transcriptional start sites, transcription terminator sequences, polyadenylation sequences, introns, and combinations thereof.
  • a genetic cluster includes a nucleotide sequence that is at least about 85% or more homologous or identical to the entire length of a naturally occurring genetic cluster sequence, e.g., at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 50% or more of the full length naturally occurring genetic cluster sequence).
  • the nucleotide sequence is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% homologous or identical to a naturally occurring genetic cluster sequence.
  • the nucleotide sequence is at least about 85%, e.g., is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% homologous or identical to a genetic cluster sequence, in a fragment thereof or a region that is much more conserved, such as an essential, but has lower sequence identity outside that region.
  • the gene clusters are native gene clusters. In some embodiments, the gene clusters are refactored gene clusters. In some instances, the nucleic acids may include non-naturally occurring nucleotides and/or substitutions, i.e. Sugar or base substitutions or modifications.
  • One or more substituted sugar moieties include, e.g., one of the following at the 2′ position: OH, SH, SCH3, F, OCN, OCH3OCH3, OCH3O(CH2)n CH3, O(CH2)n NH2 or O(CH2)n CH3 where n is from 1 to about 10; Ci to C10 lower alkyl, alkoxyalkoxy, substituted lower alkyl, alkaryl or aralkyl; Cl; Br; CN; CF3; OCF3; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; SOCH3; SO2 CH3; ONO2; NO2; N3; NH2; heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; polyalkylamino; substituted silyl; an RNA cleaving group; a reporter group; an intercalator; a group for improving the pharmacokinetic properties of a nucleic acid; or
  • Modified nucleobases include nucleobases found only infrequently or transiently in natural nucleic acids, e.g., hypoxanthine, 6-methyladenine, 5-Me pyrimidines, particularly 5-methylcytosine (also referred to as 5-methyl-2′ deoxycytosine and often referred to in the art as 5-Me-C), 5-hydroxymethylcytosine (HMC), glycosyl HMC and gentobiosyl HMC, isocytosine, pseudoisocytosine, as well as synthetic nucleobases, e.g., 2-aminoadenine, 2-(methylamino)adenine, 2-(imidazolylalkyl)adenine, 2-(aminoalklyamino)adenine or other heterosubstituted alkyladenines, 2-thiouracil, 2-thiothymine, 5-bromouracil, 5-hydroxymethyluracil, 5-propynyluracil, 8-azaguanine,
  • the present disclosure also provides methods of selecting a nif cluster of a donor bacterium that is compatible with a host bacterium.
  • the methods involve performing a phylogenetic analysis for the donor bacterium and the host bacterium.
  • a phylogenetic analysis is a method of estimating the evolutionary relationships.
  • the sequence of a common gene or protein can be used to assess the evolutionary relationship of species.
  • phylogenetic analysis is performed based on the rRNA (e.g., the full-length 16S rRNA gene) sequences. These sequence include e.g., K. oxytoca , BWI76_05380; A. vinelandii , Avin_55000; R. sphaeroides , DQL45_00005; Cyanothece ATCC51142, cce_RNA045 ; A. brasilense , AMK58_25190; R.
  • rRNA e.g., the full-length 16S rRNA gene
  • a multiple sequence alignment can be generated using MUSCLE (Edgar, R. C. J. N. a. r. MUSCLE: multiple sequence alignment with high accuracy and high throughput. 32, 1792-1797 (2004)).
  • a phylogenetic tree is then constructed using the Jukes-Cantor distance model and UPGMA as a tree build method.
  • the phylogenetic closeness has a predictive power for nitrogenase activity of transferring a nif cluster in a new host.
  • the host bacterium and the donor bacterium are in the same genus, family, order, or class.
  • the donor bacterium is selected from Klebsiella, Pseudomonas, Azotobacter, Gluconacetobacter, Azospirillum, Azorhizobium, Rhodopseudomonas, Rhodobacter , Cyanothece, or Paenibacillus genus.
  • the methods can also involve transferring the nif cluster to the host bacterium and determining the nitrogenase activity.
  • the regulatory region is a constitutive promoter. In some cases, the regulatory region is an inducible promoter. In some cases, the regulatory region is a root-active promoter that can confer transcription in root tissue, e.g., root endodermis, root epidermis, or root vascular tissues. In some embodiments, root-active promoters can include the root-specific subdomains of the CaMV 35S promoter (Lam et al., Proc. Natl. Acad. Sci. USA, 86:7890-7894 (1989)), root cell specific promoters of Conkling et al., Plant Physiol., 93:1203-1211 (1990), or the tobacco RD2 promoter.
  • Modified plants can be grown in suspension culture, or tissue or organ culture.
  • modified plant cells can be placed directly onto the medium or can be placed onto a filter that is then placed in contact with the medium.
  • modified plant cells can be placed onto a flotation device, e.g., a porous membrane that contacts the liquid medium.
  • a solid medium can be, for example, Murashige and Skoog (MS) medium containing agar and a suitable concentration of an auxin, e.g., 2,4-dichlorophenoxyacetic acid (2,4-D), and a suitable concentration of a cytokinin, e.g., kinetin.
  • a reporter sequence encoding a reporter polypeptide having a reporter activity can be included in the transformation procedure and an assay for reporter activity or expression can be performed at a suitable time after transformation.
  • a suitable time for conducting the assay typically is about 1-21 days after transformation, e.g., about 1-14 days, about 1-7 days, or about 1-3 days.
  • the use of transient assays is particularly convenient for rapid analysis in different species, or to confirm expression of a polypeptide whose expression has not previously been confirmed in particular recipient cells.
  • the disclosure provides a genetically engineered bacterium that contains a regulatory sequence or a genetic sensor that regulates the nitrogenase activity in response to a chemical signal (e.g., an environmental signal or artificial signal).
  • a chemical signal e.g., an environmental signal or artificial signal.
  • the chemical signal can be an environmental signal such as ammonia, IPTG, or oxygen.
  • the nif cluster is placed under the control of a genetic sensor that can respond to the chemical signal.
  • the genetic sensor can respond to biocontrol agents or components of added fertilizer and other treatments (e.g., DAPG).
  • the genetic sensor can respond to root exudates from a plant, including e.g., sugar such as arabinose, hormones such as salicylic acids, flavonoids such as naringenin, antimicrobials such as vanillic acid, and various chemicals that can remodel the microbial community (e.g., cuminic acid).
  • the genetic sensor can respond to chemicals released by other bacteria including e.g., 3,4-dihydroxybenzoic acid (DHBA), 3OC6HSL or 3OC14HSL.
  • DHBA 3,4-dihydroxybenzoic acid
  • 3OC6HSL 3OC14HSL
  • the arabinose and naringenin sensors are used to express NifA, which leads to the induction of the nifH promoter and nitrogenase activity in P. protegens Pf-5.
  • the DAPG sensor is used to drive T7 RNAP, which then induces nitrogenase activity.
  • the DAPG sensor is used to drive T7 RNAP, which then induces nitrogenase activity in R. sp. IRBG74.
  • the salicylic acid sensor is used to control NifA L94Q/D95Q /RpoN expression, which then activates nitrogenase activity.
  • the salicylic acid sensor is used to control NifA L94Q/D95Q /RpoN expression, which then activates nitrogenase activity in A. caulinodans.
  • a plant is engineered to release an orthogonal chemical signal that can be sensed by a corresponding engineered bacterium. This would have the benefit of only inducing nitrogenase in the presence of the engineered crop.
  • legumes and Arabidopsis are engineered to produce opines, including nopaline and octopine.
  • an engineered bacterium contains sensors for nopaline and octopine.
  • an engineered bacterium contains the LysR-type transcriptional activators OccR (octopine) and NocR (nopaline) and their corresponding promoters.
  • sensors for nopaline and octopine are used to control the expression of NifA L94Q/D95Q /RpoN, which then activates nitrogenase activity.
  • the activity of a promoter is defined as the change in RNAP flux ⁇ J around a transcription start site x tss .
  • the promoter strength or the regulatory element strength is calculated using the below equation:
  • m(i) is the number of transcripts at each position i from the FPKM-normalized transcriptomic profiles
  • n is the window length before and after x tss .
  • the window length is set to ten.
  • the Ts is defined as the fold-decrease in transcription before and after a terminator, which can be quantified from the FPKM-normalized transcriptomic profiles as:
  • ⁇ i x 0 - 1 x 0 - n ⁇ m ⁇ ( i )
  • x 0 and x 1 are the beginning and end positions of the terminator part, respectively.
  • the translation efficiency was calculated by dividing the ribosome density by the FPKM.
  • RNA expression As used herein, the equivalent terms “expression” or “gene expression” are intended to refer to the transcription of a DNA molecule into RNA, and the translation of such RNA into a polypeptide.
  • a “gene cluster” refers to a set of two or more genes that encode gene products.
  • a “nif gene cluster” refers to a set of two or more genes that encode nitrogen fixation genes.
  • exogenous indicates that the nucleic acid or gene is not in its natural (native) environment.
  • an exogenous gene can refer to a gene that is from a different species.
  • endogenous indicates that the gene is in its native environment.
  • endogenous and nonative are used interchangeably.
  • the term “delete” or “deleted” refers to the removal of a gene (e.g. endogenous gene) from a sequence or cluster.
  • the term “alter” or “altered” refers to the modification of one or more nucleotides in a gene or the deletion of one or more base pairs in a gene. This alteration may render the gene dysfunctional.
  • ⁇ nifA refers to a strain or cluster within which NifA was deleted or altered. Method of deletion and alteration, in the context of genes, are known in the art.
  • chemical signals refers to chemical compounds. Any substance consisting of two or more different types of atoms (chemical elements) in a fixed stoichiometric proportion can be termed a chemical compound. Chemical signals can be synthetic or natural chemical compounds.
  • a bacterium of the present disclosure or a sensor of the present disclosure is under the control of a chemical signal.
  • the signal is a native biological signal (e.g. root exudate, biological control agent, etc.).
  • the chemical signal is a quorum sensing signal from the bacterium.
  • Non-limiting examples of chemical signals include root exudates (as defined below), biocontrol agents (as defined below), phytohormones, vanillate, IPTG, aTc, cuminic acid, DAPG, and salicylic acid, 3,4-dihydroxybenzoic acid, 3OC6HSL and 3OC14HSL.
  • Root exudate refers to chemicals secreted or emitted by plant roots in response to their environment. These allow plant to manipulate or alter their immediate environment, specifically their rhizosphere. Root exudates are a complex mixture of soluble organic substances, which may contain sugars, amino acids, organic acids, enzymes, and other substances. Root exudates include, but are not limited to, ions, carbon-based compounds, amino acids, sterols, sugars, hormones (phytohormones), flavonoids, antimicrobials, and many other chemical compounds. The exudates can serve as either positive regulators or negative regulators.
  • phytohormone refers plant hormones and they are any of various hormones produced by plants that influence process such as germination, growth, and metabolism in the plant.
  • vanillate refers to a methoxybenzoate that is the conjugate base of vanillic acid. It is a plant metabolite.
  • Biocontrol or biocontrol is a method of controlling pests such as insects, mites, weeds and plant diseases using other organisms. Natural enemies of insect pests, also known as biological control agents, include predators, parasitoids, pathogens, and competitors. Biological control agents of plant diseases are most often referred to as antagonists. Biological control agents of weeds include seed predators, herbivores and plant pathogens.
  • the inducible clusters or promoters of the present invention may be modulated by a secretion of (or chemical otherwise associated with) a biological control agent.
  • a biological control agent that is referred to as a “biocontrol agent”.
  • inducible nitrogenase activity is engineered in two cereal endophytes ( Azorhizobium caulinodans ORS571 and Rhizobium sp. IRBG74) and the epiphyte Pseudomonas protegens Pf-5, a maize seed inoculant.
  • IRBG74 the epiphyte Pseudomonas protegens Pf-5
  • different strategies are taken to eliminate ammonium repression and place nitrogenase expression under the control of agriculturally-relevant signals, including root exudates, biocontrol agents, and phytohormones.
  • Rhizobium sp The present disclosure demonstrates that Rhizobium sp.
  • IRBG74 can be engineered to fix nitrogen under free living conditions, inter alia, by transferring either a nif cluster from Rhodobacter or Klebsiella .
  • Rhodobacter e.g., Rhodobacter
  • Klebsiella e.g., IRBG74
  • the transfer of an inducible cluster from Azotobacter vinelandii yields the highest ammonia and oxygen tolerance.
  • E. coli DH10-beta New England Biolabs, MA, Cat #C3019 was used for cloning.
  • E. coli K-12 MG1655 was used for the nitrogenase assay.
  • P. protegens Pf-5 was obtained from the ATCC (BAA-477). Strains used in this study are listed in Table 3.
  • LB medium (10 g/L tryptone, 5 g/L yeast extract, 10 g/L NaCl)
  • LB-Lennox medium (10 g/L tryptone, 5 g/L yeast extract, 5 g/L NaCl)
  • TY medium 5 g/L tryptone, 3 g/L yeast extract, 0.87 g/L CaCl 2 .2H 2 O
  • BB medium (0.25 g/L MgSO 4 .7H 2 O, 1 g/L NaCl, 0.1 g/L CaCl 2 .2H 2 O, 2.9 mg/L FeCl 3 , 0.25 mg/L Na 2 MoO 4 .2H 2 O, 1.32 g/L NH 4 CH 3 CO 2 , 25 g/L Na 2 HPO 4 , 3 g/L KH 2 PO 4 pH [7.4]), UMS medium (0.5 g/L MgSO 4 .7H 2 O, 0.2 g/L NaCl, 0.375 mg/L EDTA-Na 2 , 0.16 ZnSO 4 .7H 2 O, 0.2 mg/L Na 2 MoO 4 .2H 2 O, 0.25 mg/L H 3 BO 3 , 0.2 mg/L MnSO 4 .H 2 O, 0.02 mg/L CuSO 4 .5H 2 O, 1 mg/L CoCl 2 .6H 2 O, 75 mg
  • Antibiotics were used at the following concentrations ( ⁇ g/mL): E. coli (kanamycin, 50; spectinomycin, 100; tetracycline, 15; gentamicin, 15). P. protegens Pf-5 (kanamycin, 30; tetracycline, 50; gentamicin, 15; carbenicillin, 50). R. sp. IRBG74 (neomycin, 150; gentamicin, 150; tetracycline, 10; nitrofurantoin, 10). A. caulinodans (kanamycin, 30; gentamicin, 15; tetracycline, 10; nitrofurantoin, 10). Chemicals including inducers used in this study are listed in Table 7.
  • Two homology arms of ⁇ 500 bp flanking the hsdR gene were amplified by PCR, cloned and yielded a suicide plasmid pMR-44.
  • the suicide plasmid was mobilized into R. sp. IRBG74 by triparental mating.
  • Single-crossover recombinants were selected for resistance to gentamicin and subsequently grown and plated on LB plates supplemented with 15% sucrose to induce deletion of the vector DNA part containing the counter selective marker sacB which converts sucrose into a toxic product (levan).
  • nifHDKENX gene clusters encompassing nifHDKENX (genomic location 219.579-227, 127) and nifSW-fixABCX-nifAB-fdxN-nifTZ (genomic location 234, 635-234, 802) of R. sp. IRBG74 were sequentially deleted using pMR45-46.
  • recA gene was deleted using the plasmid pMR47.
  • the R. sp. IRBG74 ⁇ nif, hsdR, recA strain was the basis for all experiments unless indicated otherwise.
  • Plasmids with the pBBR1 origin were derived from pMQ131 and pMQ132. Plasmids with the pRO1600 origin were derived from pMQ80. Plasmids with the RK2 origin were derived from pJP2. Plasmids with the RSF1010 origin were derived from pSEVA651. Plasmids with the IncW origin were derived from pKT249. Plasmids used in this study are provided in Table 4.
  • Phylogenetic analysis was performed based on the full-length 16S rRNA gene sequences ( K. oxytoca , BWI76_05380; A. vinelandii , Avin_55000; R. sphaeroides , DQL45_00005; Cyanothece ATCC51142, cce_RNA045 ; A. brasilense , AMK58_25190; R. palustris , RNA_55; P. protegens , PST_0759; Paenibacillus sp. WLY78, JQ003557).
  • a multiple sequence alignment was generated using MUSCLE (Edgar, R. C. J. N. a. r.
  • MUSCLE multiple sequence alignment with high accuracy and high throughput. 32, 1792-1797 (2004)).
  • a phylogenetic tree was constructed using the Geneious software (R9.0.5) with the Jukes-Cantor distance model and UPGMA as a tree build method, with bootstrap values from 1,000 replicates.
  • the genomic DNAs from K. oxytoca, P. stutzeri, A. vinelandii, A. caulinodans and R. sphaeroides were purified using Wizard genomic DNA purification kit, following the isolation protocol for gram negative bacteria (Promega, Cat #A1120).
  • the genomic DNAs of Cyanothece ATCC51142 , A. brasilense ATCC29729, R. palustris ATCC BAA-98, and G. diazotrophicus ATCC49037 were obtained from ATCC.
  • nif cluster was amplified into several fragments (4-10 kb) with upstream and downstream 45 bp linkers at the 5′ and 3′ most end of the cluster by PCR with primer sets (Table 2) and assembled onto linearized E. coli -yeast shuttle vectors pMR-1 for E. coli and Rhizobia , and pMR-2 for P. protegens Pf-5 using yeast recombineering.
  • pMR-1 E. coli and Rhizobia
  • pMR-2 for P. protegens Pf-5 using yeast recombineering.
  • the DNA sequence information were gleaned from contig ALJV01 and the DNA of the nif cluster was synthesized by GeneArt gene synthesis (Thermo Fisher Scientific, MA) into four fragments that were used as templates for PCR amplification and assembly.
  • the six transcriptional units (nifHDKTY, nifENX, nifJ, nifBQ, nifF, nifUSVWZM) were amplified from the plasmid pMR-3 that harbors the native Klebsiella nif cluster. Each unit was divided onto six level-1 module plasmids where the nif genes are preceded by a terminator. T7 promoter wild-type or T7 promoter variant PT7.P2 was placed between a terminator and the first gene of the transcriptional unit. Assembly linkers ( ⁇ 45 bp) were placed at both ends of the units. The level-1 plasmids (pMR32-37) were provided in Table 4 and 5.
  • Each of the six plasmids was linearized by digestion with restriction enzymes and assembled with a linearized pMR-1 or pMR-2 vector into a single large plasmid by one-pot yeast assembly procedure, yielding pMR38 and pMR39.
  • Electroporation was used to transfer plasmids into P. protegens Pf-5.
  • a single colony was inoculated in 4 mL of LB and grown for 16 h at 30° C. with shaking at 250 rpm.
  • the cell pellets were washed twice with 2 mL of 300 mM sucrose and dissolved in 100 ⁇ l of 300 mM sucrose at RT.
  • a total of 50-100 ng DNA was electroporated and recovered in 1 mL of LB media for 1 h before plating on selective LB plates.
  • Triparental mating was used to transfer DNA from E. coli to Rhizobia .
  • a regulator protein is constitutively expressed by the PlacIq promoter, and GFP expression is driven by a cognate inducible promoter from the opposite direction, facilitating replacement of the reporter with gene of interest (e.g., T7 RNAP and nifA) and transfer of the controller unit across different plasmid backbones for diverse microbes.
  • gene of interest e.g., T7 RNAP and nifA
  • IPTG inducible LacI-A1lacO1, DAPG inducible Ph1F-PPh1, aTc inducible TetR-PTet, 3OC6HSL inducible LuxR-P Lux , salicylic acid inducible NahR-P Sal , and cuminic acid inducible CymR-P Cym systems were optimized for R. sp. IRBG74 ( FIG. 14 ).
  • Opine inducible OccR-P occ , and nopaline inducible NocR-Pnoc systems were optimized for A. caulinodans ( FIGS. 20A-20F and Tables 4 and 5).
  • RBS characterization an IPTG-inducible GFP expression plasmid pMR-40 was used and GFP was expressed to the highest levels with 1 mM IPTG ( FIGS. 12A-12B ).
  • RBS library for GFP was designed using the RBS library calculator at the highest-resolution mode, and the 3′ end of the 16S rRNA sequences were adjusted according to the species (3′-ACCTCCTTC-5′ for R. sp. IRBG74).
  • Terminators for T7 RNAP were characterized by placing a terminator between two fluorescence reporters expressed from a single T7 wild-type promoter located upstream of the first fluorescence protein GFP.
  • the expression of the two fluorescence proteins is enabled by the controller strain MR18 encoding the IPTG-inducible T7 RNAP system by 1 mM IPTG ( FIGS. 13A-13B ).
  • the terminator strength (Ts) was determined by normalizing fluorescence levels of a terminator construct by a reference construct pMR-66 where a 40 bp spacer was placed between the reporters. All genetic parts for Rhizobia were characterized as follows. Single colonies were inoculated into 0.5 ml TY supplemented with antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator (INFORS HT, MD).
  • FIGS. 15A-15C IPTG inducible LacI-P tac , DAPG inducible Ph1F-P Phl , aTc inducible TetR-P Tet , 3OC6HSL inducible LuxR-P Lux , arabinose inducible AraC-P BAD , cuminic acid inducible CymR-P Cym , and naringenin inducible FdeR-P Fde were optimized ( FIGS. 15A-15C ).
  • an arabinose-inducible GFP expression plasmid pMR-65 was used and GFP was expressed with 1 mM IPTG ( FIGS. 12A-12B ).
  • RBS library for GFP was designed using the RBS library calculator at the highest-resolution mode, and the 3′ end of the 16S rRNA sequences were adjusted according to the species (3′-ACCTCCTTA-5′ for P. protegens Pf-5).
  • Terminators for T7 RNAP were characterized by placing a terminator between two fluorescence reporters expressed from a single T7 wild-type promoter located upstream of the first fluorescence protein GFP. The expression of the two fluorescence proteins is enabled by an IPTG-inducible T7 RNAP expression system of the controller strain MR7 ( FIGS. 13A-13B ). All genetic parts for P. protegens Pf-5 were characterized as follows.
  • the mini-Tn7 insertion system was used to introduce a controller into the genome of P. protegens Pf-5.
  • the IPTG-inducible T7 RNAP expression system and a tetracycline resistant marker tetA was placed between two Tn7 ends (Tn7L and Tn7R).
  • the controller plasmid pMR-85 was introduced into P. protegens Pf-5 by double transformation with pTNS3 encoding the TnsABCD transposase.
  • a genomically-integrated controller located 25 bp downstream of the stop codon of glmS was confirmed by PCR and sequencing.
  • a markerless insertion method using homologous recombination was employed in R. sp. IRBG74.
  • a controller encoding inducible T7 RNAP system flanked by two homology fragments that enables the replacement of recA was cloned into a suicide plasmid.
  • These controller plasmids IPTG-inducible, pMR82-84; DAPG-inducible, pMR85
  • E. coli was mobilized into R. sp. IRBG74 MR18 ( ⁇ hsdR. ⁇ nif) by triparental mating, generating the controller strains (MR19, 20, 21 and 22, respectively).
  • the controller integration in the genome was confirmed by gentamicin sensitivity and diagnostic PCR. All controllers were characterized in a manner identical to that described in genetic part characterization.
  • the yfp in the 12 reporter plasmids was replaced with T7 RNAP while keeping other genetic parts (e.g., promoters and RBSs) unchanged ( FIGS. 28A-28C ).
  • the reporter plasmid pMR-120 in which gfpmut3b is fused to the PT7(P2) promoter ( FIGS. 28A-28C ) was co-transformed to analyze the response functions of each of the 12 T7 RNAP controller plasmids.
  • Cultures were initiated by inoculating a single colony into 1 mL of LB supplemented with appropriate antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator. 5 ⁇ l of overnight cultures was diluted into 500 ⁇ l of BB medium with 17.1 mM NH4CH3CO2 and appropriate antibiotics in 96-deepwell and incubated for 24 h at 30° C., 900 rpm in a Multitron incubator.
  • Cultures were diluted to an OD600 of 0.4 into 2 mL of BB medium supplemented with appropriate antibiotics, 1.43 mM serine to facilitate nitrogenase depression, and an inducer (if necessary) in 10 mL glass vials with PTFE-silicone septa screw caps (Supelco Analytical, Cat #SU860103). Headspace in the vials was replaced with 100% argon gas using a vacuum manifold. Acetylene freshly generated from CaC 2 in a Burris bottle was injected to 10% (vol/vol) into each culture vial to begin the reaction. The acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm in an Innova 44 shaking incubator (New Brunswick) to prevent cell aggregations, followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • Cultures were initiated by inoculating a single colony into 1 mL of LB supplemented with appropriate antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator. 5 ⁇ l of overnight cultures was diluted into 500 ⁇ l of BB medium with 17.1 mM NH 4 CH 3 CO 2 and appropriate antibiotics in 96-deepwell and incubated for 24 h at 30° C., 900 rpm in a Multitron incubator.
  • Cultures were diluted to an OD 600 of 0.4 into 2 mL of BB medium supplemented with appropriate antibiotics, 1.43 mM serine and an inducer (if necessary) in 10 mL glass vials with PTFE-silicone septa screw caps. Headspace in the vials was replaced with 99% argon and 1% oxygen gas (Airgas, MA USA) using a vacuum manifold. Acetylene was injected to 10% (vol/vol) into each culture vial to begin the reaction. The acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm, followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • Cultures were initiated by inoculating a single colony into 0.5 mL of TY medium supplemented with appropriate antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator. 5 ⁇ l of overnight cultures was diluted into 500 ⁇ l of UMS medium with 30 mM succinate, 10 mM sucrose, and 10 mM NH 4 C1 and appropriate antibiotics in 96-deepwell and incubated for 24 h at 30° C., 900 rpm in a Multitron incubator.
  • Cultures were diluted to an OD 600 of 0.4 into 2 mL of UMS medium plus 30 mM succinate and 10 mM sucrose supplemented with appropriate antibiotics, 1.43 mM serine and an inducer (if necessary) in 10 mL glass vials with PTFE-silicone septa screw caps. Headspace in the vials was replaced with 99% argon and 1% oxygen gas using a vacuum manifold. Acetylene was injected to 10% (vol/vol) into each culture vial to begin the reaction. The acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm, followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • Nitrogenase Assays A. caulinodans and P. stutzeri ).
  • Cultures were initiated by inoculating a single colony into 0.2 mL of TY medium supplemented with appropriate antibiotics in 96-deepwell plates and grown overnight at 37° C. and 30° C. for A. caulinodans and P. stutzeri , respectively, 900 rpm in a Multitron incubator. 5 ⁇ l of overnight cultures was diluted into 500 ⁇ l of UMS medium with 30 mM lactate and 10 mM NH 4 Cl and appropriate antibiotics in 96-deepwell and incubated for 24 h at 37° C. and 30° C. for A. caulinodans and P. stutzeri , respectively, 900 rpm in a Multitron incubator.
  • Cultures were diluted to an OD 600 of 0.4 into 2 mL of UMS medium plus 30 mM lactate supplemented with appropriate antibiotics and an inducer (if necessary) in 10 mL glass vials with PTFE-silicone septa screw caps. Headspace in the vials was replaced with 99% argon plus 1% oxygen gas using a vacuum manifold. Acetylene was injected to 10% (vol/vol) into each culture vial to begin the reaction. The acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm, followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • Cultures were initiated by inoculating a single colony into 0.5 mL of Burk medium supplemented with appropriate antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator. 5 ⁇ l of overnight cultures was diluted into 500 ⁇ l of Burk medium with 17.1 mM NH4CH3CO2 and appropriate antibiotics in 96-deepwell and incubated for 24 h at 30° C., 900 rpm in a Multitron incubator. Headspace in the vials was replaced with 97% argon and 3% oxygen gas (Airgas, MA USA) using a vacuum manifold.
  • Airgas Airgas, MA USA
  • Acetylene was injected to 10% (vol/vol) into each culture vial to begin the reaction.
  • the acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm, followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • cultures were diluted to an OD 600 of 0.4 in 2 mL of nitrogen-free minimal medium, 1.43 mM serine (for E. coli and P. protegens Pf-5) and an inducer (for inducible systems) in 10 mL glass vials with PTFE-silicone septa screw caps.
  • Ammonium (17.1 mM NH 4 CH 3 CO 2 for E. coli and P. protegens Pf-5 and 10 mM NH4Cl for Rhizobia ) was added to a nitrogen-free minimal medium when testing ammonium tolerance of nitrogenase activity. Headspace in the vials was replaced with either 100% argon gas for E.
  • coli 99% argon plus 1% oxygen for Pseudomonas and Rhizobia using a vacuum manifold.
  • Acetylene was injected to 10% (vol/vol) into each culture vial to begin the reaction.
  • the acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • cultures were diluted to an OD 600 of 0.4 in 2 mL of minimal medium, 1.43 mM serine (for E. coli and P. protegens Pf-5), and an inducer (for inducible systems) in 10 mL glass vials with PTFE-silicone septa screw caps.
  • the vial headspace was replaced with either 100% nitrogen gas for E. coli or 99% nitrogen plus 1% oxygen for P. protegens Pf-5 and A. caulinodans using a vacuum manifold. Cultures were incubated with shaking at 250 rpm at 30° C. for 6 h and 9 h for P. protegens Pf-5 and A.
  • caulinodans caulinodans , respectively, after which oxygen concentrations in the headspace were recorded with the optical oxygen meter FireStingO2 equipped with a needle-type sensor OXF500PT (Pyro Science, Germany) After the induction period, no oxygen remained in the headspace for all species as confirmed by the oxygen meter.
  • the initial oxygen levels in the headspace were adjusted by injecting pure oxygen via syringe into the headspace of the vials and stabilized with shaking at 250 rpm at 30° C. for 15 m followed by the injection of acetylene to 10% (vol/vol) into each culture vial to begin the reaction and initial oxygen concentrations in the headspace were recorded concomitantly.
  • the oxygen levels in the headspace were maintained around the setting points ( ⁇ 0.25% 02) while incubating at 250 rpm and 30° C. by injecting oxygen every hour for 3 h with oxygen monitoring before and after oxygen spiking ( FIGS. 26A-26B ).
  • the reactions were quenched after 3 h of incubation by the injection of 0.5 mL of 4 M NaOH to each vial using a syringe.
  • Ethylene production was analyzed by gas chromatography using an Agilent 7890A GC system (Agilent Technologies, Inc., CA USA) equipped with a PAL headspace autosampler and flame ionization detector as follows. An aliquot of 0.5 mL headspace preincubated to 35° C. for 30 s was injected and separated for 4 min on a GS-CarbonPLOT column (0.32 mm ⁇ 30 m, 3 microns; Agilent) at 60° C. and a He flow rate of 1.8 mL/min. Detection occurred in a FID heated to 300° C. with a gas flow of 35 mL/min H2 and 400 mL/min air. Acetylene and ethylene were detected at 3.0 min and 3.7 min after injection, respectively. Ethylene production was quantified by integrating the 3.7 min peak using Agilent GC/MSD ChemStation Software.
  • the frozen pellets were added to 650 ⁇ l of frozen droplets of lysis buffer (20 mM Tris (pH 8.0), 100 mM NH 4 Cl, 10 mM MgCl 2 , 0.4% Triton X-100, 0.1% NP-40, 1 mM chloramphenicol and 100 U/mL DNase I) in prechilled 25 mL canister (Retsch, Germany, Cat #014620213) in liquid nitrogen and pulverized using TissueLyser II (Qiagen USA) with a setting at 15 Hz for 3 min for 5 times with intermittent cooling between cycles.
  • the pellet was removed by centrifugation at 20,000 rcf at 4° C. for 10 min and the lysate was recovered in the supernatant.
  • RNA-seq and Ribosome-footprint profiling was carried out according to the method described earlier with a few modifications(Li, G.-W., Oh, E. & Weissman, J. S. J. N.
  • the anti-Shine—Dalgarno sequence drives translational pausing and codon choice in bacteria. 484, 538 (2012); Li, G.-W., Burkhardt, D., Gross, C. & Weissman, J. S. Quantifying absolute protein synthesis rates reveals principles underlying allocation of cellular resources. Cell 157, 624-635 (2014)).
  • the total RNA was isolated using the hot phenol-SDS extraction method.
  • RNA fragmentation reagents Thermo Fisher Scientific, Cat #AM8740
  • RNA fragments (10-45 bp) were isolated from a 15% TBE-Urea polyacrylamide gel (Thermo Fisher Scientific, Cat #EC6885).
  • RNA fragments were dephosphorylated using T4 polynucleotide kinase (1U/ ⁇ l, New England Biolabs, Cat #M0201S) in a 20 ⁇ l reaction volume supplemented with 1 ⁇ l of 20 U SUPERase. In at 37° C. for 1 h, after which the denatured fragments (5 pmoles) were incubated at 80° C.
  • oligo for 2 min and ligated to 1 ⁇ g of the oligo (/5rApp/CTGTAGGCACCATCAAT/3ddc/, Integrated DNA technologies) (SEQ ID NO: 1) in a 20 ⁇ l reaction volume supplemented with 8 ⁇ l of 50% PEG 8000, 2 ⁇ l of 10 ⁇ T4 RNA ligase 2 buffer, 1 ⁇ l of 200 U/ ⁇ l truncated K277Q T4 ligase 2 (New England Biolabs, Cat #M0351) and 1 ⁇ l of 20 U/ ⁇ l of SUPERase. In (Invitrogen) at 25° C. for 3 h.
  • the ligated fragments (35-65 bp) were isolated from a 10% TBE-Urea polyacrylamide gel (Invitrogen, Cat #EC6875).
  • cDNA libraries from the purified mRNA products were reverse-transcribed using Superscript III (Thermo Fisher Scientific, Cat #18080044) with oCJ485 primer (/5Phos/AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT/iSp18/CAAGCAGAAGA CGGCATACGAGATATTGATGGTGCCTACAG (SEQ ID NO: 2, SEQ ID NO: 3)) at 50° C.
  • RNA products subsequently were hydrolyzed by the addition of NaOH at a final concentration of 0.1 M, followed by incubation at 95° C. for 15 min.
  • the cDNA libraries (125-150 bp) were isolated from on a 10% TBE-Urea polyacrylamide gel (Invitrogen, Cat #EC6875).
  • the cDNA products were circularized in a 20 ⁇ l reaction volume supplemented with 2 ⁇ l of 10 ⁇ CircLigase buffer, 1 ⁇ l of 1 mM ATP, 1 ⁇ l of 50 mM MnCl2 and 1 ⁇ l of CircLigase (Epicenter, Cat #CL4115K) at 60° C. for 2 h and heat-inactivated at 80° C.
  • the purified products were analyzed by BioAnalyzer (Agilent, CA USA) and sequenced with a sequencing primer (CGACAGGTTCAGAGTTCTACAGTCCGACGATC (SEQ ID NO: 6)) using an Illumina HiSeq 2500 with a rapid run mode.
  • a sequencing primer CGACAGGTTCAGAGTTCTACAGTCCGACGATC (SEQ ID NO: 6)
  • Illumina HiSeq 2500 with a rapid run mode.
  • the raw trace profiles are multiplied by 10 7 and normalized by respective total reads from coding sequences of each species ( K. oxytoca M5al, CP020657.1; E. coli MG1655, NC_000913.3; P. protegens Pf-5, CP000076; R. sp.
  • IRBG74 HG518322, HG518323, HG518324 and an appropriate plasmid carrying a nif cluster The mRNA expression level of each gene was estimated using total sequencing reads mapped onto the gene, representing fragments per kilobase of transcript per million fragments mapped units (FPKM).
  • RNA was diluted into 195 ⁇ l of the lysis buffer including 0.5 U RNase inhibitor SUPERase.
  • In Invitrogen, Cat #AM2694
  • 5 mM CaCl2 were treated with 5 ⁇ l of 750 U of micrococcal nuclease (Sigma Aldrich, Cat #10107921001) at 25° C. for 1 h to obtain ribosome-protected monosomes.
  • the digestions were quenched by the addition of EGTA to a final concentration of 6 mM and then kept on ice before the isolation of monosomes.
  • the monosome fraction was collected by sucrose density gradient (10-55% w/v) ultracentrifugation at 35,000 rpm for 3 h, followed by a hot phenol-SDS extraction to isolate ribosome-protected mRNA fragments.
  • the mRNA fragments (15-45 bp) were isolated from a 15% TBE-Urea polyacrylamide gel. The 3′ ends of the purified fragments were dephosphorylated and ligated to the modified oligo.
  • cDNA libraries generated by Superscript III were circularized by CircLigase as described above. rRNA products were depleted by a respective biotinylated oligo mix for E. coli and P.
  • protegens Pf-5.5 ⁇ l of circularized DNA was amplified using Phusion HF DNA polymerase with o231 primer and indexing primers for 7 to 10 cycles.
  • the amplified products (125-150 bp) were recovered from an 8% TBE-Urea polyacrylamide gel.
  • the purified products were analyzed by BioAnalyzer and sequenced with a sequencing primer (CGACAGGTTCAGAGTTCTACAGTCCGACGATC (SEQ ID NO: 7)) using an Illumina HiSeq 2500 with a rapid run mode. Sequences were aligned to reference sequences using Bowtie 1.1.2 with the parameters—k1—m2—v1.
  • a center-weighting approach was used to map the aligned footprint reads ranging from 22 to 42 nucleotides in length.
  • To map P-site of ribosome from footprint reads 11 nucleotides from the both ends were trimmed, and the remaining nucleotide were given the same score, normalized by the length of the center region.
  • Aligned reads (10-45 nucleotides) were mapped to the reference with equal weight of each nucleotide.
  • a Python 3.4 script was used to perform the mapping.
  • To generate the Ribo-seq read profile for each nif cluster the raw trace profiles are multiplied by 10 8 and normalized by respective total reads from coding sequences of each species.
  • read densities were first normalized in the following ways: (i) The first and last 5 codons of the gene are excluded for the calculation to remove the effects of translation initiation and termination. (ii) A genome-wide read density profile was fitted to an exponential function and the density at each nucleotide on a given gene was corrected using this function. (iii) If the average read density on a gene is higher than 1, a 90% winsorization was applied to reduce the effect of outliers. The sum of normalized reads on a gene was normalized by the gene length and the total read densities on coding sequences to yield the ribosome density.
  • the activity of a promoter is defined as the change in RNAP flux ⁇ J around a transcription start site x tss (Gorochowski, T. E. et al. Genetic circuit characterization and debugging using RNA-seq. 13, 952 (2017)).
  • the promoter strength is calculated by
  • m(i) is the number of transcripts at each position I from FPKM-normalized transcriptomic profiles
  • n is the window length before and after x tss .
  • the window length is set to 10.
  • the terminator strength T s is defined as the fold-decrease in transcription before and after a terminator, which can be quantified from FPKM-normalized transcriptomic profiles as
  • NifA was tested using plasmid pMR-128 to 130 that contains the sfgfp fused to the nifH promoter in the A. caulinodans ⁇ nifA mutant.
  • the inducible NifA/RpoN expression was provided by the plasmid pMR-121 into which sfgfp driven by the nifH promoter was added to analyze nifH promoter activity, yielding pMR-131 ( FIG. 29 ).
  • IPTG-inducible system in the plasmid pMR-124 was substituted with other inducible systems including the salicylic acid-inducible, nopaline-inducible and octopine-inducible systems, yielding pMR-125, 126, and 127, respectively.
  • Each of the plasmids was mobilized into the A. caulinodans ⁇ nifA mutant, which was grown following the same protocol as used for nitrogenase activity (described herein).
  • the plasmids pMR-51, 53, 88, 89 and 90 were introduced into E. coli MG1655 and the plasmids pMR-91, 92, 93, 94 and 95 to P. protegens Pf-5.
  • the plasmid pMR-101 was used to provide inducible NifA expression by IPTG in E. coli .
  • the controller encoding the IPTG-inducible NifA was inserted into the genome of P. protegens Pf-5 using the plasmids pMR-96, 97 and 98.
  • the IPTG-inducible system of the NifA controller plasmid pMR-96 was replaced with the arabinose-inducible and the naringenin-inducible system, yielding pMR-99 and 100, respectively.
  • the inducibility of nifH expression was assessed by the reporter plasmids pMR-105 to 107 and pMR102 to 104 or E. coli and P. protegens Pf-5, respectively.
  • the controller plasmids were transformed into E. coli or P. protegens Pf-5 with the reporter plasmids.
  • NifA sequences of R. sphaeroides 2.4.1 (RSP_0547) and A. caulinodans ORS571 (AZC_1049) were obtained from NCBI. NifA protein sequences were aligned with MUSCLE (https://www.ebi.ac.uk/Tools/msa/muscle/) with a default settings ( FIG. 22 ).
  • FIG. 1A A set of diverse native nif clusters were cloned in order to determine their relative performance in different strains and the associated species barriers ( FIG. 1A ).
  • Previously-defined boundaries for the well-studied nif cluster from K. oxytoca (Arnold, W., Rump, A., Klipp, W., Priefer, U. B. & Paler, A. J. J. o. m. b. Nucleotide sequence of a 24,206-base-pair DNA fragment carrying the entire nitrogen fixation gene cluster of Klebsiella pneumoniae.
  • RNA-seq data shows that Rnf2 is not co-expressed with the nif genes, so only the Rnf1 and Fix complexes were included by fusing their DNA to create a single 46.9 kb construct.
  • Rhodopseudomonas palustris CGA009 Rhodopseudomonas palustris CGA009 (Oda, Y. et al. Functional genomic analysis of three nitrogenase isozymes in the photosynthetic bacterium Rhodopseudomonas palustris. 187, 7784-7794 (2005)) and Rhodobacter sphaeroides 2.4.1 (Haselkorn, R. & Kapatral, V. in Genomes and genomics of nitrogen-fixing organisms, 71-82 (Springer, 2005))) as these are members of the same alphaproteobacteria class as Rhizobia .
  • Each cluster was amplified from genomic DNA as multiple fragments by PCR and assembled with the plasmid backbone using yeast assembly (see Materials and Methods Section).
  • the P. polymyxa WLY78 cluster was de novo synthesized based on the DNA sequence on contig ALJV01 (Shanks, R. M. et al. Saccharomyces cerevisiae -based molecular tool kit for manipulation of genes from gram-negative bacteria. 72, 5027-5036 (2006)).
  • the clusters were cloned into different plasmid systems to facilitate transfer. For transfer to E. coli and R. sp.
  • IRBG74 the broad-host range plasmid based on a pBBR1 origin was used (a second compatible RK2-origin plasmid was used for the nif cluster from A. caulinodans ORS571). These plasmids contain the RK2 oriT to enable the conjugative transfer of large DNA (see Materials and Methods). For transfer to P. protegens Pf-5, this plasmid system was found to be unstable and produce a mixed population. To transfer into this strain, the Pseudomonas -specific plasmid pRO1600 with the oriT was used. After construction, all of the plasmids were verified using next-generation sequencing (see Materials and Methods Section).
  • the set of 10 nif clusters were transferred into E. coli MG1655, the cereal epiphyte P. protegens Pf-5, and the cereal endophyte R. sp. IRBG74 to create 30 strains ( FIG. 1A ).
  • E. coli was selected as a control as successful transfers to this recipient have been performed.
  • Native P. protegens Pf-5 does not fix nitrogen.
  • R. sp. IRBG74 contains two nif clusters in different genomic locations, which were left intact, but does not have nitrogenase activity under free living conditions.
  • the genomic cluster does not have the required NifV enzyme as it obtains homocitrate from the plant. All of the clusters in the set have nifV, except the one from P. polymyxa WLY78.
  • a test was run to determine whether the expression of recombinant WV from A. caulinodans ORS571 in R. sp. IRBG74 would result in active nitrogenase, but no
  • E. coli and Pseudomonas were grown at 30° C. in BB minimal media, as described previously 71 . However, no growth was observed for R. sp. IRBG74 under these conditions.
  • Different media and carbon sources were tested and it was found that UMS media with dicarboxylic acids (malate or succinate), the major carbon source from plants 147 , with 10 mM sucrose yielded the highest growth rates ( FIG. 6 ). After overnight growth, cells were transferred to stoppered test tubes in ammonium-free minimal media to a final OD 600 of 0.4.
  • E. coli and Pseudomonas were grown at 30° C. in BB minimal media, as described previously 71 . However, no growth was observed for R. sp. IRBG74 under these conditions.
  • Different media and carbon sources were tested and it was found that UMS media with dicarboxylic acids (malate or succinate), the major carbon source from plants 147 , with 10 mM sucrose yielded
  • the headspace air is completely replaced with argon gas.
  • P. protegens Pf-5 and R. sp. IRBG74 the initial headspace concentration of oxygen was maintained at 1% because these bacteria require oxygen for their metabolism.
  • the cells are incubated at 30° for 20 hours in the presence of excess acetylene and the conversion to ethylene was quantified by GC-MS (see Materials and Methods Section). There was no significant growth for any of the strains under these conditions, so the nitrogenase activities reported correspond to the same cell densities.
  • Rhizobium and Rhodobacter are alphaproteobacter and their nif clusters may contain interchangeable genes.
  • introducing the R. sphaeroides cluster alone does not yield active nitrogenase.
  • Rhodobacter and Rhodopseudomonas gene clusters were transferred to a panel of 12 species isolated from diverse legumes ( FIG. 1A ). Remarkably, the transfer of these clusters was able to produce detectable nitrogenase activity in 7 of the strains.
  • Phylogenetic analysis was performed based on the full-length 16S rRNA gene sequences ( K. oxytoca , BWI76_05380; A. vinelandii , Avin_55000; R. sphaeroides , DQL45_00005; Cyanothece ATCC51142, cce_RNA045 ; A. brasilense , AMK58_25190; R. palustris , RNA_55; P. protegens , PST_0759; Paenibacillus sp. WLY78, JQ003557).
  • a multiple sequence alignment was generated using MUSCLE (Edgar, R. C. J. N. a. r.
  • FIG. 30A A phylogenetic tree was constructed using the Geneious software (R9.0.5) with the Jukes-Cantor distance model and UPGMA as a tree build method, with bootstrap values from 1,000 replicates. This phylogenetic tree is shown in FIG. 30A .
  • the scale bar indicates 2% substitutions per site.
  • the clusters based on evolutionary closeness are circled.
  • FIG. 30B summarizes the relative nitrogenase activity in the three host strains carrying each of the 10 nif clusters. The result indicates that the phylogenetic closeness has a predictive power for achieving highest nitrogenase activity in a new host that lacks a nif cluster.
  • RNA-seq and ribosome profiling experiments were performed to evaluate the expression K. oxytoca nif cluster in K. oxytoca as well as E. coli MG1655, P. protegens Pf-5, and R. sp. IRBG74.
  • RNA-seq experiments provide mRNA levels of genes (calculated as FPKM) and can be used to measure the performance of promoters and terminators.
  • Ribosome profiling can be used to quantify protein synthesis rates, ribosome binding site (RBS) strength and ribosome pausing internal to genes.
  • the ribosome density (RD) has been shown to correlate with protein expression rates.
  • the translation efficiency is calculated by normalizing the RD by the number of transcripts (FPKM from Ribo-seq). Ribosome profiling has been applied to determine the relative levels of proteins expressed in multi-subunit complexes.
  • the RNA-seq profiles differ more significantly for P. protegens Pf-5 and R. sp. IRBG74 ( FIGS. 1B-1C ), and there was no correlation between mRNA transcripts ( FIG. 1D ).
  • the process of refactoring a gene cluster involves the complete reconstruction of the genetic system from the bottom-up, using only well-characterized genetic parts.
  • An exhaustive approach is to recode the genes (to eliminate internal regulation), reorganize into operons, control expression with synthetic ribosome binding sites (RBSs), and use T7 RNAP promoters and terminators.
  • RBSs synthetic ribosome binding sites
  • T7 RNAP promoters and terminators A separate “controller,” carried in a genetically distinct location, links synthetic sensors and circuits to the expression of T7 RNAP.
  • this approach has proven useful for transferring multi-gene systems between species, simplifies optimization through part replacement and enzyme mining, and enables the replacement of environmental signals that naturally control the cluster with the stimuli that induce the synthetic sensors (Smanski, M. J. et al.
  • T7 RNAP An advantage of using T7 RNAP is that it is functional in essentially all prokaryotes, so the refactored cluster can be transferred as-is and transcription induced by expressing T7 RNAP in the new host.
  • a new controller needs to be built for each host based on regulation and regulatory parts that work in that species.
  • a controller for E. coli was designed based on the IPTG-inducible T7 RNAP carried on a plasmid (pKT249) ( FIG. 2A ). To transfer the refactored cluster to R. sp. IRBG74, first a controller was constructed that functions in this species and produces an equivalent range of T7 RNAP expression.
  • a controller was then constructed by using the optimized IPTG-inducible system to drive the expression of a variant of T7 RNAP (R6232S, N-terminal lon tag, GTG start codon) ( FIG. 2A ).
  • RBS variants controlling T7 RNAP expression were tested and an intermediate strength was selected to maximize induction while limiting toxicity ( FIG. 16 ).
  • the controller was carried on the genome by replacing recA (see Materials and Methods).
  • the response function of the final controller is compared to that obtained for pKT249 in E. coli , showing that they sweep through the same range of expression at intermediate levels of induction ( FIG. 2B ).
  • 0.1 mM IPTG is selected for E. coli and 0.5 mM for R. sp. IRBG74 (circled points in FIG. 2B ).
  • the refactored v2.1 cluster was then transferred to R. sp. IRBG74, but no activity was observed ( FIGS. 2C-2D ). Activity was also not observed when the v2.1 cluster was transferred to P. protegens Pf-5 ( FIG. 17 ).
  • RNA-seq and ribosome profiling experiments were performed ( FIG. 18 ). From these data, the strengths of promoters/terminators and the transcription level and translation rates of genes could be calculated (see Materials and Methods).
  • the performance of the promoters in R. sp. IRBG74 was systematically lower than E. coli , particularly the first promoter controlling nifH ( FIG. 2E ).
  • the terminators were functioning the same in the two species, albeit weakly, and no termination could be detected from the three terminators in the center of the cluster ( FIG. 2E ).
  • the translation of the genes differed significantly between organisms ( FIG. 2F ).
  • FIG. 2F When the expression rates of the nif genes from the refactored cluster are compared with their levels in their native context in K. oxytoca , there is almost no correlation ( FIG. 2F ).
  • FIG. 2F Importantly, there is 9-fold less NifH expressed from the refactored cluster in R. sp. IRBG74 as compared to the same cluster in E. coli .
  • the refactored cluster produces wildly different expression levels of the component genes when transferred between organisms, even when transcription is matched between them using different controllers.
  • FIG. 2G a new refactored cluster (v3.2) ( FIG. 2G ) was designed.
  • a very strong promoter was chosen for nifH.
  • the transcription was broken up by adding promoters to divide nifENX and nifJ and selecting stronger terminators. Noting that the expression ratios between nif genes are better preserved when the native cluster is transferred to a new host ( FIG. 1D ) but not the refactored cluster ( FIG. 2F ), it was hypothesized that this could be due to the disruption of the operon structures and the associated translational coupling between genes.
  • the K. oxytoca operons were cloned intact, including native RBSs and replaced these regions of the refactored cluster ( FIG. 2G ).
  • nifT and nifX which were not included in first versions because they were either inessential(Simon, H. M., Homer, M. J. & Roberts, G. P. J. J. o. b.
  • Perturbation of nifT expression in Klebsiella pneumoniae has limited effect on nitrogen fixation. 178, 2975-2977 (1996)) or inhibitory (Gosink, M. M., Franklin, N. M. & Roberts, G. P. J. J. o. b.
  • the product of the Klebsiella pneumoniae nifX gene is a negative regulator of the nitrogen fixation (nif) regulon. 172, 1441-1447 (1990)).
  • the v3.2 cluster is less active in E. coli but is active in R. sp. IRBG74 ( FIG. 2H ) and P. protegens Pf-5 ( FIG. 17 ).
  • This experiment was performed in the double nif knockout strain in R. sp. IRBG74, thus indicating that the refactored cluster is self-contained in producing nitrogenase activity.
  • RNA-seq and ribosome profiling was applied to evaluate the performance of v3.2 in all three species ( FIG. 21 , FIG. 19 , and FIGS. 20A-20F ).
  • the promoters perform similarly in the different hosts, but there was significant diversity in terminator function.
  • the A. caulinodans nif genes are distributed across three clusters in different genomic locations.
  • the regulatory signals converge on the NifA activator that, in concert with the RpoN sigma factor, turns on transcription of the genomic nif clusters.
  • Numerous and not fully characterized environmental signals are integrated upstream of this node, including NtrBC (Kaminski, P. A. & Elmerich, C. J. M. m.
  • NtrXY Pieris, awlowski, K., Klosse, U., De Bruijn, F. J.
  • the clusters (64 kb total, containing 76 genes) were cloned into the plasmid systems described above and transferred into R. sp. IRBG74 and P. protegens Pf-5, but no activity was found in either strain.
  • Overexpression of A. caulinodans NifA and RpoN did not lead to activity and, upon further investigation, these regulators were found to be inactive in these strains.
  • the size of the clusters and the lack of genetic and gene function information would complicate fully refactoring the system. For these reasons, it was decided to modify the regulation controlling nif such that it can be placed under the control of synthetic sensors.
  • One goal herein was to eliminate ammonium repression of nitrogenase activity, which converges on the regulation of NifA.
  • the native nifA gene was knocked out of the genome using the sacB markerless deletion method (see Materials and Methods), with the intent of placing NifA under inducible control ( FIG. 3A ).
  • the promoter turns on and its activity is further enhanced by the co-expression of RpoN in an operon (note that the genomic rpoN gene is left intact for these experiments).
  • the IPTG-inducible system designed for Rhizobium was tested in A.
  • caulinodans carried on a pBBR1-ori plasmid. Using GFP, this was found to induce expression over several orders of magnitude ( FIG. 21 ). Then, the A. caulinodans nifA and rpoN gene was placed under IPTG control and the fluorescent reporter fused to the A. caulinodans nifH promoter (encompassing 281 nt upstream of the ATG), carried on the same plasmid (see Materials and Methods). The response function from the nifH promoter was analyzed at the condition used for nitrogen fixation, exhibiting a wide dynamic range to 45-fold ( FIG. 3C ).
  • the controller was designed to co-express NifA and RpoN and tested for its ability to induce nitrogenase ( FIG. 3D ). When fully induced, there was a complete recovery of activity as compared to the wild-type strain. The repression of nitrogenase activity by ammonium was then evaluated. The presence of 10 mM ammonium chloride leads to no detectible activity by the wild-type strain ( FIG. 3E ). Even when both NifA and RpoN are under inducible control, there is strong repression with only 5% of the nitrogenase activity of the wild-type. This suggests that the post-transcriptional control of NifA activity by ammonium remains intact.
  • the inducible nif clusters were tested for oxygen sensitivity, noting that A. caulinodans is an obligate aerobe and fixes nitrogen under micro-aerobic conditions.
  • the tolerance of nitrogenase to oxygen was then assessed as a function of the concentration of oxygen in the headspace, held constant by injecting oxygen while monitoring its level (Methods and FIG. 26A ).
  • the native K. oxytoca, P. stutzeri , and A. vinelandii nif clusters are all functional in P. protegens Pf-5 ( FIG. 1A ).
  • the native P. stutzeri and A. vinelandii clusters are transferred, nitrogenase is strongly repressed.
  • transferring the native K. oxytoca cluster produces uncontrolled (constitutively on) nitrogenase activity ( FIG. 4E ).
  • FIGS. 11A-11C A range of 20 constitutive promoters and seven T7 promoters that span a range of 778-fold and 24-fold expression, respectively, was characterized ( FIGS. 11A-11C ).
  • a library of 192 RBSs was screened, representing an expression range of 4,079-fold ( FIGS. 12A-12B ).
  • a set of seven terminators that share no sequence homology between each other and have a terminator strength >10 in R. sp. IRBG74 was selected and characterized together with the three well-used terminators (e.g., T7 terminator, rrnBT1, and L3S2P21). These seven terminators showed a terminator strength >50 ( FIGS. 13A-13B ).
  • the inducible systems designed for Rhizobium were transferred as-is to a Pseudomas-specific pRO1600 plasmid (see Materials and Methods).
  • the 3OC6HSL-, aTc-, cuminic acid-, and DAPG-inducible systems were all found to be functional ( FIG. 15A ).
  • a naringenin-inducible system based on the P fde promoter was constructed and found to be functional.
  • the strength of arabinose inducible system was increased by substituting the ⁇ 10 box in P BAD promoter and arabinose import was improved by constitutive expression of the arabinose transporter AraE ( FIG. 15B ).
  • the IPTG-inducible system was optimized for P.
  • protegens Pf-5 by replacing the P A1lacO1 promoter with the P tac promoter and making three amino acid substitutions to Lad (Meyer, A. J., Segall-Shapiro, T. H., Glassey, E., Zhang, J. & Voigt, C. A. J. N. c. b. Escherichia coli “Marionette” strains with 12 highly optimized small-molecule sensors. 1 (2018)). This effort resulted in seven new inducible systems that produce 41- to 554-fold induction in P. protegens Pf-5 ( FIG. 15C ).
  • the controller was constructed using the P. stutzeri NifA, placed under the control of the optimized IPTG-inducible system, described above.
  • the RBSs of NifA were synthetically designed to span a wide range of expression of nif genes ( FIG. 24A ).
  • the controller was inserted into the genome 25 bp downstream of the stop codon of glmS using the mini-Tn7 system. The ability for this controller to induce the nifH promoter from each cluster using a fluorescent reporter is shown in FIG. 4C and FIG. 24B .
  • the nitrogenase activity for each of the gene clusters in P. protegens Pf-5 was then assessed ( FIG. 4D ).
  • the three P. protegens Pf-5 strains containing the transferred clusters were modified to insert the controller and delete the native nifLA genes from each cluster ( FIG. 4B ). All three are inducible, with nitrogenase activity showing dynamic ranges of 1,200-fold, 2,300-fold, and 130-fold for the K. oxytoca, P. stutzeri , and A. vinelandii nif clusters, respectively. When induced, these systems all produce similar or even higher nitrogenase activities than can be achieved by the transfer of the unmodified native clusters ( FIG. 4D ). For reference, the nitrogenase activities produced by K.
  • the native P. stutzeri and A. vinelandii clusters are strongly repressed by ammonium: the presence of 17.1 mM eliminates activity or reduces it 7-fold, respectively ( FIG. 4E and FIGS. 8A-8B ).
  • the inducible clusters show little reduction in activity and the inducible A. vinelandii cluster exhibits almost no ammonia repression. While the native K. oxytoca cluster in P. protegens Pf-5 generates a constitutive response, there is still some repression, which is reduced by the inducible version.
  • the inducible nif clusters were tested for oxygen sensitivity. Note that wild-type A. vinelandii is able to fix nitrogen under ambient conditions due to genetic factors internal and external to the cluster.
  • the controller in P. protegens Pf-5 could induce transcription from the three nifH promoters in the presence of oxygen ( FIGS. 26A-26B ).
  • the tolerance of nitrogenase to oxygen was then assessed as a function of the concentration of oxygen in the headspace, as described for A. caulinodans (previous section).
  • the native and inducible clusters exhibited the same oxygen response ( FIG. 4F ). The nif cluster from K.
  • the A. vinelandii cluster contains two potential electron transport systems to nitrogenase and the redundant system may help maintain redox status for nitrogenase at various oxygen levels.
  • the dependence of nitrogenase activity on the oxygen concentration in various mutant backgrounds was re-measured. No effect was seen by adding the rnf2 operon or deleting the fix operon, however deleting rnf1 eliminated activity. This suggests that the rnf1 operon is the sole source of electrons in P. protegens Pf-5 under these conditions and the Fix complex cannot compensate the Rnf complex unlike the case of A. vinelandii.
  • the careful design and characterization of the controller has the benefit of simplifying the process by which different synthetic sensors can be used to induce nitrogenase expression.
  • 11 synthetic sensors were selected that respond to a variety of chemical signals of relevance to the rhizosphere and demonstrate that these can be used to create inducible nitrogenase in for example, engineered strains of E. coli (carrying the refactored v2.1 nif), R. sp. IRBG74 (carrying the refactored v3.2 nif), P. protegens Pf-5 (carrying the inducible A. vinelandii nif), and A. caulinodans (inducible nifA/rpoN) ( FIGS. 5A-5D ).
  • FIG. 5A Cuminic acid is present in plant seeds and functions as a fungicide. Natural root exudates may include sugars, amino acids, organic acids, phenolic compounds, phytohormones, and flavonoids. These represent potential signals to control nitrogenase production close to the root surface. Cereals have been shown to release arabinose, vanillic acid, and salicylic acid. In addition, salicylic acid regulates the plant innate immune response and the impact of its exogenous addition to cereals has been studied. Naringenin is a common precursor for many flavonoids and improves endophytic root colonization when applied to rice and wheat.
  • Genistein a product from naringenin catalyzed by the isoflavone synthase, is released from maize roots.
  • a quorum sensing mimic released by rice can regulate the 3OC6HSL receptor protein LuxR, which has been visualized using E. coli biosensor strains.
  • Bacteria either native to the rhizome or added as biocontrol agents introduced as a spray inoculant or seed coating produce chemical signatures. Inoculation of cereals with root colonizing Pseudomonas strains that produce DAPG elicits protection against fungal pathogens. Many bacteria produce quorum molecules, such as N-acyl homoserine lactones, as a means of communication and plants can respond to these signals. The bacterium Sinorhizobium meliloti produces 3OC14HSL, which enhances Medicago nodulation and has been shown to induce systemic resistance in cereals. DHBA can be produced by root colonizing bacteria to increase iron solubility and play a role as a chemoattractant for Agrobacterium and Rhizobium.
  • Marionette contains sensors for vanillic acid, DHBA, cuminic acid, 3OC6HSL, and 3OC14HSL.
  • the output promoter was transcriptionally fused to T7 RNAP and the response of the responsive promoter (PT7) was measured as a function of inducer concentration ( FIG. 5B and FIG. 28B ).
  • the v2.1 refactored nif cluster was introduced and nitrogenase activity was measured in the presence and absence of inducer ( FIG. 5C and FIG. 28C ).
  • the inducible systems constructed for P. protegens Pf-5 that respond to arabinose and naringenin were used to drive NifA expression for the control of the A. vinelandii nif cluster ( FIG. 4A ).
  • Plants could be engineered to release an orthogonal chemical signal that could then be sensed by a corresponding engineered bacterium. This would have the benefit of only inducing nitrogenase in the presence of the engineered crop. Further, if the molecule is metabolizable by the engineered bacterium, it could serve as a mechanism around which a synthetic symbiosis could be designed, where the plant provides the carbon and the bacterium fixed nitrogen in an engineered relationship. To this end, legumes and Arabidopsis have been engineered to produce opines, including nopaline and octopine. Sensors were constructed for these two opines for A.
  • caulinodans based on the LysR-type transcriptional activators OccR (octopine) and NocR (nopaline) and their corresponding P occ and P noc promoters ( FIG. 5D and FIG. 21 ). These sensors were connected to the expression of NifA(L94Q/D95Q)/RpoN and the response from P nifH was measured using a fluorescent reporter. Both response functions had a large dynamic range ( FIG. 5B ) and produced highly-inducible nitrogenase activity ( FIG. 5C ). The nopaline sensor yielded a 412-fold dynamic range and the octopine sensor led to 40% higher nitrogenase activity than the wild-type.
  • this work provides a side-by-side comparison of diverse species, natural nif clusters, and engineering strategies that can be used to obtain inducible nitrogenase activity in a strain that can associate with cereals as an endophyte or epiphyte.
  • ⁇ 100 strains involving the transfer of 10 natural nif clusters ranging in size from 10 kb to 64 kb to 16 diverse species of Rhizobia, Azorhizobium , Pseudomas, and E. coli were constructed.
  • Different approaches were taken to make these nif clusters inducible, from bioinformatics and protein engineering to complete genetic reconstruction from the ground-up (refactoring).
  • nitrogen fixation be robust to the addition of nitrogenous fertilizer (ammonia) and microaerobic environments.
  • an endophyte such as a variant of Azorhizobium where nifA is knocked out of the genome and a nifA mutant and rpoN are complemented on a plasmid can be used to obtain high nitrogenase activities.
  • P. protegens Pf-5 is a versatile strain based on the transfer of the A. vinelandii nif cluster and placement of nifA of P. stutzeri under inducible control. In both such cases, nitrogenase activities were obtained that are nearly identical to wild-type A.
  • caulinodans and P. stutzeri were Neither showed significant repression by ammonia and optimal activity was obtained in 1% oxygen. Based on these strains, it was demonstrated that nitrogenase can be placed under inducible control in response to cereal root exudates (arabinose, salicylic acid), phytohormones (naringenin) and putitive signaling molecules that could be released by genetically modified plants (e.g., can express or exudate nopaline or octopine).
  • R. sp. IRBG74 can fix nitrogen in a legume nodule and also associates with rice, significant effort was directed to engineering this strain to fix nitrogen when cereal-associated. The first attempt was simply complementing nifV, as this is absent in R. sp. IRBG74 and produces a metabolite provided by the plant, but this attempt was unsuccessful. Then, it was found that all of the initial nif clusters transferred, some of which have high activity in P. protegens Pf-5 and E. coli , are non-functional in R. sp. IRBG74, which led to trying clusters from alphaproteobacteria, one of which produced a very low level of activity that was dependent on the nif genes native to R. sp.
  • IRBG74 The previously-published refactored gene clusters based on Klebsiella nif were attempted in R. sp. IRBG74 but these showed no activity. It was only after the construction of a new refactored cluster (v3.2) that activity was obtained under free-living conditions that was not dependent on the native nif genes. This allowed an increase in the expression levels, and an optimum was discovered beyond which activity was lost. This is the first time that nif activity has been engineered in a Rhizobium under free-living conditions that could otherwise not perform this function.
  • the present disclosure encompasses different degrees of nif pathway re-engineering to promote heterologous transfer.
  • the most ambitious is the complete refactoring of all the nif genes and regulation, where all regulatory genetic parts are replaced, genes are recoded, operons are reorganized, and transcription is performed by the orthogonal T7 RNAP.
  • the evaluation of performance relied on the overall nitrogenase activity, rather than an understanding of the underlying parts.
  • the first refactored pathway performed poorly.
  • better part libraries and DNA assembly and automation platforms enabled the synthesis of many variants.
  • the cost of RNA-seq declined, it was used to evaluate the performance of internal parts, such as promoters and terminators.
  • Ribosome profiling a new technique that enables the measurement of translational parts (e.g., ribosome binding sites), was applied and expression levels were inferred. Further, nitrogenase activity and the function of underlying parts were assessed as the clusters were moved between species. Interestingly, the native Klebsiella nif cluster could be transferred and it performed similarly but the refactored cluster yielded widely varying expression levels in the different hosts, sometimes leading to a total loss in activity. This could be recovered by maintaining the native operon structure in the refactored cluster, implying that it was not due to the synthetic sensors, T7 RNAP, or promoters/terminators. This is one of the hypothesized functions of operons.
  • translational parts e.g., ribosome binding sites
  • the present disclosure demonstrates the deregulation of nif clusters in A. caulinodans and P. protegens Pf-5, enabling them to be placed under the control of cereal root exudates. This derepresses the pathway in the presence of exogenous nitrogenous fertilizer—critical for the use of the bacterium as part of an integrated agricultural solution. Further, these organisms retain the ability to fix nitrogen in microaerobic environments, thus avoiding the need for a root nodule that enforces strict anaerobiosis. The complete deregulation of the nif pathway makes the bacterium non-competitive in the soil and lost quickly, thus limiting its impact to particular phases of the growth cycle. Thus, it is demonstrated that nitrogenase can be placed under the control of chemical root exudates.
  • a rhizobium that can fix nitrogen under aerobic free-living conditions comprising a symbiotic rhizobium having an exogenous nif cluster, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic rhizobium under aerobic free-living conditions, and wherein the rhizobium is not Azorhizobium caulinodans.
  • a plant growth promoting bacterium that can fix nitrogen under aerobic free-living conditions, comprising a bacterium having an exogenous nif cluster having at least one inducible promoter, wherein the exogenous nif cluster confers nitrogen fixation capability on the bacterium, under aerobic free-living conditions, and wherein the bacterium is not Azorhizobium caulinodans.
  • An Azorhizobium caulinodans capable of inducible ammonium-independent nitrogen fixation in a cereal crop comprising:
  • At least one operon comprising nifA and RNA polymerase sigma factor (RpoN), wherein the operon comprises a regulatory element including an inducible promoter.
  • a method of engineering a rhizobium that can fix nitrogen under aerobic free-living conditions comprising transferring an exogenous nif cluster to a symbiotic rhizobium, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic rhizobium, under aerobic free-living conditions, and wherein the rhizobium is not Azorhizobium caulinodans.
  • a method of producing nitrogen for consumption by a cereal plant comprising providing a plant growth promoting bacterium that can fix nitrogen under aerobic free-living conditions in proximity of the cereal plant, wherein the plant growth promoting bacterium is a symbiotic bacterium having an exogenous nif cluster, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic bacterium, enabling nitrogen fixation under aerobic free-living conditions.
  • root exudate is selected from the group consisting of sugars, hormones, flavonoids, and antimicrobials.
  • inventive embodiments are presented by way of example only and that, within the scope of the appended claims and equivalents thereto, inventive embodiments may be practiced otherwise than as specifically described and claimed.
  • inventive embodiments of the present disclosure are directed to each individual feature, system, article, material, kit, and/or method described herein.
  • a reference to “A and/or B”, when used in conjunction with open-ended language such as “comprising” can refer, in one embodiment, to A only (optionally including elements other than B); in another embodiment, to B only (optionally including elements other than A); in yet another embodiment, to both A and B (optionally including other elements); etc.
  • the phrase “at least one,” in reference to a list of one or more elements, should be understood to mean at least one element selected from any one or more of the elements in the list of elements, but not necessarily including at least one of each and every element specifically listed within the list of elements and not excluding any combinations of elements in the list of elements.
  • This definition also allows that elements may optionally be present other than the elements specifically identified within the list of elements to which the phrase “at least one” refers, whether related or unrelated to those elements specifically identified.
  • “at least one of A and B” can refer, in one embodiment, to at least one, optionally including more than one, A, with no B present (and optionally including elements other than B); in another embodiment, to at least one, optionally including more than one, B, with no A present (and optionally including elements other than A); in yet another embodiment, to at least one, optionally including more than one, A, and at least one, optionally including more than one, B (and optionally including other elements); etc.
  • Nif cluster Forward primer (SEQ ID NOs: 8-64) Reverse Primer (SEQ ID NOs: 65-121) Genomic location GenBank accession No. Klebsiella oxytoca CGTAGGGCGCATTAATGCAGCTGGCACGA GTGACGCTCGCGTATCAGGTTTG 3,897,443-3,909,294 CP020657.1 M5aI CAGGTGAATTC TAGACTGCTGGATACGCTGCTTAAGGTC TACGCTGTTTGAGCTGGCAAACCT ATCAGGCGCATATTTGAATGTATTTACTGCA 3,909,255-3,920,878 CP020657.1 GCGGCCGCTTCTAG AGTGACCAAAAGCTTCCGCAACCC Pseudomonas GCCCGGAGAGCAAGCCCGTAGGGCGCATT ACTACGCATCACTAGCAGGGCACGCACCGCG 1,410,207-1,414,229 NC_009434 stutzer
  • protegens Pf-5 controller v1 P tac -nifA
  • protegens Pf-5 controller v2 P tac -nifA v2
  • protegens Pf-5 controller v3 P tac -nifA v3
  • protegens Pf-5 controller v4 P BAD.10 -nifA
  • protegens Pf-5 controller v5 P Fde -nifA
  • IRBG74 ⁇ hsdR ⁇ nif ⁇ recA::P A1lacO1 -T7RNAP This generated by pMR82 v1 study MR20 This study MR21 R . sp. IRBG74 ⁇ hsdR ⁇ nif ⁇ recA::P A1lacO1 -T7RNAP This study generated by pMR83 v2 MR22 R . sp. IRBG74 ⁇ hsdR ⁇ nif ⁇ recA::P A1lacO1 -T7RNAP This study generated by pMR84 v3 MR23 R . sp.
  • leguminosarum 8002 Poole lab MR30 Sinorhizobium meliloti WSM1022 Poole lab MR31 R .
  • leguminosarum A34 Poole lab MR32 Sinorhizobium fredii HH103 Poole lab MR33 Sinorhizobium meliloti 1021 Poole lab MR34 R . tropici CIAT899 Poole lab MR35 R .
  • leguminosarum viciae 3841 Poole lab MR36 R . etli CFN42 Poole lab MR37 Agrobacterium tumefaciens C58 Poole lab
  • vinelandii DJ with the rnf2 operon p15A pMR27 pRO1600, Gentamicin rnf1 (5,168,156-5,162,716) operon deletion in the nif cluster of A .
  • vinelandii DJ p15A pMR28 pRO1600 Gentamicin fix operon (995,860-1,000,698) deletion in the nif cluster of A .
  • P T7 -gfpmut3b-mrfp1 pMR68 pRO1600 Gentamicin Plasmid for terminator characterization.
  • P T7 -gfpmut3b-mrfp1 ColE1 pMR69 pBBR1 Kanamycin LuxR, P Lux -gfpmut3b pMR70 pBBR1 Kanamycin TetR, P Tet -gfpmut3b pMR71 pBBR1 Kanamycin CymR, P Cym -gfpmut3b pMR72 pBBR1 Kanamycin PhlF, P Phl -gfpmut3b pMR73 pBBR1 Kanamycin NahR, P Sal -gfpmut3b pMR74 pRO1600, Gentamicin PhlF, P Phl -gfpmut3b ColE1 pMR75 pRO1600, Gentamicin TetR, P Tet -gfpmut3b Col
  • IRBG74 LacI, P A1lacO1 -T7RNAP (RBSr33 for T7RNAP) pMR83 p15A Gentamicin Controller for R . sp. IRBG74, LacI, P A1lacO1 -T7RNAP (RBSr32 for T7RNAP) pMR84 p15A Gentamicin Controller for R . sp. IRBG74, LacI, P A1lacO1 -T7RNAP (RBSr3 for T7RNAP) pMR85 p15A Gentamicin Controller for R . sp.
  • protegens Pf-5 LacI(Q18M/A47V/F161Y), P tac -nifA( P . stutzeri ) (RBSp32 for NifA) pMR98 ColE1 Tetracycline NifA controller for P .
  • caulinodans LacI, P A1lacO1 -nifA-rpoN pMR123 pBBR1 Gentamicin NifA controller for A .
  • caulinodans LacI, P A1lacO1 -nifA(L94Q)-rpoN pMR124 pBBR1 Gentamicin NifA controller for A .
  • caulinodans LacI, P A1lacO1 -nifA(D95Q)-rpoN( A . caulinodans ) pMR125 pBBR1 Gentamicin NifA controller for A .
  • caulinodans LacI, P A1lacO1 -nifA(L94Q/D95Q)-rpoN pMR126 pBBR1 Gentamicin NifA controller for A .
  • caulinodans NahR, P Sal -nifA(L94Q/D95Q)-rpoN pMR127 pBBR1 Gentamicin NifA controller for A .
  • caulinodans Promoter TGTCGCGTTTGAAACACGGGGCTTTTGGAACCGTTCGATTCTGCAATGCACTGATTTTACTTGATTAATTCGACCACACGACCA CTGGCACA CCCGTTGCAAAACCCCTTGGTGCAGGCGACGGGTTGCCGGTCTGGTTCGCGGATCTCCTCGATCCCCGGCTACCGACCCGCCTC CGAAAAGTCCGGTCCCGATCCAGTTCGGCGGGGCCACAC nifA of K.
  • RBS sequences used in this study Name Strain RBS sequence a (SEQ ID NOs: 226-291) Strength (GFP, au) RBSr1 R. sp. IRBG74 ATTTCACACATCTAGAGCTAATCATCTCGTACTAAAGAGGAGAAATTAA 8242 CC ATG RBSr2 R. sp. IRBG74 ATTTCACACATCTAGAGCTAATCATCGCGTACTCAGGAGGCAAGTA ATG 7181.5 RBSr3 R. sp. IRBG74 ATTTCACACATCTAGAATTAAAGAGGAGAAATTAACC ATG 6238.5 RBSr4 R. sp.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Wood Science & Technology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Medicinal Chemistry (AREA)
  • Molecular Biology (AREA)
  • Virology (AREA)
  • Plant Pathology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Physics & Mathematics (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Pest Control & Pesticides (AREA)
  • Dentistry (AREA)
  • Environmental Sciences (AREA)
  • Agronomy & Crop Science (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Disclosed herein are engineered rhizobia having nif clusters that enable the fixation of nitrogen under free-living conditions, as well as ammonium and oxygen tolerant nitrogen fixation under free-living conditions. Also provided are methods for producing nitrogen for consumption by a cereal crop using these engineered rhizobia.

Description

    RELATED APPLICATIONS
  • This application is a national stage filing under 35 U.S.C. § 371 of International Patent Application Number PCT/US2020/023646, filed Mar. 19, 2020, which claims priority under 35 U.S.C. § 119(e) to U.S. Provisional Application Ser. No. 62/820,765, filed Mar. 19, 2019 and under 35 U.S.C. § 120 of U.S. application Ser. No. 16/746,215, filed on Jan. 17, 2020, the entire contents of each of which are incorporated by reference herein.
  • GOVERNMENT SUPPORT
  • This invention was made with Government support under Grant No. IOS1331098 awarded by the National Science Foundation (NSF). The government has certain rights in this invention.
  • BACKGROUND OF THE INVENTION
  • In agriculture, nitrogen is a limiting nutrient that needs to be added as fertilizer to those crops that cannot produce it on their own, including the cereals rice, corn, and wheat. In contrast, legumes are able to obtain nitrogen from the atmosphere using nitrogen-fixing bacteria that reside in root nodules. However, the majority of the world's calories are from cereals; thus, it has been a longstanding problem in genetic engineering to transfer this ability to these crops. This would reduce the need for nitrogenous fertilizer and the economic, environmental, and energy burdens that it brings.
  • SUMMARY OF THE INVENTION
  • The present disclosure is based, at least in part, rhizobia and methods for making rhizobia that can fix nitrogen under aerobic free-living conditions. The present disclosure also provides refactored nif-clusters that confer the ability to fix nitrogen under aerobic free-living conditions.
  • Accordingly, one aspect of the present disclosure provides a rhizobium that can fix nitrogen under aerobic free-living conditions, comprising a symbiotic rhizobium having an exogenous nif cluster, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic rhizobium under aerobic free-living conditions, and wherein the rhizobium is not Azorhizobium caulinodans. In some embodiments, the exogenous nif cluster is from a free-living diazotroph. In some embodiments, the exogenous nif cluster is from a symbiotic diazotroph. In some embodiments, the exogenous nif cluster is from a photosynthetic Alphaproteobacteria. In some embodiments, the exogenous nif cluster is from a Gammaproteobacteria. In some embodiments, the exogenous nif cluster is from a cyanobacteria. In some embodiments, the exogenous nif cluster is from a firmicutes. In some embodiments, the exogenous nif cluster is from Rhodobacter sphaeroides. In some embodiments, the exogenous nif cluster is from Rhodopseudomonas palustris. In some embodiments, the exogenous nif cluster is an inducible refactored nif cluster. In some embodiments, the inducible refactored nif cluster is an inducible refactored Klebsiella nif cluster. In some embodiments, the rhizobium is IRBG74. In some embodiments, the exogenous nif cluster comprises 6 nif genes. In some embodiments, the 6 nif genes are nifHDK(T)Y, nifEN(X), nifJ, nifBQ, nifF, and nifUSVWZM. In some embodiments, each nif gene of the exogenous nif cluster is preceded by a T7 promoter. In some embodiments, the T7 promoter is a wild-type promoter. In some embodiments, the rhizobium further comprises an endogenous nif cluster. In some embodiments, the nif cluster has a nifV gene. In some embodiments, the nifV gene is endogenous. In some embodiments, the exogenous nif cluster further comprises a terminator. In some embodiments, the T7 promoter has a terminator and the terminator is downstream from the T7 promoter. In some embodiments, the exogenous nif cluster is a refactored v3.2 nif cluster as shown in FIG. 2H.
  • Another aspect of the present disclosure provides a plant growth promoting bacterium that can fix nitrogen under aerobic free-living conditions, comprising a bacterium having an exogenous nif cluster having at least one inducible promoter, wherein the exogenous nif cluster confers nitrogen fixation capability on the bacterium, under aerobic free-living conditions, and wherein the bacterium is not Azorhizobium caulinodans. In some embodiments, the bacterium is a symbiotic bacterium. In some embodiments, the bacterium is an endophyte. In some embodiments, the endophyte is rhizobium IRBG74. In some embodiments, the bacterium is an epiphyte. In some embodiments, the epiphyte is Pseudomonas protogens PF-5. In some embodiments, the plant growth promoting bacterium is associated with a genetically modified cereal plant. In some embodiments, the genetically modified cereal plant includes an exogenous gene encoding a chemical signal. In some embodiments, the nitrogen fixation is under the control of the chemical signal. In some embodiments, the chemical signal is an opine (e.g., octopine, nopaine, or mannopine), phlorogluconol or rhizopene. In some embodiments, the exogenous nif cluster comprises 6 nif genes. In some embodiments, the 6 nif genes are nifHDK(T)Y, nifEN(X), nifJ, nifBQ, nifF, and nifUSVWZM. In some embodiments, the inducible promoter is a T7 promoter. In some embodiments, the inducible promoter is PA1lacO1 promoter. In some embodiments, the inducible promoter is activated by an agent selected from a group that includes IPTG, sodium salicylate, octapine, nopaline, the quorum signal 3OC6HSL, aTc, cuminic acid, DAPG, and salicylic acid. In some embodiments, the exogenous nif cluster further comprises a terminator. In some embodiments, the inducible promoter has a terminator and the terminator is downstream from the inducible promoter.
  • Another aspect of the present disclosure provides an Azorhizobium caulinodans capable of inducible ammonium-independent nitrogen fixation in a cereal crop, comprising: (i) a modified nif cluster, wherein an endogenous nifA gene is deleted or altered; and (ii) at least one operon comprising nifA and RNA polymerase sigma factor (RpoN), wherein the operon comprises a regulatory element including an inducible promoter. In some embodiments, the inducible promoter is PA1lacO1 promotor. In some embodiments, the inducible promoter is activated by an agent selected from the group consisting of IPTG, sodium salicylate, octapine, nopaline, the quorum signal 3OC6HSL, aTc, cuminic acid, DAPG, and salicylic acid. In some embodiments, the endogenous nifA gene is altered with at least one of the following substitutions: (i) L94Q, (ii) D95Q, and (iii) both L94Q and D95Q.
  • Another aspect of the present disclosure provides a method of engineering a rhizobium that can fix nitrogen under aerobic free-living conditions, comprising transferring an exogenous nif cluster to a symbiotic rhizobium, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic rhizobium, under aerobic free-living conditions, and wherein the rhizobium is not Azorhizobium caulinodans. In some embodiments, the exogenous nif cluster comprises 6 nif genes. In some embodiments, the 6 nif genes are nifHDK(T)Y, nifEN(X), nifJ, nifF and nifUSVWZM. In some embodiments, each of the nif genes is preceded by a wild-type T7 promoter. In some embodiments, the exogenous nif cluster is transferred to the rhizobium in a plasmid. In some embodiments, the exogenous nif cluster further comprises a terminator. In some embodiments, the wild-type T7 promoter has a terminator, and the terminator is downstream from the wild-type T7 promoter. In some embodiments, the endogenous NifL gene is deleted.
  • Another aspect of the present disclosure provides a method of producing nitrogen for consumption by a cereal plant, comprising providing a plant growth promoting bacterium that can fix nitrogen under aerobic free-living conditions in proximity of the cereal plant, wherein the plant growth promoting bacterium is a symbiotic bacterium having an exogenous nif cluster, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic bacterium, enabling nitrogen fixation under aerobic free-living conditions. In some embodiments, the plant growth promoting bacterium is a rhizobium. In some embodiments, the plant growth bacterium is a bacterium as described in the present disclosure. In some embodiments, the cereal plant is a genetically modified cereal plant. In some embodiments, the genetically modified cereal plant includes an exogenous gene encoding a chemical signal. In some embodiments, the nitrogen fixation is under the control of the chemical signal. In some embodiments, the chemical signal is opine, phlorogluconol or rhizopene. In some embodiments, the nitrogen fixation is under the control of a chemical signal. In some embodiments, the chemical signal is a root exudate, biocontrol agent or phytohormone. In some embodiments, the root exudate is selected from the group consisting of sugars, hormones, flavonoids, and antimicrobials. In some embodiments, the chemical signal is vanillate. In some embodiments, the chemical signal is IPTG, aTc, cuminic acid, DAPG, and salicylic acid, 3,4-dihydroxybenzoic acid, 3OC6HSL or 3OC14HSL.
  • In one aspect, the disclosure also provides a genetically engineered plant that can produce orthogonal carbon sources, such as opines or less common sugars, and bacteria with the corresponding catabolism pathways, which can respond to these signals.
  • In one aspect, the present disclosure provides a method for making a nitrogen-fixing bacterium, the method comprising a) identifying a host bacterium; b) selecting a donor bacterium having a nif cluster based on evolutionary distance between the host bacterium and the donor bacterium; and c) inserting the nif cluster of the donor bacterium to the host bacterium, thereby making a nitrogen-fixing bacterium. In some embodiments, the evolutionary distance between the host bacterium and the donor bacterium is less than 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, or 0.1% substitutions per site in 16S ribosomal RNA gene sequence. In some embodiments, the host bacterium and the donor bacterium are in the same genus, family, order, or class. In some embodiments, the donor bacterium is selected from Klebsiella, Pseudomonas, Azotobacter, Gluconacetobacter, Azospirillum, Azorhizobium, Rhodopseudomonas, Rhodobacter, Cyanothece, or Paenibacillus genus. In some embodiments, the host bacterium is selected from the group consisting of E. coli, Pseudomonas protegens Pf-5, and Rhizobium IRBG74. In some embodiments, the donor bacterium is selected from the group consisting of K. oxytoca, P. stutzeri, A. vinelandii, G. diazotrophicus, A. brasilense, A. caulinodans, R. palustris, R. sphaeroides, Cyanothece, and Paenibacillus. In some embodiments, the host bacterium is E. coli and the donor bacterium is K. oxytoca. In some embodiments, the host bacterium is Pseudomonas protegens Pf-5, and the donor bacterium is P. stutzeri. In some embodiments, the host bacterium is Rhizobium IRBG74, and the donor bacterium is R. sphaeroides. In some embodiments, the host bacterium is a nonsymbiotic bacterium, e.g., Azotobacter, Beijerinckia, or Clostridium bacterium. In some embodiments, the host bacterium is a symbiotic bacterium, e.g., Rhizobium, Frankia, or Azospirillum bacterium. In some embodiments, the host bacterium is symbiotic with a leguminous plant, an actinorhizal plant, or a cereal crop. In some embodiments, the inserted nif cluster is under inducible control.
  • In one aspect, the present disclosure provides a method of selecting a nif cluster of a donor bacterium that is compatible with a host bacterium, the method comprising a) performing a phylogenetic analysis for the donor bacterium and the host bacterium; b) determining evolutionary distance based on the phylogenetic analysis between the donor bacterium and the host bacterium is less than a reference value; and c) selecting the nif cluster of the donor bacterium for the host bacterium. In some embodiments, the phylogenetic analysis is performed by using distance-matrix, maximum parsimony, maximum likelihood, or Bayesian inference. In some embodiments, the phylogenetic analysis is performed by analyzing ribosomal RNA (e.g., 16s rRNA) substitution rate. In some embodiments, the reference value is 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, or 0.1% substitutions per site in 16S ribosomal RNA gene sequence. In some embodiments, the reference value is 500, 400, 300, 200, 100, 50, or 10 million years. In some embodiments, the method further comprises inserting the nif cluster to the host bacterium and evaluating the nitrogen fixation activity. In some embodiments, the nif cluster is under inducible control.
  • In one aspect, the present disclosure provides a bacterium comprising a nif cluster, where the nif cluster is under control of an exogenous control genetic element. In some embodiments, the nif cluster is an endogenous or exogenous nif cluster. In some embodiments, the exogenous control genetic element initiates promoter activities in response to an inducer (e.g., a chemical signal). In some embodiments, the promoter activities are measured by the below equation:
  • δ J = γ n [ i = x tss + 1 x tss + 1 + n m ( i ) - i = x tss - 1 x 0 - 1 - n m ( i ) ]
  • where m(i) is number of transcripts at each position i from the FPKM normalized transcriptomic profiles, γ=0.0067 s−1 is the degradation rate of mRNA and n is the window length before and after xtss. In some embodiments, the inducer is delivered to the bacterium by chemical delivery or biocontrol delivery. In some embodiments, the inducer is a chemical signal in seeds (e.g., cuminic acid), a native root exudate (e.g., arabinose, salicylic acid, vanillic acid, or narigenin), a chemical signal from a bacterium (e.g., 3OC6HSL, 3OC14HSL, DHBA, or DAPG), or a chemical signal from a genetically modified plant (e.g., Nopaline or Octopine).
  • The details of one or more embodiments of the invention are set forth in the description below. Other features or advantages of the present invention will be apparent from the following drawings and detailed description of several embodiments, and also from the appended claims.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present disclosure, which can be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein. For purposes of clarity, not every component may be labeled in every drawing. It is to be understood that the data illustrated in the drawings in no way limit the scope of the disclosure. In the drawings:
  • FIGS. 1A-1F include diagrams showing transfer of nif clusters across species. (FIG. 1A) Eight nif clusters from free-living nitrogen fixing bacteria are aligned based on phylogenetic relationships of 16S rRNA sequences. The genes and operons are based on K. oxytoca M5al. Dots in the DNA line indicate where multiple regions were cloned from genomic DNA and combined to form one large plasmid-borne nif cluster. A complete list of strain genotypes is provided in Table 3. Nitrogenase activity from transfer of the native nif clusters was measured in three species. The activities of the R. palustris and R. sphaeroides nif clusters were also measured in 12 Rhizobia strains. Asterisks indicate ethylene production below the detection limit (<10 a.u.). Error bars represent s.d. from three independent experiments. (FIG. 1B) Transcriptomic profile of the native K. oxytoca nif cluster in K. oxytoca, compared with those obtained from its transfer to the indicated species. (FIG. 1C) Transcription levels (FPKM) of the native K. oxytoca nif cluster across species. Transcriptional units are underlined. (FIG. 1D) Transcription levels (FPKM) of the K. oxytoca nif genes in K. oxytoca (→Klebsiella) compared to that obtained when transferred to a new host. (FIG. 1E) Same as in (FIG. 1C), except the translational efficiency is compared, as calculated using ribosome profiling. (FIG. 1F) Same as in (FIG. 1D), except the ribosome densities (RD) are compared, as calculated using ribosome profiling. R2 in log-log plots was calculated from the line (y=x+b), where b is an expression variable between hosts.
  • FIGS. 2A-2M include diagrams showing the transfer of the refactored K. oxytoca nif clusters to R. sp. IRBG74. (FIG. 2A) The genetic systems for the controller for E. coli MG1655 (left) and R. sp. IRBG74 (right) are shown. A variant of T7 RNAP (R6232S, N-terminal lon tag, GTG start codon) is used for the E. coli controller. Several genetic parts were substituted to build the R. sp. IRBG74 controller (FIG. 16). The sequences for the genetic parts are provided in Table 5. (FIG. 2B) The response functions for the controllers with the reporter plasmid pMR-79 (Table 4 and Table 5). The IPTG concentrations used to induce nitrogenase were circled. (FIG. 2C) The genetic parts used to build the refactored v2.1 nif gene cluster are shown (Table 5). (FIG. 2D) The activity of the refactored nif gene cluster v2.1 in different hosts is shown. Asterisks indicate ethylene production below the detection limit (<10 a.u.). (FIG. 2E) The activities of the v2.1 promoters and terminators in E. coli MG1655 and R. sp. IRBG74 as calculated from RNA-seq data (see Materials and Methods). (FIG. 2F) The translation efficiency of the v2.1 nif genes in E. coli MG1655 and R. sp. IRBG74, as calculated using ribosome profiling and RNA-seq. Lines connect points that occur in the same operon. (FIG. 2G) The ribosome density (RD) is compared for the refactored v2.1 nif genes in a new host (E. coli MG155; R. sp. IRBG74) versus that measured for the nif genes from the native K. oxytoca cluster in K. oxytoca (→Klebsiella). The points corresponding to nifH is marked H. (FIGS. 2H-2L) The same as (FIGS. 2C-2G) but with the refactored nif cluster v3.2. Genetic parts are provided in Table 5. (FIG. 2M) Nitrogenase activity is shown as a function of T7 promoter strength. The refactored nif cluster v3.2 was expressed from three controller strains with varying strengths (FIG. 16). Error bars represent s.d. from three independent experiments.
  • FIGS. 3A-3F include diagrams showing the control of nitrogen fixation in A. caulinodans ORS571. (FIG. 3A) The controller is shown, carried on a pBBR1 origin plasmid (genetic parts are provided in Table 5). NifA and RpoN co-induce the expression of three sites in the genome (identified by consensus NifA binding sequences). (FIG. 3B) Expression from the nifH promoter was evaluated using a fluorescent reporter (see Materials and Methods). NifA and RpoN were complemented (+) individually or in combination in the A. caulinodans ΔnifA strain where the genomic rpoN remains intact. (FIG. 3C) The response function for the induction of the nifH promoter by the controller is shown. (FIG. 3D) The nitrogenase activity is shown for wild-type A. caulinodans ORS571 compared to the ΔnifA complemented with the controller plasmid (+) and the addition of 1 mM IPTG (+). (FIG. 3E) The effect of the absence or presence of 10 mM ammonium chloride is shown. The WT NifA from A. caulinodans ORS571 is compared to different combinations of amino acid substitutions with additional RpoN expression. NifA/RpoN expression is induced by 1 mM IPTG (+) for the ΔnifA strain containing the controller plasmid pMR-121, 122, 123, and 124 (+). Asterisks indicate ethylene production below the detection limit (<10 au). (FIG. 3F) The nitrogenase activity is shown as a function of the oxygen concentration in the headspace (see Materials and Methods). The native nif cluster (wild-type A. caulinodans ORS571) is compared to the inducible version including the controller plasmid and 1 mM IPTG. Error bars represent s.d. from three independent experiments.
  • FIGS. 4A-4F include diagrams showing Nitrogenase activity of the inducible nif clusters in Pseudomonas protegens Pf-5. (FIG. 4A) The controllers, based on P. stutzeri NifA, were used for all three clusters. Plasmids and genetic parts are provided in Table 4 and Table 5. (FIG. 4B) The nif clusters from K. oxytoca, P. stutzeri, and A. vinelandii are shown. The deleted regions corresponding the NifLA regulators are marked. The dotted lines indicate that multiple regions from the genome were cloned and combined for form the nif cluster. The clusters were carried the plasmids pMR-4, 6, 8 (Table 4). (FIG. 4C) The induction of the nifH promoters from each species by the controller are shown (0.5 mM IPTG) (see Materials and Methods). (FIG. 4D) The nitrogenase activities of the native cluster (intact nifLA) is compared to the inducible clusters in the presence and absence of 0.5 mM IPTG. The dashed lines indicate the activity of the native clusters in the wild-type context (top to bottom, K. oxytoca M5al, P. stutzeri A1501 and A. vinelandii DJ). (FIG. 4E) The sensitivity of the native and inducible (+0.5 mM IPTG) nif clusters to 17.1 mM ammonium acetate are compared. Asterisks indicate ethylene production below the detection limit (<10 au). (FIG. 4F) The nitrogenase activity is shown as a function of the oxygen concentration in the headspace (see Materials and Methods). The native nif cluster is compared to the inducible version including the controller plasmid and 0.5 mM IPTG. Error bars represent s.d. from three independent experiments.
  • FIGS. 5A-5D include diagrams showing the control of nitrogenase activity with sensors that respond to diverse chemicals in the rhizosphere. (FIG. 5A) Schematic showing the origins of the chemicals. “Introduced DNA” refers to the genetic modification of the plant to produce nopaline and octopine. (FIG. 5B) The genetic sensors built for A. caulinodans are shown. Sequences for the genetic parts are provided in Table 5. (FIG. 5C) The response functions for the sensors are shown. Either the sensor expresses T7 RNAP, which then activates PT7, or it expresses NifA (P. protegens Pf-5) or NifA/RpoN (A. caulinodans) and activates the nifH promoter (species origin in parentheses). (FIG. 5D) The nitrogenase activity is measured in the presence or absence of inducer (see Materials and Methods). The refactored Klebsiella nif clusters v2.1 and v3.2 were used in E. coli MG1655 and R. sp. IRBG74, respectively. The inducible A. vinelandii nif cluster was used in P. protegens Pf-5. The controller containing nifA/rpoN was used in A. caulinodans ΔnifA. The inducer concentrations are: 50 μM vanillic acid, 500 μM DHBA, 50 μM cuminic acid, 25 nM 3OC6HSL, 500 nM 3OC14HSL, 33 μM arabinose, 100 μM naringenin, 100 nM DAPG, 200 μM salicylic acid, 1 mM nopaline and 1 mM octopine. Error bars represent s.d. from three independent experiments.
  • FIG. 6 includes a plot of the growth curve of R. sp. IRBG74 in UMS minimal medium with varying carbon sources. Cultures grown overnight in 2 mL TY medium in 15 mL culture tubes at 30° C. and 250 rpm were diluted to an OD600 of 0.02 into 1 mL of UMS minimal medium plus varying carbon sources in 96-deepwell plates and incubated for 16 hours at 30° C. and 900 rpm. Bacterial growth was spectrophotometrically monitored at OD600 nm. Error bars represent s.d. from three independent experiments.
  • FIGS. 7A-7F include diagrams showing the nitrogenase activity when different inducible nif clusters are transferred to E. coli MG1655. (FIG. 7A) The same controller system based on K. oxytoca NifA was used for all three clusters. The controller plasmid pMR-99 and genetic parts are provided in Table 4 and Table 5. (FIG. 7B) The nif clusters from K. oxytoca, P. stutzeri, and A. vinelandii are shown. The deleted regions corresponding the NifLA regulators are marked. The dotted lines indicate that multiple regions from the genome were cloned and combined for form the nif cluster. The clusters were carried the plasmids pMR-3, 5, 7 (Table 4). (FIG. 7C) The induction of the nifH promoters from each species by the controller is shown (50 μM IPTG) (see Materials and Methods) (FIG. 7D) The nitrogenase activities of the native cluster (intact nifLA) is compared to the inducible clusters in the presence and absence of 50 μM IPTG. The dashed lines indicate the activity of the native clusters in the wild-type context (top to bottom, K. oxytoca M5al, P. stutzeri A1501 and A. vinelandii DJ). (FIG. 7E) Regulation of nitrogenase activity by ammonia. Ammonium tolerance of nitrogenase from the native (black bar) and inducible (gray bar) systems was tested in the presence of 17.1 mM ammonium acetate. Asterisks indicate ethylene production below the detection limit (<10 au). (FIG. 7F) Regulation of nitrogenase activity by oxygen. The native nif cluster is compared to the inducible version including the controller plasmid and 50 μM IPTG. Nitrogenase activities were measured after 3 h of incubation at constant oxygen concentrations (0 to 3%) in the headspace (see Materials and Methods). Error bars represent s.d. from three independent experiments.
  • FIGS. 8A-8B include plots showing ammonium repression of the transferred nif clusters. Nitrogenase sensitivity to ammonium was measured by nitrogenase assay in the absence (−) or presence (+) of 17.1 mM ammonium acetate. The sensitivity of the native and inducible nif clusters in E. coli MG1655 (FIG. 8A) and P. protegens Pf-5 (FIG. 8B). Note that the data are from FIGS. 4A-4F and FIGS. 7A-7F. The nif clusters were induced by 50 μM and 0.5 mM IPTG in E. coli MG1655 and P. protegens Pf-5, respectively. Asterisks indicate ethylene production below the detection limit (<10 au). Error bars represent s.d. from three independent experiments.
  • FIG. 9 includes a diagram showing the ribosome profiling data for the K. oxytoca native nif cluster in K. oxytoca M5al, E. coli MG1655, P. protegens Pf-5 and R. sp. IRBG74 (see Materials and Methods).
  • FIGS. 10A-10B include diagrams showing the effect of NifA overexpression on the nifH promoter activity in R. sp. IRBG74. (FIG. 10A) The reporter construct used to measure nifH promoter activity is shown. The nifH promoter activity was analyzed in the R. sp. IRBG74 wild-type background using flow cytometry. Additional copies of NifA of R. sp. IRBG74 increased activity of the R. sp. IRBG74 nifH promoter but failed to complement or enhance activity of the other nifH promoters including K. oxytoca, P. stutzeri and A. caulinodans. Error bars represent s.d. from three independent experiments. (FIG. 10B) Plasmid maps used to assess the effect of NifA overexpression in R. sp. IRBG74. WT, wild-type; Rsp, R. sp. IRBG74; Kox, K. oxytoca M5al; Pst, P. stutzeri A1501; Aca, A. caulinodans ORS571
  • FIGS. 11A-11C include diagrams showing Promoter characterization in R. sp. IRBG74 and P. protegens Pf-5. (FIG. 11A) Constitutive promoters are rank-ordered by their strength. Plasmids used to measure promoter activity are depicted on the top. (FIG. 11B) The strength of the T7 promoter wild-type and its variants was analyzed in the controller strains containing the IPTG-inducible T7 RNAP on the genome of R. sp. IRBG74 and P. protegens Pf-5 with 1 mM IPTG induction. A reporter plasmid used to measure T7 promoter activity is shown on the right. (FIG. 11C) Correlation of T7 promoter strength between species. Error bars represent s.d. from three independent experiments.
  • FIGS. 12A-12B include diagrams showing RBS characterization in R. sp. IRBG74 and P. protegens Pf-5. RBS library for GFP was designed using the RBS library calculator at the highest-resolution mode. (FIG. 12A) The strengths of the synthetic RBSs in R. sp. IRBG74 were analyzed in the plasmid pMR-40 containing the IPTG-inducible system with 1 mM IPTG induction. 33 of the RBSs spanning a range of 5,684-fold expression were selected and their sequences are provided in Table 6. (FIG. 12B) The strengths of the synthetic RBSs in P. protegens Pf-5 was analyzed in the plasmid pMR-65 containing the arabinose-inducible system with 7 μM arabinose induction. 33 of the RBSs spanning a range of 1,075-fold expression were selected and their sequences are provided in Table 6.
  • FIGS. 13A-13B include diagrams showing the characterization of terminators for T7 RNAP in R. sp. IRBG74 (FIG. 13A) and P. protegens Pf-5 (FIG. 13B). (FIG. 13A) The strength of terminators was analyzed in the controller R. sp. IRBG74 strains MR16 containing the IPTG-inducible T7 RNAP on the genome with 1 mM IPTG induction. (FIG. 13B) Plasmids used to measure terminator strength are shown on right. Genetic parts are provided in Table 5. Error bars represent s.d. from three independent experiments.
  • FIG. 14 includes diagrams showing the response functions for the sensors in R. sp. IRBG74. Plasmids used to characterize the sensors are shown on top of each panel and provided in Table 4. Genetic parts are provided in Table 5. Error bars represent s.d. from three independent experiments. Experimental details are provided in Methods.
  • FIGS. 15A-15C include diagrams showing the response functions for the sensors in P. protegens Pf-5. The output changes as a function of input inducer concentrations. Plasmids used to characterize the sensors are shown on top of each panel. (FIG. 15A) Inducible promoter characterization in P. protegens Pf-5. (FIG. 15B) Optimization of the arabinose-inducible systems. Constitutive expression of a plasmid-borne AraE transporter decreased a dissociation constant of arabinose (dark gray). A mutation in the −10 region (TACTGT to TATATT) of the PBAD promoter increased promoter strength (black). (FIG. 15C) Optimization of IPTG-inducible systems. The IPTG-inducible promoters were induced by 1 mM IPTG. The combination of the Ptac promoter and the LacI (Q18M/A47V/F161Y) protein yielded an expression range of 110-fold. Plasmids and genetic parts are provided in Table 4 and Table 5. Error bars represent s.d. from three independent experiments.
  • FIG. 16 includes diagrams showing the tuning controller strength in R. sp. IRBG74. The controller containing the IPTG-inducible T7 RNAP is integrated into the genome of R. sp. IRBG74 (top). Controller strengths were adjusted by modulating the RBS of T7 RNAP in the plasmids pMR-81, 82, and 83. Response functions of the T7 promoter were measured with the reporter plasmid pMR-79 (right) in the R. sp. IRBG74 controller strains MR16, MR17, and MR18. Genetic parts and RBS sequences are provided in Table 5 and Table 5. Error bars represent s.d. from three independent experiments.
  • FIG. 17 includes a plot showing the nitrogenase activity of the refactored nif clusters across species. Error bars represent s.d. from three independent experiments.
  • FIG. 18 includes diagrams showing RNA-seq (top) and Ribosome profiling (bottom) data, respectively in E. coli MG1655 and R. sp. IRBG74. The nif genes were induced by 1 mM IPTG for 6 hours (see Materials and Methods).
  • FIG. 19 includes diagrams showing RNA-seq (top) and ribosome profiling (bottom) data, respectively, in E. coli MG1655 and P. protegens Pf-5 and R. sp. IRBG74. The nif genes were induce by 1 mM, 0.1 mM, and 0.5 mM IPTG for 6 h in E. coli MG1655, P. protegens Pf-5 and R. sp. IRBG74, respectively (see Materials and Methods).
  • FIGS. 20A-20F include diagrams showing the transfer of the refactored nif cluster v3.2 in P. protegens Pf-5. (FIG. 20A) Controllers whose output is T7 RNAP from the genome of P. protegens Pf-5 are described. Substituted genetic parts including a new RBS and IPTG-inducible promoter for the controller optimization compared to the controller module pKT249 in E. coli MG1655 highlighted in red. The response functions for the controllers with the reporter plasmid pMR-80 was measured in the P. protegens Pf-5 controller strain MR7. Controllers driving the expression of GFP by the T7 promoter achieved large dynamic to 96-fold activation by IPTG. Error bars represent s.d. from three independent experiments. (FIG. 20B) The genetic parts used to build the refactored v3.2 nif gene cluster are shown (Table 5). (FIG. 20C) The activity of the refactored nif cluster v3.2. Nitrogenase expression was induced by 1 mM IPTG. (FIG. 20D) Function of the transcriptional parts of the cluster v3.2 was analyzed by RNA-seq (FIG. 19). The performance of the promoters (left) and terminators (right) was calculated (see Materials and Methods). (FIG. 20E) The translation efficiency of the nif genes v3.2 as calculated using ribosome profiling and RNA-seq. Lines connect points that occur in the same operon. (FIG. 20F) The ribosome density (RD) is compared for the refactored v3.2 nif genes in P. protegens Pf-5 versus that measured for the nif genes from the native K. oxytoca cluster in K. oxytoca (→Klebsiella).
  • FIG. 21 includes diagrams showing the response function of inducible promoters in A. caulinodans ORS571. Plasmids used to characterize inducible promoters are shown on top of each panel and provided in Table 4. Genetic parts are provided in Table 5. Error bars represent s.d. from three independent experiments.
  • FIG. 22 includes a diagram showing the multiple sequence alignment of NifA of A. caulinodans ORS571 with R. spheroides 2.4.1 was generated using MUSCLE2. The corresponding residues for ammonium tolerance in R. sphaeroides are outlined. The A. caulinodans strand corresponds to SEQ ID NO: 293, and the R. sphaeroides strand corresponds to SEQ ID NO: 292.
  • FIGS. 23A-23B include diagrams showing functional testing of the NifA homologues that activate the nifH promoters. (FIG. 23A) The ability of the various NifA to activate the nifH promoters was tested with pairwise combinations of the nifH promoters and the NifA in E. coli MG1655 and P. protegens Pf-5. Error bars represent s.d. from three independent experiments. (FIG. 23B) Plasmids used to measure nifH promoter activity by NifA overexpression are shown and provided in Table 4. Genetic parts are provided in Table 5. Pst, P. stutzeri A1501; Avi, A. vinelandii DJ; Kox, K. oxytoca M5al
  • FIGS. 24A-24B include diagrams showing optimization of the controllers in P. protegens Pf-5 and E. coli MG1655 that induce the nifH promoters. (FIG. 24A) The controllers with different strengths were designed by RBS replacement and tested with the reporter plasmids (pMR103-105) in which each of the three nifH promoter is fused to sfgfp (Methods). The nifH promoters were induced with 0.5 mM IPTG. Genetic parts and RBS sequences are provided in Table 5 and 6, respectively. (FIG. 24B) Activation of the nifH promoters in the E. coli MG1655 containing the controller plasmid pMR102 was tested with the reporter plasmids pMR106-108. The P. protegens Pf-5 controller strain MR10 was used to drive expression of the nifH promoter of K. oxytoca and the controller strain MR9 was used to drive expression of the nifH promoters of P. stutzeri and A. vinelandii. The nifH promoters were induced with 0.05 mM IPTG and 0.5 mM IPTG in E. coli MG1655 and P. protegens Pf-5, respectively. Error bars represent s.d. from three independent experiments.
  • FIG. 25 includes diagrams showing the effect of oxygen on the activity of the nifH promoters. Expression from the nifH promoters was analyzed in E. coli MG1655 containing the controller plasmid pMR102, P. protegens Pf-5 MR10 (for K. oxytoca) and MR9 (for P. stutzeri and A. vinelandii) at varying initial oxygen levels in the headspace. The three nifH promoters were induced with 0.05 mM IPTG and 0.5 mM IPTG in E. coli MG1655 and P. protegens Pf-5, respectively, and incubated at varying initial oxygen concentrations. Oxygen has no effects on nifH expression in both strains. Error bars represent s.d. from three independent experiments.
  • FIGS. 26A-26B include diagrams describing the nitrogenase activity assay. (FIG. 26A) Nitrogenase activity assay at constant oxygen levels in the headspace. Experimental setup used in this study to analyze oxygen tolerance of nitrogenase. Following the expression induction of nitrogenase with preincubation under low oxygen conditions, targeted oxygen concentrations in the headspace is maintained by oxygen spiking while monitoring with oxygen monitoring system (Methods). (FIG. 26B) Nitrogenase activity in E. Coli MG1655 and P. protegens Pf-5 over a course of three hours.
  • FIG. 27 includes diagrams showing the effect of the rnf and fix complex on nitrogenase activity. The modified nif clusters of A. vinelandii on the plasmids pMR25-28 were analyzed in the controller strain P. protegens Pf-5 MR9. The deleted regions from the clusters were provided in Table 4. Nitrogenase was induced with 0.5 mM IPTG. Removing the rnf complex from the cluster abrogated activity. The cluster without the fixABCX complex showed identical oxygen tolerance to the cluster with the complex. Error bars represent s.d. from three independent experiments.
  • FIGS. 28A-28C include diagrams showing regulation of nitrogenase activity in E. coli MG1655 “Marionette” strain5. (FIG. 28A) Controller plasmids used to drive expression of T7 promoters. (FIG. 28B) Inducibility of the T7 promoter by the controller plasmids encoding T7 RNAP under the regulation of the 12 sensors was tested with a reporter plasmid pMR121 (right). (FIG. 28C) Inducible control of nitrogenase activity in response to 12 inducers was with the plasmid pMR136 (right) carrying the refactored nif cluster v2.1 on pBBR1 origin. The choline-Cl inducible system was omitted for activity assay as the system was not inducible. For the DAPG-, DHBA-, and vanillic acid-inducible system, the refactored cluster was carried on a lower copy number plasmid pMR31 (right) as transformation of the plasmid pMR29 gave rise to no colony formation. The inducers concentrations are: 400 μM arabinose, 1 mM choline-Cl, 500 nM 3OC14HSL, 50 μM cuminic acid, 25 nM 3OC6HSL, 25 μM DAPG, 500 μM DHBA, 1 mM IPTG, 100 nM aTc, 250 μM naringenin, 50 μM vanillic acid, and 250 μM salicylic acid. Plasmid and genetic parts are provided in Table 4 and 5. Error bars represent s.d. from three independent experiments.
  • FIG. 29 includes schematic plasmid maps used to assess the effect of inducible expression of NifA/RpoN on the activity of the nifH promoter in A. caulinodans ORS571.
  • FIG. 30A-30B include diagrams showing the phylogenetic relationships of 10 diazotrophs based on 16S ribosomal RNA sequences. The scale bar indicates 2% substitutions per site. The clusters based on evolutionary closeness are circled. FIG. 30B shows the relative nitrogenase activity in three host strains (E. coli, Pseudomonas protegens Pf-5, and Rhizobium sp. IRBG74) carrying each of the 10 nif clusters. The result suggests that the phylogenetic closeness has a predictive power for achieving highest nitrogenase activity in a new host that lacks an endogenous nif cluster.
  • DETAILED DESCRIPTION OF THE INVENTION
  • Nitrogen fixation in the root nodules of leguminous plants is a major contributor to world food production and therefore, the practical applications of this field are of major interest. Legumes obtain nitrogen from air through bacteria residing in root nodules, some species of which also associate with cereals but do not fix nitrogen under these conditions. Disabling native regulation can turn on expression, even in the presence of nitrogenous fertilizer and low O2, but continuous nitrogenase production confers an energetic burden.
  • The present disclosure in some aspects describes the surprising discovery that bacteria can be genetically altered in a manner that will enable the bacteria to deliver fixed nitrogen to cereal crops. Several strategies to implement control over nitrogen fixation in bacteria that live on or inside the roots of cereals are described. At least two approaches can be taken. In one embodiment, the native regulation is replaced. In alternative embodiments, a nif cluster is transferred from another species and placed under inducible control. The Examples section below includes a description of the achievement of these two approaches in multiple species with multiple constructs. For example, A. caulinodans, ammonium-independent control can be achieved using a sensor to drive the co-expression of a NifA mutant and RpoN in a ΔnifA strain. Rhizobium sp. IRBG74 can be engineered to express functional nitrogenase under free living conditions either by transferring a native nif cluster from Rhodobacter or a refactored cluster from Klebsiella. Multiple approaches enable P. protegens Pf-5 to express functional nitrogenase, of which the transfer of the nif cluster from Azotobacter vinelandii DJ yields the highest activity and O2 tolerance.
  • To date, it has not been shown that a Rhizobium strain can be engineered to fix nitrogen under free-living conditions when it does not do so naturally. Some Rhizobia isolated from legume root nodules are also cereal endophytes, however most are unable to fix nitrogen under free-living conditions (outside of the nodule) (Ramachandran, V. K., East, A. K., Karunakaran, R., Downie, J. A. & Poole, P. S. Adaptation of Rhizobium leguminosarum to pea, alfalfa and sugar beet rhizospheres investigated by comparative transcriptomics. Genome biology 12, R106 (2011); Frans, J. et al. in Nitrogen Fixation 33-44 (Springer, 1990)). There have been reports of cereal yield improvements due to these bacteria, including a 20% increase for rice by Rhizobium sp. IRBG74, but this is likely due to other growth-promoting mechanisms, such as improved nutrient uptake or root formation (Ramachandran, V. K., East, A. K., Karunakaran, R., Downie, J. A. & Poole, P. S. Adaptation of Rhizobium leguminosarum to pea, alfalfa and sugar beet rhizospheres investigated by comparative transcriptomics. Genome biology 12, R106 (2011); Delmotte, N. et al. An integrated proteomics and transcriptomics reference data set provides new insights into the Bradyrhizobium japonicum bacteroid metabolism in soybean root nodules. Proteomics 10, 1391-1400 (2010); Hoover, T. R., Imperial, J., Ludden, P. W. & Shah, V. K. Homocitrate is a component of the iron-molybdenum cofactor of nitrogenase. Biochemistry 28, 2768-2771 (1989)). Azorhizobium caulinodans ORS571 is exceptional because it is able to fix nitrogen in both aerobic free-living and symbiotic states, has been shown to be a rice and wheat endophyte, and does not rely on plant metabolites to produce functional nitrogenase. However, when Rhizobia or Azorhizobium species are living in cereal roots, there is low nitrogenase expression and 15N2 transfer rates suggest any reported uptake is due to bacterial death.
  • Cereal Crops, Nitrogen Fixation, and Bacteria
  • Cereal crops are broadly defined as any grass cultivated for the edible components of its grain (also referred to as caryopsis), composed of the endosperm, germ, and bran. Cereal crops are considered staple crops in many parts of the world. They are grown in greater quantities and provide more food energy worldwide than any other type of crop. Non-limiting examples of cereal crops include maize, rye, barley, wheat, sorghum, oats, millet and rice. As used herein, the terms “cereal crop” and “cereal plant” are used interchangeably.
  • Nitrogen fixation is the process by which atmospheric nitrogen is assimilated into organic compounds as part of the nitrogen cycle. The fixation of atmospheric nitrogen associated with specific legumes is the result of a highly specific symbiotic relationship with rhizobial bacteria. These indigenous bacteria dwell in the soil and are responsible for the formation of nodules in the roots of leguminous plants as sites for the nitrogen fixation. Most Rhizobium symbioses are confined to leguminous plants. Furthermore, Rhizobium strains which fix nitrogen in association with the agriculturally-important temperate legumes are usually restricted in their host range to a single legume genus.
  • The nif genes are genes encoding enzymes involved in the fixation of atmospheric nitrogen into a form of nitrogen available to living organisms. The primary enzyme encoded by the nif genes is the nitrogenase complex which converts atmospheric nitrogen (N2) to other nitrogen forms (e.g. ammonia) which the organism can process. As used herein, the term “nif cluster” refers to a gene cluster comprising nif genes. As used herein, the term “refactored” refers to an engineered gene cluster, i.e. its genes have reordered, deleted or altered in some way.
  • Rhizobia are diazotrophic bacteria. In general, they are gram negative, motile, non-sporulating rods. In terms of taxonomy, they fall into two classes: alphaproteobacteria and betaproteobacteria. Non-limiting examples of rhizobia include Azorhizobium caulinodans, Rhizobium(R.) sp. IRBG74, R. radiobacter, R. rhizogenes, R. rubi, R. vitis, Alfalfa Rhizobia (R. meliloti), Chickpea Rhizobia (Rhizobium sp.), Soybean Rhizobia (Bradyrhizobium japonicum), Leucaena Rhizobia (Rhizobium sp.), R. leguminosarum by trifolii, R. leguminosarum by phaseoli, and Rhizobium leguminosarum by viciae (see for example U.S. Pat. No. 7,888,552, herein incorporated by reference). In some embodiments, the rhizobia of the present invention are Azorhizobium caulinodans. In some embodiments, the rhizobia of the present invention are not Azorhizobium caulinodans.
  • As used herein, the term “free-living conditions” refers to a bacterium (e.g. rhizobium) that is not within a leguminous root nodule. It generally refers to something that has not formed a parasitic (or dependent) relationship with another organism or is not on a substrate. As used herein, the term “symbiotic” refers to the interaction between two organisms living in close proximity. Close proximity can be about 0.2 μm, 0.4 μm, 0.6 μm, 0.8 μm, 1 μm, 5 μm, 10 μm, 20 μm, 50 μm, 100 μm, 500 μm, 1 mm, 1 cm, 5 cm, 10 cm. Close proximity can also be less than 0.2 μm. In many cases, a symbiotic relationship refers to a mutually beneficial interaction.
  • As used herein, “aerobic free-living conditions” refer to conditions under which a bacterium is not within a leguminous root nodule and the bacterium is in the presence of oxygen. Aerobic free-living conditions can also be referred to as nonsymbiotic or non-parasitic conditions in the presence of oxygen. The bacterium can be in close proximity to a crop, as defined above.
  • As used herein, the term “endophyte” refers to a group of organisms, often fungi and bacteria, that live within living plant cells for at least part of its life cycle without having an apparent detrimental effect on the plant cell. This is contrasting with an epiphyte, which is a plant that grows on another plant, without being parasitic.
  • As used herein, the term “diazotroph” refers to microorganisms that are able to grow without external sources of fixed nitrogen. The group includes some bacteria and some archae. There are free-living and symbiotic diazotrophs. An example of a free-living diazotroph is Klebsiella pneumoniae. K. pneumoniae is a facultative anaerobes—these species can grow either with or without oxygen, but they only fix nitrogen anaerobically.
  • As used herein, the term “Alphaproteobacteria” refers to a diverse class of bacteria falling under the phylum Proteobacteria. Non-limiting examples of Alphaproteobacteria include species Rhodobacter sphaeroides and Rhodopseudomonas palustris. As used herein, the term “Gammaproteobacteria” refers to another class of bacteria falling under the phylum of Proteobacteria. All proteobacteria are gram negative. As used herein, the term “Cyanobacteria” refers to a phylum of bacteria that obtain their energy through photosynthesis. They are also referred to as Cyanophyta. They have characteristic internal membranes and thylakoids, the latter being for photosynthetic purposes. As used herein, the term “Firmicutes” refer to a phylum of bacteria. This phylum includes the classes Bacilli, Clostridia, and Thermolithobacteria.
  • Nif Genes
  • Typically, the genes necessary for nitrogen fixation occur together in a gene cluster, including the nitrogenase subunits, the biosynthesis of metalloclusters cluster and, electron transport, and regulator proteins. Nif genes are genes that encode the enzyme involved in nitrogen fixation. In most cases nif genes occur as an operon. Some of these genes encode the subunits for the nitrogenase complex, which is the primary enzyme imparting the ability to convert atmospheric nitrogen (N2) to forms of nitrogen accessible to living organisms. In most genes, the regulation of the nif gene transcription is conducted by NifA protein, which is responsive to nitrogen levels. When there are nitrogen deficits, NtrC activates NifA expression, which in turn leads to the activation of the remaining nif genes. When nitrogen levels are adequate or in excess, NifL protein, encoded by NifL, inhibits NifA activity.
  • Nif gene pathways are generally sensitive to small changes in expression. Important genes include nifHDK, which form the subunits for nitrogenase. The chaperone NifY is required to achieve full activity and broadens the tolerance to changes in expression level. NifJ and nif regulate electron transport. The nifUSVWZM operon encodes proteins for early Fe—S cluster formation (NifUS) and proteins for component maturation (NifVWZ for Component I and NifM for Component II), whereas nifBQ encodes proteins for FeMo-co core synthesis (NifB) and molybdenum integration (NifQ). NifEN is tolerant to varied expression levels.
  • Exemplary sequences for various nif genes are provided in Table 5. Non-limiting examples of nif genes include nifH, nifD, nifK, nifE, nifN, nifU, nifS, nifV, nifW, nifX, nifB, nifQ, nifY, nifT, nifJ, nifF, nifX, nifU, and nifS.
  • Nitrogen Fixation and Regulatory Elements
  • The nitrogen fixation (nif) genes are organized as genomic clusters, ranging from a 10.5 kb single operon in Paenibacillus to 64 kb divided amongst three genomic locations in A. caulinodans. Conserved genes include those encoding the nitrogenase enzyme (nifHDK), FeMoCo biosynthesis, and chaperones. Species that can fix nitrogen under more conditions tend to have larger gene clusters that include environment-specific paralogues, alternative electron transport routes, and oxygen protective mechanisms. Often, the functions of many genes in the larger clusters are unknown.
  • There is evolutionary evidence for the lateral transfer of nif clusters between species (Pascuan, C., Fox, A. R., Soto, G. & Ayub, N. D. Exploring the ancestral mechanisms of regulation of horizontally acquired nitrogenases. Journal of molecular evolution 81, 84-89 (2015); Kechris, K. J., Lin, J. C., Bickel, P. J. & Glazer, A. N. Quantitative exploration of the occurrence of lateral gene transfer by using nitrogen fixation genes as a case study. Proceedings of the National Academy of Sciences 103, 9584-9589 (2006)). However, achieving such a transfer via genetic engineering poses a challenge as many things can go awry, including differences in regulation, missing genes, and the intracellular environment (Frans, J. et al. in Nitrogen Fixation 33-44 (Springer, 1990); Poudel, S. et al. Electron transfer to nitrogenase in different genomic and metabolic backgrounds. Journal of bacteriology 200, e00757-00717 (2018); Thony, B., Anthamatten, D. & Hennecke, H. Dual control of the Bradyrhizobium japonicum symbiotic nitrogen fixation regulatory operon fixR nifA: analysis of cis- and trans-acting elements. Journal of bacteriology 171, 4162-4169 (1989); Han, Y. et al. Interspecies Transfer and Regulation of Pseudomonas stutzeri A1501 Nitrogen Fixation Island in Escherichia coli. Journal of microbiology and biotechnology 25, 1339-1348 (2015)). Nitrogenase is under stringent control because it is oxygen sensitive and energetically expensive: it can make up 20% of the cell mass and each NH3 requires ˜40 ATP. It is also irreversibly deactivated by oxygen. Across species, transcription of nif genes is strongly repressed by fixed nitrogen (ammonia) and oxygen with these signals converging on the NifA regulatory protein that works in concert with the sigma factor RpoN. Diverse, species-specific, and often poorly understood signals control these regulators, including plant-produced chemicals, ATP, reducing power, temperature, and carbon sources. Those bacteria that can fix nitrogen in a wider range of environmental conditions tend to be controlled by more complex regulatory networks.
  • When a nif cluster is transferred from one species to another, it either preserves its regulation by environmental stimuli or has an unregulated constitutive phenotype. Maintaining the native regulation, notably ammonium repression, limits their use in agriculture because such levels are likely to fluctuate according to soil types, irrigation, and fertilization. Nitrogen-fixing diazotrophs have been engineered to reduce ammonia sensitivity by disrupting NifL or mutating NifA and placing the entire cluster under the control of T7 RNA polymerase (RNAP). Constitutive expression of nitrogenase is also undesirable as it imparts a fitness burden on the cells. For example, when the nif cluster from P. stutzeri A1501 was transferred to P. protegens Pf-5, this was reported to result in sufficient ammonia production to support maize and wheat growth, but the bacteria quickly declined after a month when competing with other species in soil. Constitutive activity is detrimental even before the bacteria are introduced to the soil, impacting production, formulation, and long-term storage. Therefore, uncontrolled nitrogenase production could lead to more expensive production, shorter shelf life, and more in-field variability.
  • An important aspect of the nif clusters or nif genes the present disclosure is that they can each be under the control of a regulatory element. In some embodiments, 2 or more genes are under the control of a regulatory element. In some embodiments, all the genes are under the control of a regulatory element. The regulatory elements may also be activation elements or inhibitory elements. An activation element is a nucleic acid sequence that when presented in context with a nucleic acid to be expressed will cause expression of the nucleic acid in the presence of an activation signal. An inhibitory signal is a nucleic acid sequence that when presented in context with a nucleic acid to be expressed will cause expression of the nucleic acid unless an inhibitory signal is present. Each of the activation and inhibitory elements may be a promoter, such as a bacteriophage T7 promoter, sigma 70 promoter, sigma 54 promoter, lac promoter, etc. As used herein, the term “promoter” is intended to refer to those regulatory sequences which are sufficient to enable the transcription of an operably linked DNA molecule. Promoters may be constitutive or inducible. As used herein, the term “constitutive promoter” refers to a promoter that is always on (i.e. causing transcription at a constant level). Examples of constitutive promoters include, without limitation, sigma 70 promoter, bla promoter, lacI. promoter, etc. Non-limiting examples of inducible promoters are shown in Table 1. The PA1lacO1 promoter is another example of an inducible promoter that can be used in the present invention.
  • TABLE 1
    Examples of regulatory elements (e.g. inducible
    promoters, repressors).
    Essential
    regulatory
    Name Chemical inducer and/or repressor gene(s)
    ParaBAD L-arabinose (ON) & glucose (OFF) araC
    (“PBAD”)
    PrhaBAD L-rhamnose (ON) & glucose (OFF) rhaR &rhaS
    Plac lactose or IPTG (ON) & glucose lacI
    (OFF)
    Ptac lactose or IPTG (ON) lacI
    Plux acyl-homoserine lactone (ON) luxR
    Ptet tetracycline or aTc (ON) tetR
    Psal salycilate (ON) nahR
    Ptrp tryptophan (OFF) (NONE)
    Ppho phosphate (OFF) phoB & phoR
  • Inducible promoters allow regulation of gene expression and can be regulated by exogenously supplied compounds, environmental factors such as temperature, or the presence of a specific physiological state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells only. Inducible promoters and inducible systems are available from a variety of commercial sources, including, without limitation, Invitrogen, Clontech and Ariad. Many other systems have been described and can be readily selected by one of skill in the art. Examples of inducible promoters regulated by exogenously supplied promoters include the zinc-inducible sheep metallothionine (MT) promoter, the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter, the T7 polymerase promoter system [WO 98/10088]; the ecdysone insect promoter [No et al, Proc. Natl. Acad. Sci. USA, 93:3346-3351 (1996)], the tetracycline-repressible system [Gossen et al, Proc. Natl. Acad. Sci. USA, 89:5547-5551 (1992)], the tetracycline-inducible system [Gossen et al, Science, 268:1766-1769 (1995), see also Harvey et al, Curr. Opin. Chem. Biol., 2:512-518 (1998)], the RU486-inducible system [Wang et al, Nat. Biotech., 15:239-243 (1997) and Wang et al, Gene Ther., 4:432-441 (1997)] and the rapamycin-inducible system [Magari et al, J. Clin. Invest., 100:2865-2872 (1997)]. Still other types of inducible promoters which may be useful in this context are those which are regulated by a specific physiological state, e.g., temperature, acute phase, a particular differentiation state of the cell, or in replicating cells only.
  • As used herein, the term “terminator” (as referred to as a transcription terminator) is a section of nucleic acid sequence that marks the end of a gene or operon in genomic DNA during transcription. They stop transcription of a polymerase. Terminators can be classified into several groups. At the first group of termination signals the core enzyme can terminate in vitro at certain sites in the absence of any other factors (as tested in vitro). These sites of termination are called intrinsic terminators or also class I terminators. Intrinsic terminators usually share one common structural feature, the so called hairpin or stem-loop structure. On the one hand the hairpin comprises a stem structure, encoded by a dG-dC rich sequence of dyad symmetrical structure. On the other hand the terminator also exhibits a dA-dT rich region at the 3′-end directly following the stem structure. The uridine rich region at the 3′ end is thought to facilitate transcript release when RNA polymerase pauses at hairpin structures. Two or more terminators can be operatively linked if they are positioned to each other to provide concerted termination of a preceding coding sequence. Particularly preferred, the terminator sequences are downstream of coding sequences, i.e. on the 3′ position of the coding sequence. The terminator can e.g. be at least 1, at least 10, at least 30, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400, at least 500 nucleotides downstream of the coding sequence or directly adjacent. Examples of terminators include, but are not limited to, T7 terminator, rrnBT1, L3S2P21, tonB, rrnA, rrnB, rrnD, RNAI, crp, his, ilv lambda, M13, rpoC, and trp (see for example U.S. Pat. No. 9,745,588, incorporated herein by reference).
  • RpoN
  • As used herein “RpoN” refers to a gene that encodes the sigma factor sigma-54 (σ54, sigma N, or RpoN), a protein in Escherichia coli and other species of bacteria. Sigma factors are initiation factors that promote attachment of RNA polymerase to specific initiation sites and are then released. Bacteria normally only have one functional copy of the alternative sigma factor, σ54 or RpoN, which regulates a complex genetic network that extends into various facets of bacterial physiology, including metabolism, survival in strenuous environments, production of virulence factors, and formation of biofilms. RpoN is one of seven RNA polymerase sigma subunits in E. coli required for promoter-initiated transcription and RpoN plays a major role in the response of E. coli to nitrogen-limiting conditions. Under such conditions, RpoN directs the transcription of at least 14 E. coli operons/regulators in the nitrogen regulatory (Ntr) response. RpoN also plays an important role in stress resistance (e.g. resistance to osmotic stress) and virulence of bacteria. RpoN is structurally and functionally distinct from the other E. coli σ factors. It is able to bind promoter DNA in the absence of core RNA polymerase and it recognizes promoter sequences with conserved GG and GC elements located −24 to −12 nucleotides upstream of the transcription start site. Additionally, Regulatory proteins like NtrB and NtrC can activate σ54 holoenzyme.
  • Without being bound by theory or mechanism, it is believed that RpoN works in concert with NifA to turn on the transcription of nif clusters. An exemplary sequence for RpoN is provided in Table 5.
  • Gene Cluster Nucleic Acids
  • As used herein, a “gene cluster” or “genetic cluster” refers to a set of two or more genes that encode gene products. A target, naturally occurring, or wild type genetic cluster can be used as the original model for refactoring. In some embodiments, the gene products are enzymes. In some embodiments, the gene products of a genetic cluster function in a biosynthetic pathway. In some embodiments, the gene cluster encodes proteins of the nif nitrogen fixation pathway.
  • The genetic clusters can encode proteins of a biosynthetic pathway. A biosynthetic pathway, as used herein, refers to any pathway found in a biological system that involves more than one protein. In some instances, these pathways involve 2-1,000 proteins. In other instances the number of proteins involved in a biosynthetic pathway may be 2-500, 2-100, 5-1000, 5-500, 5-100, 5-10, 10-1,000, 10-900, 10-800, 10-700, 10-600, 10-500, 10-400, 10-300, 10-200, 10-100, 50-1,000, 50-500, 50-100, 100-1,000, or 100-500. Examples of biosynthetic pathways include but are not limited to the nitrogen fixation pathway.
  • In some instances, the refactored genetic clusters have naturally occurring non-coding DNA, naturally occurring regulatory sequences, and/or non-essential genes that have been removed from at least one or in some instances all of the transcriptional units. These can be replaced by synthetic regulatory sequences, not replaced at all or replaced by spacers. A spacer simply refers to a set of nucleotides or analogs thereof that don't have a function such as coding for a protein or in any way regulating the activity of the gene cluster.
  • The genetic components in the genetic cluster typically will include at least one regulatory element. A synthetic regulatory element is any nucleic acid sequence which plays a role in regulating gene expression and which differs from the naturally occurring regulatory element. It may differ for instance by a single nucleotide from the naturally occurring element. In some cases, it is an exogenous regulatory element (i.e. not identical to the naturally occurring version). Thus, a “regulatory element” refers to a nucleic acid having nucleotide sequences that influence transcription or translation initiation or rate, or stability and/or mobility of a transcription or translation product. Regulatory regions include, without limitation, promoter sequences, ribosome binding sites, ribozymes, enhancer sequences, response elements, protein recognition sites, inducible elements, protein binding sequences, 5′ and 3′ untranslated regions (UTRs), transcriptional start sites, transcription terminator sequences, polyadenylation sequences, introns, and combinations thereof.
  • The genetic clusters can be expressed in vivo in an organism or in vitro in a cell. The organism or cell can be any organism or cell in which a DNA can be introduced. For example, organisms and cells can include prokaryotes and eukaryotes (i.e. yeast, plants). Prokaryotes include but are not limited to Cyanobacteria, Bacillus subtilis, E. coli, Clostridium, and Rhodococcus. Eukaryotes include, for instance, algae (Nannochloropsis), yeast such as, S. cerevisiae and P. pastoris, plant cells, mammalian cells. Thus, some aspects of this disclosure relate to engineering of a cell to express proteins from the modified genetic clusters.
  • In some embodiments of the present disclosure provides a genetic cluster includes a nucleotide sequence that is at least about 85% or more homologous or identical to the entire length of a naturally occurring genetic cluster sequence, e.g., at least 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 50% or more of the full length naturally occurring genetic cluster sequence). In some embodiments, the nucleotide sequence is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% homologous or identical to a naturally occurring genetic cluster sequence. In some embodiments, the nucleotide sequence is at least about 85%, e.g., is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% homologous or identical to a genetic cluster sequence, in a fragment thereof or a region that is much more conserved, such as an essential, but has lower sequence identity outside that region. The disclosure also provides a nucleotide sequence that is at least about 85%, e.g., is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any nucleotide sequence as described herein or an amino acid sequence that is at least about 85%, e.g., is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to any amino acid sequence as described herein.
  • Calculations of homology or sequence identity between sequences (the terms are used interchangeably herein) are performed as follows. To determine the percent identity of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). The length of a reference sequence aligned for comparison purposes is at least 80% of the length of the reference sequence, and in some embodiments is at least 90% or 100%. The nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position (as used herein nucleic acid “identity” is equivalent to nucleic acid “homology”). The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences.
  • In some embodiments the gene clusters are native gene clusters. In some embodiments, the gene clusters are refactored gene clusters. In some instances, the nucleic acids may include non-naturally occurring nucleotides and/or substitutions, i.e. Sugar or base substitutions or modifications.
  • One or more substituted sugar moieties include, e.g., one of the following at the 2′ position: OH, SH, SCH3, F, OCN, OCH3OCH3, OCH3O(CH2)n CH3, O(CH2)n NH2 or O(CH2)n CH3 where n is from 1 to about 10; Ci to C10 lower alkyl, alkoxyalkoxy, substituted lower alkyl, alkaryl or aralkyl; Cl; Br; CN; CF3; OCF3; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; SOCH3; SO2 CH3; ONO2; NO2; N3; NH2; heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; polyalkylamino; substituted silyl; an RNA cleaving group; a reporter group; an intercalator; a group for improving the pharmacokinetic properties of a nucleic acid; or a group for improving the pharmacodynamic properties of a nucleic acid and other substituents having similar properties. Similar modifications may also be made at other positions on the nucleic acid, particularly the 3′ position of the sugar on the 3′ terminal nucleotide and the 5′ position of 5′ terminal nucleotide. Nucleic acids may also have sugar mimetics such as cyclobutyls in place of the pentofuranosyl group.
  • Nucleic acids can also include, additionally or alternatively, nucleobase (often referred to in the art simply as “base”) modifications or substitutions. As used herein, “unmodified” or “natural” nucleobases include adenine (A), guanine (G), thymine (T), cytosine (C) and uracil (U). Modified nucleobases include nucleobases found only infrequently or transiently in natural nucleic acids, e.g., hypoxanthine, 6-methyladenine, 5-Me pyrimidines, particularly 5-methylcytosine (also referred to as 5-methyl-2′ deoxycytosine and often referred to in the art as 5-Me-C), 5-hydroxymethylcytosine (HMC), glycosyl HMC and gentobiosyl HMC, isocytosine, pseudoisocytosine, as well as synthetic nucleobases, e.g., 2-aminoadenine, 2-(methylamino)adenine, 2-(imidazolylalkyl)adenine, 2-(aminoalklyamino)adenine or other heterosubstituted alkyladenines, 2-thiouracil, 2-thiothymine, 5-bromouracil, 5-hydroxymethyluracil, 5-propynyluracil, 8-azaguanine, 7-deazaguanine, N6 (6-aminohexyl)adenine, 6-aminopurine, 2-aminopurine, 2-chloro-6-aminopurine and 2,6-diaminopurine or other diaminopurines. See, e.g., Kornberg, “DNA Replication,” W. H. Freeman & Co., San Francisco, 1980, pp 75-′ 7′ 7; and Gebeyehu, G., et al. Nucl. Acids Res., 15:4513 (1987)). A “universal” base known in the art, e.g., inosine, can also be included.
  • Methods to deliver expression vectors or expression constructs into cells, for example, into bacteria, yeast, or plant cells, are well known to those of skill in the art. Nucleic acids, including expression vectors, can be delivered to prokaryotic and eukaryotic cells by various methods well known to those of skill in the relevant biological arts. Methods for the delivery of nucleic acids to a cell, include, but are not limited to, different chemical, electrochemical and biological approaches, for example, heat shock transformation, electroporation, transfection, for example liposome-mediated transfection, DEAE-Dextran-mediated transfection or calcium phosphate transfection. In some embodiments, a nucleic acid construct, for example an expression construct comprising a fusion protein nucleic acid sequence, is introduced into the host cell using a vehicle, or vector, for transferring genetic material. Vectors for transferring genetic material to cells are well known to those of skill in the art and include, for example, plasmids, artificial chromosomes, and viral vectors. Methods for the construction of nucleic acid constructs, including expression constructs comprising constitutive or inducible heterologous promoters, knockout and knockdown constructs, as well as methods and vectors for the delivery of a nucleic acid or nucleic acid construct to a cell are well known to those of skill in the art, and are described, for example, in J. Sambrook and D. Russell, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press; 3rd edition (Jan. 15, 2001); David C. Amberg, Daniel J. Burke; and Jeffrey N. Strathern, Methods in Yeast Genetics: A Cold Spring Harbor Laboratory Course Manual, Cold Spring Harbor Laboratory Press (April 2005); John N. Abelson, Melvin I. Simon, Christine Guthrie, and Gerald R. Fink, Guide to Yeast Genetics and Molecular Biology, Part A, Volume 194 (Methods in Enzymology Series, 194), Academic Press (Mar. 11, 2004); Christine Guthrie and Gerald R. Fink, Guide to Yeast Genetics and Molecular and Cell Biology, Part B, Volume 350 (Methods in Enzymology, Vol 350), Academic Press; 1st edition (Jul. 2, 2002); Christine Guthrie and Gerald R. Fink, Guide to Yeast Genetics and Molecular and Cell Biology, Part C, Volume 351, Academic Press; 1st edition (Jul. 9, 2002); Gregory N. Stephanopoulos, Aristos A. Aristidou and Jens Nielsen, Metabolic Engineering: Principles and Methodologies, Academic Press; 1 edition (Oct. 16, 1998); and Christina Smolke, The Metabolic Pathway Engineering Handbook: Fundamentals, CRC Press; 1 edition (Jul. 28, 2009), all of which are incorporated by reference herein.
  • Phylogenetic Analysis
  • The present disclosure also provides methods of selecting a nif cluster of a donor bacterium that is compatible with a host bacterium. The methods involve performing a phylogenetic analysis for the donor bacterium and the host bacterium.
  • A phylogenetic analysis is a method of estimating the evolutionary relationships. In molecular phylogenetic analysis, the sequence of a common gene or protein can be used to assess the evolutionary relationship of species. In some embodiments, phylogenetic analysis is performed based on the rRNA (e.g., the full-length 16S rRNA gene) sequences. These sequence include e.g., K. oxytoca, BWI76_05380; A. vinelandii, Avin_55000; R. sphaeroides, DQL45_00005; Cyanothece ATCC51142, cce_RNA045; A. brasilense, AMK58_25190; R. palustris, RNA_55; P. protegens, PST_0759; Paenibacillus sp. WLY78, JQ003557. In some embodiments, a multiple sequence alignment can be generated using MUSCLE (Edgar, R. C. J. N. a. r. MUSCLE: multiple sequence alignment with high accuracy and high throughput. 32, 1792-1797 (2004)). A phylogenetic tree is then constructed using the Jukes-Cantor distance model and UPGMA as a tree build method.
  • As shown in FIGS. 30A and 30B, the phylogenetic closeness has a predictive power for nitrogenase activity of transferring a nif cluster in a new host. In some embodiments, the host bacterium and the donor bacterium are in the same genus, family, order, or class. In some embodiments, the donor bacterium is selected from Klebsiella, Pseudomonas, Azotobacter, Gluconacetobacter, Azospirillum, Azorhizobium, Rhodopseudomonas, Rhodobacter, Cyanothece, or Paenibacillus genus.
  • In some embodiments, the evolutionary distance based on the phylogenetic analysis between the donor bacterium and the host bacterium is less than a reference value. For example, the reference value is 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, or 0.1% substitutions per site in 16S ribosomal RNA gene sequence. In some embodiments, the reference value is 500, 400, 300, 200, 100, 50, or 10 million years on the phylogenetic tree.
  • The methods can also involve transferring the nif cluster to the host bacterium and determining the nitrogenase activity.
  • Genetically Modified Plants
  • In some embodiments, this disclosure features genetically-modified plant cells and plants (e.g., genetically-modified cereal plants or cells) comprising at least one recombinant nucleic acid construct described herein. In some embodiments, a nucleic acid construct can encode, for example, a chemical signal synthesis peptide, operably linked in sense orientation to one or more regulatory regions. In some embodiments, the chemical signal synthesis peptide is an opine biosynthetic polypeptide (e.g., from A. tumefaciens) such as octopine synthase or nopaline synthase. In some embodiments, the chemical signal synthesis peptide can produce a chemical signal, which the genetically engineered bacterium can respond. It will be appreciated that because of the degeneracy of the genetic code, a number of nucleic acids can encode a particular opine biosynthetic polypeptide; i.e., for many amino acids, there is more than one nucleotide triplet that serves as the codon for the amino acid. Thus, codons in the coding sequence for a given opine biosynthetic polypeptide can be modified such that expression in a particular plant species is obtained, using appropriate codon bias tables for that species.
  • In some cases, the regulatory region is a constitutive promoter. In some cases, the regulatory region is an inducible promoter. In some cases, the regulatory region is a root-active promoter that can confer transcription in root tissue, e.g., root endodermis, root epidermis, or root vascular tissues. In some embodiments, root-active promoters can include the root-specific subdomains of the CaMV 35S promoter (Lam et al., Proc. Natl. Acad. Sci. USA, 86:7890-7894 (1989)), root cell specific promoters of Conkling et al., Plant Physiol., 93:1203-1211 (1990), or the tobacco RD2 promoter.
  • As described herein, a cereal plant or plant cell can be transformed by having a nucleic acid construct integrated into its genome, i.e., can be stably transformed. Stably transformed cells typically retain the introduced nucleic acid with each cell division. A plant or plant cell can also be transiently transformed such that the construct is not integrated into its genome. Transiently transformed cells typically lose all or some portion of the introduced nucleic acid construct with each cell division such that the introduced nucleic acid cannot be detected in daughter cells after a sufficient number of cell divisions. Both transiently transformed and stably transformed transgenic plants and plant cells can be useful in the methods described herein.
  • Genetically modified plant cells used in methods described herein can constitute part or all of a whole plant. Such plants can be grown in a manner suitable for the species under consideration, either in a growth chamber, a greenhouse, or in a field. As used herein, a genetically modified plant also refers to progeny of an initial engineered plant provided the progeny inherits the construct. Seeds produced by a modified plant can be grown and then selfed (or outcrossed and selfed) to obtain seeds homozygous for the nucleic acid construct.
  • Modified plants can be grown in suspension culture, or tissue or organ culture. When using solid medium, modified plant cells can be placed directly onto the medium or can be placed onto a filter that is then placed in contact with the medium. When using liquid medium, modified plant cells can be placed onto a flotation device, e.g., a porous membrane that contacts the liquid medium. A solid medium can be, for example, Murashige and Skoog (MS) medium containing agar and a suitable concentration of an auxin, e.g., 2,4-dichlorophenoxyacetic acid (2,4-D), and a suitable concentration of a cytokinin, e.g., kinetin.
  • When transiently transformed plant cells are used, a reporter sequence encoding a reporter polypeptide having a reporter activity can be included in the transformation procedure and an assay for reporter activity or expression can be performed at a suitable time after transformation. A suitable time for conducting the assay typically is about 1-21 days after transformation, e.g., about 1-14 days, about 1-7 days, or about 1-3 days. The use of transient assays is particularly convenient for rapid analysis in different species, or to confirm expression of a polypeptide whose expression has not previously been confirmed in particular recipient cells.
  • Techniques for introducing nucleic acids into plants are known and include, without limitation, Agrobacterium-mediated transformation, viral vector-mediated transformation, electroporation and particle gun transformation, e.g., U.S. Pat. Nos. 5,538,880; 5,204,253; 6,329,571 and 6,013,863. If a cell or cultured tissue is used as the recipient tissue for transformation, plants can be regenerated from transformed cultures if desired, by techniques known to those skilled in the art.
  • A population of modified plants can be screened and/or selected for those members of the population that produce as described chemical signal (e.g., opine) at a desired location (e.g., in the roots) as conferred by expression of the transgene. For example, a population of progeny of a single transformation event can be screened for those plants having a desired level of expression of a polypeptide or nucleic acid.
  • Control of Nitrogenase Activity
  • In some embodiments, the disclosure provides a genetically engineered bacterium that contains a regulatory sequence or a genetic sensor that regulates the nitrogenase activity in response to a chemical signal (e.g., an environmental signal or artificial signal). In some embodiments, the chemical signal can be an environmental signal such as ammonia, IPTG, or oxygen. In some embodiments, the nif cluster is placed under the control of a genetic sensor that can respond to the chemical signal. In some embodiments, the genetic sensor can respond to biocontrol agents or components of added fertilizer and other treatments (e.g., DAPG). In some embodiments, the genetic sensor can respond to root exudates from a plant, including e.g., sugar such as arabinose, hormones such as salicylic acids, flavonoids such as naringenin, antimicrobials such as vanillic acid, and various chemicals that can remodel the microbial community (e.g., cuminic acid). In some embodiments, the genetic sensor can respond to chemicals released by other bacteria including e.g., 3,4-dihydroxybenzoic acid (DHBA), 3OC6HSL or 3OC14HSL.
  • In some embodiments, sensors for chemicals are used to construct controllers. In some embodiments, a “Marionette” strain of E. coli, which includes sensors for e.g., vanillic acid, DHBA, cuminic acid, 3OC6HSL and 3OC14HSL in the genome, is used to host the nif cluster. In some embodiments, the output promoter of a sensor is used to express T7 RNA polymerase. In some embodiments, the arabinose and naringenin sensors are used to express NifA, which leads to the induction of the nifH promoter and nitrogenase activity. In some embodiments, the arabinose and naringenin sensors are used to express NifA, which leads to the induction of the nifH promoter and nitrogenase activity in P. protegens Pf-5. In some embodiments, the DAPG sensor is used to drive T7 RNAP, which then induces nitrogenase activity. In some embodiments, the DAPG sensor is used to drive T7 RNAP, which then induces nitrogenase activity in R. sp. IRBG74. In some embodiments, the salicylic acid sensor is used to control NifAL94Q/D95Q/RpoN expression, which then activates nitrogenase activity. In some embodiments, the salicylic acid sensor is used to control NifAL94Q/D95Q/RpoN expression, which then activates nitrogenase activity in A. caulinodans.
  • In some embodiments, a plant is engineered to release an orthogonal chemical signal that can be sensed by a corresponding engineered bacterium. This would have the benefit of only inducing nitrogenase in the presence of the engineered crop. In some embodiments, legumes and Arabidopsis are engineered to produce opines, including nopaline and octopine. In some embodiments, an engineered bacterium contains sensors for nopaline and octopine. In some embodiments, an engineered bacterium contains the LysR-type transcriptional activators OccR (octopine) and NocR (nopaline) and their corresponding promoters. In some embodiments, sensors for nopaline and octopine are used to control the expression of NifAL94Q/D95Q/RpoN, which then activates nitrogenase activity.
  • The present disclosure also provides methods of selecting a nif cluster or a regulatory element for the nif cluster. The methods involve calculation of genetic-part strengths based on sequencing data. In some embodiments, RNA-seq and Ribosome-footprint profiling is carried out according to the method described herein. In some embodiments, to generate the RNA-seq read profile for each nif cluster, the raw trace profiles can be multiplied by e.g., at least or about 105, 106, 107, 108, or 109 and normalized by respective total reads from coding sequences of each species. In some embodiments, the mRNA expression level of each gene is estimated using total sequencing reads mapped onto the gene, representing fragments per kilobase of transcript per million fragments mapped units (FPKM).
  • In some embodiments, the activity of a promoter is defined as the change in RNAP flux δJ around a transcription start site xtss. In some embodiments, the promoter strength or the regulatory element strength is calculated using the below equation:
  • δ J = γ n [ i = x tss + 1 x tss + 1 + n m ( i ) - i = x tss - 1 x 0 - 1 - n m ( i ) ]
  • where m(i) is the number of transcripts at each position i from the FPKM-normalized transcriptomic profiles, γ=0.0067 s−1 is the degradation rate of mRNA and n is the window length before and after xtss. In some embodiments, the window length is set to ten. In some embodiments, the Ts is defined as the fold-decrease in transcription before and after a terminator, which can be quantified from the FPKM-normalized transcriptomic profiles as:
  • T s = i = x 1 + 1 x 1 + n m ( i ) i = x 0 - 1 x 0 - n m ( i )
  • where x0 and x1 are the beginning and end positions of the terminator part, respectively. The translation efficiency was calculated by dividing the ribosome density by the FPKM.
  • Definition
  • As used herein, the equivalent terms “expression” or “gene expression” are intended to refer to the transcription of a DNA molecule into RNA, and the translation of such RNA into a polypeptide.
  • As used herein, a “gene cluster” refers to a set of two or more genes that encode gene products. As used herein, a “nif gene cluster” refers to a set of two or more genes that encode nitrogen fixation genes.
  • “Exogenous” with respect to genes indicates that the nucleic acid or gene is not in its natural (native) environment. For example, an exogenous gene can refer to a gene that is from a different species. In contrast, “endogenous” with respect to genes indicates that the gene is in its native environment. As used herein, the terms “endogenous” and “native” are used interchangeably.
  • As used herein, the term “delete” or “deleted” refers to the removal of a gene (e.g. endogenous gene) from a sequence or cluster. As used herein, the term “alter” or “altered” refers to the modification of one or more nucleotides in a gene or the deletion of one or more base pairs in a gene. This alteration may render the gene dysfunctional. Herein, “ΔnifA” refers to a strain or cluster within which NifA was deleted or altered. Method of deletion and alteration, in the context of genes, are known in the art.
  • As used herein, the term “chemical signals” refers to chemical compounds. Any substance consisting of two or more different types of atoms (chemical elements) in a fixed stoichiometric proportion can be termed a chemical compound. Chemical signals can be synthetic or natural chemical compounds. In some embodiments of the present invention, a bacterium of the present disclosure or a sensor of the present disclosure is under the control of a chemical signal. In some embodiments, the signal is a native biological signal (e.g. root exudate, biological control agent, etc.). In some embodiments, the chemical signal is a quorum sensing signal from the bacterium. Non-limiting examples of chemical signals include root exudates (as defined below), biocontrol agents (as defined below), phytohormones, vanillate, IPTG, aTc, cuminic acid, DAPG, and salicylic acid, 3,4-dihydroxybenzoic acid, 3OC6HSL and 3OC14HSL.
  • As used herein, the term “root exudate” refers to chemicals secreted or emitted by plant roots in response to their environment. These allow plant to manipulate or alter their immediate environment, specifically their rhizosphere. Root exudates are a complex mixture of soluble organic substances, which may contain sugars, amino acids, organic acids, enzymes, and other substances. Root exudates include, but are not limited to, ions, carbon-based compounds, amino acids, sterols, sugars, hormones (phytohormones), flavonoids, antimicrobials, and many other chemical compounds. The exudates can serve as either positive regulators or negative regulators.
  • As used herein, the term “phytohormone” refers plant hormones and they are any of various hormones produced by plants that influence process such as germination, growth, and metabolism in the plant.
  • As used herein, the term “vanillate” refers to a methoxybenzoate that is the conjugate base of vanillic acid. It is a plant metabolite.
  • Biological control or biocontrol is a method of controlling pests such as insects, mites, weeds and plant diseases using other organisms. Natural enemies of insect pests, also known as biological control agents, include predators, parasitoids, pathogens, and competitors. Biological control agents of plant diseases are most often referred to as antagonists. Biological control agents of weeds include seed predators, herbivores and plant pathogens. The inducible clusters or promoters of the present invention may be modulated by a secretion of (or chemical otherwise associated with) a biological control agent. Herein, that is referred to as a “biocontrol agent”.
  • Without further elaboration, it is believed that one skilled in the art can, based on the above description, utilize the present invention to its fullest extent. The following specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way whatsoever. All publications cited herein are incorporated by reference for the purposes or subject matter referenced herein.
  • EXAMPLES
  • Herein, inducible nitrogenase activity is engineered in two cereal endophytes (Azorhizobium caulinodans ORS571 and Rhizobium sp. IRBG74) and the epiphyte Pseudomonas protegens Pf-5, a maize seed inoculant. For each organism, different strategies are taken to eliminate ammonium repression and place nitrogenase expression under the control of agriculturally-relevant signals, including root exudates, biocontrol agents, and phytohormones. The present disclosure demonstrates that Rhizobium sp. (e.g., IRBG74) can be engineered to fix nitrogen under free living conditions, inter alia, by transferring either a nif cluster from Rhodobacter or Klebsiella. For P. protegens Pf-5, the transfer of an inducible cluster from Azotobacter vinelandii yields the highest ammonia and oxygen tolerance. Collectively, data from the transfer of 12 nif gene clusters between diverse species (including E. coli and 12 additional Rhizobia) help identify the barriers that must be overcome to engineer a bacterium to deliver a high nitrogen flux to a cereal crop and provide a solution such that Rhizobium can be engineered to fix nitrogen under free living conditions.
  • Materials and Methods
  • Bacterial Strains and Growth Media.
  • All bacterial strains and their derivatives used in this study are listed in Table 2. E. coli DH10-beta (New England Biolabs, MA, Cat #C3019) was used for cloning. E. coli K-12 MG1655 was used for the nitrogenase assay. P. protegens Pf-5 was obtained from the ATCC (BAA-477). Strains used in this study are listed in Table 3. For rich media, LB medium (10 g/L tryptone, 5 g/L yeast extract, 10 g/L NaCl), LB-Lennox medium (10 g/L tryptone, 5 g/L yeast extract, 5 g/L NaCl), and TY medium (5 g/L tryptone, 3 g/L yeast extract, 0.87 g/L CaCl2.2H2O) were used. For minimal media, BB medium (0.25 g/L MgSO4.7H2O, 1 g/L NaCl, 0.1 g/L CaCl2.2H2O, 2.9 mg/L FeCl3, 0.25 mg/L Na2MoO4.2H2O, 1.32 g/L NH4CH3CO2, 25 g/L Na2HPO4, 3 g/L KH2PO4 pH [7.4]), UMS medium (0.5 g/L MgSO4.7H2O, 0.2 g/L NaCl, 0.375 mg/L EDTA-Na2, 0.16 ZnSO4.7H2O, 0.2 mg/L Na2MoO4.2H2O, 0.25 mg/L H3BO3, 0.2 mg/L MnSO4.H2O, 0.02 mg/L CuSO4.5H2O, 1 mg/L CoCl2.6H2O, 75 mg/L CaCl2.2H2O, 12 mg/L FeSO4.7H2O, 1 mg/L thiamine hydrochloride 2 mg/L D-pantothenic acid hemicalcium salt, 0.1 mg/L biotin, 87.4 mg/L K2HPO, 4.19 g/L MOPS pH [7.0]), and Burk medium (0.2 g/L MgSO4.7H2O, 73 mg/L CaCl2.2H2O, 5.4 mg/L FeCl3.6H2O, 4.2 mg/L Na2MoO4.2H2O, 0.2 g/L KH2PO4, 0.8 g/L K2HPO4 pH [7.4]) were used. Antibiotics were used at the following concentrations (μg/mL): E. coli (kanamycin, 50; spectinomycin, 100; tetracycline, 15; gentamicin, 15). P. protegens Pf-5 (kanamycin, 30; tetracycline, 50; gentamicin, 15; carbenicillin, 50). R. sp. IRBG74 (neomycin, 150; gentamicin, 150; tetracycline, 10; nitrofurantoin, 10). A. caulinodans (kanamycin, 30; gentamicin, 15; tetracycline, 10; nitrofurantoin, 10). Chemicals including inducers used in this study are listed in Table 7.
  • Strain Construction.
  • In order to increase transformation efficiency in R. sp. IRBG74, a type-I restriction modification system was inactivated by deleting hsdR, which encodes a restriction enzyme for foreign DNA (this strain was the basis for all experiments) (Ferri, L., Gori, A., Biondi, E. G., Mengoni, A. & Bazzicalupo, M. J. P. Plasmid electroporation of Sinorhizobium strains: The role of the restriction gene hsdR in type strain Rm1021. 63, 128-135 (2010)). A sacB markerless insertion method was utilized to allow replacements of a native locus with synthetic parts by homologous recombination. Two homology arms of −500 bp flanking the hsdR gene were amplified by PCR, cloned and yielded a suicide plasmid pMR-44. The suicide plasmid was mobilized into R. sp. IRBG74 by triparental mating. Single-crossover recombinants were selected for resistance to gentamicin and subsequently grown and plated on LB plates supplemented with 15% sucrose to induce deletion of the vector DNA part containing the counter selective marker sacB which converts sucrose into a toxic product (levan). Two native nif gene clusters encompassing nifHDKENX (genomic location 219.579-227, 127) and nifSW-fixABCX-nifAB-fdxN-nifTZ (genomic location 234, 635-234, 802) of R. sp. IRBG74 were sequentially deleted using pMR45-46. To increase genetic stability recA gene was deleted using the plasmid pMR47. The R. sp. IRBG74 Δnif, hsdR, recA strain was the basis for all experiments unless indicated otherwise. Two homology arms of ˜900 bp flanking the nifA gene were amplified by PCR, cloned and yielded a suicide plasmid pMR-47 to generate nifA deletion in A. caulinodans ORS571, The suicide plasmid pMR47 in E. coli was mobilized into A. caulinodans by triparental mating. Single-crossover recombinants were selected for resistance to gentamicin and subsequently grown and plated on plain TY plates supplemented with 15% sucrose to induce deletion of the vector DNA part. All markerless deletions were confirmed by gentamicin sensitivity and diagnostic PCR. A list of the mutant strains is provided in Table 3.
  • Plasmid System.
  • Plasmids with the pBBR1 origin were derived from pMQ131 and pMQ132. Plasmids with the pRO1600 origin were derived from pMQ80. Plasmids with the RK2 origin were derived from pJP2. Plasmids with the RSF1010 origin were derived from pSEVA651. Plasmids with the IncW origin were derived from pKT249. Plasmids used in this study are provided in Table 4.
  • Phylogenetic Analysis of Nif Clusters.
  • Phylogenetic analysis was performed based on the full-length 16S rRNA gene sequences (K. oxytoca, BWI76_05380; A. vinelandii, Avin_55000; R. sphaeroides, DQL45_00005; Cyanothece ATCC51142, cce_RNA045; A. brasilense, AMK58_25190; R. palustris, RNA_55; P. protegens, PST_0759; Paenibacillus sp. WLY78, JQ003557). A multiple sequence alignment was generated using MUSCLE (Edgar, R. C. J. N. a. r. MUSCLE: multiple sequence alignment with high accuracy and high throughput. 32, 1792-1797 (2004)). A phylogenetic tree was constructed using the Geneious software (R9.0.5) with the Jukes-Cantor distance model and UPGMA as a tree build method, with bootstrap values from 1,000 replicates.
  • Nif Cluster Construction.
  • To obtain large nif clusters on mobilizable plasmids that carry origin of transfer (oriT) for conjugative transfer of the plasmids, the genomic DNAs from K. oxytoca, P. stutzeri, A. vinelandii, A. caulinodans and R. sphaeroides were purified using Wizard genomic DNA purification kit, following the isolation protocol for gram negative bacteria (Promega, Cat #A1120). The genomic DNAs of Cyanothece ATCC51142, A. brasilense ATCC29729, R. palustris ATCC BAA-98, and G. diazotrophicus ATCC49037 were obtained from ATCC. Each nif cluster was amplified into several fragments (4-10 kb) with upstream and downstream 45 bp linkers at the 5′ and 3′ most end of the cluster by PCR with primer sets (Table 2) and assembled onto linearized E. coli-yeast shuttle vectors pMR-1 for E. coli and Rhizobia, and pMR-2 for P. protegens Pf-5 using yeast recombineering. For the nif cluster of Paenibacillus sp. WLY78, the DNA sequence information were gleaned from contig ALJV01 and the DNA of the nif cluster was synthesized by GeneArt gene synthesis (Thermo Fisher Scientific, MA) into four fragments that were used as templates for PCR amplification and assembly. Amplified fragments from two to eight (Table 2) were assembled with a linearized vector into a single large plasmid by one-pot yeast assembly procedure(Shanks, R. M. et al. Saccharomyces cerevisiae-based molecular tool kit for manipulation of genes from gram-negative bacteria. 72, 5027-5036 (2006)). Once assembled, the nif cluster-plasmids were isolated from yeast using Zymoprep Yeast Miniprep kit (Zymo Research Cat #D2004) and transformed into E. coli. The purified plasmid was isolated from E. coli and sequenced to verify the correct assembly and sequence (MGH CCIB DNA Core facility, Cambridge, Mass.). E. coli containing a mutation-free plasmid were stored for further experiments. Plasmids containing nif clusters are provided in Table 4.
  • Construction of Refactored Nif v3.2.
  • The six transcriptional units (nifHDKTY, nifENX, nifJ, nifBQ, nifF, nifUSVWZM) were amplified from the plasmid pMR-3 that harbors the native Klebsiella nif cluster. Each unit was divided onto six level-1 module plasmids where the nif genes are preceded by a terminator. T7 promoter wild-type or T7 promoter variant PT7.P2 was placed between a terminator and the first gene of the transcriptional unit. Assembly linkers (˜45 bp) were placed at both ends of the units. The level-1 plasmids (pMR32-37) were provided in Table 4 and 5. Each of the six plasmids was linearized by digestion with restriction enzymes and assembled with a linearized pMR-1 or pMR-2 vector into a single large plasmid by one-pot yeast assembly procedure, yielding pMR38 and pMR39.
  • Transformation.
  • Electroporation was used to transfer plasmids into P. protegens Pf-5. A single colony was inoculated in 4 mL of LB and grown for 16 h at 30° C. with shaking at 250 rpm. The cell pellets were washed twice with 2 mL of 300 mM sucrose and dissolved in 100 μl of 300 mM sucrose at RT. A total of 50-100 ng DNA was electroporated and recovered in 1 mL of LB media for 1 h before plating on selective LB plates. Triparental mating was used to transfer DNA from E. coli to Rhizobia. An aliquot of 40 μl of late-log phase (OD600˜0.6) donor cells and 40 μl of late-log phage helper cells containing pRK7013 were mixed with 200 μl of late-log phase (OD600˜0.8) recipient Rhizobia cells and washed in 200 μl of TY medium. Mating was initiated by spotting 20 μl of the mixed cells on TY plates and incubated at 30° C. for 6 h. The mating mixtures were plated on TY medium supplemented with nitrofurantoin to isolate Rhizobia transconjugants.
  • Construction and Characterization Genetic Parts for Rhizobia.
  • Genetic part libraries were built on a pBBR1-ori plasmid pMR-1 using Gibson assembly (New England Biolabs, Cat #E2611). The fluorescence proteins, GFPmut3b and mRFP1, were used as reporters. The Anderson promoter library (Anderson, J. et al. BglBricks: A flexible standard for biological part assembly. 4, 1 (2010)) on the BioBricks Registry were utilized for the characterization of constitutive promoters (FIGS. 11A-11C). To characterize inducible promoters, a regulator protein is constitutively expressed by the PlacIq promoter, and GFP expression is driven by a cognate inducible promoter from the opposite direction, facilitating replacement of the reporter with gene of interest (e.g., T7 RNAP and nifA) and transfer of the controller unit across different plasmid backbones for diverse microbes. The following combinations of cognate regulators and inducible promoters were characterized. IPTG inducible LacI-A1lacO1, DAPG inducible Ph1F-PPh1, aTc inducible TetR-PTet, 3OC6HSL inducible LuxR-PLux, salicylic acid inducible NahR-PSal, and cuminic acid inducible CymR-PCym systems were optimized for R. sp. IRBG74 (FIG. 14). Opine inducible OccR-Pocc, and nopaline inducible NocR-Pnoc systems were optimized for A. caulinodans (FIGS. 20A-20F and Tables 4 and 5). For RBS characterization, an IPTG-inducible GFP expression plasmid pMR-40 was used and GFP was expressed to the highest levels with 1 mM IPTG (FIGS. 12A-12B). RBS library for GFP was designed using the RBS library calculator at the highest-resolution mode, and the 3′ end of the 16S rRNA sequences were adjusted according to the species (3′-ACCTCCTTC-5′ for R. sp. IRBG74). Terminators for T7 RNAP were characterized by placing a terminator between two fluorescence reporters expressed from a single T7 wild-type promoter located upstream of the first fluorescence protein GFP. The expression of the two fluorescence proteins is enabled by the controller strain MR18 encoding the IPTG-inducible T7 RNAP system by 1 mM IPTG (FIGS. 13A-13B). The terminator strength (Ts) was determined by normalizing fluorescence levels of a terminator construct by a reference construct pMR-66 where a 40 bp spacer was placed between the reporters. All genetic parts for Rhizobia were characterized as follows. Single colonies were inoculated into 0.5 ml TY supplemented with antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator (INFORS HT, MD). 1.5 μl of overnight cultures was diluted into 200 μl of TY with antibiotics and appropriate inducers in 96-well plates (Thermo Scientific, Cat #12565215) and incubated for 7 h at 30° C., 1,000 rpm in an ELMI DTS-4 shaker (ELMI, CA). After growth, 8 μl of culture sample was diluted into 150 μl PBS with 2 mg/mL kanamycin for flow cytometry analysis. Plasmids and genetic parts are listed in Table 4 and 5.
  • Construction and Characterization Genetic Parts for P. protegens.
  • Genetic part libraries were built on a pRO1600-ori plasmid pMR-2 using Gibson assembly (New England Biolabs, Cat #E2611). The fluorescence proteins, GFPmut3b and mRFP1 were used as reporters. The Anderson promoter library on the BioBricks Registry were utilized for the characterization of constitutive promoters (FIGS. 11A-11C). The following combinations of cognate regulators and inducible promoters were characterized. IPTG inducible LacI-Ptac, DAPG inducible Ph1F-PPhl, aTc inducible TetR-PTet, 3OC6HSL inducible LuxR-PLux, arabinose inducible AraC-PBAD, cuminic acid inducible CymR-PCym, and naringenin inducible FdeR-PFde were optimized (FIGS. 15A-15C). For RBS characterization, an arabinose-inducible GFP expression plasmid pMR-65 was used and GFP was expressed with 1 mM IPTG (FIGS. 12A-12B). RBS library for GFP was designed using the RBS library calculator at the highest-resolution mode, and the 3′ end of the 16S rRNA sequences were adjusted according to the species (3′-ACCTCCTTA-5′ for P. protegens Pf-5). Terminators for T7 RNAP were characterized by placing a terminator between two fluorescence reporters expressed from a single T7 wild-type promoter located upstream of the first fluorescence protein GFP. The expression of the two fluorescence proteins is enabled by an IPTG-inducible T7 RNAP expression system of the controller strain MR7 (FIGS. 13A-13B). All genetic parts for P. protegens Pf-5 were characterized as follows. Single colonies were inoculated into 1 ml LB supplemented with antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator (INFORS HT, MD). 0.5 μl of overnight cultures was diluted into 200 μl of LB with antibiotics and appropriate inducers in 96-well plates (Thermo Scientific, Cat #12565215) and incubated for 7 h at 30° C., 1,000 rpm in an ELMI DTS-4 shaker (ELMI, CA). After growth, 10 μl of culture sample was diluted into 150 μl PBS with 2 mg/mL kanamycin for flow cytometry analysis. Plasmids and genetic parts are listed in Tables 4 and 5.
  • Genomic Integration and Characterization of Controllers.
  • The mini-Tn7 insertion system was used to introduce a controller into the genome of P. protegens Pf-5. The IPTG-inducible T7 RNAP expression system and a tetracycline resistant marker tetA was placed between two Tn7 ends (Tn7L and Tn7R). The controller plasmid pMR-85 was introduced into P. protegens Pf-5 by double transformation with pTNS3 encoding the TnsABCD transposase. A genomically-integrated controller located 25 bp downstream of the stop codon of glmS was confirmed by PCR and sequencing. A markerless insertion method using homologous recombination was employed in R. sp. IRBG74. A controller encoding inducible T7 RNAP system flanked by two homology fragments that enables the replacement of recA was cloned into a suicide plasmid. These controller plasmids (IPTG-inducible, pMR82-84; DAPG-inducible, pMR85) in E. coli was mobilized into R. sp. IRBG74 MR18 (ΔhsdR. Δnif) by triparental mating, generating the controller strains (MR19, 20, 21 and 22, respectively). The controller integration in the genome was confirmed by gentamicin sensitivity and diagnostic PCR. All controllers were characterized in a manner identical to that described in genetic part characterization.
  • Construction and Characterization of Marionette-Based Controllers.
  • To regulate nitrogenase expression in the E. coli Marionette MG1655, the yfp in the 12 reporter plasmids was replaced with T7 RNAP while keeping other genetic parts (e.g., promoters and RBSs) unchanged (FIGS. 28A-28C). The reporter plasmid pMR-120 in which gfpmut3b is fused to the PT7(P2) promoter (FIGS. 28A-28C) was co-transformed to analyze the response functions of each of the 12 T7 RNAP controller plasmids. To characterize controllers, single colonies were inoculated into 1 ml LB supplemented with antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator (INFORS HT, MD). 0.5 μl of overnight cultures was diluted into 200 μl of LB with antibiotics and appropriate inducers in 96-well plates (Thermo Scientific, Cat #12565215) and incubated for 6 h at 30° C., 1,000 rpm in an ELMI DTS-4 shaker (ELMI, CA). After growth, 4 μl of culture sample was diluted into 150 μl PBS with 2 mg/mL kanamycin for flow cytometry analysis.
  • Flow Cytometry.
  • Cultures with fluorescence proteins were analyzed by flow cytometry using a BD Biosciences LSRII Forterssa analyzer with a 488 nm laser and 510/20-nm band pass filter for GFP and a 561 nm laser and 610/20 nm band pass filter for mCherry and mRFP1. Cells were diluted into 96-well plates containing phosphate buffered saline solution (PBS) supplemented with 2 mg/mL kanamycin after incubation. Cells were collected over 20,000 events which were gated using forward and side scatter to remove background events using FlowJo (TreeStar Inc., Ashland, Oreg.). The median fluorescence from cytometry histograms was calculated for all samples. The median autofluorescence was subtracted from the median fluorescence and reported as the fluorescence value in arbitrary unit (au).
  • Nitrogenase Assay (E. coli and K. oxytoca).
  • Cultures were initiated by inoculating a single colony into 1 mL of LB supplemented with appropriate antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator. 5 μl of overnight cultures was diluted into 500 μl of BB medium with 17.1 mM NH4CH3CO2 and appropriate antibiotics in 96-deepwell and incubated for 24 h at 30° C., 900 rpm in a Multitron incubator. Cultures were diluted to an OD600 of 0.4 into 2 mL of BB medium supplemented with appropriate antibiotics, 1.43 mM serine to facilitate nitrogenase depression, and an inducer (if necessary) in 10 mL glass vials with PTFE-silicone septa screw caps (Supelco Analytical, Cat #SU860103). Headspace in the vials was replaced with 100% argon gas using a vacuum manifold. Acetylene freshly generated from CaC2 in a Burris bottle was injected to 10% (vol/vol) into each culture vial to begin the reaction. The acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm in an Innova 44 shaking incubator (New Brunswick) to prevent cell aggregations, followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • Nitrogenase Assay (P. protegens Pf-5).
  • Cultures were initiated by inoculating a single colony into 1 mL of LB supplemented with appropriate antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator. 5 μl of overnight cultures was diluted into 500 μl of BB medium with 17.1 mM NH4CH3CO2 and appropriate antibiotics in 96-deepwell and incubated for 24 h at 30° C., 900 rpm in a Multitron incubator. Cultures were diluted to an OD600 of 0.4 into 2 mL of BB medium supplemented with appropriate antibiotics, 1.43 mM serine and an inducer (if necessary) in 10 mL glass vials with PTFE-silicone septa screw caps. Headspace in the vials was replaced with 99% argon and 1% oxygen gas (Airgas, MA USA) using a vacuum manifold. Acetylene was injected to 10% (vol/vol) into each culture vial to begin the reaction. The acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm, followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • Nitrogenase Assays (Rhizobia Strains).
  • Cultures were initiated by inoculating a single colony into 0.5 mL of TY medium supplemented with appropriate antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator. 5 μl of overnight cultures was diluted into 500 μl of UMS medium with 30 mM succinate, 10 mM sucrose, and 10 mM NH4C1 and appropriate antibiotics in 96-deepwell and incubated for 24 h at 30° C., 900 rpm in a Multitron incubator. Cultures were diluted to an OD600 of 0.4 into 2 mL of UMS medium plus 30 mM succinate and 10 mM sucrose supplemented with appropriate antibiotics, 1.43 mM serine and an inducer (if necessary) in 10 mL glass vials with PTFE-silicone septa screw caps. Headspace in the vials was replaced with 99% argon and 1% oxygen gas using a vacuum manifold. Acetylene was injected to 10% (vol/vol) into each culture vial to begin the reaction. The acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm, followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • Nitrogenase Assays (A. caulinodans and P. stutzeri).
  • Cultures were initiated by inoculating a single colony into 0.2 mL of TY medium supplemented with appropriate antibiotics in 96-deepwell plates and grown overnight at 37° C. and 30° C. for A. caulinodans and P. stutzeri, respectively, 900 rpm in a Multitron incubator. 5 μl of overnight cultures was diluted into 500 μl of UMS medium with 30 mM lactate and 10 mM NH4Cl and appropriate antibiotics in 96-deepwell and incubated for 24 h at 37° C. and 30° C. for A. caulinodans and P. stutzeri, respectively, 900 rpm in a Multitron incubator. Cultures were diluted to an OD600 of 0.4 into 2 mL of UMS medium plus 30 mM lactate supplemented with appropriate antibiotics and an inducer (if necessary) in 10 mL glass vials with PTFE-silicone septa screw caps. Headspace in the vials was replaced with 99% argon plus 1% oxygen gas using a vacuum manifold. Acetylene was injected to 10% (vol/vol) into each culture vial to begin the reaction. The acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm, followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • Nitrogenase Assays (A. vinelandii).
  • Cultures were initiated by inoculating a single colony into 0.5 mL of Burk medium supplemented with appropriate antibiotics in 96-deepwell plates (USA Scientific, Cat #18962110) and grown overnight at 30° C., 900 rpm in a Multitron incubator. 5 μl of overnight cultures was diluted into 500 μl of Burk medium with 17.1 mM NH4CH3CO2 and appropriate antibiotics in 96-deepwell and incubated for 24 h at 30° C., 900 rpm in a Multitron incubator. Headspace in the vials was replaced with 97% argon and 3% oxygen gas (Airgas, MA USA) using a vacuum manifold. Acetylene was injected to 10% (vol/vol) into each culture vial to begin the reaction. The acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm, followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • Nitrogenase Activity Assay in the Presence of Ammonium.
  • Following overnight incubation in minimal medium with a nitrogen source (described above), cultures were diluted to an OD600 of 0.4 in 2 mL of nitrogen-free minimal medium, 1.43 mM serine (for E. coli and P. protegens Pf-5) and an inducer (for inducible systems) in 10 mL glass vials with PTFE-silicone septa screw caps. Ammonium (17.1 mM NH4CH3CO2 for E. coli and P. protegens Pf-5 and 10 mM NH4Cl for Rhizobia) was added to a nitrogen-free minimal medium when testing ammonium tolerance of nitrogenase activity. Headspace in the vials was replaced with either 100% argon gas for E. coli, 99% argon plus 1% oxygen for Pseudomonas and Rhizobia using a vacuum manifold. Acetylene was injected to 10% (vol/vol) into each culture vial to begin the reaction. The acetylene reduction was carried out for 20 h at 30° C. with shaking at 250 rpm followed by quenching via the addition of 0.5 mL of 4 M NaOH to each vial.
  • Nitrogenase Activity Assay at Varying Oxygen Levels.
  • Following overnight incubation in minimal medium with a nitrogen source (described above), cultures were diluted to an OD600 of 0.4 in 2 mL of minimal medium, 1.43 mM serine (for E. coli and P. protegens Pf-5), and an inducer (for inducible systems) in 10 mL glass vials with PTFE-silicone septa screw caps. The vial headspace was replaced with either 100% nitrogen gas for E. coli or 99% nitrogen plus 1% oxygen for P. protegens Pf-5 and A. caulinodans using a vacuum manifold. Cultures were incubated with shaking at 250 rpm at 30° C. for 6 h and 9 h for P. protegens Pf-5 and A. caulinodans, respectively, after which oxygen concentrations in the headspace were recorded with the optical oxygen meter FireStingO2 equipped with a needle-type sensor OXF500PT (Pyro Science, Germany) After the induction period, no oxygen remained in the headspace for all species as confirmed by the oxygen meter. The initial oxygen levels in the headspace were adjusted by injecting pure oxygen via syringe into the headspace of the vials and stabilized with shaking at 250 rpm at 30° C. for 15 m followed by the injection of acetylene to 10% (vol/vol) into each culture vial to begin the reaction and initial oxygen concentrations in the headspace were recorded concomitantly. The oxygen levels in the headspace were maintained around the setting points (<±0.25% 02) while incubating at 250 rpm and 30° C. by injecting oxygen every hour for 3 h with oxygen monitoring before and after oxygen spiking (FIGS. 26A-26B). The reactions were quenched after 3 h of incubation by the injection of 0.5 mL of 4 M NaOH to each vial using a syringe.
  • Ethylene Quantification.
  • Ethylene production was analyzed by gas chromatography using an Agilent 7890A GC system (Agilent Technologies, Inc., CA USA) equipped with a PAL headspace autosampler and flame ionization detector as follows. An aliquot of 0.5 mL headspace preincubated to 35° C. for 30 s was injected and separated for 4 min on a GS-CarbonPLOT column (0.32 mm×30 m, 3 microns; Agilent) at 60° C. and a He flow rate of 1.8 mL/min. Detection occurred in a FID heated to 300° C. with a gas flow of 35 mL/min H2 and 400 mL/min air. Acetylene and ethylene were detected at 3.0 min and 3.7 min after injection, respectively. Ethylene production was quantified by integrating the 3.7 min peak using Agilent GC/MSD ChemStation Software.
  • Sample Preparation for RNA-Seq and Ribosome Profiling.
  • Cultures of K. oxytoca, E. coli, P. protegens Pf-5 or R. sp. IRBG74 were grown following the same protocol as used for nitrogenase activity assay (described above) with a few changes. Following overnight incubation in minimal medium with a nitrogen source, cultures were diluted to an OD600=0.4 in 25 mL of minimal medium (with an inducer, if needed) and antibiotics in 125 mL Wheaton serum vials (DWK Life Sciences, Cat #223748) with septum stoppers (Fisher Scientific, Cat #FB57873). The vial headspace was replaced with either 100% nitrogen gas for E. coli and K. oxytoca or 99% nitrogen plus 1% oxygen for P. protegens Pf-5 and R. sp. IRBG74 using a vacuum manifold. Cultures grown 6 h at 30° C., 250 rpm were filtered onto a nitrocellulose filter 0.45 μM pore size (Fisher Scientific, Cat #GVS1215305). Cell pellets were combined from three vials using a stainless-steel scoopula, followed by flash-frozen in liquid nitrogen. The frozen pellets were added to 650 μl of frozen droplets of lysis buffer (20 mM Tris (pH 8.0), 100 mM NH4Cl, 10 mM MgCl2, 0.4% Triton X-100, 0.1% NP-40, 1 mM chloramphenicol and 100 U/mL DNase I) in prechilled 25 mL canister (Retsch, Germany, Cat #014620213) in liquid nitrogen and pulverized using TissueLyser II (Qiagen USA) with a setting at 15 Hz for 3 min for 5 times with intermittent cooling between cycles. The pellet was removed by centrifugation at 20,000 rcf at 4° C. for 10 min and the lysate was recovered in the supernatant.
  • RNA-Seq Experiments.
  • RNA-seq and Ribosome-footprint profiling was carried out according to the method described earlier with a few modifications(Li, G.-W., Oh, E. & Weissman, J. S. J. N. The anti-Shine—Dalgarno sequence drives translational pausing and codon choice in bacteria. 484, 538 (2012); Li, G.-W., Burkhardt, D., Gross, C. & Weissman, J. S. Quantifying absolute protein synthesis rates reveals principles underlying allocation of cellular resources. Cell 157, 624-635 (2014)). The total RNA was isolated using the hot phenol-SDS extraction method. The rRNA fractions were determined and subtracted from the total using the MICROBExpress kit (Thermo Fisher Scientific, Cat #AM1905). The remaining mRNAs and tRNAs were fragmented by RNA fragmentation reagents (Thermo Fisher Scientific, Cat #AM8740) at 95° C. for 1 m 45 s. RNA fragments (10-45 bp) were isolated from a 15% TBE-Urea polyacrylamide gel (Thermo Fisher Scientific, Cat #EC6885). The 3′ ends of the RNA fragments were dephosphorylated using T4 polynucleotide kinase (1U/μl, New England Biolabs, Cat #M0201S) in a 20 μl reaction volume supplemented with 1 μl of 20 U SUPERase. In at 37° C. for 1 h, after which the denatured fragments (5 pmoles) were incubated at 80° C. for 2 min and ligated to 1 μg of the oligo (/5rApp/CTGTAGGCACCATCAAT/3ddc/, Integrated DNA technologies) (SEQ ID NO: 1) in a 20 μl reaction volume supplemented with 8 μl of 50% PEG 8000, 2 μl of 10×T4 RNA ligase 2 buffer, 1 μl of 200 U/μl truncated K277Q T4 ligase 2 (New England Biolabs, Cat #M0351) and 1 μl of 20 U/μl of SUPERase. In (Invitrogen) at 25° C. for 3 h. The ligated fragments (35-65 bp) were isolated from a 10% TBE-Urea polyacrylamide gel (Invitrogen, Cat #EC6875). cDNA libraries from the purified mRNA products were reverse-transcribed using Superscript III (Thermo Fisher Scientific, Cat #18080044) with oCJ485 primer (/5Phos/AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT/iSp18/CAAGCAGAAGA CGGCATACGAGATATTGATGGTGCCTACAG (SEQ ID NO: 2, SEQ ID NO: 3)) at 50° C. for 30 min and RNA products subsequently were hydrolyzed by the addition of NaOH at a final concentration of 0.1 M, followed by incubation at 95° C. for 15 min. The cDNA libraries (125-150 bp) were isolated from on a 10% TBE-Urea polyacrylamide gel (Invitrogen, Cat #EC6875). The cDNA products were circularized in a 20 μl reaction volume supplemented with 2 μl of 10× CircLigase buffer, 1 μl of 1 mM ATP, 1 μl of 50 mM MnCl2 and 1 μl of CircLigase (Epicenter, Cat #CL4115K) at 60° C. for 2 h and heat-inactivated at 80° C. for 10 min. 5 μl of circularized DNA was amplified using Phusion HF DNA polymerase (New England Biolabs, Cat #M0530) with o231 primer (CAAGCAGAAGACGGCATACGA (SEQ ID NO: 4)) and indexing primers (AATGATACGGCGACCACCGAGATCTACACGATCGGAAGAGCACACGTCTGAACT CCAGTCACNNNNNNACACTCTTTCCCTACAC (SEQ ID NO: 5)) for 7 to 10 cycles. The amplified products (125-150 bp) were recovered from an 8% TBE-Urea polyacrylamide gel (Invitrogen, Cat #EC62152). The purified products were analyzed by BioAnalyzer (Agilent, CA USA) and sequenced with a sequencing primer (CGACAGGTTCAGAGTTCTACAGTCCGACGATC (SEQ ID NO: 6)) using an Illumina HiSeq 2500 with a rapid run mode. To generate the RNA-seq read profile for each nif cluster, the raw trace profiles are multiplied by 107 and normalized by respective total reads from coding sequences of each species (K. oxytoca M5al, CP020657.1; E. coli MG1655, NC_000913.3; P. protegens Pf-5, CP000076; R. sp. IRBG74 HG518322, HG518323, HG518324 and an appropriate plasmid carrying a nif cluster). The mRNA expression level of each gene was estimated using total sequencing reads mapped onto the gene, representing fragments per kilobase of transcript per million fragments mapped units (FPKM).
  • Ribo-Seq Experiments.
  • 0.5 mg of RNA was diluted into 195 μl of the lysis buffer including 0.5 U RNase inhibitor SUPERase. In (Invitrogen, Cat #AM2694), 5 mM CaCl2 and were treated with 5 μl of 750 U of micrococcal nuclease (Sigma Aldrich, Cat #10107921001) at 25° C. for 1 h to obtain ribosome-protected monosomes. The digestions were quenched by the addition of EGTA to a final concentration of 6 mM and then kept on ice before the isolation of monosomes. Subsequently, the monosome fraction was collected by sucrose density gradient (10-55% w/v) ultracentrifugation at 35,000 rpm for 3 h, followed by a hot phenol-SDS extraction to isolate ribosome-protected mRNA fragments. The mRNA fragments (15-45 bp) were isolated from a 15% TBE-Urea polyacrylamide gel. The 3′ ends of the purified fragments were dephosphorylated and ligated to the modified oligo. cDNA libraries generated by Superscript III were circularized by CircLigase as described above. rRNA products were depleted by a respective biotinylated oligo mix for E. coli and P. protegens Pf-5.5 μl of circularized DNA was amplified using Phusion HF DNA polymerase with o231 primer and indexing primers for 7 to 10 cycles. The amplified products (125-150 bp) were recovered from an 8% TBE-Urea polyacrylamide gel. The purified products were analyzed by BioAnalyzer and sequenced with a sequencing primer (CGACAGGTTCAGAGTTCTACAGTCCGACGATC (SEQ ID NO: 7)) using an Illumina HiSeq 2500 with a rapid run mode. Sequences were aligned to reference sequences using Bowtie 1.1.2 with the parameters—k1—m2—v1. A center-weighting approach was used to map the aligned footprint reads ranging from 22 to 42 nucleotides in length. To map P-site of ribosome from footprint reads, 11 nucleotides from the both ends were trimmed, and the remaining nucleotide were given the same score, normalized by the length of the center region. Aligned reads (10-45 nucleotides) were mapped to the reference with equal weight of each nucleotide. A Python 3.4 script was used to perform the mapping. To generate the Ribo-seq read profile for each nif cluster, the raw trace profiles are multiplied by 108 and normalized by respective total reads from coding sequences of each species. To calculate the ribosome density of each gene, read densities were first normalized in the following ways: (i) The first and last 5 codons of the gene are excluded for the calculation to remove the effects of translation initiation and termination. (ii) A genome-wide read density profile was fitted to an exponential function and the density at each nucleotide on a given gene was corrected using this function. (iii) If the average read density on a gene is higher than 1, a 90% winsorization was applied to reduce the effect of outliers. The sum of normalized reads on a gene was normalized by the gene length and the total read densities on coding sequences to yield the ribosome density.
  • Calculation of Genetic Part Strengths Based On—Seq Data.
  • The activity of a promoter is defined as the change in RNAP flux δJ around a transcription start site xtss (Gorochowski, T. E. et al. Genetic circuit characterization and debugging using RNA-seq. 13, 952 (2017)). The promoter strength is calculated by
  • δ J = γ n [ i = x tss + 1 x tss + 1 + n m ( i ) - i = x tss - 1 x 0 - 1 - n m ( i ) ] ( 1 )
  • where m(i) is the number of transcripts at each position I from FPKM-normalized transcriptomic profiles, y=0.0067 s−1 is the degradation rate of mRNA, n is the window length before and after xtss. The window length is set to 10. The terminator strength Ts is defined as the fold-decrease in transcription before and after a terminator, which can be quantified from FPKM-normalized transcriptomic profiles as
  • T s = i = x 1 + 1 x 1 + n m ( i ) i = x 0 - 1 x 0 - n m ( i ) ( 2 )
  • where x0 and x1 are the beginning and end positions of the terminator part, respectively. Translation efficiency was calculated by dividing the ribosome density by the FPKM.
  • nifH Expression Analysis.
  • Complementation of NifA was tested using plasmid pMR-128 to 130 that contains the sfgfp fused to the nifH promoter in the A. caulinodans ΔnifA mutant. The inducible NifA/RpoN expression was provided by the plasmid pMR-121 into which sfgfp driven by the nifH promoter was added to analyze nifH promoter activity, yielding pMR-131 (FIG. 29). The IPTG-inducible system in the plasmid pMR-124 was substituted with other inducible systems including the salicylic acid-inducible, nopaline-inducible and octopine-inducible systems, yielding pMR-125, 126, and 127, respectively. Each of the plasmids was mobilized into the A. caulinodans ΔnifA mutant, which was grown following the same protocol as used for nitrogenase activity (described herein). Following overnight incubation in minimal medium with a nitrogen source, cultures were diluted to an OD600=0.4 in 2 mL of UMS medium plus 30 mM lactate, antibiotics and an inducer (for inducible systems) in 10 mL glass vials with PTFE-silicone septa screw caps. Headspace in the vials was replaced with 99% argon plus 1% oxygen using a vacuum manifold. The vials were incubated with shaking at 250 rpm at 30° C. for 9 h, after which 10 μl of cultures was diluted into 150 μl PBS with 2 mg/mL kanamycin for flow cytometry analysis. To test activation of the nifH promoters by diverse NifA proteins, the plasmids pMR-51, 53, 88, 89 and 90 were introduced into E. coli MG1655 and the plasmids pMR-91, 92, 93, 94 and 95 to P. protegens Pf-5. The plasmid pMR-101 was used to provide inducible NifA expression by IPTG in E. coli. The controller encoding the IPTG-inducible NifA was inserted into the genome of P. protegens Pf-5 using the plasmids pMR-96, 97 and 98. The IPTG-inducible system of the NifA controller plasmid pMR-96 was replaced with the arabinose-inducible and the naringenin-inducible system, yielding pMR-99 and 100, respectively. The inducibility of nifH expression was assessed by the reporter plasmids pMR-105 to 107 and pMR102 to 104 or E. coli and P. protegens Pf-5, respectively. The controller plasmids were transformed into E. coli or P. protegens Pf-5 with the reporter plasmids. Following overnight incubation in minimal medium with a nitrogen source, cultures were diluted to an OD600=0.4 in 2 mL of BB medium, antibiotics and an inducer (for inducible systems) in 10 mL glass vials with PTFE-silicone septa screw caps. Headspace in the vials was replaced with either 100% argon for E. coli or 99% argon plus 1% oxygen for P. protegens Pf-5 using a vacuum manifold. The vials were incubated with shaking at 250 rpm at 30° C. for 9 h, after which 10 μl of cultures was diluted into 150 μl PBS with 2 mg/mL kanamycin for flow cytometry analysis.
  • Sequence Alignment.
  • NifA sequences of R. sphaeroides 2.4.1 (RSP_0547) and A. caulinodans ORS571 (AZC_1049) were obtained from NCBI. NifA protein sequences were aligned with MUSCLE (https://www.ebi.ac.uk/Tools/msa/muscle/) with a default settings (FIG. 22).
  • Results
  • Performance of Native Nif Clusters in E. coli, P. Protegens Pf-5, and Symbiotic Rhizobia
  • A set of diverse native nif clusters were cloned in order to determine their relative performance in different strains and the associated species barriers (FIG. 1A). Previously-defined boundaries for the well-studied nif cluster from K. oxytoca (Arnold, W., Rump, A., Klipp, W., Priefer, U. B. & Paler, A. J. J. o. m. b. Nucleotide sequence of a 24,206-base-pair DNA fragment carrying the entire nitrogen fixation gene cluster of Klebsiella pneumoniae.
  • 203, 715-738 (1988)) and the small (10 kb) cluster from Paenibacillus polymyxa WLY7870 were used. Similarly, the published boundaries (43.7 kb) of the P. stutzeri A1501 (Yan, Y. et al. Nitrogen fixation island and rhizosphere competence traits in the genome of root-associated Pseudomonas stutzeri A1501. Proceedings of the National Academy of Sciences (2008)) and A. vinelandii DJ clusters were used (Hamilton, T. L. et al. Transcriptional profiling of nitrogen fixation in Azotobacter vinelandii. J Bacteriol 193, 4477-4486, doi:10.1128/JB.05099-11 (2011)). A region of the P. stutzeri A1501 nif cluster (Pst1307-Pst1312) was excluded as these genes are predicted to have no effect on nitrogenase. A. vinelandii DJ contains three putative electron transport systems (the Rnf1 and Rnf2 complexes and the Fix complex) located in other regions of the genome. RNA-seq data shows that Rnf2 is not co-expressed with the nif genes, so only the Rnf1 and Fix complexes were included by fusing their DNA to create a single 46.9 kb construct. The nif cluster
  • (40.1 kb) from Azospirillum brasilense Sp7 was selected because this species is a cereal endophyte and fixes nitrogen in free-living conditions. Several less-studied gene clusters were also cloned in order to probe species barriers. As a representative of cyanobacteria, the gene cluster from Cyanothece sp. ATCC51142 was cloned following published boundaries. Its transcriptional activator PatB occurs outside of the nif cluster, which was cloned along with its native promoter and fused to nif cluster to form a single construct (31.7 kb). Several gene clusters were selected from photosynthetic purple bacteria (Rhodopseudomonas palustris CGA009 (Oda, Y. et al. Functional genomic analysis of three nitrogenase isozymes in the photosynthetic bacterium Rhodopseudomonas palustris. 187, 7784-7794 (2005)) and Rhodobacter sphaeroides 2.4.1 (Haselkorn, R. & Kapatral, V. in Genomes and genomics of nitrogen-fixing organisms, 71-82 (Springer, 2005))) as these are members of the same alphaproteobacteria class as Rhizobia. The mf cluster, encoded on a separate chromosome of
  • R. sphaeroides 2.4.1, was added to the nif cluster to provide electrons to nitrogenase. Finally, the gene clusters from the sugarcane and rice endosymbiant Gluconacetobacter diazotrophicus PA1 5 (28.9 kb) as well as the three nif clusters from A. caulinodans ORS571 (64 kb)37 were cloned together with an upstream regulator fixLJK, but these were found to be inactive in all species tested, so they are not shown in FIGS. 1A-1F. The precise genomic locations for all the nif clusters are provided in Table 2 and the plasmids containing nif clusters are provided in Table 3.
  • Each cluster was amplified from genomic DNA as multiple fragments by PCR and assembled with the plasmid backbone using yeast assembly (see Materials and Methods Section). The P. polymyxa WLY78 cluster was de novo synthesized based on the DNA sequence on contig ALJV01 (Shanks, R. M. et al. Saccharomyces cerevisiae-based molecular tool kit for manipulation of genes from gram-negative bacteria. 72, 5027-5036 (2006)). The clusters were cloned into different plasmid systems to facilitate transfer. For transfer to E. coli and R. sp. IRBG74, the broad-host range plasmid based on a pBBR1 origin was used (a second compatible RK2-origin plasmid was used for the nif cluster from A. caulinodans ORS571). These plasmids contain the RK2 oriT to enable the conjugative transfer of large DNA (see Materials and Methods). For transfer to P. protegens Pf-5, this plasmid system was found to be unstable and produce a mixed population. To transfer into this strain, the Pseudomonas-specific plasmid pRO1600 with the oriT was used. After construction, all of the plasmids were verified using next-generation sequencing (see Materials and Methods Section).
  • The set of 10 nif clusters were transferred into E. coli MG1655, the cereal epiphyte P. protegens Pf-5, and the cereal endophyte R. sp. IRBG74 to create 30 strains (FIG. 1A). E. coli was selected as a control as successful transfers to this recipient have been performed. Native P. protegens Pf-5 does not fix nitrogen. R. sp. IRBG74 contains two nif clusters in different genomic locations, which were left intact, but does not have nitrogenase activity under free living conditions. The genomic cluster does not have the required NifV enzyme as it obtains homocitrate from the plant. All of the clusters in the set have nifV, except the one from P. polymyxa WLY78. A test was run to determine whether the expression of recombinant WV from A. caulinodans ORS571 in R. sp. IRBG74 would result in active nitrogenase, but no activity was detected.
  • The bacteria were grown in appropriate media, including antibiotics, and then evaluated for nitrogenase activity using an acetylene reduction assay (see Methods and Materials Section). E. coli and Pseudomonas were grown at 30° C. in BB minimal media, as described previously71. However, no growth was observed for R. sp. IRBG74 under these conditions. Different media and carbon sources were tested and it was found that UMS media with dicarboxylic acids (malate or succinate), the major carbon source from plants147, with 10 mM sucrose yielded the highest growth rates (FIG. 6). After overnight growth, cells were transferred to stoppered test tubes in ammonium-free minimal media to a final OD600 of 0.4. For E. coli, the headspace air is completely replaced with argon gas. For P. protegens Pf-5 and R. sp. IRBG74, the initial headspace concentration of oxygen was maintained at 1% because these bacteria require oxygen for their metabolism. The cells are incubated at 30° for 20 hours in the presence of excess acetylene and the conversion to ethylene was quantified by GC-MS (see Materials and Methods Section). There was no significant growth for any of the strains under these conditions, so the nitrogenase activities reported correspond to the same cell densities.
  • A surprising 6 out of 10 clusters were functional in E. coli MG1655, with the K. oxytoca cluster producing the highest activity (FIG. 1A). The K. oxytoca cluster is also functional in P. protegens Pf-5, albeit with 60-fold less activity as compared to that in E. coli MG1655. Interestingly, the clusters from P. stutzeri and A. vinelandii—both obligate aerobes—are able to achieve high activities in P. protegens Pf-5. The resulting nitrogenase activities are 3- to 7-fold higher than that achieved from K. oxytoca, which only fixes nitrogen under strict anaerobic conditions. These clusters have common organizational features and similar electron transport chains, such as the Rnf complex.
  • A single gene cluster, from R. sphaeroides, yielded nitrogenase activity in R. sp. IRBG74 (FIG. 1A). Notably, both Rhizobium and Rhodobacter are alphaproteobacter and their nif clusters may contain interchangeable genes. When the native nif clusters are knocked out of R. sp. IRBG74, introducing the R. sphaeroides cluster alone does not yield active nitrogenase. These data point to a complex complementation between the endogenous and introduced gene clusters. To determine whether this approach could be generalized to other symbiotic Rhizobia, the Rhodobacter and Rhodopseudomonas gene clusters were transferred to a panel of 12 species isolated from diverse legumes (FIG. 1A). Remarkably, the transfer of these clusters was able to produce detectable nitrogenase activity in 7 of the strains.
  • Phylogenetic analysis was performed based on the full-length 16S rRNA gene sequences (K. oxytoca, BWI76_05380; A. vinelandii, Avin_55000; R. sphaeroides, DQL45_00005; Cyanothece ATCC51142, cce_RNA045; A. brasilense, AMK58_25190; R. palustris, RNA_55; P. protegens, PST_0759; Paenibacillus sp. WLY78, JQ003557). A multiple sequence alignment was generated using MUSCLE (Edgar, R. C. J. N. a. r. MUSCLE: multiple sequence alignment with high accuracy and high throughput. 32, 1792-1797 (2004)). A phylogenetic tree was constructed using the Geneious software (R9.0.5) with the Jukes-Cantor distance model and UPGMA as a tree build method, with bootstrap values from 1,000 replicates. This phylogenetic tree is shown in FIG. 30A. The scale bar indicates 2% substitutions per site. The clusters based on evolutionary closeness are circled. Using the same data from FIG. 1A, FIG. 30B summarizes the relative nitrogenase activity in the three host strains carrying each of the 10 nif clusters. The result indicates that the phylogenetic closeness has a predictive power for achieving highest nitrogenase activity in a new host that lacks a nif cluster.
  • Hereafter, studies were conducted to further characterize the extent to which changes in transcription and translation impacted the differences in activity observed when a native cluster is transferred between species. Differences in promoter activity, ribosome binding sites, and codon usage could change the expression levels of nif genes in detrimental ways. To quantify this effect, RNA-seq and ribosome profiling experiments were performed to evaluate the expression K. oxytoca nif cluster in K. oxytoca as well as E. coli MG1655, P. protegens Pf-5, and R. sp. IRBG74. RNA-seq experiments provide mRNA levels of genes (calculated as FPKM) and can be used to measure the performance of promoters and terminators. Ribosome profiling can be used to quantify protein synthesis rates, ribosome binding site (RBS) strength and ribosome pausing internal to genes. The ribosome density (RD) has been shown to correlate with protein expression rates. The translation efficiency is calculated by normalizing the RD by the number of transcripts (FPKM from Ribo-seq). Ribosome profiling has been applied to determine the relative levels of proteins expressed in multi-subunit complexes.
  • The RNA-seq profiles in both the sense and antisense direction are very close when compared between K. oxytoca and E. coli (FIGS. 1B-1C) and the ratios between mRNAs is preserved (R2=0.89) (FIG. 1D). This is consistent with the observation that this cluster yields a similar activity in both hosts. In contrast, the RNA-seq profiles differ more significantly for P. protegens Pf-5 and R. sp. IRBG74 (FIGS. 1B-1C), and there was no correlation between mRNA transcripts (FIG. 1D).
  • The ratios between protein expression rates were measured using ribosome profiling (FIG. 1E and FIG. 9). It is noteworthy that the ratios measured in K. oxytoca almost perfectly correlate with immunoblotting assays of A. vinelandii and the stoichiometry of H:D:K reflects the known 2:1:1 ratio. Interestingly, unlike mRNA levels, the ratios in expression rates are strongly correlated when the cluster is transferred between species: E. coli (R2=0.94), P. protegens Pf-5 (R2=0.61), and R. sp. IRBG74 (R2=0.71) (FIGS. 1E-1F). The production of NifH is significantly lower in R. sp. IRBG as compared to other strains. In an attempt to increase the induction of the cluster in this host, NifA was overexpressed, but this proved unsuccessful in producing high levels of active nitrogenase (FIGS. 10A-10B).
  • The following summarizes the results of the transfer of native nif clusters to new species. The most successful recipient was E. coli. However, this is not a viable agricultural strain and activity was eliminated in the presence of 17.1 mM ammonium (FIGS. 7A-7E, and FIGS. 8A-8B). Moderately high activity was obtained in P. protegens Pf-5, but this yielded a constitutively-on response (the K. oxytoca cluster) or was strongly repressed by ammonium (the A. vinelandii cluster). It was also found that the P. stutzeri cluster in P. protegens Pf-5 is inactive in the presence of ammonium, in disagreement with previously published results (Setten, L. et al. Engineering Pseudomonas protegens Pf-5 for nitrogen fixation and its application to improve plant growth under nitrogen-deficient conditions. PLoS One 8, e63666 (2013)). Only low levels of activity could be obtained by transferring clusters to Rhizobia. To address these issues, different approaches were applied to engineer the clusters to generate higher activity, exhibit less repression by ammonium, and be inducible.
  • Transfer of Refactored Klebsiella Nif Clusters to R. Sp. IRBG74
  • The process of refactoring a gene cluster involves the complete reconstruction of the genetic system from the bottom-up, using only well-characterized genetic parts. An exhaustive approach is to recode the genes (to eliminate internal regulation), reorganize into operons, control expression with synthetic ribosome binding sites (RBSs), and use T7 RNAP promoters and terminators. A separate “controller,” carried in a genetically distinct location, links synthetic sensors and circuits to the expression of T7 RNAP. For various applications, this approach has proven useful for transferring multi-gene systems between species, simplifies optimization through part replacement and enzyme mining, and enables the replacement of environmental signals that naturally control the cluster with the stimuli that induce the synthetic sensors (Smanski, M. J. et al. Synthetic biology to access and expand nature's chemical diversity. Nature Reviews Microbiology 14, 135 (2016); Song, M. et al. Control of type III protein secretion using a minimal genetic system. 8, 14737 (2017); Guo, C.-J. et al. Discovery of reactive microbiota-derived metabolites that inhibit host proteases. 168, 517-526. e518 (2017); Ren, H., Hu, P., Zhao, H. J. B. & bioengineering. A plug-and-play pathway refactoring workflow for natural product research in Escherichia coli and Saccharomyces cerevisiae. 114, 1847-1854 (2017)). In previous studies, the Klebsiella nif cluster was refactored, which was subsequently used as a platform to optimize activity by changing the genetic organization and the parts controlling expression. The top variant (v2.1) fully recovered activity in a K. oxytoca nif knockout and is functional in E. coli. To transfer into E. coli, a controller based on the isopropyl-β-D-thiogalactoside (IPTG)-inducible T7 RNAP carried on a plasmid was used (FIG. 2A). An interesting observation during optimization is that the genetic organization of the native cluster, including the existence of operons, was not correlated with activity.
  • An advantage of using T7 RNAP is that it is functional in essentially all prokaryotes, so the refactored cluster can be transferred as-is and transcription induced by expressing T7 RNAP in the new host. However, a new controller needs to be built for each host based on regulation and regulatory parts that work in that species. A controller for E. coli was designed based on the IPTG-inducible T7 RNAP carried on a plasmid (pKT249) (FIG. 2A). To transfer the refactored cluster to R. sp. IRBG74, first a controller was constructed that functions in this species and produces an equivalent range of T7 RNAP expression.
  • While a handful of inducible systems and sets of genetic parts have been previously described for Rhizobia, a new part collection needed to be built and characterized in order to have those needed to create a controller with sufficient dynamic range. First, a set of 20 constitutive promoters (Anderson, J. et al. BglBricks: A flexible standard for biological part assembly. 4, 1 (2010)) and seven T7 RNAP-dependent promoters (emme, K., Zhao, D. & Voigt, C. A. Refactoring the nitrogen fixation gene cluster from Klebsiella oxytoca. Proceedings of the National Academy of Sciences 109, 7085-7090 (2012)) that were found to span a range of 382-fold and 23-fold expression, respectively, were characterized (FIGS. 11A-11C). Second, a library of 285 ribosome binding sites (RBSs) were screened using the RBS Library Calculator, representing an expression range of 5,600-fold (FIGS. 12A-12B). Finally, a set of 29 terminators was characterized, of which 17 were found to have a terminator strength >10 (FIGS. 13A-13B). Using these part libraries, six inducible systems for R. sp. IRBG74 were then constructed that respond to IPTG, the quorum signal 3OC6HSL, aTc, cuminic acid, DAPG, and salicylic acid (FIG. 14). After optimization, these systems generate between 7- to 400-fold induction.
  • A controller was then constructed by using the optimized IPTG-inducible system to drive the expression of a variant of T7 RNAP (R6232S, N-terminal lon tag, GTG start codon) (FIG. 2A). RBS variants controlling T7 RNAP expression were tested and an intermediate strength was selected to maximize induction while limiting toxicity (FIG. 16). The controller was carried on the genome by replacing recA (see Materials and Methods). The response function of the final controller is compared to that obtained for pKT249 in E. coli, showing that they sweep through the same range of expression at intermediate levels of induction (FIG. 2B). To achieve the same level of induction in the two species, 0.1 mM IPTG is selected for E. coli and 0.5 mM for R. sp. IRBG74 (circled points in FIG. 2B).
  • The refactored v2.1 cluster was then transferred to R. sp. IRBG74, but no activity was observed (FIGS. 2C-2D). Activity was also not observed when the v2.1 cluster was transferred to P. protegens Pf-5 (FIG. 17). To determine if the genetic parts that make up the refactored cluster were functioning as designed, RNA-seq and ribosome profiling experiments were performed (FIG. 18). From these data, the strengths of promoters/terminators and the transcription level and translation rates of genes could be calculated (see Materials and Methods). The performance of the promoters in R. sp. IRBG74 was systematically lower than E. coli, particularly the first promoter controlling nifH (FIG. 2E). The terminators were functioning the same in the two species, albeit weakly, and no termination could be detected from the three terminators in the center of the cluster (FIG. 2E). The translation of the genes differed significantly between organisms (FIG. 2F). When the expression rates of the nif genes from the refactored cluster are compared with their levels in their native context in K. oxytoca, there is almost no correlation (FIG. 2F). Importantly, there is 9-fold less NifH expressed from the refactored cluster in R. sp. IRBG74 as compared to the same cluster in E. coli. Thus, the refactored cluster produces wildly different expression levels of the component genes when transferred between organisms, even when transcription is matched between them using different controllers.
  • Based on these results, a new refactored cluster (v3.2) (FIG. 2G) was designed. A very strong promoter was chosen for nifH. The transcription was broken up by adding promoters to divide nifENX and nifJ and selecting stronger terminators. Noting that the expression ratios between nif genes are better preserved when the native cluster is transferred to a new host (FIG. 1D) but not the refactored cluster (FIG. 2F), it was hypothesized that this could be due to the disruption of the operon structures and the associated translational coupling between genes. The K. oxytoca operons were cloned intact, including native RBSs and replaced these regions of the refactored cluster (FIG. 2G). Note that this also preserves nifT and nifX, which were not included in first versions because they were either inessential(Simon, H. M., Homer, M. J. & Roberts, G. P. J. J. o. b. Perturbation of nifT expression in Klebsiella pneumoniae has limited effect on nitrogen fixation. 178, 2975-2977 (1996)) or inhibitory (Gosink, M. M., Franklin, N. M. & Roberts, G. P. J. J. o. b. The product of the Klebsiella pneumoniae nifX gene is a negative regulator of the nitrogen fixation (nif) regulon. 172, 1441-1447 (1990)).
  • Compared to v2.1, the v3.2 cluster is less active in E. coli but is active in R. sp. IRBG74 (FIG. 2H) and P. protegens Pf-5 (FIG. 17). This experiment was performed in the double nif knockout strain in R. sp. IRBG74, thus indicating that the refactored cluster is self-contained in producing nitrogenase activity. RNA-seq and ribosome profiling was applied to evaluate the performance of v3.2 in all three species (FIG. 21, FIG. 19, and FIGS. 20A-20F). The promoters perform similarly in the different hosts, but there was significant diversity in terminator function. Despite this, the translation rates (RD) of the genes were remarkably consistent and NifH expression is nearly identical (FIG. 2J). The higher expression of NifH and the preserved ratios between proteins is the likely reason that the refactored cluster is functional in R. sp. IRBG74. The next attempt was to increase expression level of the nif genes in R. sp. IRBG74 by increasing the concentration of inducer used, but a clear optimum beyond which increased expression caused a rapid decline in activity was found (FIG. 2M). This indicates a potential upper limit in obtaining activity in R. sp. IRBG74 under free living conditions using only the genes from K. oxytoca.
  • Replacement of A. caulinodans Nif Regulation with Synthetic Control
  • The A. caulinodans nif genes are distributed across three clusters in different genomic locations. The regulatory signals converge on the NifA activator that, in concert with the RpoN sigma factor, turns on transcription of the genomic nif clusters. Numerous and not fully characterized environmental signals are integrated upstream of this node, including NtrBC (Kaminski, P. A. & Elmerich, C. J. M. m. The control of Azorhizobium caulinodans nifA expression by oxygen, ammonia and by the HF-I-like protein, NrfA. 28, 603-613 (1998)), NtrXY (Pawlowski, K., Klosse, U., De Bruijn, F. J. M. & MGG, G. G. Characterization of a novel Azorhizobium caulinodans ORS571 two-component regulatory system, NtrY/NtrX, involved in nitrogen fixation and metabolism. 231, 124-138 (1991)), FixLJK(Kaminski, P. & Elmerich, C. J. M. m. Involvement of fixLJ in the regulation of nitrogen fixation in Azorhizobium caulinodans. 5, 665-673 (1991); Kaminski, P., Mandon, K., Arigoni, F., Desnoues, N. & Elmerich, C. J. M. m. Regulation of nitrogen fixation in Azorhizobium caulinodans: identification of a fixK-like gene, a positive regulator of nifA. 5, 1983-1991 (1991)), NrfA (Kaminski, P. A. & Elmerich, C. J. M. m. The control of Azorhizobium caulinodans nifA expression by oxygen, ammonia and by the HF-I-like protein, NrfA. 28, 603-613 (1998)), and PII proteins (e.g., GlnB and GlnK (Michel-Reydellet, N. & Kaminski, P. A. J. J. o. b. Azorhizobium caulinodans Plland GlnK proteins control nitrogen fixation and ammonia assimilation. 181, 2655-2658 (1999))). The clusters (64 kb total, containing 76 genes) were cloned into the plasmid systems described above and transferred into R. sp. IRBG74 and P. protegens Pf-5, but no activity was found in either strain. Overexpression of A. caulinodans NifA and RpoN did not lead to activity and, upon further investigation, these regulators were found to be inactive in these strains. The size of the clusters and the lack of genetic and gene function information would complicate fully refactoring the system. For these reasons, it was decided to modify the regulation controlling nif such that it can be placed under the control of synthetic sensors.
  • One goal herein was to eliminate ammonium repression of nitrogenase activity, which converges on the regulation of NifA. The native nifA gene was knocked out of the genome using the sacB markerless deletion method (see Materials and Methods), with the intent of placing NifA under inducible control (FIG. 3A). There is only basal activity from the nifH promoter in the ΔnifA strain (FIG. 3B). When NifA is overexpressed, the promoter turns on and its activity is further enhanced by the co-expression of RpoN in an operon (note that the genomic rpoN gene is left intact for these experiments). The IPTG-inducible system designed for Rhizobium (previous section) was tested in A. caulinodans carried on a pBBR1-ori plasmid. Using GFP, this was found to induce expression over several orders of magnitude (FIG. 21). Then, the A. caulinodans nifA and rpoN gene was placed under IPTG control and the fluorescent reporter fused to the A. caulinodans nifH promoter (encompassing 281 nt upstream of the ATG), carried on the same plasmid (see Materials and Methods). The response function from the nifH promoter was analyzed at the condition used for nitrogen fixation, exhibiting a wide dynamic range to 45-fold (FIG. 3C).
  • The controller was designed to co-express NifA and RpoN and tested for its ability to induce nitrogenase (FIG. 3D). When fully induced, there was a complete recovery of activity as compared to the wild-type strain. The repression of nitrogenase activity by ammonium was then evaluated. The presence of 10 mM ammonium chloride leads to no detectible activity by the wild-type strain (FIG. 3E). Even when both NifA and RpoN are under inducible control, there is strong repression with only 5% of the nitrogenase activity of the wild-type. This suggests that the post-transcriptional control of NifA activity by ammonium remains intact.
  • In related alphaproteobacteria, mutations have been identified in NifA that abrogate ammonium repression (Paschen, A., Drepper, T., Masepohl, B. & Klipp, W. Rhodobacter capsulatus nifA mutants mediating nif gene expression in the presence of ammonium. FEMS microbiology letters 200, 207-213 (2001); Rey, F. E., Heiniger, E. K. & Harwood, C. S. Redirection of metabolism for biological hydrogen production. Applied and environmental microbiology 73, 1665-1671 (2007)). These mutations occur in the N-terminal GAF domain. Using a multiple sequence alignment, two equivalent residues were identified to mutate in A. caulinodans (L94Q and D95Q) (FIG. 22). These mutations were made and then tested individually and in combination (FIG. 3D). When the double mutant of NifA is co-expressed with RpoN, the presence of ammonium only results in a slight decrease in activity.
  • Oxygen irreversibly inhibits nitrogenase and represses nif clusters. The inducible nif clusters were tested for oxygen sensitivity, noting that A. caulinodans is an obligate aerobe and fixes nitrogen under micro-aerobic conditions. The tolerance of nitrogenase to oxygen was then assessed as a function of the concentration of oxygen in the headspace, held constant by injecting oxygen while monitoring its level (Methods and FIG. 26A). The native and inducible gene clusters responded nearly identically to oxygen (FIG. 3F). The optimum activity occurs between 0.5% to 1% with a wide tolerance (30% activity at 3% oxygen).
  • Introduction of Controllable Nif Activity in P. protegens Pf-5
  • The native K. oxytoca, P. stutzeri, and A. vinelandii nif clusters are all functional in P. protegens Pf-5 (FIG. 1A). However, when the native P. stutzeri and A. vinelandii clusters are transferred, nitrogenase is strongly repressed. In contrast, transferring the native K. oxytoca cluster produces uncontrolled (constitutively on) nitrogenase activity (FIG. 4E). For these three clusters in P. protegens Pf-5, it was sought to gain regulatory control by removing the nifA master regulators from the clusters and expressing them from a controller (FIG. 4A).
  • As with Rhizobia, it was found that first, part libraries for P. protegens Pf-5 had to be built before building controllers with sufficient dynamic range. A range of 20 constitutive promoters and seven T7 promoters that span a range of 778-fold and 24-fold expression, respectively, was characterized (FIGS. 11A-11C). A library of 192 RBSs was screened, representing an expression range of 4,079-fold (FIGS. 12A-12B). A set of seven terminators that share no sequence homology between each other and have a terminator strength >10 in R. sp. IRBG74 was selected and characterized together with the three well-used terminators (e.g., T7 terminator, rrnBT1, and L3S2P21). These seven terminators showed a terminator strength >50 (FIGS. 13A-13B).
  • The inducible systems designed for Rhizobium were transferred as-is to a Pseudomas-specific pRO1600 plasmid (see Materials and Methods). The 3OC6HSL-, aTc-, cuminic acid-, and DAPG-inducible systems were all found to be functional (FIG. 15A). In addition, a naringenin-inducible system based on the Pfde promoter was constructed and found to be functional. The strength of arabinose inducible system was increased by substituting the −10 box in PBAD promoter and arabinose import was improved by constitutive expression of the arabinose transporter AraE (FIG. 15B). Finally, the IPTG-inducible system was optimized for P. protegens Pf-5 by replacing the PA1lacO1 promoter with the Ptac promoter and making three amino acid substitutions to Lad (Meyer, A. J., Segall-Shapiro, T. H., Glassey, E., Zhang, J. & Voigt, C. A. J. N. c. b. Escherichia coli “Marionette” strains with 12 highly optimized small-molecule sensors. 1 (2018)). This effort resulted in seven new inducible systems that produce 41- to 554-fold induction in P. protegens Pf-5 (FIG. 15C).
  • To simplify the comparison between clusters, it was sought to build a single, universal controller that could induce all three. Each has a different NifA sequence, so the ability to cross induce the gene clusters was tested. To do this, the nifH promoters from each nif cluster were cloned and fused to gfp to build plasmid-based reporters (see Materials and Methods). The ability of the various NifA homologues to activate the nifH promoters was evaluated in E. coli and P. protegens Pf-5 (FIG. 23A-23B). The results suggest that it is more important to express a NifA variant from a similar species as the host, as opposed to expressing the NifA variant that is cognate to the transferred cluster. This may be due to the need for NifA to recruit host transcriptional machinery, whereas the NifA binding sites in the promoters are well conserved across species. Based on these data, the controller was constructed using the P. stutzeri NifA, placed under the control of the optimized IPTG-inducible system, described above. The RBSs of NifA were synthetically designed to span a wide range of expression of nif genes (FIG. 24A). The controller was inserted into the genome 25 bp downstream of the stop codon of glmS using the mini-Tn7 system. The ability for this controller to induce the nifH promoter from each cluster using a fluorescent reporter is shown in FIG. 4C and FIG. 24B.
  • The nitrogenase activity for each of the gene clusters in P. protegens Pf-5 was then assessed (FIG. 4D). The three P. protegens Pf-5 strains containing the transferred clusters were modified to insert the controller and delete the native nifLA genes from each cluster (FIG. 4B). All three are inducible, with nitrogenase activity showing dynamic ranges of 1,200-fold, 2,300-fold, and 130-fold for the K. oxytoca, P. stutzeri, and A. vinelandii nif clusters, respectively. When induced, these systems all produce similar or even higher nitrogenase activities than can be achieved by the transfer of the unmodified native clusters (FIG. 4D). For reference, the nitrogenase activities produced by K. oxytoca, P. stutzeri, and A. vinelandii are shown as dashed lines in FIG. 4D (top to bottom) (see Methods and Materials). All three inducible clusters produce similar levels of activity that approach those measured from wild-type P. stutzeri and A. vinelandii.
  • The native P. stutzeri and A. vinelandii clusters are strongly repressed by ammonium: the presence of 17.1 mM eliminates activity or reduces it 7-fold, respectively (FIG. 4E and FIGS. 8A-8B). The inducible clusters show little reduction in activity and the inducible A. vinelandii cluster exhibits almost no ammonia repression. While the native K. oxytoca cluster in P. protegens Pf-5 generates a constitutive response, there is still some repression, which is reduced by the inducible version.
  • The inducible nif clusters were tested for oxygen sensitivity. Note that wild-type A. vinelandii is able to fix nitrogen under ambient conditions due to genetic factors internal and external to the cluster. First, it was established that the controller in P. protegens Pf-5 could induce transcription from the three nifH promoters in the presence of oxygen (FIGS. 26A-26B). The tolerance of nitrogenase to oxygen was then assessed as a function of the concentration of oxygen in the headspace, as described for A. caulinodans (previous section). The native and inducible clusters exhibited the same oxygen response (FIG. 4F). The nif cluster from K. oxytoca was the most sensitive, generating the highest activity under anaerobic conditions, but this is quickly abolished in the presence of 02. In contrast, the nif clusters from P. stutzeri and A. vinelandii showed wider tolerance with optima at 1% and 0.5%, respectively. However, both clusters lose activity at lower oxygen concentrations than A. caulinodans.
  • To explore the impact of the electron transport chains, several mutants to the A. vinelandii cluster were made (FIG. 27). The A. vinelandii cluster contains two potential electron transport systems to nitrogenase and the redundant system may help maintain redox status for nitrogenase at various oxygen levels. The dependence of nitrogenase activity on the oxygen concentration in various mutant backgrounds was re-measured. No effect was seen by adding the rnf2 operon or deleting the fix operon, however deleting rnf1 eliminated activity. This suggests that the rnf1 operon is the sole source of electrons in P. protegens Pf-5 under these conditions and the Fix complex cannot compensate the Rnf complex unlike the case of A. vinelandii.
  • Control of Nitrogen Fixation with Agriculturally-Relevant Sensors
  • The careful design and characterization of the controller has the benefit of simplifying the process by which different synthetic sensors can be used to induce nitrogenase expression. By knowing the dynamic range required to go from inactive to active nitrogenase, one can quantitatively select sensors that have the produce a compatible response. This allows different environmental signals—or combinations of signals using genetic logic circuits—to be used to control expression. To demonstrate this, 11 synthetic sensors were selected that respond to a variety of chemical signals of relevance to the rhizosphere and demonstrate that these can be used to create inducible nitrogenase in for example, engineered strains of E. coli (carrying the refactored v2.1 nif), R. sp. IRBG74 (carrying the refactored v3.2 nif), P. protegens Pf-5 (carrying the inducible A. vinelandii nif), and A. caulinodans (inducible nifA/rpoN) (FIGS. 5A-5D).
  • The roles of the chemical signals in the rhizosphere are shown in FIG. 5A. Cuminic acid is present in plant seeds and functions as a fungicide. Natural root exudates may include sugars, amino acids, organic acids, phenolic compounds, phytohormones, and flavonoids. These represent potential signals to control nitrogenase production close to the root surface. Cereals have been shown to release arabinose, vanillic acid, and salicylic acid. In addition, salicylic acid regulates the plant innate immune response and the impact of its exogenous addition to cereals has been studied. Naringenin is a common precursor for many flavonoids and improves endophytic root colonization when applied to rice and wheat. Genistein, a product from naringenin catalyzed by the isoflavone synthase, is released from maize roots. A quorum sensing mimic released by rice can regulate the 3OC6HSL receptor protein LuxR, which has been visualized using E. coli biosensor strains.
  • Bacteria either native to the rhizome or added as biocontrol agents introduced as a spray inoculant or seed coating produce chemical signatures. Inoculation of cereals with root colonizing Pseudomonas strains that produce DAPG elicits protection against fungal pathogens. Many bacteria produce quorum molecules, such as N-acyl homoserine lactones, as a means of communication and plants can respond to these signals. The bacterium Sinorhizobium meliloti produces 3OC14HSL, which enhances Medicago nodulation and has been shown to induce systemic resistance in cereals. DHBA can be produced by root colonizing bacteria to increase iron solubility and play a role as a chemoattractant for Agrobacterium and Rhizobium.
  • Sensors for these chemicals were constructed based on the controllers for each species. For E. coli MG1655, a strain that contains 12 optimized sensors, carried in the genome, that respond to various small molecules (“Marionette”) had been previously constructed (Meyer, A. J., Segall-Shapiro, T. H., Glassey, E., Zhang, J. & Voigt, C. A. J. N. b. Escherichia coli “Marionette” strains with 12 highly optimized small-molecule sensors. 1 (2018).). The response functions of these sensors were characterized in standard units, making it simple to identify those that can be connected to nitrogenase expression without further tuning. Marionette contains sensors for vanillic acid, DHBA, cuminic acid, 3OC6HSL, and 3OC14HSL. For each sensor, the output promoter was transcriptionally fused to T7 RNAP and the response of the responsive promoter (PT7) was measured as a function of inducer concentration (FIG. 5B and FIG. 28B). Then, the v2.1 refactored nif cluster was introduced and nitrogenase activity was measured in the presence and absence of inducer (FIG. 5C and FIG. 28C). The inducible systems constructed for P. protegens Pf-5 that respond to arabinose and naringenin were used to drive NifA expression for the control of the A. vinelandii nif cluster (FIG. 4A). The induction of the nifH promoter by these sensors was first confirmed using a reporter (FIG. 5B). When this is replaced with the nif gene cluster, it results in an inducible response of nitrogenase activity (FIG. 5C). The best nitrogenase activity in R. sp. IRBG74 is low; however, herein it was demonstrated that it could be placed under inducible control. The DAPG-inducible system developed for R. sp. IRBG74 was connected to the control of T7 RNAP and this produces a strong response from PT7 (FIG. 5B). However, when used to drive the expression of the v3.2 refactored pathway, only a 9-fold induction is observed, consistent with the low nitrogenase activity observed in this strain (FIG. 5C). Finally, the salicylic acid sensor designed for Rhizobium was used to control NifA (L94Q/D95Q)/RpoN expression in A. caulinodans (FIG. 3A and FIG. 5B). This yielded a 1000-fold dynamic range of nitrogenase activity (FIG. 5C).
  • Plants could be engineered to release an orthogonal chemical signal that could then be sensed by a corresponding engineered bacterium. This would have the benefit of only inducing nitrogenase in the presence of the engineered crop. Further, if the molecule is metabolizable by the engineered bacterium, it could serve as a mechanism around which a synthetic symbiosis could be designed, where the plant provides the carbon and the bacterium fixed nitrogen in an engineered relationship. To this end, legumes and Arabidopsis have been engineered to produce opines, including nopaline and octopine. Sensors were constructed for these two opines for A. caulinodans based on the LysR-type transcriptional activators OccR (octopine) and NocR (nopaline) and their corresponding Pocc and Pnoc promoters (FIG. 5D and FIG. 21). These sensors were connected to the expression of NifA(L94Q/D95Q)/RpoN and the response from PnifH was measured using a fluorescent reporter. Both response functions had a large dynamic range (FIG. 5B) and produced highly-inducible nitrogenase activity (FIG. 5C). The nopaline sensor yielded a 412-fold dynamic range and the octopine sensor led to 40% higher nitrogenase activity than the wild-type.
  • Discussion
  • Towards designing a bacterium that can deliver fixed nitrogen to a cereal crop, this work provides a side-by-side comparison of diverse species, natural nif clusters, and engineering strategies that can be used to obtain inducible nitrogenase activity in a strain that can associate with cereals as an endophyte or epiphyte. To this end, ˜100 strains involving the transfer of 10 natural nif clusters ranging in size from 10 kb to 64 kb to 16 diverse species of Rhizobia, Azorhizobium, Pseudomas, and E. coli were constructed. Different approaches were taken to make these nif clusters inducible, from bioinformatics and protein engineering to complete genetic reconstruction from the ground-up (refactoring). In addition to the highest activity, it is important that nitrogen fixation be robust to the addition of nitrogenous fertilizer (ammonia) and microaerobic environments. For example, an endophyte such as a variant of Azorhizobium where nifA is knocked out of the genome and a nifA mutant and rpoN are complemented on a plasmid can be used to obtain high nitrogenase activities. For an epiphyte, P. protegens Pf-5, is a versatile strain based on the transfer of the A. vinelandii nif cluster and placement of nifA of P. stutzeri under inducible control. In both such cases, nitrogenase activities were obtained that are nearly identical to wild-type A. caulinodans and P. stutzeri, respectively. Neither showed significant repression by ammonia and optimal activity was obtained in 1% oxygen. Based on these strains, it was demonstrated that nitrogenase can be placed under inducible control in response to cereal root exudates (arabinose, salicylic acid), phytohormones (naringenin) and putitive signaling molecules that could be released by genetically modified plants (e.g., can express or exudate nopaline or octopine).
  • Because R. sp. IRBG74 can fix nitrogen in a legume nodule and also associates with rice, significant effort was directed to engineering this strain to fix nitrogen when cereal-associated. The first attempt was simply complementing nifV, as this is absent in R. sp. IRBG74 and produces a metabolite provided by the plant, but this attempt was unsuccessful. Then, it was found that all of the initial nif clusters transferred, some of which have high activity in P. protegens Pf-5 and E. coli, are non-functional in R. sp. IRBG74, which led to trying clusters from alphaproteobacteria, one of which produced a very low level of activity that was dependent on the nif genes native to R. sp. IRBG74. The previously-published refactored gene clusters based on Klebsiella nif were attempted in R. sp. IRBG74 but these showed no activity. It was only after the construction of a new refactored cluster (v3.2) that activity was obtained under free-living conditions that was not dependent on the native nif genes. This allowed an increase in the expression levels, and an optimum was discovered beyond which activity was lost. This is the first time that nif activity has been engineered in a Rhizobium under free-living conditions that could otherwise not perform this function.
  • The present disclosure encompasses different degrees of nif pathway re-engineering to promote heterologous transfer. The most ambitious is the complete refactoring of all the nif genes and regulation, where all regulatory genetic parts are replaced, genes are recoded, operons are reorganized, and transcription is performed by the orthogonal T7 RNAP. Initially, the evaluation of performance relied on the overall nitrogenase activity, rather than an understanding of the underlying parts. As such, the first refactored pathway performed poorly. In subsequent studies, better part libraries and DNA assembly and automation platforms enabled the synthesis of many variants. Further, as the cost of RNA-seq declined, it was used to evaluate the performance of internal parts, such as promoters and terminators. This revealed that the first designs were effectively large single operons with little differential control over the transcription levels of individual genes. With these techniques allowed the tailoring of the function of the refactored nif pathway and the discovery that many of the underlying genetic structure were not needed to achieve high activities.
  • Ribosome profiling, a new technique that enables the measurement of translational parts (e.g., ribosome binding sites), was applied and expression levels were inferred. Further, nitrogenase activity and the function of underlying parts were assessed as the clusters were moved between species. Interestingly, the native Klebsiella nif cluster could be transferred and it performed similarly but the refactored cluster yielded widely varying expression levels in the different hosts, sometimes leading to a total loss in activity. This could be recovered by maintaining the native operon structure in the refactored cluster, implying that it was not due to the synthetic sensors, T7 RNAP, or promoters/terminators. This is one of the hypothesized functions of operons. Achieving this required maintenance of the codon usage and translational coupling of the native cluster. However, this does not mean that it will not be possible to also encode this function synthetically. There have been computational advances that enable the calculation of RBSs internal to upstream genes when encoded on an operon. If coupled with codon optimization algorithms, this would allow the design of de novo genetic parts that achieve a desired degree of translational coupling and expression level.
  • The present disclosure demonstrates the deregulation of nif clusters in A. caulinodans and P. protegens Pf-5, enabling them to be placed under the control of cereal root exudates. This derepresses the pathway in the presence of exogenous nitrogenous fertilizer—critical for the use of the bacterium as part of an integrated agricultural solution. Further, these organisms retain the ability to fix nitrogen in microaerobic environments, thus avoiding the need for a root nodule that enforces strict anaerobiosis. The complete deregulation of the nif pathway makes the bacterium non-competitive in the soil and lost quickly, thus limiting its impact to particular phases of the growth cycle. Thus, it is demonstrated that nitrogenase can be placed under the control of chemical root exudates.
  • EMBODIMENTS
  • 1. A rhizobium that can fix nitrogen under aerobic free-living conditions, comprising a symbiotic rhizobium having an exogenous nif cluster, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic rhizobium under aerobic free-living conditions, and wherein the rhizobium is not Azorhizobium caulinodans.
  • 2. The rhizobium of paragraph 1, wherein the exogenous nif cluster is from a free-living diazotroph.
  • 3. The rhizobium of paragraph 1, wherein the exogenous nif cluster is from a symbiotic diazotroph.
  • 4. The rhizobium of paragraph 1, wherein the exogenous nif cluster is from a photosynthetic Alphaproteobacteria.
  • 5. The rhizobium of paragraph 1, wherein the exogenous nif cluster is from a Gammaproteobacteria.
  • 6. The rhizobium of paragraph 1, wherein the exogenous nif cluster is from a cyanobacteria.
  • 7. The rhizobium of paragraph 1, wherein the exogenous nif cluster is from a firmicutes.
  • 8. The rhizobium of paragraph 1, wherein the exogenous nif cluster is from Rhodobacter sphaeroides.
  • 9. The rhizobium of paragraph 1, wherein the exogenous nif cluster is from Rhodopseudomonas palustris.
  • 10. The rhizobium of paragraph 1, wherein the exogenous nif cluster is an inducible refactored nif cluster.
  • 11. The rhizobium of paragraph 10, wherein the inducible refactored nif cluster is an inducible refactored Klebsiella nif cluster.
  • 12 The rhizobium of any one of the preceding paragraphs, wherein the rhizobium is IRBG74.
  • 13. The rhizobium of any one of the preceding paragraphs, wherein the exogenous nif cluster comprises 6 nif genes.
  • 14. The rhizobium of paragraph 13, wherein the 6 nif genes are nifHDK(T)Y, nifEN(X), nifJ, nifBQ, nifF, and nifUSVWZM.
  • 15. The rhizobium of paragraphs 13 or 14, wherein each nif gene of the exogenous nif cluster is preceded by a T7 promoter.
  • 16. The rhizobium of paragraph 15, wherein the T7 promoter is a wild-type promoter.
  • 17. The rhizobium of any one of the preceding paragraphs, further comprising an endogenous nif cluster.
  • 18. The rhizobium of any one of the preceding paragraphs, wherein the nif cluster has a nifV gene.
  • 19. The rhizobium of paragraph 18, wherein the nifV gene is endogenous.
  • 20. The rhizobium of any one of the preceding paragraphs, wherein the exogenous nif cluster further comprises a terminator.
  • 21. The rhizobium of any one of paragraphs 15-20, wherein the T7 promoter has a terminator and wherein the terminator is downstream from the T7 promoter.
  • 22. The rhizobium of paragraph 12, wherein the exogenous nif cluster is a refactored rhizobium IRBG74 nif cluster.
  • 23. A plant growth promoting bacterium that can fix nitrogen under aerobic free-living conditions, comprising a bacterium having an exogenous nif cluster having at least one inducible promoter, wherein the exogenous nif cluster confers nitrogen fixation capability on the bacterium, under aerobic free-living conditions, and wherein the bacterium is not Azorhizobium caulinodans.
  • 24. The plant growth promoting bacterium of paragraph 23, wherein the bacterium is a symbiotic bacterium.
  • 25. The plant growth promoting bacterium of paragraph 23, wherein the bacterium is an endophyte.
  • 26. The plant growth promoting bacterium of paragraph 25, wherein the endophyte is rhizobium IRBG74.
  • 27. The plant growth promoting bacterium of paragraph 23, wherein the bacterium is an epiphyte.
  • 28. The plant growth promoting bacterium of paragraph 27, wherein the epiphyte is pseudomonas protogens PF-5.
  • 29. The plant growth promoting bacterium of any one of paragraphs 23-28, wherein the plant growth promoting bacterium is associated with a genetically modified cereal plant.
  • 30. The plant growth promoting bacterium of paragraph 29, wherein the genetically modified cereal plant includes an exogenous gene encoding a chemical signal.
  • 31. The plant growth promoting bacterium of paragraph 29, wherein the nitrogen fixation is under the control of the chemical signal.
  • 32. The plant growth promoting bacterium of paragraphs 30 or 31, wherein the chemical signal is opine, phlorogluconol or rhizopene.
  • 33. The rhizobium of any one of paragraphs 23-32, wherein the exogenous nif cluster comprises 6 nif genes.
  • 34. The rhizobium of paragraph 33, wherein the 6 nif genes are nifHDK(T)Y, nifEN(X), nifJ, nifBQ, nifF, and nifUSVWZM.
  • 35. The rhizobium of any one of paragraphs 23-34, wherein the inducible promoter is a T7 promoter.
  • 36. The rhizobium of any one of paragraphs 23-34, wherein the inducible promoter is PA1lacO1 promoter.
  • 37. The rhizobium of any one of paragraphs 23-36, wherein the inducible promoter is activated by an agent selected from a group that includes IPTG, sodium salicylate, octapine, nopaline, the quorum signal 3OC6HSL, aTc, cuminic acid, DAPG, and salicylic acid.
  • 38. The rhizobium of any one of paragraphs 23-37, wherein the exogenous nif cluster further comprises a terminator.
  • 39. The rhizobium of any one of paragraphs 23-37, wherein the inducible promoter has a terminator and wherein the terminator is downstream from the inducible promoter.
  • 40. An Azorhizobium caulinodans capable of inducible ammonium-independent nitrogen fixation in a cereal crop, comprising:
  • (i) a modified nif cluster, wherein an endogenous nifA gene is deleted or altered; and
  • (ii) at least one operon comprising nifA and RNA polymerase sigma factor (RpoN), wherein the operon comprises a regulatory element including an inducible promoter.
  • 41. The Azorhizobium caulinodans of claim 40, wherein the inducible promoter is PA1lacO1 promotor.
  • 42. The Azorhizobium caulinodans of paragraphs 40 or 41, wherein the inducible promoter is activated by an agent selected from IPTG, sodium salicylate, octapine, nopaline, the quorum signal 3OC6HSL, aTc, cuminic acid, DAPG, and salicylic acid.
  • 43. The Azorhizobium caulinodans of any one of paragraphs 40-42, wherein the endogenous nifA gene is altered with at least one of the following substitutions:
  • (i) L94Q;
  • (ii) D95Q; and
  • (iii) both L94Q and D95Q.
  • 44. A method of engineering a rhizobium that can fix nitrogen under aerobic free-living conditions, comprising transferring an exogenous nif cluster to a symbiotic rhizobium, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic rhizobium, under aerobic free-living conditions, and wherein the rhizobium is not Azorhizobium caulinodans.
  • 45. The method of paragraph 44, wherein the exogenous nif cluster comprises 6 nif genes.
  • 46. The method of paragraph 45, wherein the 6 nif genes are nifHDK(T)Y, nifEN(X), nifJ, nifBQ, nijF and nifUSVWZM.
  • 47. The method of paragraph 45 or 46, wherein each of the nif genes is preceded by a wild-type T7 promoter.
  • 48. The method of any one of paragraphs 44-47, wherein the exogenous nif cluster is transferred to the rhizobium in a plasmid.
  • 49. The method of any one of paragraphs 44-48, wherein the exogenous nif cluster further comprises a terminator.
  • 50. The method of any one of paragraphs 47-49, wherein the wild-type T7 promoter has a terminator, and wherein the terminator is downstream from the wild-type T7 promoter.
  • 51. The method of any one of paragraphs 44-50, wherein the endogenous NifL gene is deleted.
  • 52. A method of producing nitrogen for consumption by a cereal plant, comprising providing a plant growth promoting bacterium that can fix nitrogen under aerobic free-living conditions in proximity of the cereal plant, wherein the plant growth promoting bacterium is a symbiotic bacterium having an exogenous nif cluster, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic bacterium, enabling nitrogen fixation under aerobic free-living conditions.
  • 53. The method of paragraph 52, wherein the plant growth promoting bacterium is a rhizobium.
  • 54. The method of paragraph 52, wherein the plant growth bacterium is the bacterium of any one of paragraphs 1-22 and 23-39.
  • 55. The method of any one of paragraphs 52-54, wherein the cereal plant is a genetically modified cereal plant.
  • 56. The method of paragraph 55, wherein the genetically modified cereal plant includes an exogenous gene encoding a chemical signal.
  • 57. The method of paragraph 56, wherein the nitrogen fixation is under the control of the chemical signal.
  • 58. The method of paragraph 56 or 57, wherein the chemical signal is opine, phlorogluconol or rhizopene.
  • 59. The method of any one of paragraphs 52-55, wherein the nitrogen fixation is under the control of a chemical signal.
  • 60. The method of paragraph 57 or 59, wherein the chemical signal is a root exudate, biocontrol agent or phytohormone.
  • 61. The method of paragraph 60, wherein the root exudate is selected from the group consisting of sugars, hormones, flavonoids, and antimicrobials.
  • 62. The method of paragraph 57 or 59, wherein the chemical signal is vanillate.
  • 63. The method of paragraph 57 or 59, wherein the chemical signal is IPTG, aTc, cuminic acid, DAPG, and salicylic acid, 3,4-dihydroxybenzoic acid, 3OC6HSL or 3OC14HSL.
  • All of the features disclosed in this specification may be combined in any combination. Each feature disclosed in this specification may be replaced by an alternative feature serving the same, equivalent, or similar purpose. Thus, unless expressly stated otherwise, each feature disclosed is only an example of a generic series of equivalent or similar features. From the above description, one skilled in the art can easily ascertain the essential characteristics of the present invention, and without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various usages and conditions. Thus, other embodiments are also within the claims.
  • EQUIVALENTS
  • While several inventive embodiments have been described and illustrated herein, those of ordinary skill in the art will readily envision a variety of other means and/or structures for performing the function and/or obtaining the results and/or one or more of the advantages described herein, and each of such variations and/or modifications is deemed to be within the scope of the inventive embodiments described herein. More generally, those skilled in the art will readily appreciate that all parameters, dimensions, materials, and configurations described herein are meant to be exemplary and that the actual parameters, dimensions, materials, and/or configurations will depend upon the specific application or applications for which the inventive teachings is/are used. Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific inventive embodiments described herein. It is, therefore, to be understood that the foregoing embodiments are presented by way of example only and that, within the scope of the appended claims and equivalents thereto, inventive embodiments may be practiced otherwise than as specifically described and claimed. Inventive embodiments of the present disclosure are directed to each individual feature, system, article, material, kit, and/or method described herein. In addition, any combination of two or more such features, systems, articles, materials, kits, and/or methods, if such features, systems, articles, materials, kits, and/or methods are not mutually inconsistent, is included within the inventive scope of the present disclosure.
  • All definitions, as defined and used herein, should be understood to control over dictionary definitions, definitions in documents incorporated by reference, and/or ordinary meanings of the defined terms.
  • All references, patents and patent applications disclosed herein are incorporated by reference with respect to the subject matter for which each is cited, which in some cases may encompass the entirety of the document.
  • The indefinite articles “a” and “an,” as used herein in the specification and in the claims, unless clearly indicated to the contrary, should be understood to mean “at least one.”
  • The phrase “and/or,” as used herein in the specification and in the claims, should be understood to mean “either or both” of the elements so conjoined, i.e., elements that are conjunctively present in some cases and disjunctively present in other cases. Multiple elements listed with “and/or” should be construed in the same fashion, i.e., “one or more” of the elements so conjoined. Other elements may optionally be present other than the elements specifically identified by the “and/or” clause, whether related or unrelated to those elements specifically identified. Thus, as a non-limiting example, a reference to “A and/or B”, when used in conjunction with open-ended language such as “comprising” can refer, in one embodiment, to A only (optionally including elements other than B); in another embodiment, to B only (optionally including elements other than A); in yet another embodiment, to both A and B (optionally including other elements); etc.
  • As used herein in the specification and in the claims, “or” should be understood to have the same meaning as “and/or” as defined above. For example, when separating items in a list, “or” or “and/or” shall be interpreted as being inclusive, i.e., the inclusion of at least one, but also including more than one, of a number or list of elements, and, optionally, additional unlisted items. Only terms clearly indicated to the contrary, such as “only one of” or “exactly one of,” or, when used in the claims, “consisting of,” will refer to the inclusion of exactly one element of a number or list of elements. In general, the term “or” as used herein shall only be interpreted as indicating exclusive alternatives (i.e. “one or the other but not both”) when preceded by terms of exclusivity, such as “either,” “one of,” “only one of,” or “exactly one of.” “Consisting essentially of,” when used in the claims, shall have its ordinary meaning as used in the field of patent law.
  • As used herein in the specification and in the claims, the phrase “at least one,” in reference to a list of one or more elements, should be understood to mean at least one element selected from any one or more of the elements in the list of elements, but not necessarily including at least one of each and every element specifically listed within the list of elements and not excluding any combinations of elements in the list of elements. This definition also allows that elements may optionally be present other than the elements specifically identified within the list of elements to which the phrase “at least one” refers, whether related or unrelated to those elements specifically identified. Thus, as a non-limiting example, “at least one of A and B” (or, equivalently, “at least one of A or B,” or, equivalently “at least one of A and/or B”) can refer, in one embodiment, to at least one, optionally including more than one, A, with no B present (and optionally including elements other than B); in another embodiment, to at least one, optionally including more than one, B, with no A present (and optionally including elements other than A); in yet another embodiment, to at least one, optionally including more than one, A, and at least one, optionally including more than one, B (and optionally including other elements); etc.
  • It should also be understood that, unless clearly indicated to the contrary, in any methods claimed herein that include more than one step or act, the order of the steps or acts of the method is not necessarily limited to the order in which the steps or acts of the method are recited.
  • ADDITIONAL TABLES
  • TABLE 2
    Primers used for nif cluster cloning.
    Nif cluster Forward primer (SEQ ID NOs: 8-64) Reverse Primer (SEQ ID NOs: 65-121) Genomic location GenBank accession No.
    Klebsiella oxytoca CGTAGGGCGCATTAATGCAGCTGGCACGA GTGACGCTCGCGTATCAGGTTTG 3,897,443-3,909,294 CP020657.1
    M5aI CAGGTGAATTC
    TAGACTGCTGGATACGCTGCTTAAGGTC
    TACGCTGTTTGAGCTGGCAAACCT ATCAGGCGCATATTTGAATGTATTTACTGCA 3,909,255-3,920,878 CP020657.1
    GCGGCCGCTTCTAG
    AGTGACCAAAAGCTTCCGCAACCC
    Pseudomonas GCCCGGAGAGCAAGCCCGTAGGGCGCATT ACTACGCATCACTAGCAGGGCACGCACCGCG 1,410,207-1,414,229 NC_009434
    stutzeri AATGCAGCTGG GACGAAATCGAAGT
    A1501 CACGACAGGTGTTAGGTTGGCCTGAATTC GAG
    GGTGT
    GGCTCACTTCGATTTCGTCCGCGGTGCGT TTGTCGACTCCCGGGGTCTGAC 1,419,757-1,424,637 NC_009434
    GCCCTGCTAGT
    GATGCGTA
    CGCCTGATTTCGCCTGATGAACAGG GGCTTTAACGGCATGTTCCGGGT 1,424,588-1,429,971 NC_009434
    TGACGCTGTTGACCACCGCC GTAGTCGTCGTTGTGGCCGAACTC 1,429,922-1,434,417 NC_009434
    ATGGAAGTGGTCGGCACCGGCTA AAAGCATCATCTCGGGTCGGGC 1,434,370-1,438,503 NC_009434
    CGCAACGGTTGGGGTAGGTTGG CGTCGAGCGACAACGCCTCGA 1,438,454-1,442,613 NC_009434
    GACGTCCATCGCTTCGGCTTCGA CTATGAGCTGGACTGAACCGCGATG 1,442,565-1,448,340 NC_009434
    CTGCGAAATCGACGCTGTCGAGCATCATC GAAAATACCGCATCAGGCGCATATTTGAATG 1,448,291-1,459,252 NC_009434
    GCGGTTCA TATTTACTGCAGCG
    GCCGCTGGCGAATCTCCTTCCTCGGTTCG
    Azotobacter ATCCATTCTCAGGCTGTCTCGTCTCGTCT GCCTTCGAACATGTTGTCCCAG 134,732-144,115 NC_012560
    vinelandii CTACGTACGCG
    DJ GATCCCAGGCAACGTCTTCGTACTGCGGT
    ACCGGGTTGCG
    GGGGCAGCCAGTGGAAAAAGG
    CTACGGCACGCCCTGGTTCGA TCGAGTTCGAGCAGTTTCTCCAGC 144.076-148,534 NC_012560
    GCTCGGAAAGTGCTGGAGAAAC AGCGAACAATACCTGTGGCC 148,500-152,895 NC_012560
    AAATCAGACATTCATGGCCACAGG TGGCGCTTGCCCTTGTTCCAA 152,861-157,152 NC_012560
    TCTACCATGGCGTGACTCTCGG GCGCGGTGGTAGAGTTCCGGGAGTTTAAACG 157,101-162,181 NC_012560
    GACAGAAGACGAGT
    CGTGCGGGC
    ACTCGTCTTCTGTCCGTTTAAACTCCCGG TTGCTCAGGGTCGGGTTGGC 5,161,399-5,168,611 NC_012560
    AACTCTACCAC
    CGC
    CTTGGATAGACGAGGCACAGC CATCATCCTCGGCCCCTTCAGGTTGCAGGAG 5,168,561-5,175,635 NC_012560
    CCGGCTTG
    GCCGGCTCCTGCAACCTGAAGGGGCCGAG GCAAGCCACTCCACTGACGAA   995,860-1,000,698 NC_012560
    GATGATG
    Paenibacillus GAATTGAGGATAAATGTCAGGGATTTCAT ACAGGTTCCGCAGTTCACAAGC 23,686-26,413 ALJV01.1
    polymyxa G contig00089
    WLY78 CCAAGCATTTTGAGATCGCGGATG GCTGATTGTGATCGACAATATTCGG 26,364-27,763 ALJV01.1
    contig00089
    CGGAGGTGCCGGTATGAGCGA GAAAGCCTACACGAAGCAAAGG 27,714-29,113 ALJV01.1
    contig00089
    GAAGTTTGCAGCGAAAGAGGCG CTTGAGAATCTGCCGGGCGCCT 29.064-30,463 ALJV01.1
    contig00089
    GGGATGATGCAGAATACATCCCG ATCCACAAATCAACACCCTGCG 30,414-31,813 ALJV01.1
    contig00089
    GGTGACCTGGATGATGCAGAGGAGAG AAAGCGTTCCAGTCACGGTCAC 31,764-34,402 ALJV01.1
    contig00089
    Cyanothece GGCCCGCGTTAGGTTGGCCTGAATTCGGT GAGACTTTCCCCACCTTATTATGCATGCAGA 1,931,343-1,929,132 NC_010546.1
    ATCC51142 GTGTATCCCCC TGTTATGGGAATTA ACG
    GGAGATACGTAAAAAAAAAAACCCCGCCC
    TGTCAGGGGCG
    GGGTTTTTTTTTGATAAGTCAAGCTATCA
    GAACCGATC
    TAATTCCCATAACATCTGCATGCATAATA ACCTTGACAATCATTACACAGCG 555,364-562,941 NC_010546.1
    AGGTGGGGAAA
    GTCTCAGC
    AATGTATTTCTGATCGATGCGACG CAAATATAATGATCGACATTTTCACCAC 562,897-570,603 NC_010546.1
    GTTATCTGGCTGATGTTTGTGGTG CGTTAACTTTGTCGCAAAACTTCG 570,558-577,494 NC_010546.1
    GTCAAACTGTCTTGTTTAAAGCCG ACCAAGGCGAATCTCCTTCCTCGGTTCGCGA 577,449-584,687 NC_010546.1
    TCACGCTACTCCGC
    CAATAAAAAAGCCCCCGGAATGATCTTCCGG
    GGGCCAGATTCAGG
    TAACTGCTCAAG
    Azospirillum TTAAGGTCATGCAGCAGGAGAACTAAAGG TGCGTCTTCTTCGGGCATCGTCA 1.043,795-1,035,568 CP012914
    brasilense CCCGCGTTAGG
    Sp7 TTGGTAATAAAAAAGCCCCCGGAATGATC
    TTCCGGGGGCC
    CTGCGCAAATACAACATCGAGATC
    GACGACTGAATAAGGATCGCGGAATG AGAAAATTGATTGCGGACGAGCG 1,035,614-1,027,483 CP012914
    TATGTCACAGGCCCGACAAAGCG TTCAATAAGTTAAGCAGATCGGCCTCG 1,027,533-1,019,166 CP012914
    GATTGTCGGGTATCGCACACGAG CGGTGTTACGAATAAATATTTCTACGAATAG 1,019,211-1,010,628 CP012914
    AC
    CGAAGGAGTTCGCCCCAGTCTATTC GCTCCAAAAGGAGCCTTTAATTGTATCGGTT 1,010,677-1,003,838 CP012914
    TATCAGCTTGCTTT
    GTTCCGCGGGTCTCGATACAACG
    Rhodopseudomonas AATACGATCGCATGTCCTAGGTAATACGA GGTCTTGCGGATCATCACTTTC 5,215,514-5,207,699 NC_005296.1
    palustris CTCACTATAGG
    CGA009 GAGAGGTAATCAGTGGTGGATTTGATGT
    CCAAGCAAAGGACCACCCTC GACGGTCAGGTGGTCCGAAC 5,207,743-5,201,639 NC_005296.1
    AGCTTCGATATCATCCGCTGAT GGTGAGAATGATCATGATCGGCC 5,201,687-5,196,113 NC_005296.1
    TTGTTCATGTCGGACCTAACCGA CTCCAAAAGGAGCCTTTAATTGTATCGGTTT 5,196,162-5,187,847 NC_005296.1
    ATCAGCTTGCTTTG
    ACGACAAGTGGAGAAGGGATAG
    Rhodobacter CAATACGATCGCATGTCCTAGGTAATACG TCCCATGGTCATGTCCTTTGCG 2,285,634-2,279,216 NC_007493
    sphaeroides ACTCACTATAG
    2.4.1. GGAGATGCATTTCACGCTTCGCGATTC
    CCGCCTTCACCAGAGACACC GTGCGCTTTTCCACGAGGAGC 2,279,260-2,271,404 NC_007493
    ATCGAGAAGTTCTACGATGCCGT AATTGAAAAAAAAAACCCCGCCCTGTCAGGG 2,271,450-2,264,419 NC_007493
    GCGGGGTTTTTTTT
    TGCAGCGCCCATTCCGTCTTC
    GCAAAAAAAAACCCCGCCCCTGACAGGGC GCTCCAAAAGGAGCCTTTAATTGTATCGGTT 245,956-252,936 NC_007494
    GGGGTTTTTTT TATCAGCTTGCTTT
    TTTCAATTGGACCTGGATGGGCAGCAAG GGAGAAAGCCTGCGCGGCTAG
    Azorhizobium CTCGCATCCATTCTCAGGCTGTCTCGTCT GCCCCCGGAAGGTGATCTTCCGGGGGCTTTC 5,290,244-5,293,483 NC_009937
    caulinodans CGTCTCTCTAG TCATGCGTTGA
    ORS571 AGTCGGAGCTCTTGGGGCCTCTAAACGGG CAGCCTTGAGATAGATCAAGTGC
    TCTTGAGGGGT
    TTTTTGTTGTCTTCGACGCGAAGCTC
    ATAGGCAATACGATCGCATGTCCGTTTAA CTGATCCAGGCCTTCATCGG 1,183,854-1,175,614 NC_009937
    ACTGATAAGGA
    CGGCACTGGCTGG
    CGATGCCGTCCAGCACCTC GACATGTCTGGTCTCCTTGGAAC 1,175,653-1,170,712 NC_009937
    CTGCCACGGTTCCCAAGGTTC TTCTGGAATTTGGTACCGAGTCAGTAACGTG 1,179,751-1,162,529 NC_009937
    CCACAGCCTCG
    TAAAAAAGCGGCTAACCACGCCGCTTTTT ATCAGGCGCATATTTGAATGTATTTACTGCA 3,922,323-3,919,341 NC_009937
    TTACGTCTGCA GCGGCCGCTAC
    GTGTTGTCGAAGCTTGATGCGC GTACTTGTGGGGTCAGTTCCGGCTGGGGGTT
    CAGCAGCCACC
    TGCAGTTAATTAAGGCGCTCCTTTCCTGATT
    CG
    CGCTGCTTAAGGTCATGCAGCAGGAGAAC GCTGCTGTGTGGAGAGATCG 3,930,607-3,934,260 NC_009937
    TAAAGGCCCGC
    TCTGCGAAAGGAATAGCGTC
    CTATCGCCGCCACCTGACC GTCGGTGAGATTGATCATGGCC 3,934,220-3,937,923 NC_009937
    CGTCAGAACGGCTCTGACGCATCAGGGAG TGCATGTCCGTTCCTCGCTG 3,937,871-3,941,205 NC_009937
    A
    AGTAATATTGCGGATCGGCCAGCAGCGAG ACATGTCTTGAATTCCTTCGAACC 3,941,164-3,959,444 NC_009937
    GAA
    GGTGGTCATTGGCAACGGTTCGAAG TGCATTGCGTTCGCTCCC 3,959,405-3,962,598 NC_009937
    TCCCCAAGAGCCCAACCGTTCCGGGAGCG TGTCAGGGCAGGCAGGGCC 3,962,559-3,966,562 NC_009937
    AA
    Gluconacetobacter TTAAGGTCATGCAGCAGGAGAACTAAAGG TCACCAGCCGTATCCGGAATATGTCAGGATC 1,759,465-1,754,718 CP001189
    diazotrophicus CCCGCGTTAGG ATGACATCCC
    PA1
     5 TTGGTAATAAAAAAGCCCCCGGAATGATC
    TTCCGGGGGCC
    GATCGAGGAAATCGACGTG
    ATATTCCGGATACGGCTGGTGAGGTGGA ACGATTTCCATGCCCAGGTC 1,754,739-1,746,565 CP001189
    CGCCACGTCGTCAATGCCTATAAC CCTCCAGCACCTCTTCGATG 1,746,608-1,738,322 CP001189
    TGACCACCGTGCAGAAGATCC GCTCCAAAAGGAGCCTTTAATTGTATCGGTT 1,738,366-1,730,601 CP001189
    TATCAGCTTGC
    TTTGGGCAATACCTGAGACGTTTCA
  • TABLE 3
    Strains used in this study
    Name Strain Source Description
    MR1 E. coli DH10-beta NEB Cat# C3019
    MR2 E. coli K-12 MG1655 Voigt lab
    MR3 Klebsiella oxytoca M5al Voigt lab
    MR4 Pseudomonas stutzeri A1501 Poole lab
    MR5 Azotobacter vinelandii DJ Peters lab
    MR6 Pseudomonas protegens Pf-5 ATCC BAA-477
    MR7 P. protegens Pf-5 controller (Ptac-T7RNAP) This study generated by pMR86
    MR8 P. protegens Pf-5 controller v1 (Ptac-nifA) This study generated by pMR97
    MR9 P. protegens Pf-5 controller v2 (Ptac-nifA v2) This study generated by pMR98
    MR10 P. protegens Pf-5 controller v3 (Ptac-nifA v3) This study generated by pMR99
    MR11 P. protegens Pf-5 controller v4 (PBAD.10-nifA) This study generated by pMR100
    MR12 P. protegens Pf-5 controller v5 (PFde-nifA) This study generated by pMR101
    MR13 Rhizobium sp. IRBG74 Ané lab
    MR14 R. sp. IRBG74 ΔhsdR This study generated by pMR44
    MR15 R. sp. IRBG74 ΔrecA This study generated by pMR47
    MR16 R. sp. IRBG74 Δnif This study generated by pMR45-46. Two nif clusters
    (227,127-
    219,579 and 234,635-234,802) were removed.
    MR17 R. sp. IRBG74 ΔhsdR, recA This study
    MR18 R. sp. IRBG74 ΔhsdR, Δnif This
    R. sp. IRBG74 ΔhsdR, recA Δnif study
    MR19 R. sp. IRBG74 ΔhsdR Δnif ΔrecA::PA1lacO1-T7RNAP This generated by pMR82
    v1 study
    MR20 This study
    MR21 R. sp. IRBG74 ΔhsdR Δnif ΔrecA::PA1lacO1-T7RNAP This study generated by pMR83
    v2
    MR22 R. sp. IRBG74 ΔhsdR Δnif ΔrecA::PA1lacO1-T7RNAP This study generated by pMR84
    v3
    MR23 R. sp. IRBG74 ΔhsdR Δnif ΔrecA::PPhl-T7RNAP This study generated by pMR85
    MR24 Azorhizobium coulinodans ORS571 Poole lab
    MR25 Azorhizobium coulinodans ORS571 ΔnifA This study generated by pMR48
    MR26 R. spp NGR234 Poole lab
    MR27 R. leguminosarum bv. Trifolii WSM1325 Poole lab
    MR28 Sinorhizobium medicae WSM419 Poole lab
    MR29 R. leguminosarum 8002 Poole lab
    MR30 Sinorhizobium meliloti WSM1022 Poole lab
    MR31 R. leguminosarum A34 Poole lab
    MR32 Sinorhizobium fredii HH103 Poole lab
    MR33 Sinorhizobium meliloti 1021 Poole lab
    MR34 R. tropici CIAT899 Poole lab
    MR35 R. leguminosarum viciae 3841 Poole lab
    MR36 R. etli CFN42 Poole lab
    MR37 Agrobacterium tumefaciens C58 Poole lab
  • TABLE 4
    Plasmids used in this study
    Origin of
    Name replication Marker Description
    pMR1 pBBR1 Kanamycin Plasmid for nif cluster cloning
    pMR2 pRO1600, Gentamicin Plasmid for nif cluster cloning
    p15A
    pMR3 pBBR1 Kanamycin Native nif cluster of K. oxytoca M5al
    pMR4 pRO1600, Gentamicin Native nif cluster of K. oxytoca M5al
    p15A
    pMR5 pBBR1 Kanamycin Native nif cluster of P. stutzeri A1501
    pMR6 pRO1600, Gentamicin Native nif cluster of P. stutzeri A1501
    p15A
    pMR7 pBBR1 Kanamycin Native nif cluster of A. vinelandii DJ
    pMR8 pRO1600, Gentamicin Native nif cluster of A. vinelandii DJ
    p15A
    pMR9 pBBR1 Gentamicin Native nif cluster of Cyanothece ATCC51142
    pMR10 pRO1600, Gentamicin Native nif cluster of Cyanothece ATCC51142
    p15A
    pMR11 pBBR1 Kanamycin Native nif cluster of P. polymyxa WLY78
    pMR12 pRO1600, Gentamicin Native nif cluster of P. polymyxa WLY78
    ColE1
    pMR13 pBBR1 Kanamycin Native nif cluster of A. brasilense Sp7
    pMR14 pRO1600, Gentamicin Native nif cluster of A. brasilense Sp7
    ColE1
    pMR15 pBBR1 Kanamycin Native nif cluster of R. sphaeroides 2.4.1
    pMR16 pRO1600, Gentamicin Native nif cluster of R. sphaeroides 2.4.1
    ColE1
    pMR17 pBBR1 Kanamycin Native nif cluster of R. palustris CGA009
    pMR18 pRO1600, Gentamicin Native nif cluster of R. palustris CGA009
    ColE1
    pMR19 pBBR1 Kanamycin Native nif cluster of A. caulinodans ORS571 (Part1 of 2)
    pMR20 RK2 Tetracycline Native nif cluster of A. caulinodans ORS571 (Part2 of 2)
    pMR21 pBBR1 Kanamycin Native nif cluster of G. diazotrophicus PA1 5
    pMR22 pRO1600, Gentamicin Native nif cluster of G. diazotrophicus PA1 5
    ColE1
    pMR23 pRO1600, Gentamicin nifLA (3,915,521-3,918,529) deletion in the nif cluster of K. oxytoca M5al
    p15A
    pMR24 pRO1600, Gentamicin nifLA (1,420,874-1,423,084) deletion in the nif cluster of P. stutzeri A1501
    p15A
    pMR25 pRO1600, Gentamicin nifLA (5,168,709-5,171,731) deletion in the nif cluster of A. vinelandii DJ
    p15A
    pMR26 pRO1600, Gentamicin Native nif cluster of A. vinelandii DJ with the rnf2 operon
    p15A
    pMR27 pRO1600, Gentamicin rnf1 (5,168,156-5,162,716) operon deletion in the nif cluster of A. vinelandii DJ
    p15A
    pMR28 pRO1600, Gentamicin fix operon (995,860-1,000,698) deletion in the nif cluster of A. vinelandii DJ
    p15A
    pMR29 pBBR1 Kanamycin Refactored nif cluster v2.1
    pMR30 pRO1600, Gentamicin Refactored nif cluster v2.1
    p15A
    pMR31 RK2 Tetracycline Refactored nif cluster v2.1
    pMR32 ColE1 Gentamicin PWT-nifHDKTY
    pMR33 ColE1 Gentamicin P2-nifENX
    pMR34 ColE1 Gentamicin P2-nifJ
    pMR35 ColE1 Gentamicin P2-nifBQ
    pMR36 ColE1 Gentamicin P2-nifF
    pMR37 ColE1 Gentamicin P2-nifUSVWZM
    pMR38 pBBR1 Kanamycin Refactored nif cluster v3.2
    pMR39 pRO1600, Gentamicin Refactored nif cluster v3.2
    p15A
    pMR40 pBBR1 Kanamycin LacI, PA1lacO1-gfpmut3b
    pMR41 RSF1010 Gentamicin LacI, PA1lacO1-gfpmut3b
    pMR42 RK2 Tetracycline LacI, Ptac-gfpmut3b
    pMR43 pRO1600, Gentamicin LacI, PA1lacO1-gfpmut3b
    ColE1
    pMR44 p15A Gentamicin Suicide plasmid for hsdR deletion in R. sp. IRBG74
    pMR45 p15A Gentamicin Suicide plasmid for the nif cluster I (219,579-227,127) deletion in R. sp. IRBG74
    pMR46 p15A Gentamicin Suicide plasmid for the nif cluster II (234,635-234,802) deletion in R. sp. IRBG74
    pMR47 p15A Gentamicin Suicide plasmid for recA deletion in R. sp. IRBG74
    pMR48 p15A Gentamicin Suicide plasmid for nifA deletion in A. coulinodans ORS571
    pMR49 pBBR1 Gentamicin LacI, PA1lacO1-nifV (A. coulinodans ORS571)
    pMR50 pBBR1 Gentamicin PnifH(R. sp. IRBG74)-sfgfp
    pMR51 pBBR1 Gentamicin NifA(R. sp. IRBG74), PnifH(R. sp. IRBG74)-sfgfp
    pMR52 pBBR1 Gentamicin NifA(K. oxytoca), PnifH(K. oxytoca)-sfgfp
    pMR53 pBBR1 Gentamicin NifA(R. sp. IRBG74), PnifH(K. oxytoca)-sfgfp
    pMR54 pBBR1 Gentamicin NifA(P. stutzeri), PnifH(P. stutzeri)-sfgfp
    pMR55 pBBR1 Gentamicin NifA(R. sp. IRBG74), PnifH(P. stutzeri)-sfgfp
    pMR56 pBBR1 Gentamicin NifA(A. coulinodans), PnifH(A. coulinodans)-sfgfp
    pMR57 pBBR1 Gentamicin NifA(R. sp. IRBG74), PnifH(A. coulinodans)-sfgfp
    pMR58 pBBR1 Kanamycin Plasmid for consituitive promoter characterization. Pconstitutive-gfpmut3b
    pMR59 pRO1600, Gentamicin Plasmid for consituitive promoter characterization. Pconstitutive-gfpmut3b
    p15A
    pMR60 pBBR1 Kanamycin PT7(WT)-mCherry
    pMR61 pBBR1 Kanamycin PT7(P1)-mCherry
    pMR62 pBBR1 Kanamycin PT7(P2)-mCherry
    pMR63 pBBR1 Kanamycin PT7(P3)-mCherry
    pMR64 pBBR1 Kanamycin PT7(P4)-mCherry
    pMR65 pBBR1 Kanamycin PT7(P5)-mCherry
    pMR66 pRO1600, Gentamicin AraE, AraC, PBAD.10-gfpmut3b
    ColE1
    pMR67 pBBR1 Kanamycin Plasmid for terminator characterization. PT7-gfpmut3b-mrfp1
    pMR68 pRO1600, Gentamicin Plasmid for terminator characterization. PT7-gfpmut3b-mrfp1
    ColE1
    pMR69 pBBR1 Kanamycin LuxR, PLux-gfpmut3b
    pMR70 pBBR1 Kanamycin TetR, PTet-gfpmut3b
    pMR71 pBBR1 Kanamycin CymR, PCym-gfpmut3b
    pMR72 pBBR1 Kanamycin PhlF, PPhl-gfpmut3b
    pMR73 pBBR1 Kanamycin NahR, PSal-gfpmut3b
    pMR74 pRO1600, Gentamicin PhlF, PPhl-gfpmut3b
    ColE1
    pMR75 pRO1600, Gentamicin TetR, PTet-gfpmut3b
    ColE1
    pMR76 pRO1600, Gentamicin LuxR, PLux-gfpmut3b
    ColE1
    pMR77 pRO1600, Gentamicin CymR, PCym-gfpmut3b
    ColE1
    pMR78 pRO1600, Gentamicin FdeR, PFde-gfpmut3b
    ColE1
    pMR79 pRO1600, Gentamicin LacI(Q18M/A47V/F161Y), Ptac-gfpmut3b
    ColE1
    pMR80 pBBR1 Kanamycin PT7-gfpmut3b
    pMR81 pRO1600, Gentamicin PT7-gfpmut3b
    p15A
    pMR82 p15A Gentamicin Controller for R. sp. IRBG74, LacI, PA1lacO1-T7RNAP (RBSr33 for T7RNAP)
    pMR83 p15A Gentamicin Controller for R. sp. IRBG74, LacI, PA1lacO1-T7RNAP (RBSr32 for T7RNAP)
    pMR84 p15A Gentamicin Controller for R. sp. IRBG74, LacI, PA1lacO1-T7RNAP (RBSr3 for T7RNAP)
    pMR85 p15A Gentamicin Controller for R. sp. IRBG74, PhlF, PPhlF-T7RNAP (RBSr33 for T7RNAP)
    pMR86 ColE1 Tetracycline Controller for P. protegens Pf-5, LacI(Q18M/A47V/F161Y), Ptac-T7RNAP
    pMR87 pBBR1 Kanamycin NocR, Pnoc-gfpmut3b
    pMR88 pBBR1 Kanamycin OccR, Pooc-gfpmut3b
    pMR89 pBBR1 Gentamicin NifA(A. vinelandii), PnifH(A. vinelandii)-sfgfp
    pMR90 pBBR1 Gentamicin NifA(K. oxytoca),PnifH(P. stutzeri)-sfgfp
    pMR91 pBBR1 Gentamicin NifA(K. oxytoca), PnifH(A. vinelandii)-sfgfp
    pMR92 pRO1600, Gentamicin NifA(K. oxytoca), PnifH(K. oxytoca)-sfgfp
    p15A
    pMR93 pRO1600, Gentamicin NifA(A. vinelandii), PnifH(A. vinelandii)-sfgfp
    p15A
    pMR94 pRO1600, Gentamicin NifA(P. stutzeri), PnifH(P. stutzeri)-sfgfp
    p15A
    pMR95 pRO1600, Gentamicin NifA(P. stutzeri), PnifH(K. oxytoca)-sfgfp
    p15A
    pMR96 pRO1600, Gentamicin NifA(P. stutzeri), PnifH(A. vinelandii)-sfgfp
    p15A
    pMR97 ColE1 Tetracycline NifA controller for P. protegens Pf-5, LacI(Q18M/A47V/F161Y),
    Ptac-nifA(P. stutzeri) (RBSp32 for NifA)
    pMR98 ColE1 Tetracycline NifA controller for P. protegens Pf-5, LacI(Q18M/A47V/F161Y),
    Ptac-nifA(P. stutzeri) (RBSp27 RBS for NifA)
    pMR99 ColE1 Tetracycline NifA controller for P. protegens Pf-5, LacI(Q18M/A47V/F161Y),
    Ptac-nifA(P. stutzeri) (RBSp33 for NifA)
    pMR100 ColE1 Tetracycline NifA controller for P. protegens Pf-5, AraE, AraC, PBAD.10-nifA
    pMR101 ColE1 Tetracycline NifA controller for P. protegens Pf-5, FdeR, PFde-nifA
    pMR102 IncW Spectinomycin NifA controller plasmid for E. coli, LacI, PA1lacO1-nifA(K. oxytoca)
    pMR103 pRO1600, Gentamicin PnifH(K. oxytoca)-sfgfp
    p15A
    pMR104 pRO1600, Gentamicin PnifH(P. stutzeri)-sfgfp
    p15A
    pMR105 pRO1600, Gentamicin PnifH(A. vinelandii)-sfgfp
    p15A
    pMR106 pBBR1 Gentamicin PnifH(K. oxytoca)-sfgfp
    pMR107 pBBR1 Gentamicin PnifH(P. stutzeri)-sfgfp
    pMR108 pBBR1 Gentamicin PnifH(A. vinelandii)-sfgfp
    pMR109 p15A Kanamycin PBAD-T7RNAP
    pMR110 p15A Kanamycin PBet-T7RNAP
    pMR111 p15A Kanamycin PCin-T7RNAP
    pMR112 p15A Kanamycin PCym-T7RNAP
    pMR113 p15A Kanamycin PLux-T7RNAP
    pMR114 p15A Kanamycin PPhl-T7RNAP
    pMR115 p15A Kanamycin P3B5B-T7RNAP
    pMR116 p15A Kanamycin Ptac-T7RNAP
    pMR117 p15A Kanamycin PTet-T7RNAP
    pMR118 p15A Kanamycin PTfg-T7RNAP
    pMR119 p15A Kanamycin PVan-T7RNAP
    pMR120 p15A Kanamycin PSal-T7RNAP
    pMR121 pBBR1 Gentamicin PT7(P2)-gfpmut3b
    pMR122 pBBR1 Gentamicin NifA controller for A. caulinodans, LacI, PA1lacO1-nifA-rpoN
    pMR123 pBBR1 Gentamicin NifA controller for A. caulinodans, LacI, PA1lacO1-nifA(L94Q)-rpoN
    pMR124 pBBR1 Gentamicin NifA controller for A. caulinodans, LacI, PA1lacO1-nifA(D95Q)-rpoN(A. caulinodans)
    pMR125 pBBR1 Gentamicin NifA controller for A. caulinodans, LacI, PA1lacO1-nifA(L94Q/D95Q)-rpoN
    pMR126 pBBR1 Gentamicin NifA controller for A. caulinodans, NahR, PSal-nifA(L94Q/D95Q)-rpoN
    pMR127 pBBR1 Gentamicin NifA controller for A. caulinodans, NocR, Pnoc-nifA(L94Q/D95Q)-rpoN
    pMR128 pBBR1 Gentamicin NifA controller for A. caulinodans, OccR, Pocc-nifA(L94Q/D95Q)-rpoN
    pMR129 pBBR1 Gentamicin PnifH(A. caulinodans)-sfgfp
    pMR130 pBBR1 Gentamicin NifA, PnifH(A. caulinodans)-sfgfp
    pMR131 pBBR1 Gentamicin NifA, RpoN, PnifH(A. caulinodans)-sfgfp
    pMR132 pBBR1 Gentamicin LacI, PA1lacO1-nifA(L94Q/D95Q)-rpoN, PnifH-sfgfp
    pMR133 pBBR1 Gentamicin NahR, PSal-nifA(L940/D95Q)-rpoN, PnifH-sfgfp
    pMR134 pBBR1 Gentamicin NocR, Pnoc-nifA(L940/D95Q)-rpoN, PnifH-sfgfp
    pMR135 pBBR1 Gentamicin OccR, Pocc-nifA(L94Q/D95Q)-rpoN, PnifH-sfgfp
    pMR136 pBBR1 Gentamicin Refactored nif clusterv2.1
  • TABLE 5
    Genetic part sequences used in this study
    Name Genetic part DNA sequence (SEQ ID NOs: 122-225)
    PA1lacO1 Promoter6 AGAGTGTTGACTTGTGAGCGGATAACAATGATACTTAGATTCAATTGTGAGCGGATAACAATTTCAC
    ACA
    T7 RNAP Gene ATGAACACGATTAACATCGCTAAGAACGACTTCTCTGACATCGAACTGGCTGCTATCCCGTTCAACACTC
    TGGCTGACCATTACGGTGAGCG
    TTTAGCTCGCGAACAGTTGGCCCTTGAGCATGAGTCTTACGAGATGGGTGAAGCACGCTTCCGCAAGATG
    TTTGAGCGTCAACTTAAAGCTG
    GTGAGGTTGCGGATAACGCTGCCGCCAAGCCTCTCATCACTACCCTACTCCCTAAGATGATTGCACGCAT
    CAACGACTGGTTTGAGGAAGTG
    AAAGCTAAGCGCGGCAAGCGCCCGACAGCCTTCCAGTTCCTGCAAGAAATCAAGCCGGAAGCCGTAGCGT
    ACATCACCATTAAGACCACTCT
    GGCTTGCCTAACCAGTGCTGACAATACAACCGTTCAGGCTGTAGCAAGCGCAATCGGTCGGGCCATTGAG
    GACGAGGCTCGCTTCGGTCGTA
    TCCGTGACCTTGAAGCTAAGCACTTCAAGAAAAACGTTGAGGAACAACTCAACAAGCGCGTAGGGCACGT
    CTACAAGAAAGCATTTATGCAA
    GTTGTCGAGGCTGACATGCTCTCTAAGGGTCTACTTGGTGGCGAGGCGTGGTCTTCGTGGCATAAGGAAG
    ACTCTATTCATGTAGGAGTACG
    CTGCATCGAGATGCTCATTGAGTCAACCGGAATGGTTAGCTTACACCGCCAAAATGCTGGCGTAGTAGGT
    CAAGACTCTGAGACTATCGAAC
    TCGCACCTGAATACGCTGAGGCTATCGCAACCCGTGCAGGTGCGCTGGCTGGCATCTCTCCGATGTTCCA
    ACCTTGCGTAGTTCCTCCTAAG
    CCGTGGACTGGCATTACTGGTGGTGGCTATTGGGCTAACGGTCGTCGTCCTCTGGCGCTGGTGCGTACTC
    ACAGTAAGAAAGCACTGATGCG
    CTACGAAGACGTTTACATGCCTGAGGTGTACAAAGCGATTAACATTGCGCAAAACACCGCATGGAAAATC
    AACAAGAAAGTCCTAGCGGTCG
    CCAACGTAATCACCAAGTGGAAGCATTGTCCGGTCGAGGACATCCCTGCGATTGAGCGTGAAGAACTCCC
    GATGAAACCGGAAGACATCGAC
    ATGAATCCTGAGGCTCTCACCGCGTGGAAACGTGCTGCCGCTGCTGTGTACCGCAAGGACAAGGCTCGCA
    AGTCTCGCCGTATCAGCCTTGA
    GTTCATGCTTGAGCAAGCCAATAAGTTTGCTAACCATAAGGCCATCTGGTTCCCTTACAACATGGACTGG
    CGCGGTCGTGTTTACGCTGTGT
    CAATGTTCAACCCGCAAGGTAACGATATGACCAAAGGACTGCTTACGCTGGCGAAAGGTAAACCAATCGG
    TAAGGAAGGTTACTACTGGCTG
    AAAATCCACGGTGCAAACTGTGCGGGTGTCGACAAGGTTCCGTTCCCTGAGCGCATCAAGTTCATTGAGG
    AAAACCACGAGAACATCATGGC
    TTGCGCTAAGTCTCCACTGGAGAACACTTGGTGGGCTGAGCAAGATTCTCCGTTCTGCTTCCTTGCGTTC
    TGCTTTGAGTACGCTGGGGTAC
    AGCACCACGGCCTGAGCTATAACTGCTCCCTTCCGCTGGCGTTTGACGGGTCTTGCTCTGGCATCCAGCA
    CTTCTCCGCGATGCTCCGAGAT
    GAGGTAGGTGGTCGCGCGGTTAACTTGCTTCCTAGTGAAACCGTTCAGGACATCTACGGGATTGTTGCTA
    AGAAAGTCAACGAGATTCTACA
    AGCAGACGCAATCAATGGGACCGATAACGAAGTAGTTACCGTGACCGATGAGAACACTGGTGAAATCTCT
    GAGAAAGTCAAGCTGGGCACTA
    AGGCACTGGCTGGTCAATGGCTGGCTTACGGTGTTACTCGCAGTGTGACTAAGCGTTCAGTCATGACGCT
    GGCTTACGGGTCCAAAGAGTTC
    GGCTTCCGTCAACAAGTGCTGGAAGATACCATTCAGCCAGCTATTGATTCCGGCAAGGGTCTGATGTTCA
    CTCAGCCGAATCAGGCTGCTGG
    ATACATGGCTAAGCTGATTTGGGAATCTGTGAGCGTGACGGTGGTAGCTGCGGTTGAAGCAATGAACTGG
    CTTAAGTCTGCTGCTAAGCTGC
    TGGCTGCTGAGGTCAAAGATAAGAAGACTGGAGAGATTCTTCGCAAGCGTTGCGCTGTGCATTGGGTAAC
    TCCTGATGGTTTCCCTGTGTGG
    CAGGAATACAAGAAGCCTATTCAGACGCGCTTGAACCTGATGTTCCTCGGTCAGTTCCGCTTACAGCCTA
    CCATTAACACCAACAAAGATAG
    CGAGATTGATGCACACAAACAGGAGTCTGGTATCGCTCCTAACTTTGTACACAGCCAAGACGGTAGCCAC
    CTTCGTAAGACTGTAGTGTGGG
    CACACGAGAAGTACGGAATCGAATCTTTTGCACTGATTCACGACTCCTTCGGTACGATTCCGGCTGACGC
    TGCGAACCTGTTCAAAGCAGTG
    CGCGAAACTATGGTTGACACATATGAGTCTTGTGATGTACTGGCTGATTTCTACGACCAGTTCGCTGACC
    AGTTGCACGAGTCTCAATTGGA
    CAAAATGCCAGCACTTCCGGCTAAAGGTAACTTGAACCTCCGTGACATCTTAGAGTCGGACTTCGC
    GTTCGCGTAA
    Placlq Promoter7 CGAATGGTGCAAAACCTTTCGCGGTATGGCATGATAGCGCCCGGAAGAGAG
    rpoN of A.caulinodans Gene ATGGCGATGAGCCCAAAGATGGAGTTCCGCCAGAGCCAGTCTCTGGTGATGACGCCGCAGCTGATGCAGG
    CCATCAAGCTGCTGCAGCTCTC
    CAATCTCGAACTGGTCGCCTATGTGGAGGCCGAGCTCGAACGCAATCCGCTGCTGGAGCGGGCGAGCGAG
    CCGGAAAGCCCCGAGCACGATC
    CGCCGAACCCGCAGGAAGAGGCACCCACCCCGCCTGACAGTGGCGCGCCGGTGTCCGGCGACTGGATGGA
    AAGCGACATGGGCTCGAGCCGC
    GAGGCCATCGAGACCCGGCTGGACACCGACCTCGGCAATGTCTTTCCCGATGATGCGCCGGCCGAGCGCA
    TCGGCGCGGGCAGCGGCAGCGG
    CTCGTCCATCGAATGGGGCTCGGGCGGCGACCGGGGCGAGGACTACAATCCGGAAGCCTTCCTCGCTGCC
    GAGACGACGCTGGCCGACCATC
    TGGAAGCCCAGCTCTCCGTGGCGGAGCCCGATCCGGCGCGCCGCCTCATCGGCCTCAACCTCATCGGCCT
    CATCGACGAGACGGGTTATTTC
    TCCGGCGACCTCGATGCGGTGGCCGAGCAACTGGGCGCCACCCACGATCAGGTGGCCGACGTGCTGCGCG
    TCATCCAGAGCTTCGAGCCGTC
    CGGCGTCGGCGCACGGTCGCTCAGCGAATGCCTGGCCCTGCAATTGCGCGACAAGGATCGCTGCGATCCC
    GCCATGCAGGCGCTGCTCGACA
    ATCTGGAACTCCTCGCCCGCCACGACCGCAACGCGCTGAAGCGCATCTGCGGGGTGGACGCGGAAGACCT
    CGCGGACATGATCGGCGAGATC
    CGCCGCCTCGATCCGAAGCCCGGCCTCGCCTATGGCGGCGGCGTCGTCCACCCGCTGGTGCCGGACGTGT
    TCGTGCGCGAGGGCTCCGACGG
    CAGCTGGATCGTGGAACTGAATTCCGAGACGCTGCCGCGCGTGCTGGTGAACCAGACCTATCACGCGACG
    GTGGCCAAGGCGGCGCGCTCGG
    CCGAGGAAAAGACCTTCCTCGCCGACTGCCTCCAGAGCGCCTCCTGGCTTACCCGCTCGCTCGACCAGCG
    GGCTCGCACCATCCTCAAGGTG
    GCGAGCGAGATCGTGCGCCAGCAGGACGCCTTCCTCGTGCACGGCGTGCGGCACCTGCGCCCCCTGAACC
    TGCGCACGGTGGCGGATGCCAT
    CGGCATGCACGAATCCACCGTCTCGCGGGTGACCTCGAACAAGTACATCTCCACCCCGCGCGGGGTGCTG
    GAGATGAAGTTCTTCTTCTCCT
    CCTCCATCGCTTCCTCGGGTGGTGGCGAGGCCCATGCGGCGGAGGCGGTGCGCCACCGCATCAAGAGCCT
    CATCGAGGCCGAGAGTGCGGAC
    GACGTGCTGTCCGACGACACGCTGGTGCAGAAGCTGAAGGACGACGGCATCGATATCGCCCGCCGAACGG
    TCGCGAAATATCGCGAGAGCAT
    GAACATCCCGTCCTCGGTCCAGCGCCGCCGCGAAAAGCAGGCCCTGCGCAGCGACGCCGCCGCCGC
    CGGCTGA
    lacl Gene GTGAAACCAGTAACGTTATACGATGTCGCAGAGTATGCCGGTGTCTCTTATCAGACCGTTTCCCGCGTGG
    TGAACCAGGCCAGCCACGTTTC
    TGCGAAAACGCGGGAAAAAGTGGAAGCGGCGATGGCGGAGCTGAATTACATTCCCAACCGCGTGGCACAA
    CAACTGGCGGGCAAACAGTCGT
    TGCTGATTGGCGTTGCCACCTCCAGTCTGGCCCTGCACGCGCCGTCGCAAATTGTCGCGGCGATTAAATC
    TCGCGCCGATCAACTGGGTGCC
    AGCGTGGTGGTGTCGATGGTAGAACGAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCTTCTCG
    CGCAACGCGTCAGTGGGCTGAT
    CATTAACTATCCGCTGGATGACCAGGATGCCATTGCTGTGGAAGCTGCCTGCACTAATGTTCCGGCGTTA
    TTTCTTGATGTCTCTGACCAGA
    CACCCATCAACAGTATTATTTTCTCCCATGAAGACGGTACGCGACTGGGCGTGGAGCATCTGGTCGCATT
    GGGTCACCAGCAAATCGCGCTG
    TTAGCGGGCCCATTAAGTTCTGTCTCGGCGCGTCTGCGTCTGGCTGGCTGGCATAAATATCTCACTCGCA
    ATCAAATTCAGCCGATAGCGGA
    ACGGGAAGGCGACTGGAGTGCCATGTCCGGTTTTCAACAAACCATGCAAATGCTGAATGAGGGCATCGTT
    CCCACTGCGATGCTGGTTGCCA
    ACGATCAGATGGCGCTGGGCGCAATGCGCGCCATTACCGAGTCCGGGCTGCGCGTTGGTGCGGATATCTC
    GGTAGTGGGATACGACGATACC
    GAAGACAGCTCATGTTATATCCCGCCGTTAACCACCATCAAACAGGATTTTCGCCTGCTGGGGCAAACCA
    GCGTGGACCGCTTGCTGCAACT
    CTCTCAGGGCCAGGCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCACTGGTGAAAAGAAAAACCACCCTG
    GCGCCCAATACGCAAACCGCCT
    CTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGC
    AGTGA
    gfpmut3b Gene ATGAGTAAAGGAGAAGAACTTTTCACTGGAGTTGTCCCAATTCTTGTTGAATTAGATGGTGATGTTAATG
    GGCACAAATTTTCTGTTAGTGG
    AGAGGGTGAAGGTGATGCAACATACGGAAAACTTACCCTTAAATTTATTTGCACTACTGGAAAACTACCT
    GTTCCATGGCCAACACTTGTCA
    CTACTTTCGGTTATGGTGTTCAATGCTTTGCGAGATACCCAGATCATATGAAACAGCATGACTTTTTCAA
    GAGTGCCATGCCCGAAGGTTAT
    GTACAGGAAAGAACTATATTTTTCAAAGATGACGGGAACTACAAGACACGTGCTGAAGTCAAGTTTGAAG
    GTGATACCCTTGTTAATAGAAT
    CGAGTTAAAAGGTATTGATTTTAAAGAAGATGGAAACATTCTTGGACACAAATTGGAATACAACTATAAC
    TCACACAATGTATACATCATGG
    CAGACAAACAAAAGAATGGAATCAAAGTTAACTTCAAAATTAGACACAACATTGAAGATGGAAGCGTTCA
    ACTAGCAGACCATTATCAACAA
    AATACTCCAATTGGCGATGGCCCTGTCCTTTTACCAGACAACCATTACCTGTCCACACAATCTGCCCTTT
    CGAAAGATCCCAACGAAAAGAG
    AGACCACATGGTCCTTCTTGAGTTTGTAACAGCTGCTGGGATTACACATGGCATGGATGAACTATA
    CAAATAG
    sigfg Gene ATGCGTAAAGGCGAAGAGCTGTTCACTGGTGTCGTCCCTATTCTGGTGGAACTGGATGGTGATGTCAACGGTCATAAGTTTTCC
    GTGCGTGG
    CGAGGGTGAAGGTGACGCAACTAATGGTAAACTGACGCTGAAGTTCATCTGTACTACTGGTAAACTGCCGGTACCTTGGCCGAC
    TCTGGTAA
    CGACGCTGACTTATGGTGTTCAGTGCTTTGCTCGTTATCCGGACCATATGAAGCAGCATGACTTCTTCAAGTCCGCCATGCCGG
    AAGGCTAT
    GTGCAGGAACGCACGATTTCCTTTAAGGATGACGGCACGTACAAAACGCGTGCGGAAGTGAAATTTGAAGGCGATACCCTGGTA
    AACCGCAT
    TGAGCTGAAAGGCATTGACTTTAAAGAAGACGGCAATATCCTGGGCCATAAGCTGGAATACAATTTTAACAGCCACAATGTTTA
    CATCACCG
    CCGATAAACAAAAAAATGGCATTAAAGCGAATTTTAAAATTCGCCACAACGTGGAGGATGGCAGCGTGCAGCTGGCTGATCACT
    ACCAGCAA
    AACACTCCAATCGGTGATGGTCCTGTTCTGCTGCCAGACAATCACTATCTGAGCACGCAAAGCGTTCTGTCTAAAGATCCGAAC
    GAGAAACG
    CGATCATATGGTTCTGCTGGAGTTCGTAACCGCAGCGGGCATCACGCATGGTATGGATGAACTGTACAAATGA
    mrfp1 Gene ATGGCTTCCTCCGAAGACGTTATCAAAGAGTTCATGCGTTTCAAAGTTCGTATGGAAGGTTCCGTTAACGGTCACGAGTTCGAA
    ATCGAAGG
    TGAAGGTGAAGGTCGTCCGTACGAAGGTACCCAGACCGCTAAACTGAAAGTTACCAAAGGTGGTCCGCTGCCGTTCGCTTGGGA
    CATCCTGT
    CCCCGCAGTTCCAGTACGGTTCCAAAGCTTACGTTAAACACCCGGCTGACATCCCGGACTACCTGAAACTGTCCTTCCCGGAAG
    GTTTCAAA
    TGGGAACGTGTTATGAACTTCGAAGACGGTGGTGTTGTTACCGTTACCCAGGACTCCTCCCTGCAAGACGGTGAGTTCATCTAC
    AAAGTTAA
    ACTGCGTGGTACCAACTTCCCGTCCGACGGTCCGGTTATGCAGAAAAAAACCATGGGTTGGGAAGCTTCCACCGAACGTATGTA
    CCCGGAAG
    ACGGTGCTCTGAAAGGTGAAATCAAAATGCGTCTGAAACTGAAAGACGGTGGTCACTACGACGCTGAAGTTAAAACCACCTACA
    TGGCTAAA
    AAACCGGTTCAGCTGCCGGGTGCTTACAAAACCGACATCAAACTGGACATCACCTCCCACAACGAAGACTACACCATCGTTGAA
    CAGTACGAACGTGCTGAAGGTCGTCACTCCACCGGTGCTTAA
    mCherry Gene ATGGTTTCGAAGGGCGAGGAGGATAACATGGCCATCATCAAGGAGTTCATGCGCTTCAAGGTGCACATGGAGGGCTCCGTGAAC
    GGCCACGA
    GTTCGAGATCGAGGGCGAGGGCGAGGGCCGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCT
    GCCCTTCG
    CCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAAGGCCTACGTGAAGCACCCCGCCGACATCCCCGACTACTTGAAGC
    TGTCCTTC
    CCCGAGGGCTTCAAGTGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGTGGTGACCGTGACCCAGGACTCCTCCTTGCAGGAC
    GGCGAGTT
    CATCTACAAGGTGAAGCTGCGCGGCACCAACTTCCCCTCCGACGGCCCCGTAATGCAGAAGAAGACGATGGGCTGGGAGGCCTC
    CTCCGAGC
    GGATGTACCCCGAGGACGGCGCCCTGAAGGGCGAGATCAAGCAGAGGCTGAAGCTGAAGGACGGCGGCCACTACGACGCTGAGG
    TCAAGACC
    ACCTACAAGGCCAAGAAGCCCGTGCAGCTGCCCGGCGCCTACAACGTCAACATCAAGTTGGACATCACCTCCCACAACGAGGAC
    TACACCATCGTGGAACAGTACGAACGCGCCGAGGGCCGCCACTCCACCGGCGGCATGGACGAGCTGTACAAGTAA
    PWT Promoter8 TAATACGACTCACTATAGGGAGA
    P1 Promoter8 TAATACGACTCACTACAGGCAGA
    P2 Promoter8 TAATACGACTCACTAGAGAGAGA
    P3 Promoter8 TAATACGACTCACTAATGGGAGA
    P4 Promoter8 TAATACGACTCACTAAAGGGAGA
    P5 Promoter8 TAATACGACTCACTATAGGTAGA
    P6 Promoter8 TAATACGACTCACTATTGGGAGA
    nifV of A.caulinodans Gene GTGTTCCGTGGGAGGCCTGCCATGCTCGCCAAGACACCCGCAAACCCCGCGCCGCTTCAGCGGACGGCGTTCCTGAACGACACC
    ACGCTGCG
    CGACGGCGAGCAGGCGCCGGGTGTCGCCTTCACCCGCAAGGAGAAGATCGAGATCGCCGCCGCCCTTGCCGCCGCCGGTGTCCC
    GGAGATCG
    AGGCGGGAACGCCCGCCATGGGCGACGAAGAGGTGGAAACCATCCGCTCCATCGTCTCGCTGAACCTCCCGACGCGCGTCATGG
    CCTGGTGC
    CGCATGAGCGAGGACGACCTGATGGCCGCCGTCGCGGCGGGCGTGAAGATCGTCAATGTCTCCATTCCCACCTCCGACCGGCAA
    CTGGCCGG
    CAAGCTCGGCAAGGATCGCGCCTGGGCGCTCGGCCGTGTGGCGGAGGTGGTGACACTGGCGCGTCGGCTCGGCTTTGAGGTGGC
    GGTAGGGG
    GCGAGGATTCCTCGCGGGCCGATCCCGATTTTCTCTGCCGTCTCGCGGAGACGGCGAAGGCGGCGGGCGCCTTTCGCCTGCGGC
    TGGCCGAC
    ACGCTTGGCGTGCTTGACCCCTTCGGCACCTATGCATTGGTGCGCCGGGTGGCCGCCACCACCGACATCGAGCTTGAGTTCCAC
    GCCCATGA
    CGATCTCGGCCTTGCCACCGCCAATACGCTGGCGGCGGTGATGGGCGGAGCGCGTCACGCCAGCGTCACCGTCGCCGGGCTCGG
    CGAGCGCG
    CGGGCAATGCCGCGCTGGAGGAAGTGGCCATCGCCCTGCGCCAGACGGCGCGGGCGGAGACCGGCATCGCTCCGGCCGCGCTGA
    AGCCGCTG
    GCCGAACTAGTGTGCGGCGCCGCCGCCCGTCCGGTGCCGCGCGGCAAGGCCATCGTCGGCGCGGATGTGTTCACCCACGAGTCG
    GGCATCCA
    TGTCTCCGGCCTGCTCAAGGACCGGGCCACCTATGAAGCTCTGAATCCGGAACTGTTCGGGCGTGGCCACACGGTGGTGCTCGG
    AAAGCATT
    CCGGTCTTGCGGCGGTGGAGAAGGCGCTGGCCGACGAGGGCATCACCGTGGATGCGGTGCGCGGGCGCGCCATTCTCGACCGGG
    TGCGGGCT
    TTTGCTGTCCGCACCAAGGAGAATGTTTCCCGCGAGACGCTGCTGCGCTTCTATCAGGACAGCTTCACCGAGTCCGCGCTGCGT
    CTGCGGCGGGCCGCCGTGGAAGGCGCAATCTGA
    Pnifh of K.oxytoca Promoter TGTTGCCTCAAGCACAGCCTGTGCCAGCTCGCGGATGACAGAAGAGTTAGCGCGAATTCAACGCGTTATGAAGAGAGTCGCCGC
    GCAGCGCG
    CCAAGAGATTGCGTGGAATAAGACACAGGGGGCGACAAGCTGTTGAACAGGCGACAAAGCGCCACCATGGCCCCGGCAGGCGCA
    ATTGTTCT
    GTTTCCCACATTTGGTCGCCTTATTGTGCCGTTTTGTTTTACGTCCTGCGCGGCGACAAATAACTAACTTCATAAAAATCATAA
    GAATAC ATAAACAGGCACGGCTGGTATGTTCCCTGCACTTCTCTGCTGGCAAACA
    Pnifh of P.stutzeri Promoter TGTCATGTTCGCAACAGTTGCCGAAAGTGTGGAAAACCGGCGCTTGGCCCGGCCGATCTTTTTGTCGCCATTGCAACAGTCAGG
    CCTGTCGG
    TTGTTAACTATCGAACCGCCGAAGGATGTTGCTAGTAATTAAATTATTCTAATTAAAACAAGTGCTTAGATTATTTTAGAAACG
    CTGGCACAAAGGCTGCTATTGCCCTGTTGCGCAGGCTTGTTCGTGCCTATAGCCCAC
    Pnifh of A. vinelandii Promoter TGTCAGTTTTGTCACAGGGGGCCGGACCAGGATGGTGGACGCTCGATGGGGATGTCGGGCCATTGTTCGGTTGTAGCAATTACA
    ACAGTCGG
    AGTAGGGGGATTGTAGGGGGATTGTTGTGTATCAGACCGCCCTGCAGCTCCCGTCGATGGATAATTAATCATTTAAAATCAATG
    GTTTATTT
    ATGTGTTGCGGGTGCTGGCACAGACGCTGCATTACCTTTGGTGCGCGGAGTTGTTCGGGCTTACGGCCGAAC
    Pnifh of R. sp. Promoter TTGACAAAGCCTCCGAGAAGAGCGCCCCCTAACCCCTCCTCAGCCCTGATCGGCAGTATCATCTTGTCGAATCCTAACGTCTGA
    IRBG74 TAGGCAAC
    GCTATACGACAAACGCTGGTTACAATTGTCGGTTCCGCGACAAGAATTTGCTTTGTCTGGCGGGTGGTCTATTTTGAGCTAAGT
    AGCTGAGA
    AATCAGGAAAACAAAACTCTATTCGGTCTACCCGACGAGTTGGCACGGGTCTTGTAACCATCCTTGCGCAGGCGGCGAAAGCCA
    CCGGCGATATTCATGTTGCGGGCAAC
    Pnifh of Of A.caulinodans Promoter TGTCGCGTTTGAAACACGGGGCTTTTGGAACCGTTCGATTCTGCAATGCACTGATTTTACTTGATTAATTCGACCACACGACCA
    CTGGCACA
    CCCGTTGCAAAACCCCTTGGTGCAGGCGACGGGTTGCCGGTCTGGTTCGCGGATCTCCTCGATCCCCGGCTACCGACCCGCCTC
    CGAAAAGTCCGGTCCCGATCCAGTTCGGCGGGGCCACAC
    nifA of K. oxytoca Gene ATGATCCATAAATCCGATTCGGACACCACCGTCAGACGTTTCGATCTCTCCCAGCAGTTTACCGCCATGCAGCGGATAAGCGTG
    GTCCTGAG
    TCGCGCCACCGAAGCGAGCAAAACCCTGCAGGAGGTTCTGAGCGTGCTACATAACGATGCCTTTATGCAGCACGGGATGATTTG
    CCTGTACG
    ACAGCCAGCAGGAGATCCTGAGCATCGAAGCGCTGCAGCAAACGGAAGATCAGACGCTGCCCGGCAGTACGCAAATTCGCTACC
    GGCCGGGG
    GAAGGATTAGTCGGTACCGTGCTGGCGCAGGGCCAGTCGCTGGTGCTGCCGCGCGTCGCCGACGACCAGCGTTTTCTCGATCGT
    CTGAGCCT
    GTACGACTATGACCTGCCGTTTATCGCCGTTCCGCTGATGGGCCCCCACTCCCGGCCCATCGGCGTACTGGCGGCGCAGCCGAT
    GGCGCGTC
    AGGAAGAGCGGCTGCCCGCCTGCACGCGCTTTCTCGAAACCGTCGCCAATCTGATCGCCCAGACGATTCGCCTGATGATCCTGC
    CAACCTCC
    GCCGCGCAGGCGCCGCAGCAGAGCCCCAGAATAGAGCGCCCGCGCGCCTGTACCCCTTCGCGCGGTTTCGGCCTGGAAAATATG
    GTCGGTAA
    AAGCCCGGCGATGCGGCAGATTATGGATATTATTCGTCAGGTTTCCCGCTGGGATACCACGGTGCTGGTACGCGGCGAGAGCGG
    CACCGGGA
    AAGAGCTCATCGCCAACGCCATCCACCATAATTCTCCGCGCGCCGCCGCGGCGTTCGTCAAATTTAACTGCGCGGCGCTGCCGG
    ACAACCTG
    CTGGAGAGCGAGCTGTTTGGTCATGAGAAAGGCGCGTTTACCGGCGCGGTGCGCCAGCGGAAAGGCCGCTTTGAGCTGGCGGAC
    GGCGGCAC
    CTTATTCCTCGATGAGATCGGCGAAAGCAGCGCCTCGTTTCAGGCTAAGCTACTGCGTATTCTGCAAGAGGGGGAGATGGAGCG
    CGTCGGCG
    GCGACGAAACCCTGCGGGTCAACGTGCGCATTATCGCGGCGACCAACCGCCATCTGGAAGAGGAGGTGCGGCTGGGTCATTTCC
    GCGAGGAT
    CTATACTACCGCCTGAACGTAATGCCTATCGCGCTGCCGCCGCTGCGCGAGCGCCAGGAGGATATCGCCGAGCTGGCGCACTTT
    CTGGTGCG
    AAAAATCGCCCACAGCCAGGGGCGAACGCTGCGCATCAGCGATGGGGCGATTCGCCTGCTGATGGAGTACAGCTGGCCGGGAAA
    CGTGCGCG
    AACTGGAAAACTGTCTCGAACGTTCGGCGGTGCTGTCGGAAAGCGGCCTGATAGACCGGGACGTGATTCTGTTCAACCATCGCG
    ATAACCCG
    CCGAAAGCGCTCGCCAGCAGCGGCCCGGCGGAGGACGGCTGGCTCGATAACAGCCTCGACGAGCGCCAGCGGCTGATCGCCGCC
    CTGGAAAA
    AGCGGGCTGGGTGCAGGCCAAAGCGGCGCGGCTGCTCGGCATGACCCCGCGCCAGGTGGCGTATCGCATTCAGATTATGGATAT
    CACCATGC CGCGACTGTGA
    nifA of P. stutzeri Gene ATGAACGCCACATTCGCCGAACGCCCCAGCGCGCCAACCCGCAACGAACTGCTGGATGCCCAACTGCAGGCGCTGGCGCAGATC
    GCCCGCAT
    CCTTAACCGCGGCCGGCCCATCGAGGAACTGCTGGCCGAGATCCTCGCCGTGCTGCACGAAGACCTCGGCCTGCTGCACGGGCT
    GGTCTCCA
    TCTGCAACCCGAAGGACGGCAGCCTGCAGGTGGGCGCCGTGCACAGCGACTCCGAAACCGTGGTACGGGCCTGCGAAAGCACCC
    GCTACCGC
    ATCGGCGAAGGCGTGTTCGGCAACATCCTCAAGCATGGCAACAGCGTGGTGCTCGGGCGTATCGACGCCGAACCGCGCTTTCTC
    GACCGACT
    GGCGCTGTACGACATGGACCTGCCCTTCATCGCCGTGCCGATCAAGGCCGTCGACGGCACCACCATCGGCGTGCTGGCTGCCCA
    GCCCGACC
    GCCGCGCCGACGAGCTGATGCCCGAACGCACCCGTTTGATGGAAATCGTCGCCCGCCTACTGGCGCAGACCGTGCGCCTGGTGG
    TGAACCTC
    GAGGACGGCCAGGAAGTGGTCGACGAGCGCGACGAGCTACGCCGCGAAGTCCGCGCCAAGTACGGCTTCGAGAACATGGTGGTG
    GGCCACAC
    CGCCTCCATGCGCCGGGTTTTCGACCAGGTTCGACGGGTCGCCAAGTGGAACAGCACCGTGCTGATCCTCGGCGAATCCGGCAC
    CGGCAAGG
    AGCTGATCGCCAGCGCCATCCACTACAACTCACCGCGCGCTCACCAGCCGCTGGTACGCCTGAACTGCGCCGCGCTACCGGAAA
    CCCTGCTC
    GAATCGGAACTGTTCGGTCACGAGAAAGGCGCCTTCACCGGCGCCGTGAAGCAGCGCAAGGGACGTTTCGAACAGGCCGACGGC
    GGCACCCT
    GTTCCTCGACGAGATCGGCGAGATCTCGCCGATGTTCCAGGCCAAGCTGCTGCGCGTGCTGCAGGAAGGCGAGCTGGAGCGCGT
    CGGCGGCA
    GCCAGACGGTGAAGGTCAACGTGCGCATCGTCGCCGCCACCAACCGCGACCTGGAGCACGAGGTGGAGCAAGGCAAGTTCCGCG
    AAGACCTC
    TACTACCGCCTCAACGTCATGGCCATCCGCGTCCCGCCGCTGCGCGAGCGCAGCGCCGACATCCCGGAACTGGCCGAATTCCTC
    CTCGACAA
    GATCGCCCGCCAGCAGGGTCGCAAACTCAAGCTGACCGACAGCGCCCTGCGTCTGCTGATGAGCCACCGCTGGCCGGGCAACGT
    GCGCGAAC
    TGGAAAACTGCCTGGAACGCTCGGCCATCATGAGCGAGGATGGCACCATCAGCCGCGACGTGGTCTCCCTCACCGGCCTCGACC
    ACGACGCC
    ACGCCGCTGGCGCCGGTCCCCGAAGTCGACCTCGCCGACGACAGCCTCGACGACCGCGAGCGCGTCATCGCCGCGCTGGAACAG
    GCCGGCTG
    GGTCCAGGCCAAGGCCGCCCGCCTGCTCGGCATGACGCCCCGGCAGATCGCCTACCGAGTGCAGACGCTGAACATTCATATGCG
    CAAGATCT GA
    nifA of A. vinelandii Gene ATGAATGCAACCATCCCTCAGCGCTCGGCCAAACAGAACCCGGTCGAACTCTATGACCTGCAATTGCAGGCCCTGGCGAGCATC
    GCCCGCAC
    GCTCAGCCGCGAACAACAGATCGACGAACTGCTCGAACAGGTCCTGGCCGTACTGCACAATGACCTCGGCCTGCTGCATGGCCT
    GGTGACCA
    TTTCCGACCCGGAACACGGCGCCCTGCAGATCGGCGCCATCCACACCGACTCGGAAGCGGTGGCCCAGGCCTGCGAAGGCGTGC
    GCTACAGA
    AGCGGCGAAGGCGTGATCGGCAACGTGCTCAAGCACGGCAACAGCGTGGTGCTCGGGCGCATCTCCGCCGACCCGCGCTTTCTC
    GACCGCCT
    GGCGCTGTACGACCTGGAAATGCCGTTCATCGCCGTGCCGATCAAGAACCCCGAGGGCAACACCATCGGCGTGCTGGCGGCCCA
    GCCGGACT
    GCCGCGCCGACGAGCACATGCCCGCGCGCACGCGCCTTCTGGAGATCGTCGCCAACCTGCTGGCGCAGACCGTGCGCCTGGTGG
    TGAACATC
    GAGGACGGCCGCGAGGCGGCCGACGAGCGCGACGAACTGCGTCGCGAGGTGCGCGGCAAGTACGGCTTCGAGAACATGGTGGTG
    GGCCACAC
    CCCCACCATGCGCCGGGTGTTCGATCAGATCCGCCGGGTCGCCAAGTGGAACAGCACCGTACTGGTCCTCGGCGAGTCCGGTAC
    CGGCAAGG
    AACTGATCGCCAGCGCCATCCACTACAACTCGCCGCGCGCGCACCGCCCCTTCGTGCGCCTGAACTGCGCCGCGCTGCCGGAAA
    CCCTGCTC
    GAGTCCGAACTCTTCGGCCACGAGAAGGGCGCCTTCACCGGCGCGGTGAAGCAGCGCAAGGGGCGTTTCGAGCAGGCCGACGGC
    GGCACCCT
    GTTCCTCGACGAGATCGGCGAGATCTCGCCGATGTTCCAGGCCAAGCTGCTGCGCGTGCTGCAGGAAGGCGAGTTCGAGCGGGT
    CGGCGGCA
    ACCAGACGGTGCGGGTCAACGTGCGCATCGTCGCCGCCACCAACCGCGACCTGGAAAGCGAGGTGGAAAAGGGCAAGTTCCGCG
    AGGACCTC
    TACTACCGCCTGAACGTCATGGCCATCCGCATTCCGCCGCTGCGCGAGCGTACCGCCGACATTCCCGAACTGGCGGAATTCCTG
    CTCGGCAA
    GATCGGCCGCCAGCAGGGCCGCCCGCTGACCGTCACCGACAGCGCCATCCGCCTGCTGATGAGCCACCGCTGGCCGGGCAACGT
    GCGCGAAC
    TGGAGAACTGCCTGGAGCGCTCGGCGATCATGAGCGAGGACGGCACCATCACCCGCGACGTGGTCTCGCTGACCGGGGTCGACA
    ACGAGAGC
    CCGCCGCTCGCCGCGCCGCTGCCCGAGGTCAACCTGGCCGACGAGACCCTGGACGACCGCGAACGGGTGATCGCCGCCCTCGAA
    CAGGCCGG
    CTGGGTGCAGGCCAAGGCCGCGCGGCTGCTGGGCATGACGCCGCGGCAGATCGCCTACCGCATCCAGACCCTCAACATCCACAT
    GCGCAAGA TCTGA
    nifA of R. sp. Gene ATGCTGCACAATGGGCTCAATGAGGGTATGACTGAACGATCCGCTCAAACCATCCACAAACCGGATTTCTGGGGCAGCGGTATC
    IRBG74 TATCGGAT
    ATCGAAAGTTTTGATTGGTCCAGACAGTCTCGAGACGAAGCTTGCCAATGTCATTAACGCCCTCTCAGTAATTCTCCCAATGCG
    GCGCGGCG
    CAATCGTCGTTCTAAATGTTAAAGGAGAGCCCGAGATGGTTGCAATGCTGGGCCTAGAGCAAGCATCTCAAGGCGCCCGCTCCA
    TTCCGGCG
    GAGGCTGCGATAGATAGAATCGTCGCCAAAGGCGCGCCGCTGGTCGTACCGGACATTTGCAAGTCGGACCTGTTCCAGGCGGAG
    CTCCAAAC
    CAACTCGAACGCCACAGGCCCAGCCACGTTCGTTGGCGTCCCGATGAAGGTCGAAAAAGAAACGCTTGGAACACTATGGATCGA
    CCGCGCCA
    AAGATGGCAGCACTAGGATCCAATTTGAGGAAGAGGTGCGCTTCCTCTCCATGGTCGCCAACCTTTCGGCCCGGGCCATTTGGC
    TGGATCGC
    CACCAGAGCCGCGATGGTCAGCCAATCGTGGGCGAGGAAGGAACTCGCAAGACTAGTTCAGGCGACAAGGAACTGCCCGAATCT
    GCCCGACA
    AAGGCCCACAAAAATCGATTGGATTGTCGGGGAAAGCCCTGCCCTCAAGCAGGTGGTTGAAAGCGTCAAAGTCGTTGCAACAAC
    CAATTCTG
    CGGTGCTTCTCAGGGGCGAAAGCGGCACGGGCAAGGAGTTCTTTGCAAAGGCCATCCACGAGCTTTCATACCGGAAAAAGAAGC
    CCTTCGTG
    AAGTTGAACTGCGCCGCGCTGTCTGCAGGCGTTTTGGAATCGGAATTGTTTGGACATGAAAAGGGCGCCTTCACGGGGGCCATC
    TCTCAGCG
    CGCAGGCCGCTTCGAACTCGCAGACGGCGGAACGCTGCTGCTCGATGAGATCGGCGACATTTCGCCGGGCTTCCAAGCGAAACT
    GTTGCGCG
    TCTTGCAGGAAGGTGAGCTTGAGCGAGTCGGCGGCACAAAAACACTCAAAGTGGACGTTCGACTCATATGCGCCACGAACAAAG
    ACCTAGAA
    GCGGCAGTCGCGGATGGGGAGTTCAGGGCCGACCTTTATTACCGGATCAATGTGGTGCCCCTATTTCTGCCGCCTCTCCGGGAG
    CGAAATGG
    GGATATTCCACGCCTTGCGAGAGTTTTCCTCGGCCGATTCAACAGGGAAAACAATCGCGATCTCGCGTTCGCGCCGGCTGCGCT
    CGAGCTCT
    TGTCAAAATGCAACTTTCCCGGCAACGTCCGAGAGCTTGAAAACTGCGTCCGCAGGACCGCCACTCTCGCGCGTTCGGAGACGA
    TCGTTCCA
    TCAGATTTCTCCTGCCTGAAGAACCAGTGCTTTTCTTCAATGCTCTGGAAAACCGGTGACCGTCCACTTGGGGATACGCTCAAT
    GGGTTGGC
    CATGCGTAAGAGTTTGTCGGTCGAATCGCCGATCAGCCTCGGTTACTCCAATGGACCGGCCGGCTTAACGGTGGCACCACATCT
    AACGGACC
    GCGAGCTGCTAATCAGTGCGATGGAGAAGGCCGGTTGGGTTCAGGCAAAGGCAGCTCGGATCCTCGGCCTCACACCGCGACAGG
    TCGGCTAT GCTTTACGTAGGCATCGTATACAGGTGAAGAAAATCTAA
    nifA of A.caulinodans Gene ATGCCAATGACCGACGCCTTCCAGGTCCGCGTACCTCGGGTTTCGTCGAGCACCGCCGGAGACATCGCCGCGTCATCCATCACC
    ACGCGGGG
    CGCGCTGCCGCGCCCGGGAGGGATGCCTGTGTCCATGTCGCGGGGGACCTCGCCCGAGGTGGCACTCATCGGGGTCTATGAGAT
    ATCGAAGA
    TCCTGACGGCGCCCCGGCGCCTCGAAGTCACGCTCGCCAATGTGGTGAACGTGCTCTCCTCCATGCTGCAGATGCGGCATGGCA
    TGATCTGC
    ATCCTCGACAGCGAGGGCGATCCCGACATGGTGGCCACCACCGGCTGGACGCCTGAGATGGCGGGCCAGATCCGCGCGCATGTG
    CCCCAGAA
    GGCCATCGACCAGATCGTCGCCACGCAGATGCCGCTGGTGGTGCAGGACGTGACGGCCGATCCGCTCTTCGCCGGTCACGAGGA
    TCTGTTCG
    GCCCGCCTGAGGAGGCCACCGTCTCCTTCATCGGCGTGCCGATCAAGGCCGACCACCATGTGATGGGCACCCTCTCCATCGACC
    GCATCTGG
    GACGGCACCGCCCGTTTCCGCTTCGACGAGGACGTGCGCTTCCTCACCATGGTGGCCAATCTCGTCGGCCAGACCGTGCGCCTG
    CACAAGCT
    GGTGGCGAGCGACCGCGACCGGCTGATCGCCCAGACGCACCGCCTCGAAAAGGCGCTGCGGGAAGAAAAATCCGGGGCCGAGCC
    GGAGGTGG
    CCGAGGCCGCCAACGGATCCGCCATGGGCATCGTGGGCGATAGCCCGCTGGTGAAACGCCTGATCGCGACCGCGCAAGTGGTCG
    CCCGCTCA
    AACTCCACCGTGCTGCTGCGCGGGGAGAGCGGCACCGGCAAGGAGTTGTTCGCCCGTGCCATCCACGAACTGTCGCCCCGCAAG
    GGCAAGCC
    CTTCGTGAAGGTGAACTGCGCCGCCCTCCCGGAATCGGTGCTGGAATCGGAACTGTTCGGCCATGAGAAGGGCGCCTTCACCGG
    TGCGCTGA
    ACATGCGCCAGGGCCGCTTCGAGCTGGCGCACGGCGGCACGCTCTTCCTTGACGAGATCGGCGAGATCACCCCCGCTTTCCAGG
    CCAAGCTG
    CTGCGCGTGCTGCAGGAAGGCGAGTTCGAGCGGGTCGGCGGCAATCGCACGCTGAAGGTGGATGTGCGGCTCGTGTGCGCCACC
    AACAAGAA
    TCTGGAAGAGGCGGTCTCCAAGGGCGAGTTCCGGGCCGATCTCTACTACCGCATCCATGTGGTGCCGCTGATCCTGCCGCCGCT
    GCGCGAAC
    GGCCGGGCGACATTCCCAAGCTCGCGAAGAACTTCCTCGACCGCTTCAACAAGGAAAACAAGCTCCACATGATGCTCTCGGCGC
    CGGCCATC
    GACGTGCTGCGGCGCTGCTATTTCCCGGGCAACGTGCGCGAGCTGGAGAACTGTATCCGGCGGACGGCAACGCTCGCCCACGAT
    GCCGTCAT
    CACCCCCCATGACTTCGCCTGCGACAGCGGCCAGTGCCTCTCGGCCATGCTCTGGAAGGGCTCGGCCCCGAAGCCTGTGATGCC
    GCACGTGC
    CGCCGGCGCCCACGCCGCTGACTCCGCTCTCCCCTGCTCCGCTCGCGACCGCAGCGCCCGCTGCGGCGAGCCCGGCGCCGGCGG
    CCGACAGC
    CTGCCGGTCACTTGCCCCGGCACCGAGGCCTGTCCCGCGGTGCCCCCCCGCCAGAGCGAAAAGGAGCAGTTGCTCCAGGCCATG
    GAGCGCTC
    CGGCTGGGTGCAGGCGAAGGCCGCGCGCCTCCTCAACCTCACGCCGCGCCAGGTGGGTTATGCGCTGCGCAAATATGACATCGA
    CATCAAGC GCTTCTGA
    PJ3123100 Promoter9 TAGGTGTTGACGGCTAGCTCAGTCCTAGGTACAGTGCTAGCTCTAGA
    PJ3123101 Promoter9 TAGGTGTTTACAGCTAGCTCAGTCCTAGGTATTATGCTAGCTCTAGA
    PJ3123102 Promoter9 TAGGTGTTGACAGCTAGCTCAGTCCTAGGTACTGTGCTAGCTCTAGA
    PJ3123103 Promoter9 TAGGTGCTGATAGCTAGCTCAGTCCTAGGGATTATGCTAGCTCTAGA
    PJ23104 Promoter9 TAGGTGTTGACAGCTAGCTCAGTCCTAGGTATTGTGCTAGCTCTAGA
    PJ23105 Promoter9 TAGGTGTTTACGGCTAGCTCAGTCCTAGGTACTATGCTAGCTCTAGA
    PJ23106 Promoter9 TAGGTGTTTACGGCTAGCTCAGTCCTAGGTATAGTGCTAGCTCTAGA
    PJ23107 Promoter9 TAGGTGTTTACGGCTAGCTCAGCCCTAGGTATTATGCTAGCTCTAGA
    PJ23108 Promoter9 TAGGTGCTGACAGCTAGCTCAGTCCTAGGTATAATGCTAGCTCTAGA
    PJ23109 Promoter9 TAGGTGTTTACAGCTAGCTCAGTCCTAGGGACTGTGCTAGCTCTAGA
    PJ23110 Promoter9 TAGGTGTTTACGGCTAGCTCAGTCCTAGGTACAATGCTAGCTCTAGA
    PJ23111 Promoter9 TAGGTGTTGACGGCTAGCTCAGTCCTAGGTATAGTGCTAGCTCTAGA
    PJ23112 Promoter9 TAGGTGCTGATAGCTAGCTCAGTCCTAGGGATTATGCTAGCTCTAGA
    PJ23113 Promoter9 TAGGTGCTGATGGCTAGCTCAGTCCTAGGGATTATGCTAGCTCTAGA
    PJ23114 Promoter9 TAGGTGTTTATGGCTAGCTCAGTCCTAGGTACAATGCTAGCTCTAGA
    PJ23115 Promoter9 TAGGTGTTTATAGCTAGCTCAGCCCTTGGTACAATGCTAGCTCTAGA
    PJ23116 Promoter9 TAGGTGTTGACAGCTAGCTCAGTCCTAGGGACTATGCTAGCTCTAGA
    PJ23117 Promoter9 TAGGTGTTGACAGCTAGCTCAGTCCTAGGGATTGTGCTAGCTCTAGA
    PJ23118 Promoter9 TAGGTGTTGACGGCTAGCTCAGTCCTAGGTATTGTGCTAGCTCTAGA
    PJ23119 Promoter9 TAGGTGTTGACAGCTAGCTCAGTCCTAGGTATAATGCTAGCTCTAGA
    Ptrp Promoter10 TAGGTGTTGACATTATTCCATCGAACTAGTTAACTAGTACGAAAGTT
    TT7 Terminator11 TAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGT
    TT2.2 Terminator11 TACTCGAACCCCTAGCCCGCTCTTATCGGGCGGCTAGGGGTTTTTTGT
    TT7.3 Terminator11 TACATATCGGGGGGGTAGGGGTTTTTTGT
    TrrnBT1 Terminator12 CCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACTGGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTC
    TL3S2P21 Terminator13 CTCGGTACCAAATTCCAGAAAAGAGGCCTCCCGAAAGGGGGGCCTTTTTTCGTTTTGGTCC
    T1 Terminator13 CTCGGTACCAAATTCCAGAAAAGACACCCGAAAGGGTGTTTTTTCGTTTTGGTCCTCCTTGGCCCTCCATCCTTAGATAGCAGA
    TAAAAAAA ATCCTTAGCTTTCGCTAAGGATGATTTCTTCATAGGCAATACGATCGCATGTCC
    T2 Terminator14 CCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACTGGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCTACT
    AGAGTCAC ACTGGCTCACCTTCGGGTGGGCCTTTCTGCGTTTATA
    T3 Terminator13 CCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACTGGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCTACT
    AGAGTCAC ACTGGCTCACCTTCGGGTGGGCCTTTCTGCGTTTATA
    T4 Terminator13 GGTCTTGTCCACTACCTTGCAGTAATGCGGTGGACAGGATCGGCGGTTTTCTTTTCTCTTCTCAATGACTGAATAGAAAAGACG
    AACATTAA
    CGCATGAGAAAGCCCCCGGAAGATCACCTTCCGGGGGCTTTTTTATTGCGCTACAAATGAAAGTACATAGAAATTA
    T5 Terminator13 CAGATAAAAAAAATCCTTAGCTTTCGCTAAGGATGATTTCTTCCTTGGCCCTCCATCCTTAGATAGCTCGGTACCAAATTCCAG
    AAAAGACA CCCGAAAGGGTGTTTTTTCGTTTTGGTCCTCATAGGCAATACGATCGCATGTCC
    T6 Terminator13 CCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACTGGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTAG
    CATAACCC CTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTG
    T7 Terminator13 CTCGGTACCAAATTCCAGAAAAGAGACGCTTTCGAGCGTCTTTTTTCGTTTTGGTCCTCCTTGGCCCTCCATCCTTAGATAGAG
    TTAACCAA AAAGGGGGGATTTTATCTCCCCTTTAATTTTTCCTTCATAGGCAATACGATCGCATGTCC
    T8 Terminator13 CGCAGATAGCAAAAAAGCGCCTTTAGGGCGCTTTTTTACATTGGTGGTCCTTGGCCCTCCATCCTTAGATAGAGGCGACTGACG
    AAACCTCG CTCCGGCGGGGTTTTTTGTTATCTGCATCATAGGCAATACGATCGCATGTCC
    T9 Terminator14 TCGGTCAGTTTCACCTGATTTACGTAAAAACCCGCTTCGGCGGGTTTTTGCTTTTGGAGGGGCAGAAAGATGAATGACTG
    TC
    T10 Terminator14 GCCCCCGGAAGATCACCTTCCGGGGGCTTTTTTATTGGCGGCCGGCTGATTGATCAGGCGGCCGGCTGATTGGCGCGTTACCTG
    GTAGCGCG CCATTTTGTTT
    T11 Terminator14 GTAATCGTTAATCCGCAAATAACGTAAAAACCCGCTTCGGCGGGTTTTTTTATGGGGGGAGTTTAGGGAAAGAGCATTTG
    TCA
    T12 Terminator14 AAAAAAAAACCCCGCCCCTGACAGGGCGGGGTTTTTTTTTT
    T13 Terminator13 TCCGGCAATTAAAAAAGCGGCTAACCACGCCGCTTTTTTTACGTCTGCATGACTGAATAGAAAAGACGAACATTAACGCATGAG
    AAAGCCCC CGGAAGATCACCTTCCGGGGGCTTTTTTATTGCGCTCCTTGGCCCTCCATCCTTAGATAG
    T14 Terminator13 GGAAGACCATACTGGAAACACAGAAAAAAGCCCGCACCTGACAGTGCGGGCTTTTTTTTTCGACCAAAGGTGACTGAATAGAAA
    AGACGAAC
    ATTCGCAGATAGCAAAAAAGCGCCTTTAGGGCGCTTTTTTACATTGGTGGTCATAGGCAATACGATCGCATGTCC
    T15 Terminator13 TCCGGCAATTAAAAAAGCGGCTAACCACGCCGCTTTTTTTACGTCTGCATCCTTGGCCCTCCATCCTTAGATAGCTCGGTACCA
    AATTCCAG AAAAGAGGCCTCCCGAAAGGGGGGCCTTTTTTCGTTTTGGTCCTCATAGGCAATACGATCGCATGTCC
    T16 Terminator13 TTCAGCCAAAAAACTTAAGACCGCCGGTCTTGTCCACTACCTTGCAGTAATGCGGTGGACAGGATCGGCGGTTTTCTTTTCTCT
    TCTCAATA
    CATGAAAGTACATAGAAATTACTCGGTACCAAATTCCAGAAAAGAGGCCTCCCGAAAGGGGGGCCTTTTTTCGTTTTGGTCCTC
    ATAGGCAA TACGATCGCATGTCC
    T17 Terminator13 TTCAGCCAAAAAACTTAAGACCGCCGGTCTTGTCCACTACCTTGCAGTAATGCGGTGGACAGGATCGGCGGTTTTCTTTTCTCT
    TCTCAATC
    CTTGGCCCTCCATCCTTAGATAGTCCGGCAATTAAAAAAGCGGCTAACCACGCCGCTTTTTTTACGTCTGCATCATAGGCAATA
    CGATCGCA TGTCC
    T18 Terminator13 CTCGGTACCAAATTCCAGAAAAGAGGCCTCCCGAAAGGGGGGCCTTTTTTCGTTTTGGTCCTGACTGAATAGAAAAGACGAACA
    TTAACGCA
    TGAGAAAGCCCCCGGAAGATCACCTTCCGGGGGCTTTTTTATTGCGCTCCTTGGCCCTCCATCCTTAGATAG
    T19 Terminator13 CTCGGTACCAAATTCCAGAAAAGAGGCCTCCCGAAAGGGGGGCCTTTTTTCGTTTTGGTCCTCCTTGGCCCTCCATCCTTAGAT
    GTCCGGCA ATTAAAAAAGCGGCTAACCACGCCGCTTTTTTTACGTCTGCATCATAGGCAATACGATCGCATGTCC
    T20 Terminator13 CTCGGTACCAAAGACGAACAATAAGACGCTGAAAAGCGTCTTTTTTCGTTTTGGTCCTACAAATGAAAGTACATAGAAATTATT
    CAGCCAAA
    AAACTTAAGACCGCCGGTCTTGTCCACTACCTTGCAGTAATGCGGTGGACAGGATCGGCGGTTTTCTTTTCTCTTCTCAATCCT
    TGGCCCTC CATCCTTAGATAG
    T21 Terminator14 GGGAACTGCCAGACATCAAATAAAACAAAAGGCTCAGTCGGAAGACTGGGCCTTTTGTTTTATCTGTTGTTTGTCGGTGAACAC
    TCTCCCGA CTAGTAGCGGCCGCTGCAGAAAGAGGAGA
    T22 Terminator13 AACGCATGAGAAAGCCCCCGGAAGATCACCTTCCGGGGGCTTTTTTATTGCGCTCATAGGCAATACGATCGCATGTCCTCCGGC
    AATTAAAA AAGCGGCTAACCACGCCGCTTTTTTTACGTCTGCATCCTTGGCCCTCCATCCTTAGATAG
    T23 Terminator14 GGGAACTGCCAGACATCAAATAAAACAAAAGGCTCAGTCGGAAGACTGGGCCTTTTGTTTTATCTGTTGTTTGTCGGTGA
    ACACTCTCCCG
    T24 Terminator14 AAAGCAAGCTGATAAACCGATACAATTAAAGGCTCCTTTTGGAGCCTTTTTTTTTGGAGATTTTCAACATGAAAAAATTATTAT
    TTGATGAT
    CAGATAGCGGCGGGGAACTGCCAGACATCAAATAAAACAAAAGGCTCAGTCGGAAGACTGGGCCTTTTGTTTTATCTGTTGTTT
    GTCGGTGA ACACTCTCCCG
    T25 Terminator13 AACGCATGAGAAAGCCCCCGGAAGATCACCTTCCGGGGGCTTTTTTATTGCGCTCCTTGGCCCTCCATCCTTAGATAGCTCGGT
    ACCAAATT
    CCAGAAAAGAGGCCTCCCGAAAGGGGGGCCTTTTTTCGTTTTGGTCCTCATAGGCAATACGATCGCATGTCC
    T26 Terminator14 AACGCATGAGAAAGCCCCCGGAAGATCACCTTCCGGGGGCTTTTTTATTGCGCTCCTTGGCCCTCCATCCTTAGATAGTTCAGC
    CAAAAAAC
    TTAAGACCGCCGGTCTTGTCCACTACCTTGCAGTAATGCGGTGGACAGGATCGGCGGTTTTCTTTTCTCTTCTCAATCATAGGC
    AATACGAT CGCATGTCC
    PLux Promoter15 CCTAGGACCTGTAGGATCGTACAGGTTTACGCAAGAAAATGGTTTGTTACTTTCGAATAAATCTAGA
    PTet Promoter16 CGGTGGAATCCCTATCAGTGATAGAGATTGACATCCCTATCAGTGATAGATATAATGAGCACTCTAGA
    PCym Promoter5 AACAAACAGACAATCTGGTCTGTTTGTATTATGGAAAATTTTTCTGTATAATAGATTCAACAAACAGACAATCTGGTCTG
    TTTGTATTAT
    PPhl Promoter17 AAAAAGAGTTTGACATGATACGAAACGTACCGTATCGTTAAGGTTACTAGAGTCTAGA
    PSal Promoter5 GGGGCCTCGCTTGGGTTATTGCTGGTGCCCGGCCGGGCGCAATATTCATGTTGATGATTTATTATATATCGAGTGGTGTATTTA
    TTTATATT GTTTGCTCCGTTACCGTTATTAAC
    luxR Gene ATGAAAAACATAAATGCCGACGACACATACAGAATAATTAATAAAATTAAAGCTTGTAGAAGCAATAATGATATTAATCAATGC
    TTATCTGA
    TATGACTAAAATGGTACATTGTGAATATTATTTACTCGCGATCATTTATCCTCATTCTATGGTTAAATCTGATATTTCAATCCT
    AGATAATT
    ACCCTAAAAAATGGAGGCAATATTATGATGACGCTAATTTAATAAAATATGATCCTATAGTAGATTATTCTAACTCCAATCATT
    CACCAATT
    AATTGGAATATATTTGAAAACAATGCTGTAAATAAAAAATCTCCAAATGTAATTAAAGAAGCGAAAACATCAGGTCTTATCACT
    GGGTTTAG
    TTTCCCTATTCATACGGCTAACAATGGCTTCGGAATGCTTAGTTTTGCACATTCAGAAAAAGACAACTATATAGATAGTTTATT
    TTTACATG
    CGTGTATGAACATACCATTAATTGTTCCTTCTCTAGTTGATAATTATCGAAAAATAAATATAGCAAATAATAAATCAAACAACG
    ATTTAACC
    AAAAGAGAAAAAGAATGTTTAGCGTGGGCATGCGAAGGAAAAAGCTCTTGGGATATTTCAAAAATATTAGGTTGCAGTGAGCGT
    ACTGTCAC
    TTTCCATTTAACCAATGCGCAAATGAAACTCAATACAACAAACCGCTGCCAAAGTATTTCTAAAGCAATTTTAACAGGAGCAAT
    TGATTGCC CATACTTTAAAAATTAA
    tetR Gene ATGTCCAGATTAGATAAAAGTAAAGTGATTAACAGCGCATTAGAGCTGCTTAATGAGGTCGGAATCGAAGGTTTAACAACCCGT
    AAACTCGC
    CCAGAAGCTAGGTGTAGAGCAGCCTACATTGTATTGGCATGTAAAAAATAAGCGGGCTTTGCTCGACGCCTTAGCCATTGAGAT
    GTTAGATA
    GGCACCATACTCACTTTTGCCCTTTAGAAGGGGAAAGCTGGCAAGATTTTTTACGTAATAACGCTAAAAGTTTTAGATGTGCTT
    TACTAAGT
    CATCGCGATGGAGCAAAAGTACATTTAGGTACACGGCCTACAGAAAAACAGTATGAAACTCTCGAAAATCAATTAGCCTTTTTA
    TGCCAACA
    AGGTTTTTCACTAGAGAATGCATTATATGCACTCAGCGCTGTGGGGCATTTTACTTTAGGTTGCGTATTGGAAGATCAAGAGCA
    TCAAGTCG
    CTAAAGAAGAAAGGGAAACACCTACTACTGATAGTATGCCGCCATTATTACGACAAGCTATCGAATTATTTGATCACCAAGGTG
    CAGAGCCA
    GCCTTCTTATTCGGCCTTGAATTGATCATATGCGGATTAGAAAAACAACTTAAATGTGAAAGTGGGTCCTAA
    cymR Gene ATGAGCCCGAAACGTCGTACCCAGGCAGAACGTGCAATGGAAACCCAGGGTAAACTGATTGCAGCAGCACTGGGTGTTCTGCGT
    GAAAAAGG
    TTATGCAGGTTTTCGTATTGCAGATGTTCCGGGTGCAGCCGGTGTTAGCCGTGGTGCACAGAGCCATCATTTTCCGACCAAACT
    GGAACTGC
    TGCTGGCAACCTTTGAATGGCTGTATGAGCAGATTACCGAACGTAGCCGTGCACGTCTGGCAAAACTGAAACCGGAAGATGATG
    TTATTCAG
    CAGATGCTGGATGATGCAGCAGAATTTTTTCTGGATGATGATTTTAGCATCAGCCTGGATCTGATTGTTGCAGCAGATCGTGAT
    CCGGCACT
    GCGTGAAGGTATTCAGCGTACCGTTGAACGTAATCGTTTTGTTGTTGAAGATATGTGGCTGGGTGTGCTGGTGAGCCGTGGTCT
    GAGCCGTG
    ATGATGCCGAAGATATTCTGTGGCTGATTTTTAACAGCGTTCGTGGTCTGGCAGTTCGTAGCCTGTGGCAGAAAGATAAAGAAC
    GTTTTGAA CGTGTGCGTAATAGCACCCTGGAAATTGCACGTGAACGTTATGCAAAATTCAAACGTTGA
    phlF Gene ATGGCACGTACCCCGAGCCGTAGCAGCATTGGTAGCCTGCGTAGTCCGCATACCCATAAAGCAATTCTGACCAGCACCATTGAA
    ATCCTGAA
    AGAATGTGGTTATAGCGGTCTGAGCATTGAAAGCGTTGCACGTCGTGCCGGTGCAAGCAAACCGACCATTTATCGTTGGTGGAC
    CAATAAAG
    CAGCACTGATTGCCGAAGTGTATGAAAATGAAAGCGAACAGGTGCGTAAATTTCCGGATCTGGGTAGCTTTAAAGCCGATCTGG
    ATTTTCTG
    CTGCGTAATCTGTGGAAAGTTTGGCGTGAAACCATTTGTGGTGAAGCATTTCGTTGTGTTATTGCAGAAGCACAGCTGGACCCT
    GCAACCCT
    GACCCAGCTGAAAGATCAGTTTATGGAACGTCGTCGTGAGATGCCGAAAAAACTGGTTGAAAATGCCATTAGCAATGGTGAACT
    GCCGAAAG
    ATACCAATCGTGAACTGCTGCTGGATATGATTTTTGGTTTTTGTTGGTATCGCCTGCTGACCGAACAGCTGACCGTTGAACAGG
    ATATTGAA GAATTTACCTTCCTGCTGATTAATGGTGTTTGTCCGGGTACACAGCGTTAA
    nahR Gene ATGGAACTGCGTGACCTGGATTTAAACCTGCTGGTGGTGTTCAACCAGTTGCTGGTCGACAGACGCGTCTCTGTCACTGCGGAG
    AACCTGGG
    CCTGACCCAGCCTGCCGTGAGCAATGCGCTGAAACGCCTGCGCACCTCGCTACAGGACCCACTCTTCGTGCGCACACATCAGGG
    AATGGAAC
    CCACACCCTATGCCGCGCATCTGGCCGAGCACGTCACTTCGGCCATGCACGCACTGCGCAACGCCCTACAGCACCATGAAAGCT
    TCGATCCG
    CTGACCAGCGAGCGTACCTTCACCCTGGCCATGACCGACATTGGCGAGATCTACTTCATGCCGCGGCTGATGGATGCGCTGGCT
    CACCAGGC
    CCCCAATTGCGTGATCAGTACGGTGCGCGACAGTTCGATGAGCCTGATGCAGGCCTTGCAGAACGGAACCGTGGACTTGGCCGT
    GGGCCTGC
    TTCCCAATCTGCAAACTGGCTTCTTTCAGCGCCGGCTGCTCCAGAATCACTACGTGTGCCTATGTCGCAAGGACCATCCAGTCA
    CCCGCGAA
    CCCCTGACTCTGGAGCGCTTCTGTTCCTACGGCCACGTGCGTGTCATCGCCGCTGGCACCGGCCACGGCGAGGTGGACACGTAC
    ATGACACG
    GGTCGGCATCCGGCGCGACATCCGTCTGGAAGTGCCGCACTTCGCCGCCGTTGGCCACATCCTCCAGCGCACCGATCTGCTCGC
    CACTGTGC
    CGATATGTTTAGCCGACTGCTGCGTAGAGCCCTTCGGCCTAAGCGCCTTGCCGCACCCAGTCGTCTTGCCTGAAATAGCCATCA
    ACATGTTC
    TGGCATGCGAAGTACCACAAGGACCTAGCCAATATTTGGTTGCGGCAACTGATGTTTGACCTGTTTACGGATTGATAA
    PFde Promoter18 TCAATGTATTGATGCCGTCCATATCATGAATCAAAACAATCCATTTGATCAATATCAAGCTCACTCTTAAGCTTCACTCA
    TCCGCTGCAT
    fdeR Gene ATGCGTTTCAACAAGCTCGACCTCAATCTTCTGGTCGCCCTGGATGCACTGCTCACGGAGATGAGCATCAGCCGCGCCGCCGAA
    AAGATCCA
    TCTGAGCCAGTCGGCCATGAGCAATGCCCTGGCGCGGCTGCGCGAGTATTTCGATGATGAATTGCTGATCCAGGTGGGCCGGCG
    CATGGAGC
    CCACGCCGCGCGCCGAGGTGCTCAAGGATGCGGTGCATGATGTGCTGCGGCGTATCGATGGCTCCATCGCGGCGCTGCCGGCCT
    TCGTGCCG
    GCCGAGTCCACGCGCGAGTTTCGCATCTCGGTTTCGGACTTTACGCTCTCCGTCCTCATCCCCCGGGTGCTGGCGCGCGCGCAC
    GCCGAGGG
    CAAGCACATCCGCTTTGCCCTGATGCCGCAGGTGCAAGACCCGACCCGCTCGCTGGATCGGGCCGAGGTGGACCTGCTGGTCTT
    GCCGCAGG
    AATTCTGCACGCCCGATCATCCTGCCGAAGAGGTCTTCCGCGAACGGCATGTCTGCGTGGTCTGGCGCGACAGTGCGCTGGCGC
    AAGGCGAG
    CTGACGCTGGAACGCTACATGGCCTCAGGCCATGTGGTGATGGTGCCGCCTGGGGCCAATGCGTCGTCGGTGGAGGCGTGGATG
    GCCAGGAA
    GCTGGGCTTTGCGCGCCGGGTGGAAGTGACCAGCTTCAGCTTCGCTTCTGCGCTGGCGCTGGTACAGGGGACGGACCGCATCGC
    CACGGTGC
    ATGCCCGGCTGGCGCAGCTGCTGGCTCCGCAATGGCCGGTGGTGATCAAGGAGAGTCCGCTGTCGCTGGGCGAGATGCGGCAGA
    TGATGCAG
    TGGCATCGCTACCGCAGCAATGATCCTGGCATCCAGTGGCTGCGTCGGGTGTTTCTGGAGAGTGCGCAGGAGATGGATGCGGCG
    CTGCCAGG CATCTGCTGA
    PBAD.10 Promoter CAGACATTGCCGTCACTGCGTCTTTTACTGGCTCTTCTCGCTAACCAAACCGGTAACCCCGCTTATTAAAAGCATTCTGTAACA
    AAGCGGGA
    CCAAAGCCATGACAAAAACGCGTAACAAAAGTGTCTATAATCACGGCAGAAAAGTCCACATTGATTATTTGCACGGCGTCACAC
    TTTGCTAT
    GCCATAGCATTTTTATCCATAAGATTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTATATTTTCTCCATACCCGTTT
    TTTTGGGCTAGCGAATTC
    araC Gene ATGCAATATGGACAATTGGTTTCTTCTCTGAATGGCGGGAGTATGAAAAGTATGGCTGAAGCGCAAAATGATCCCCTGCTGCCG
    GGATACTC
    GTTTAATGCCCATCTGGTGGCGGGTTTAACGCCGATTGAGGCCAACGGTTATCTCGATTTTTTTATCGACCGACCGCTGGGAAT
    GAAAGGTT
    ATATTCTCAATCTCACCATTCGCGGTCAGGGGGTGGTGAAAAATCAGGGACGAGAATTTGTTTGCCGACCGGGTGATATTTTGC
    TGTTCCCG
    CCAGGAGAGATTCATCACTACGGTCGTCATCCGGAGGCTCGCGAATGGTATCACCAGTGGGTTTACTTTCGTCCGCGCGCCTAC
    TGGCATGA
    ATGGCTTAACTGGCCGTCAATATTTGCCAATACGGGGTTCTTTCGCCCGGATGAAGCGCACCAGCCGCATTTCAGCGACCTGTT
    TGGGCAAA
    TCATTAACGCCGGGCAAGGGGAAGGGCGCTATTCGGAGCTGCTGGCGATAAATCTGCTTGAGCAATTGTTACTGCGGCGCATGG
    AAGCGATT
    AACGAGTCGCTCCATCCACCGATGGATAATCGGGTACGCGAGGCTTGTCAGTACATCAGCGATCACCTGGCAGACAGCAATTTT
    GATATCGC
    CAGCGTCGCACAGCATGTTTGCTTGTCGCCGTCGCGTCTGTCACATCTTTTCCGCCAGCAGTTAGGGATTAGCGTCTTAAGCTG
    GCGCGAGG
    ACCAACGTATCAGCCAGGCGAAGCTGCTTTTGAGCACCACCCGGATGCCTATCGCCACCGTCGGTCGCAATGTTGGTTTTGACG
    ATCAACTC
    TATTTCTCGCGGGTATTTAAAAAATGCACCGGGGCCAGCCCGAGCGAGTTCCGTGCCGGTTGTGAAGAAAAAGTGAATGATGTA
    GCCGTCAA GTTGTCATAA
    araE Gene ATGGTTACTATCAATACGGAATCTGCTTTAACGCCACGTTCTTTGCGGGATACGCGGCGTATGAATATGTTTGTTTCGGTAGCT
    GCTGCGGT
    CGCAGGATTGTTATTTGGTCTTGATATCGGCGTAATCGCCGGAGCGTTGCCGTTCATTACCGATCACTTTGTGCTGACCAGTCG
    TTTGCAGG
    AATGGGTGGTTAGTAGCATGATGCTCGGTGCAGCAATTGGTGCGCTGTTTAATGGTTGGCTGTCGTTCCGCCTGGGGCGTAAAT
    ACAGCCTG
    ATGGCGGGGGCCATCCTGTTTGTACTCGGTTCTATAGGGTCCGCTTTTGCGACCAGCGTAGAGATGTTAATCGCCGCTCGTGTG
    GTGCTGGG
    CATTGCTGTCGGGATCGCGTCTTACACCGCTCCTCTGTATCTTTCTGAAATGGCAAGTGAAAACGTTCGCGGTAAGATGATCAG
    TATGTACC
    AGTTGATGGTCACACTCGGCATCGTGCTGGCGTTTTTATCCGATACAGCGTTCAGTTATAGCGGTAACTGGCGCGCAATGTTGG
    GGGTTCTT
    GCTTTACCAGCAGTTCTGCTGATTATTCTGGTAGTCTTCCTGCCAAATAGCCCGCGCTGGCTGGCGGAAAAGGGGCGTCATATT
    GAGGCGGA
    AGAAGTATTGCGTATGCTGCGCGATACGTCGGAAAAAGCGCGAGAAGAACTCAACGAAATTCGTGAAAGCCTGAAGTTAAAACA
    GGGCGGTT
    GGGCACTGTTTAAGATCAACCGTAACGTCCGTCGTGCTGTGTTTCTCGGTATGTTGTTGCAGGCGATGCAGCAGTTTACCGGTA
    TGAACATC
    ATCATGTACTACGCGCCGCGTATCTTCAAAATGGCGGGCTTTACGACCACAGAACAACAGATGATTGCGACTCTGGTCGTAGGG
    CTGACCTT
    TATGTTCGCCACCTTTATTGCGGTGTTTACGGTAGATAAAGCAGGGCGTAAACCGGCTCTGAAAATTGGTTTCAGCGTGATGGC
    GTTAGGCA
    CTCTGGTGCTGGGCTATTGCCTGATGCAGTTTGATAACGGTACGGCTTCCAGTGGCTTGTCCTGGCTCTCTGTTGGCATGACGA
    TGATGTGT
    ATTGCCGGTTATGCGATGAGCGCCGCGCCAGTGGTGTGGATCCTGTGCTCTGAAATTCAGCCGCTGAAATGCCGCGATTTCGGT
    ATTACCTG
    TTCGACCACCACGAACTGGGTGTCGAATATGATTATCGGCGCGACCTTCCTGACACTGCTTGATAGCATTGGCGCTGCCGGTAC
    GTTCTGGC
    TCTACACTGCGCTGAACATTGCGTTTGTGGGCATTACTTTCTGGCTCATTCCGGAAACCAAAAATGTCACGCTGGAACATATCG
    AACGCAAA CTGATGGCAGGCGAGAAGTTGAGAAATATCGGCGTCTGA
    Ptac Promoter10 CTCGAGTGTTGACAATTAATCATCGGCTCGTATAATGTGTGGAATTGTGAGCGCTCACAATTTCACACATCTAGA
    Pnoc Promoter AACAAATACACATGGGCGCATGCCTATTACTGCCCTTGCGATATGGAAGGCAAGCTTTTAGTAACAATAGAAAACTGGGTCCTA
    CTCTCGAA
    GAATGCACTGCGGCGGTCACGTCAACACGTGCTGCACCGTTGAGAATGAATGCTGGGCAGATTGCCAGCGGCGTCATTTTCGGC
    TGTCCCGT CCTCACGGTTTTGCGCTGCATCGCAAGAGATTGGGAA
    nocR Gene ATGACGTCAGCAGCGAATCTGGTGAGGATCACGCAGCCCGCGATCAGCCGGCTGATCAGGGATCTCGAAGAGGAAATTGGGATC
    AGCCTCTT
    CGAAAGAACGGGCAACCGGTTACGTCCTACGCGGGAGGCCGGTATTCTGTTCAAGGAAGTGTCGCGACATTTCAACGGGATTCA
    GCACATCG
    ACAAAGTCGCGGCTGAACTGAAGAAGTCTCATATGGGGTCCCTAAGGGTCGCCTGTTATACAGCGCCAGCTCTGAGTTTTATGT
    CCGGCGTC
    ATTCAGACGTTCATCGCCGATCGGCCCGACGTGTCGGTCTACCTCGACACAGTTCCTTCCCAGACGGTCCTCGAATTGGTCTCG
    CTCCAGCA
    CTACGATCTCGGAATATCGATATTGGCTGGCGACTATCCTGGTCTCACCACCGAACCTGTCCCTTCCTTTCGTGCGGTCTGCCT
    GCTGCCGC
    CGGGGCATCGTCTCGAAGACAAGGAAACTGTTCATGCGACGGACCTTGAAGGAGAGTCATTGATTTGCCTCTCTCCAGTGAGCC
    TTCTACGG
    ATGCAAACGGACGCCGCACTGGACAGCTGCGGCGTCCACTGTAATCGCAGGATAGAAAGTAGTCTGGCGCTGAATCTCTGCGAT
    CTGGTAAG
    CAGGGGAATGGGGGTTGGTATCGTCGACCCCTTCACTGCCGACTACTACAGTGCAAATCCGGTTATTCAGCGCTCCTTTGATCC
    GGTTGTCC
    CCTACCATTTTGCTATAGTTCTTCCGACCGACAGCCCACCGCCGCGCTTGGTTAGCGAGTTCCGGGCAGCGTTGCTTGATGCTT
    TGAAAGCC TTGCCCTATGAAACCATTTGA
    Pocc Promoter AAACGCACCATAACATCTGCTTATTCTTGCCCGGTCATTATGAATTTGACCGAATGCATATCGAATGTAAAGCTCACCCTATAA
    ATCACAAC TCTTCCGGGCCAACCGGGATCAGACGT
    occR Gene ATGAATCTCAGGCAGGTCGAGGCGTTCCGGGCAGTCATGCTGACGGGGCAAATGACGGCGGCGGCTGAACTAATGCTGGTGACT
    CAGCCGGC
    CATCAGTCGCCTAATCAAGGACTTTGAACAGGCGACAAAACTGCAGCTCTTCGAGAGGCGTGGGAACCATATTATCCCGACACA
    GGAGGCAA
    AGACGCTGTGGAAAGAGGTCGATCGGGCGTTCGTCGGGCTTAATCATATAGGCAACCTGGCTGCCGACATCGGCAGGCAGGCAG
    CGGGGACG
    CTCCGCATTGCTGCAATGCCTGCTCTGGCAAACGGCCTCTTGCCGCGGTTTCTTGCTCAGTTCATCCGTGACAGACCAAATCTC
    CAGGTCTC
    CCTAATGGGACTGCCCTCAAGCATGGTCATGGAAGCCGTTGCGTCCGGCAGGGCCGACATCGGTTATGCCGATGGCCCACAGGA
    GCGCCAAG
    GTTTTCTAATCGAAACCCGGTCGCTTCCCGCTGTTGTCGCTGTCCCGATGGGACATCGACTTGCTGGCCTTGACCGTGTCACGC
    CACAGGAC
    CTTGCCGGTGAGCGTATTATAAAACAGGAGACTGGCACTCTCTTCGCCATGCGGGTAGAGGTGGCGATTGGTGGTATTCAACGC
    CGGCCGTC
    AATTGAAGTGAGCCTGTCGCATACTGCGCTAAGTCTCGTCCGCGAAGGCGCCGGGATCGCAATTATCGATCCAGCCGCGGCGAT
    CGAGTTCA
    CGGACAGGATCGTACTGCGACCGTTCTCGATCTTCATTGACGCCGGATTCCTCGAAGTCCGGTCAGCAATTGGCGCTCCCTCAA
    CCATCGTC GATCGTTTCACAACCGAATTCTGGAGGTTTCATGATGACTTGATGAAGCAGAACGGCCTAATGGAGTAA
    PBet Promoter17 AGCGCGGGTGAGAGGGATTCGTTACCAATAGACAATTGATTGGACGTTCAATATAATGCTAGC
    PCin Promoter CCCTTTGTGCGTCCAAACGGACGCACGGCGCTCTAAAGCGGGTCGCGATCTTTCAGATTCGCTCCTCGCGCTTTCAGTCTTTGTTTTGGCGC
    ATGTCGTTATCGCAAAACCGCTGCACACTTTTGCGCGACATGCTCTGATCCCCCTCATCTGGGGGGGCCTATCTGAGGGAATTT
    CCGATCCG GCTCGCCTGAACCATTCTGCTTTCCACGAACTTGAAAACGCT
    P3B5B Promoter5 TTTTGTTCGATTATCGAACAAATTATTGAAATATCGAACAAAACCTCTAAACTACTGTGGCACTGAATCAAAAAATTATA AACCCTGATCAG A
    PTTg Promoter19 CACCCAGCAGTATTTACAAACAACCATGAATGTAAGTATATTCCTTAGCAA
    PVan Promoter20 ATTGGATCCAATTGACAGCTAGCTCAGTCCTAGGTACCATTGGATCCAAT
  • TABLE 6
    RBS sequences used in this study
    Name Strain RBS sequencea (SEQ ID NOs: 226-291) Strength (GFP, au)
    RBSr1 R. sp. IRBG74 ATTTCACACATCTAGAGCTAATCATCTCGTACTAAAGAGGAGAAATTAA 8242
    CCATG
    RBSr2 R. sp. IRBG74 ATTTCACACATCTAGAGCTAATCATCGCGTACTCAGGAGGCAAGTAATG 7181.5
    RBSr3 R. sp. IRBG74 ATTTCACACATCTAGAATTAAAGAGGAGAAATTAACCATG 6238.5
    RBSr4 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCTCGTACTAAAGAGGCAAGTAATG 3618
    RBSr5 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCGCGTACTAAGGAGGCAAGTAATG 3560
    RBSr6 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCGCGTACTCAAGAGGCAAGTAATG 2614.5
    RBSr7 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCGCGTACTAAAGAGGCAAGTAATG 2418.5
    RBSr8 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCTCGTACTCAGGAGGCAAGTAATG 1882.5
    RBSr9 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCTCGTACTAATGAGGCAAGTAATG 1593.5
    RBSr10 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCGCGTACTAATGAGGCAAGTAATG 1590
    RBSr11 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCGCGTACTCACGAGGCAAGTAATG 1554
    RBSr12 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCGCGTACTAAAAAGGCAAGTAATG 1138
    RBSr13 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCGCGTACTAAAAAGGCAAGTAATG 895.5
    RBSr14 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCGCGTACTAAGAAGGCAAGTAATG 632.5
    RBSr15 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCTCGTACTAAATAGGCAAGTAATG 648.5
    RBSr16 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCTCGTACTAATAAGGCAAGTAATG 532
    RBSr17 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCTCGTACTAAAGAGGCAAGTAATG 488
    RBSr18 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCGCGTACTCAATAGGCCAGTAATG 305.5
    RBSr19 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCGCGTACTAAGTAGGCAAGTAATG 242
    RBSr20 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCTCGTACTAACGAGGCAAGTAATG 248
    RBSr21 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCGCGTACTCAGCAGGCAAGTAATG 183
    RBSr22 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCGCGTACTAAGTAGGCAAGTAATG 130
    RBSr23 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCGCGTACTAATTAGGCAAGTAATG 84.4
    RBSr24 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCTCGTACTAACAAGGCAAGTAATG 75.15
    RBSr25 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCTCGTACTCAATAGGCAAGTAATG 45.45
    RBSr26 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCTCGTACTAAGCACGCAAGTAATG 36
    RBSr27 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCATCGCGTACTAACTACGCAAGTAATG 12.2
    RBSr28 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCGCGTACTAAGAACGCAAGTAATG 13
    RBSr29 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCGCGTACTAAAAACGCAAGTAATG 4.6
    RBSr30 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCGCGTACTAACAACGCAAGTAATG 2.95
    RBSr31 R. sp. IRBG74 TAACAATTTCACACATCTAGAGCTAATCTTCTCGTACTCATGACGCAAGTAATG 1.45
    RBSr32 R. sp. IRBG74 ATTTCACACATCTAGAATTAAAGAGAAGAAATTAACCATG N/Ab
    RBSr33 R. sp. IRBG74 CTAGTGCGAACTAGCTCATACCGCAGATG N/Ab
    RBSp1 P. protegens Pf-5 CTAGCGCAGGTCCAACGTTTTTCTAAGCAAGGAGGTCATATG 25090
    RBSp2 P. protegens Pf-5 CTAGCGAAGGTCCAACGTTTTTCTAAGCAAGGAGGTCATATG 21590
    RBSp3 P. protegens Pf-5 CTAGCGAAGGTCCAACGTTTTTCTAAGCCAGGAGGTCATATG 19690
    RBSp4 P. protegens Pf-5 CTAGCGCAGGTCCAACGTTTTTCTAAGCCAGGAGGTCATATG 19490
    RBSp5 P. protegens Pf-5 CTAGCGAAGCTCCAACGTTTTTCTAAGCAAGGAGGTCATATG 17990
    RBSp6 P. protegens Pf-5 GAATTCTACACTAACGGACAGGAGGGTCCGATG 14490
    RBSp7 P. protegens Pf-5 GAATTCTAAACTAACGGACAGGAGGGTCCGATG 13390
    RBSp8 P. protegens Pf-5 GAATTCTAAGCTAACGGACAGGAGGGTCCGATG 12790
    RBSp9 P. protegens Pf-5 GAATTCTTAACTAACGGACAGGAGGGTCCGATG 11490
    RBSp10 P. protegens Pf-5 GAATTCTACACTAACGGACAGGAGGGTCGGATG 11090
    RBSp11 P. protegens Pf-5 GAATTCTACGCTAACGGACAGGAGGGTCCGATG 10390
    RBSp12 P. protegens Pf-5 GAATTCTCAACTAACGGACAGGAGGGTCCGATG 9590
    RBSp13 P. protegens Pf-5 GAATTCTAAGCTAACGGACAGGAGGGTCGGATG 8918
    RBSp14 P. protegens Pf-5 GAATTCTCAGCTAACGGACAGGAGGGTCCGATG 8766
    RBSp15 P. protegens Pf-5 GAATTCTCAACTAACGGACAGGAGGGTCCGATG 7596
    RBSp16 P. protegens Pf-5 GAATTCTACGCTAACGGACAGGAGGGTCGGATG 6055
    RBSp17 P. protegens Pf-5 GAATTCTCAACTAACGGACAGGAGATATACATATG 5939
    RBSp18 P. protegens Pf-5 GAATTCTCAGCTAACGGACAGGAGGGTCGGATG 5915
    RBSp19 P. protegens Pf-5 GAATTCTAAACTAACGGACAGGAGGGTCGGATG 4867
    RBSp20 P. protegens Pf-5 GAATTCTCAGCTCACGGACAGGAGGGTCGGATG 4426
    RBSp21 P. protegens Pf-5 GAATTCTCAACTAACGGACAGGAGGGTCGGGATG 4110
    RBSp22 P. protegens Pf-5 GAATTCTACACTCACGGACAGGAGGGTCGGATG 3977
    RBSp23 P. protegens Pf-5 GAATTCTAAGCTCACGGACAGGAGGGTCGGATG 3829
    RBSp24 P. protegens Pf-5 GAATTCTCAACTCACGGACAGGAGGGTCGGATG 3661
    RBSp25 P. protegens Pf-5 GAATTCTACACTAACGGACAGCAGGGTCGGATG 3542
    RBSp26 P. protegens Pf-5 CTAGCGCAGGTCCAACCTTTTTCTAAGCAAGTAGGTCATATG 2139
    RBSp27 P. protegens Pf-5 GAATTCTCAGCTAACGGACAGCAGGGTCGGATG 1265
    RBSp28 P. protegens Pf-5 CTAGCGCAGGTCCAACCTTTTTCTAAGCAACTAGGTCATATG 389
    RBSp29 P. protegens Pf-5 CTAGCGAAGGTCCAACCTTTTTCTAAGCCAGTAGGTCATATG 377
    RBSp30 P. protegens Pf-5 GAATTCTACGCTCACGGACAGCAGGGTCGGATG 221
    RBSp31 P. protegens Pf-5 GAATTCTCCGCTCACGGACAGGAGGGTCCGATG 23.3
    RBSp32 P. protegens Pf-5 CTTCTCGGCCAGCTGACAGGGGAAGCTCGCATG N/Ab
    RBSp33 P. protegens Pf-5 CTTCTCGGCCAGCTGACAGGAGGAAGCTCGC A TG N/Ab
    aThe start codon is underlined.
    bRBSs are rationally designed for the controllers by the RBS Calculator2
  • TABLE 7
    Chemicals used in this study
    Chemicals Source Identifier
    Tryptone Fisher Scientific Cat# BP1421
    Yeast extract BD Bacto Cat# DF0127
    NaCl Fisher Scientific Cat# S271
    CaCl2•2H2O Sigma-Aldrich Cat# C3306
    MgSO4•7H2O Fisher Scientific Cat# M80
    FeCl3 Alfa Aesar Cat# AA1235709
    Na2MoO4•2H2O Sigma-Aldrich Cat# 331058
    NH4CH3CO2 Sigma-Aldrich Cat# A1542
    Na2HPO4 Fisher Scientific Cat# S375
    KH2PO4 Sigma-Aldrich Cat# P9791
    EDTA-Na2 Sigma-Aldrich Cat# E5134
    ZnSO4•7H2O ACROS Organics Cat# AC424605000
    H3BO3 Fisher Scientific Cat# A73
    MnSO4•H2O MP Biomedicals Cat# ICN225099
    CuSO4•5H2O Aldon Corp Cat# CC0535
    CoCl2•6H2O Sigma-Aldrich Cat# C8661
    FeSO4•7H2O Sigma-Aldrich Cat# 215422
    Thiamine hydrochloride ACROS Organics Cat# 148990100
    D-pantothenic acid hemicalcium salt Sigma-Aldrich Cat# P5155
    Biotin Sigma-Aldrich Cat# B4501
    Nicotinic acid Sigma-Aldrich Cat# 72309
    MOPS Fisher Scientific Cat# BP308
    Isopropyl-beta-D-thiogalactoside (IPTG) GoldBio Cat# I2481
    L-arabinose Sigma Cat# A3256
    Anhydrotetracycline hydrochloride (aTc) Sigma Cat# 37919
    N-(3-Oxohexanoyl)-L-homoserine lactone (3OC6HSL) Sigma Cat# K3007
    N-(3-Hydroxytetradecanoyl)-DL-homoserine lactone Sigma Cat# 51481
    (3OC14HSL)
    Naringenin Sigma Cat# N5893
    2,4-Diacetylphloroglucinol (DAPG) Santa Cruz Cat# sc-206518
    Salicylic acid sodium salt Sigma Cat# S3007
    3,4-Dihydroxybenzoic acid (DHBA) Sigma Cat# 37580
    Vanillic acid Sigma Cat# 94770
    Cuminic acid Sigma Cat# 268402
    Nopaline Toronto Research Chemicals Cat# N650600
    Octopine Toronto Research Chemicals Cat# O239850
    Choline chloride Sigma Cat# C7017
    Tris (1M), pH 8.0 Invitrogen Cat# AM9855
    Triton X-100 Sigma-Aldrich Cat# T8787
    Tergitol solution Sigma-Aldrich Cat# NP40S
    DNase I Sigma-Aldrich Cat# 4716728001
    RNA Fragmentation Reagents Invitrogen Cat# AM8740
    T4 Polynucleotide kinase New England Biolabs Cat# M0201
    SUPERase•In Invitrogen Cat# AM2694
    PEG 8000 Sigma-Aldrich Cat# 1546605
    T4 RNA ligase 2, truncated K277Q New England Biolabs Cat# M0351
    SuperScript III reverse transcriptase Invitrogen Cat# 18080044
    CircLigase ssDNA ligase Epicentre Cat# CL4115K
    Phusion High-Fidelity DNA polymerase New England Biolabs Cat# M0530
    Micrococcal nuclease Roche 10107921001

Claims (24)

1. A rhizobium that can fix nitrogen under aerobic free-living conditions, comprising a symbiotic rhizobium having an exogenous nif cluster, wherein the exogenous nif cluster confers nitrogen fixation capability on the symbiotic rhizobium under aerobic free-living conditions, and wherein the rhizobium is not Azorhizobium caulinodans.
2. The rhizobium of claim 1, wherein the exogenous nif cluster is selected from a group consisting of a free-living diazotroph, a symbiotic diazotroph, a photosynthetic Alphaproteobacteria, a Gammaproteobacteria, a cyanobacteria, a firmicutes, a Rhodobacter sphaeroides, and a Rhodopseudomonas palustris.
3. The rhizobium of claim 1, wherein the exogenous nif cluster is an inducible refactored nif cluster.
4. The rhizobium of claim 3, wherein the inducible refactored nif cluster is an inducible refactored Klebsiella nif cluster.
5. The rhizobium of claim 1, wherein the rhizobium is IRBG74.
6. The rhizobium of claim 1, wherein the exogenous nif cluster comprises 6 nif genes or operons.
7. The rhizobium of claim 6, wherein the 6 nif genes or operons are nifHDK(T)Y, nifEN(X), nifJ, nifBQ, nifF, and nifUSVWZM.
8. The rhizobium of claim 6, wherein each nif gene or operon of the exogenous nif cluster is preceded by a T7 promoter.
9. The rhizobium of claim 1, further comprising an endogenous nif cluster.
10. The rhizobium of claim 1, wherein the exogenous nif cluster further comprises a terminator.
11. The rhizobium of claim 8, wherein the T7 promoter has a terminator and wherein the terminator is downstream from the T7 promoter.
12. The rhizobium of claim 11, wherein the exogenous nif cluster is a refactored rhizobium IRBG74 nif cluster.
13-33. (canceled)
34. A method for making a nitrogen-fixing bacterium, the method comprising:
a) identifying a host bacterium;
b) selecting a donor bacterium having a nif cluster based on evolutionary distance between the host bacterium and the donor bacterium;
c) inserting the nif cluster of the donor bacterium to the host bacterium, thereby making a nitrogen-fixing bacterium.
35. The method of claim 34, wherein the evolutionary distance between the host bacterium and the donor bacterium is less than 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.9%, 0.8%, 0.7%, 0.6%, 0.5%, 0.4%, 0.3%, 0.2%, or 0.1% substitutions per site in 16S ribosomal RNA gene sequence.
36. The method of claim 34, wherein the host bacterium and the donor bacterium are in the same genus, family, order, or class.
37-39. (canceled)
40. The method of claim 34, wherein the host bacterium is E. coli and the donor bacterium is K. oxytoca.
41. (canceled)
42. The method of claim 34, wherein the host bacterium is Rhizobium IRBG74, and the donor bacterium is R. sphaeroides.
43. The method of claim 34, wherein the host bacterium is a nonsymbiotic bacterium, e.g., Azotobacter, Beijerinckia, or Clostridium bacterium.
44-45. (canceled)
46. The method of claim 34, wherein the inserted nif cluster is under inducible control.
47-59. (canceled)
US17/440,618 2019-03-19 2020-03-19 Control of nitrogen fixation in rhizobia that associate with cereals Pending US20220162544A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US17/440,618 US20220162544A1 (en) 2019-03-19 2020-03-19 Control of nitrogen fixation in rhizobia that associate with cereals

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201962820765P 2019-03-19 2019-03-19
US16/746,215 US12281299B2 (en) 2019-03-19 2020-01-17 Control of nitrogen fixation in rhizobia that associate with cereals
PCT/US2020/023646 WO2020191201A1 (en) 2019-03-19 2020-03-19 Control of nitrogen fixation in rhizobia that associate with cereals
US17/440,618 US20220162544A1 (en) 2019-03-19 2020-03-19 Control of nitrogen fixation in rhizobia that associate with cereals

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US16/746,215 Continuation US12281299B2 (en) 2019-03-19 2020-01-17 Control of nitrogen fixation in rhizobia that associate with cereals

Publications (1)

Publication Number Publication Date
US20220162544A1 true US20220162544A1 (en) 2022-05-26

Family

ID=69593791

Family Applications (3)

Application Number Title Priority Date Filing Date
US16/746,215 Active 2042-11-09 US12281299B2 (en) 2019-03-19 2020-01-17 Control of nitrogen fixation in rhizobia that associate with cereals
US17/440,618 Pending US20220162544A1 (en) 2019-03-19 2020-03-19 Control of nitrogen fixation in rhizobia that associate with cereals
US19/174,691 Pending US20250257316A1 (en) 2019-03-19 2025-04-09 Control of nitrogen fixation in rhizobia that associate with cereals

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US16/746,215 Active 2042-11-09 US12281299B2 (en) 2019-03-19 2020-01-17 Control of nitrogen fixation in rhizobia that associate with cereals

Family Applications After (1)

Application Number Title Priority Date Filing Date
US19/174,691 Pending US20250257316A1 (en) 2019-03-19 2025-04-09 Control of nitrogen fixation in rhizobia that associate with cereals

Country Status (4)

Country Link
US (3) US12281299B2 (en)
EP (1) EP3941930A1 (en)
CN (1) CN113710690A (en)
WO (2) WO2020190363A1 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11963530B2 (en) 2018-06-27 2024-04-23 Pivot Bio, Inc. Agricultural compositions comprising remodeled nitrogen fixing microbes
US12391624B2 (en) 2018-07-11 2025-08-19 Pivot Bio, Inc. Temporally and spatially targeted dynamic nitrogen delivery by remodeled microbes
US12421519B2 (en) 2019-01-07 2025-09-23 Pivot Bio, Inc. Plant colonization assays using natural microbial barcodes
US12478068B2 (en) 2020-05-01 2025-11-25 Pivot Bio, Inc. Stable liquid formulations for nitrogen-fixing microorganisms

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
HUE046506T2 (en) 2011-06-16 2020-03-30 Univ California Synthetic gene clusters
US10968446B2 (en) 2012-11-01 2021-04-06 Massachusetts Institute Of Technology Directed evolution of synthetic gene cluster
KR102461443B1 (en) 2015-07-13 2022-10-31 피벗 바이오, 인크. Methods and compositions for improving plant traits
JP2018537119A (en) 2015-10-05 2018-12-20 マサチューセッツ インスティテュート オブ テクノロジー Nitrogen fixation using refactored nif clusters
JP7234116B2 (en) 2017-01-12 2023-03-07 ピボット バイオ, インコーポレイテッド Methods and compositions for improving plant traits
AU2018354338B2 (en) 2017-10-25 2023-10-26 Pivot Bio, Inc. Gene targets for nitrogen fixation targeting for improving plant traits
KR20200088342A (en) 2017-10-25 2020-07-22 피벗 바이오, 인크. Methods and compositions for improving genetically engineered microorganisms that fix nitrogen
WO2020190363A1 (en) 2019-03-19 2020-09-24 Massachusetts Institute Of Technology Control of nitrogen fixation in rhizobia that associate with cereals
AU2020445067A1 (en) 2020-05-01 2022-12-01 Pivot Bio, Inc. Measurement of nitrogen fixation and incorporation
CN113248582A (en) * 2021-05-11 2021-08-13 西安交通大学 Transformation and regulation method of nitrogen-fixing microorganisms
EP4137576A1 (en) * 2021-08-17 2023-02-22 Justus-Liebig-Universität Gießen Chromosomal integrating cassette allowing inducible gene expression for production of compounds via fermentation
CN115011534B (en) * 2022-03-23 2023-11-03 山东农业大学 A mutant strain, construction method and application of stem nodule nitrogen-fixing rhizobium ORS571
WO2023225117A1 (en) * 2022-05-17 2023-11-23 Bioconsortia, Inc. Methods and compositions for refactoring nitrogen fixation clusters
CN116083299B (en) * 2022-12-07 2024-06-11 塔里木大学 Mubase rhizobium for promoting chickpea nodulation and application thereof
WO2025015205A1 (en) 2023-07-11 2025-01-16 Massachusetts Institute Of Technology Replacing synthetic fertilizers by engineering cereal-associated nitrogen-fixing bacteria to release urea
US20250243130A1 (en) * 2024-01-31 2025-07-31 Switch Bioworks, Inc. Phosphate sensing microbial gene switch

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6548289B1 (en) * 1984-12-28 2003-04-15 Land O'lakes, Inc. Biological nitrogen fixation

Family Cites Families (228)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US1520545A (en) 1924-09-04 1924-12-23 Charles K Morganroth Transmission gearing
US4832728A (en) 1981-09-25 1989-05-23 Melamine Chemicals, Inc. Fertilizer compositions, processes of making them, and pocesses of using them
US4782022A (en) 1984-06-04 1988-11-01 Lubrizol Genetics, Inc. Nitrogen fixation regulator genes
DK609885D0 (en) 1985-12-30 1985-12-30 Sven Erik Nielsen PROCEDURE FOR MODIFICATION OF ORGANISMS
US5229291A (en) 1985-12-30 1993-07-20 Novo Industri A/S Rhizobia transformants which symbiotically fixes nitrogen in non-legumes, a material for treating seeds of a non-legume plant, non-legume seeds, a non-legume plant and a method for producing rhizobia transconjungants
CA1335366C (en) 1986-08-19 1995-04-25 Joseph Kloepper Plant growth promoting rhizobacteria for agronomic nonroot crops
EP0292984A2 (en) 1987-05-29 1988-11-30 The General Hospital Corporation Cloned rhizobium meliloti ntrA (rpoN) gene
JPH01225483A (en) 1988-03-04 1989-09-08 Takeshi Uozumi Recombinant plasmid
EP0339830A3 (en) 1988-04-14 1990-01-17 Biotechnica International, Inc. Improved biological nitrogen fixation
EP1103616A3 (en) 1989-02-24 2001-06-27 Monsanto Company Synthetic plant genes and method for preparation
US5188960A (en) 1989-06-27 1993-02-23 Mycogen Corporation Bacillus thuringiensis isolate active against lepidopteran pests, and genes encoding novel lepidopteran-active toxins
US5116506A (en) 1989-06-30 1992-05-26 Oregon State University Support aerated biofilm reactor
US5071743A (en) 1989-10-27 1991-12-10 Her Majesty The Queen In Right Of Canada, As Represented By The National Research Council Of Canada Process for conducting site-directed mutagenesis
US5484956A (en) 1990-01-22 1996-01-16 Dekalb Genetics Corporation Fertile transgenic Zea mays plant comprising heterologous DNA encoding Bacillus thuringiensis endotoxin
US6946587B1 (en) 1990-01-22 2005-09-20 Dekalb Genetics Corporation Method for preparing fertile transgenic corn plants
US5204253A (en) 1990-05-29 1993-04-20 E. I. Du Pont De Nemours And Company Method and apparatus for introducing biological substances into living cells
US5610044A (en) 1990-10-01 1997-03-11 Lam; Stephen T. Microorganisms with mannopine catabolizing ability
US5427785A (en) 1990-11-21 1995-06-27 Research Seeds, Inc. Rhizosheric bacteria
GB2259302A (en) 1991-09-09 1993-03-10 Anil Kumar Bali Mutant nitrogen fixing bacterium
CA2051071A1 (en) 1991-09-10 1993-03-11 Anil K. Bali Ammonia production
WO1993013216A1 (en) 1991-12-24 1993-07-08 The President And Fellows Of Harvard College Site-directed mutagenesis of dna
US5877012A (en) 1993-03-25 1999-03-02 Novartis Finance Corporation Class of proteins for the control of plant pests
FR2718750B1 (en) 1994-04-19 1996-06-14 Pasteur Institut Recombinant proteins of the filamentous hemagglutinin of Bordetella, in particular, B. Pertussis, production and application to the production of foreign proteins or vaccinating active principles.
US6740506B2 (en) 1995-12-07 2004-05-25 Diversa Corporation End selection in directed evolution
US5789166A (en) 1995-12-08 1998-08-04 Stratagene Circular site-directed mutagenesis
US6083499A (en) 1996-04-19 2000-07-04 Mycogen Corporation Pesticidal toxins
US5916029A (en) 1996-06-26 1999-06-29 Liphatech, Inc. Process for producing seeds coated with a microbial composition
US5780270A (en) 1996-07-17 1998-07-14 Promega Corporation Site-specific mutagenesis and mutant selection utilizing antibiotic-resistant markers encoding gene products having altered substrate specificity
EP0931158A1 (en) 1996-09-06 1999-07-28 The Trustees Of The University Of Pennsylvania An inducible method for production of recombinant adeno-associated viruses utilizing t7 polymerase
US6114148C1 (en) 1996-09-20 2012-05-01 Gen Hospital Corp High level expression of proteins
US6063756A (en) 1996-09-24 2000-05-16 Monsanto Company Bacillus thuringiensis cryET33 and cryET34 compositions and uses therefor
JPH10117776A (en) 1996-10-22 1998-05-12 Japan Tobacco Inc Transformation of indica rice
US6017534A (en) 1996-11-20 2000-01-25 Ecogen, Inc. Hybrid Bacillus thuringiensis δ-endotoxins with novel broad-spectrum insecticidal activity
US6713063B1 (en) 1996-11-20 2004-03-30 Monsanto Technology, Llc Broad-spectrum δ-endotoxins
US5942664A (en) 1996-11-27 1999-08-24 Ecogen, Inc. Bacillus thuringiensis Cry1C compositions toxic to lepidopteran insects and methods for making Cry1C mutants
HUP9701446A1 (en) 1997-08-27 1999-05-28 Phylaxia-Pharma Gyógyszer-, Oltóanyag és Agrobiológiai Készítményeket Gyártó és Forgalmazó Rt. Process for production of soil microorganism population
US6218188B1 (en) 1997-11-12 2001-04-17 Mycogen Corporation Plant-optimized genes encoding pesticidal toxins
US6033861A (en) 1997-11-19 2000-03-07 Incyte Genetics, Inc. Methods for obtaining nucleic acid containing a mutation
AU5209399A (en) 1998-07-10 2000-02-01 Cornell Research Foundation Inc. Recombinant constructs and systems for secretion of proteins via type iii secretion systems
DE69931511T2 (en) 1998-10-23 2006-09-28 Mycogen Corp., San Diego PLANT-OPTIMIZED POLYNUCLEOTIDES CODING FOR 15KDA AND 45KDA PESTICIDE PROTEINS
US6489542B1 (en) 1998-11-04 2002-12-03 Monsanto Technology Llc Methods for transforming plants to express Cry2Ab δ-endotoxins targeted to the plastids
JP2003536048A (en) 1999-03-23 2003-12-02 バイオヴェイション リミテッド Protein isolation and analysis
EP2182072B1 (en) 1999-07-27 2012-12-26 Food Industry Research and Development Institute Method for producing isoprenoids
US6248535B1 (en) 1999-12-20 2001-06-19 University Of Southern California Method for isolation of RNA from formalin-fixed paraffin-embedded tissue specimens
JP3859947B2 (en) 2000-08-04 2006-12-20 独立行政法人理化学研究所 Mutation introduction method
US20020061579A1 (en) * 2000-08-09 2002-05-23 Farrand Stephen K. Counter selection strategy for Gram-negative bacteria
US7879540B1 (en) 2000-08-24 2011-02-01 Promega Corporation Synthetic nucleic acid molecule compositions and methods of preparation
CN1289852A (en) 2000-09-26 2001-04-04 国家人类基因组南方研究中心 Human nif gene homogenous protein and its coding sequence
AR035799A1 (en) 2001-03-30 2004-07-14 Syngenta Participations Ag INSECTICIDE TOXINS ISOLATED FROM BACILLUS THURINGIENSIS AND ITS USES.
JP2003033174A (en) 2001-07-10 2003-02-04 Japan Science & Technology Corp Rhizobium with enhanced nitrogen fixation
GB0121126D0 (en) 2001-08-31 2001-10-24 Univ Nottingham Systemic non-nodular endosymbiotic nitrogen fixation in plants
US7084331B2 (en) 2002-01-15 2006-08-01 Society for Techno-Innovation of Agriculture Forestry and Fisheries Rice containing endophytic bacteria and method of producing it
WO2004003148A2 (en) 2002-06-26 2004-01-08 E.I. Du Pont De Nemours And Company Genes encoding proteins with pesticidal activity
US7462760B2 (en) 2002-06-26 2008-12-09 Pioneer Hi-Bred International, Inc. Genes encoding plant protease-resistant pesticidal proteins and method of their use
CN100494185C (en) 2002-08-28 2009-06-03 旭化成制药株式会社 Novel quaternary ammonium compounds
US20060112447A1 (en) 2002-08-29 2006-05-25 Bogdanova Natalia N Nucleotide sequences encoding cry1bb proteins for enhanced expression in plants
US20050266541A1 (en) 2002-11-04 2005-12-01 Harrison F. Dillon Methods and compositions for evolving microbial hydrogen production
CN1500801A (en) 2002-11-18 2004-06-02 中国农业科学院原子能利用研究所 Gene capable of improving the nitrogen fixing ability of combined azotobacter and uses thereof
CA2514041A1 (en) 2003-01-21 2004-08-12 Dow Agrosciences Llc Mixing and matching tc proteins for pest control
NZ570682A (en) 2003-02-20 2009-08-28 Athenix Corp AXMI-006 a delta-endotoxin gene and methods for its use as a pesticide
US20040216186A1 (en) 2003-02-20 2004-10-28 Athenix Corporation AXMI-006, a delta-endotoxin gene and methods for its use
US20040210965A1 (en) 2003-02-20 2004-10-21 Athenix Corporation AXMI-007, a delta-endotoxin gene and methods for its use
US7351881B2 (en) 2003-02-20 2008-04-01 Athenix Corporation AXMI-008, a delta-endotoxin gene and methods for its use
US7355099B2 (en) 2003-02-20 2008-04-08 Athenix Corporation AXMI-004, a delta-endotoxin gene and methods for its use
US20040197917A1 (en) 2003-02-20 2004-10-07 Athenix Corporation AXMI-014, delta-endotoxin gene and methods for its use
US20040210964A1 (en) 2003-02-20 2004-10-21 Athenix Corporation AXMI-009, a delta-endotoxin gene and methods for its use
CN1254533C (en) 2003-06-02 2006-05-03 中国农业大学 Engineering strain of Brasil diazotrophic spirillum DraT* containing multiple copied nifA gene
HU0301909D0 (en) 2003-06-23 2003-08-28 Someus Edward Process for solid fermentation of microorganisms bound to bone black carrier amid for production, storage and uses of granular compositions
RU2382822C2 (en) 2003-07-07 2010-02-27 Монсанто Текнолоджи, Ллс INSECTICIDAL PROTEINS EXTRACTED FROM Bacillus BACTERIA AND USE THEREOF
US7253343B2 (en) 2003-08-28 2007-08-07 Athenix Corporation AXMI-003, a delta-endotoxin gene and methods for its use
US7205450B2 (en) 2003-10-08 2007-04-17 The Regents Of The University Of California DMI1 gene encodes a protein that is required for the early steps of bacterial and fungal symbioses
US20050183161A1 (en) 2003-10-14 2005-08-18 Athenix Corporation AXMI-010, a delta-endotoxin gene and methods for its use
US7214860B2 (en) 2004-02-20 2007-05-08 Pioneer Hi-Bred International, Inc. Methods of using non-plant encoding nucleic acids
AR048747A1 (en) 2004-03-05 2006-05-24 Agrigenetics Inc COMBINATIONS OF CRY1AB AND CRY1FA AS A TOOL FOR CONTROL OF INSECT RESISTANCE
US20090137390A1 (en) 2004-06-30 2009-05-28 Eric Wendell Triplett Materials and methods for enhancing nitrogen fixation in plants
WO2006003026A1 (en) 2004-07-07 2006-01-12 National University Of Ireland, Galway A biofilm reactor
WO2006005100A1 (en) 2004-07-12 2006-01-19 Zebra Holdings Pty Ltd Method and system for promoting microbial nitrogen fixation activity
JP2008510186A (en) 2004-08-10 2008-04-03 日本板硝子株式会社 LCD mirror system and method
CN1746304A (en) 2004-09-10 2006-03-15 中国农业科学院生物技术研究所 The structure of secreting the ammonium engineering bacteria and the application of the sudden change of fixed nitrogen negative regulator gene
US20060096918A1 (en) 2004-11-09 2006-05-11 Semmens Michael J Biofilm wastewater treatment devices
US7485451B2 (en) 2004-11-18 2009-02-03 Regents Of The University Of California Storage stable compositions of biological materials
NZ560935A (en) 2005-01-31 2009-06-26 Athenix Corp AXMI-018, AXMI-020, and AXMI-021, a family of delta-endotoxin genes and methods for their use as pesticides
JP4677568B2 (en) 2005-03-14 2011-04-27 国立大学法人 鹿児島大学 Production method of plants that grow nodules with high nitrogen fixation activity
US7601498B2 (en) 2005-03-17 2009-10-13 Biotium, Inc. Methods of using dyes in association with nucleic acid staining or detection and associated technology
WO2006107761A2 (en) 2005-04-01 2006-10-12 Athenix Corporation Axmi-027, axmi-036 and axmi-038, a family of delta-endotoxin genes and methods for their use
US7622572B2 (en) 2005-05-02 2009-11-24 Athenix Corporation AXMI-028 and AXMI-029, a family of novel delta-endotoxin genes and methods for their use
BRPI0615649A2 (en) 2005-08-31 2011-05-24 Monsanto Technology Llc A method of increasing the accumulation of an insecticide protein in a host cell, to produce an insect pest-resistant plant cell, to control infestation, and to protect a crop, insecticide composition, commodity product, transgenic plant or plant cell, sequence of nucleotide, insecticide protein, plant progeny or seed, vector, host cell and expression cassette
US8389250B2 (en) 2006-01-04 2013-03-05 Metabolic Explorer Methods for producing methionine by culturing a microorganism modified to enhance production of cysteine
EP2001821B1 (en) 2006-03-22 2016-09-07 Adjuvants Plus Inc. The production and use of endophytes as novel inoculants for promoting enhanced plant vigor, health, growth, yield reducing environmental stress and for reducing dependency on chemical pesticides for pest control
US7329736B2 (en) 2006-04-14 2008-02-12 Pioneer Hi-Bred International, Inc. Bacillus thuringiensis cry gene and protein
US7449552B2 (en) 2006-04-14 2008-11-11 Pioneer Hi-Bred International, Inc. Bacillus thuringiensis cry gene and protein
US7888552B2 (en) 2006-05-16 2011-02-15 Monsanto Technology Llc Use of non-agrobacterium bacterial species for plant transformation
NZ594744A (en) 2006-06-14 2013-03-28 Athenix Corp Axmi-031, axmi-039, axmi-040 and axmi-049, a family of delta-endotoxin genes and methods for their use
AR061491A1 (en) 2006-06-15 2008-09-03 Athenix Corp A FAMILY OF PESTICIDE PROTEINS AND METHODS OF THE SAME USE
CA2654656A1 (en) 2006-06-30 2008-01-03 Biogasol Ipr Aps Production of fermentation products in biofilm reactors using microorganisms immobilised on sterilised granular sludge
US8268584B1 (en) 2006-12-01 2012-09-18 University Of Washington Hydrogen production from microbial strains
EA200970559A1 (en) 2006-12-08 2009-12-30 Пайонир Хай-Бред Интернэшнл, Инк. NEW CRYSTAL POLYPEPTIDES FROM BACILLUS THURINGIENSIS ENCODING THEIR POLYNUCLEOTIDES AND COMPOSITIONS OF THESE COMPOUNDS
US8076142B2 (en) 2006-12-21 2011-12-13 Basf Plant Sciences Gmbh Rooted plant assay system
FR2910230B3 (en) 2006-12-22 2009-01-23 Pierre Philippe Claude METHODS OF BIOFERTILIZATION FOR IMPROVING THE STABILITY OF YIELDS OF LARGE AGRONOMIC CROPS.
EP2137211B1 (en) 2007-03-28 2016-08-24 Syngenta Participations AG Insecticidal proteins
US20100267147A1 (en) 2007-04-25 2010-10-21 GM Biosciences, Inc. Site-directed mutagenesis in circular methylated dna
US8609936B2 (en) 2007-04-27 2013-12-17 Monsanto Technology Llc Hemipteran-and coleopteran active toxin proteins from Bacillus thuringiensis
WO2009017124A1 (en) 2007-07-31 2009-02-05 Nihon University Method for production of biofilm
EP2020437A1 (en) 2007-08-03 2009-02-04 Commissariat A L'energie Atomique (Nife)-hydrogenases having an improved resistance to dioxygen, process for obtaining them and their applications
AU2008312468B2 (en) 2007-10-16 2014-07-31 BASF Agricultural Solutions Seed US LLC AXMI-066 and AXMI-076: delta-endotoxin proteins and methods for their use
WO2009060012A2 (en) 2007-11-06 2009-05-14 Basf Se Plant health compositions comprising a beneficial microorganism and a pesticide
US20090162477A1 (en) 2007-12-21 2009-06-25 Daniel Nadel High yield maize derivatives
BRPI0906761A2 (en) 2008-01-15 2015-07-14 Univ Michigan State Polymicrobial Formulations to Increase Plant Productivity
US8518685B2 (en) 2008-03-24 2013-08-27 Tsinghua University Engineered nitrile hydratase-producing bacterium with amidase gene knocked-out, the construction and the use thereof
JP2009232721A (en) 2008-03-26 2009-10-15 Univ Of Miyazaki Plant cultivation method using enterobacter bacterium
US8401798B2 (en) 2008-06-06 2013-03-19 Dna Twopointo, Inc. Systems and methods for constructing frequency lookup tables for expression systems
ES2911327T3 (en) 2008-06-25 2022-05-18 BASF Agricultural Solutions Seed US LLC Toxin genes and procedures for their use
CA3183317A1 (en) 2008-07-02 2010-01-07 BASF Agricultural Solutions Seed US LLC Axmi-115, axmi-113, axmi-005, axmi-163 and axmi-184: insecticidal proteins and methods for their use
EA019574B1 (en) 2008-12-23 2014-04-30 Атеникс Корпорейшн Axmi-150 delta-endotoxin gene and methods for its use
US8709781B2 (en) 2009-01-09 2014-04-29 Syracuse University System and method for the heterologous expression of polyketide synthase gene clusters
CA2748506A1 (en) 2009-01-23 2010-07-29 Pioneer Hi-Bred International, Inc. Novel bacillus thuringiensis gene with lepidopteran activity
CN102369286B (en) 2009-02-05 2014-12-10 阿森尼克斯公司 Variant AXMI-R1 delta-endotoxin genes and methods for their use
WO2010141141A2 (en) 2009-03-11 2010-12-09 Athenix Corporation Axmi-001, axmi-002, axmi-030, axmi-035, and axmi-045: toxin genes and methods for their use
US8728781B2 (en) 2009-03-13 2014-05-20 University Of Washington Through Its Center Of Commercialization Endophytic yeast strains, methods for ethanol and xylitol production, methods for biological nitrogen fixation, and a genetic source for improvement of industrial strains
US20120015806A1 (en) 2009-03-25 2012-01-19 Sitaram Prasad Paikray Novel formulation of microbial consortium based bioinoculant for wide spread use in agriculture practices
JP5873005B2 (en) 2009-04-17 2016-03-01 ダウ アグロサイエンシィズ エルエルシー DIG-3 insecticidal CRY toxin
US8481026B1 (en) 2009-04-17 2013-07-09 Peter J. Woodruff Bacteria with increased trehalose production and method for using the same in bioremediation
US8334366B1 (en) 2009-04-29 2012-12-18 The United States Of America, As Represented By The Secretary Of Agriculture Mutant lycotoxin-1 peptide sequences for insecticidal and cell membrane altering properties
WO2010147880A2 (en) 2009-06-16 2010-12-23 Dow Agrosciences Llc Dig-11 insecticidal cry toxins
UA105046C2 (en) 2009-07-02 2014-04-10 Атенікс Корп. Pesticide gene axmi-205 and its using for plants protection againts insect pests
WO2011029002A2 (en) 2009-09-03 2011-03-10 Advanced Biological Marketing Herbicide-resistant inoculant strains
EP2475765A4 (en) 2009-09-11 2013-07-31 Valent Biosciences Corp Novel bacillus thuringiensis isolate
CN102041241A (en) 2009-10-20 2011-05-04 中国农业科学院生物技术研究所 High-efficiency ammonium-excreting combined azotobacter strain
UA111814C2 (en) 2009-12-16 2016-06-24 ДАУ АГРОСАЙЄНСІЗ ЕлЕлСі TRANSGENIC PLANT CONTAINING Cry1Ab AND Cry2Aa FOR REGULATION OF EUROPEAN MASTER METALLIC METHOD AND METHODS FOR COMBATING ANTI-RESISTANCE
JP2013514769A (en) 2009-12-16 2013-05-02 ダウ アグロサイエンシィズ エルエルシー Combination of CRY1Ca and CRY1Fa proteins for pest resistance management
KR101841298B1 (en) 2009-12-16 2018-03-22 다우 아그로사이언시즈 엘엘씨 Combined use of cry1da and cry1fa proteins for insect resistance management
KR101841296B1 (en) 2009-12-16 2018-03-22 다우 아그로사이언시즈 엘엘씨 Use of cry1da in combination with cry1be for management of resistant insects
WO2011084631A1 (en) 2009-12-16 2011-07-14 Dow Agrosciences Llc Use of cry1ab in combination with cry1be for management of resistant insects
CA2782549A1 (en) 2009-12-16 2011-07-14 Dow Agrosciences Llc Combined use of cry1ca and cry1ab proteins for insect resistance management
RU2607666C2 (en) 2009-12-16 2017-01-10 ДАУ АГРОСАЙЕНСИЗ ЭлЭлСи COMBINED APPLICATION OF PROTEINS Vip3Ab AND Cry1Fa TO GENERATE INSECT RESISTANCE
WO2011099024A1 (en) 2010-02-09 2011-08-18 Patel, Babubhai C. Preparation of novel bacterial based product for providing nuitrition essential for promoting plant growth
WO2011099019A1 (en) 2010-02-09 2011-08-18 Patel, Babubhai, C. Composition and method of preparation of bacterial based product that fix atmospheric nitrogen from air and makes available to plant
EP2536266A2 (en) 2010-02-18 2012-12-26 Athenix Corp. Axmi218, axmi219, axmi220, axmi226, axmi227, axmi228, axmi229, axmi230, and axmi231 delta-endotoxin genes and methods for their use
EP2937419B1 (en) 2010-02-18 2017-08-16 Athenix Corp. Axmi221z, axmi222z, axmi223z, axmi224z, and axmi225z delta-endotoxin genes and methods for their use
MA34242B1 (en) 2010-04-23 2013-05-02 Dow Agrosciences Llc COMBINATIONS INCLUDING PROTEINS CRY34AB / 35AB AND CRY3AA FOR PREVENTING THE DEVELOPMENT OF RESISTANCE IN CORN ROOT CHRYSOMELLES (DIABROTICA SPP.)
CN101880676A (en) 2010-05-20 2010-11-10 黑龙江大学 Construction method of soybean rhizobia genetically engineered strain HD-SFH-01 into nifA gene
US9228240B2 (en) 2010-06-03 2016-01-05 California Institute Of Technology Methods for detecting and quantifying viable bacterial endo-spores
WO2011154960A1 (en) 2010-06-09 2011-12-15 Patel, Babubhai C. Advance material and method of preparation of bacterial formulation using nitrogen fixing bacteria that fix atmoshpheric nitrogen and make available to crop plant
UA111592C2 (en) 2010-07-07 2016-05-25 Сінгента Партісіпейшнс Аг METHOD OF CONTROL ABOUT SOLID WIND PESTS
CN103201388A (en) 2010-08-19 2013-07-10 先锋国际良种公司 Novel bacillus thuringiensis gene with lepidopteran activity against insect pests
US8802934B2 (en) 2010-08-19 2014-08-12 Pioneer Hi Bred International Inc Bacillus thuringiensis gene with lepidopteran activity
CA2822884A1 (en) 2010-12-23 2012-06-28 The Ohio State University Fertilizer composition and method
AU2012214420B2 (en) 2011-02-11 2017-03-02 Monsanto Technology Llc Pesticidal nucleic acids and proteins and uses thereof
CN102690808B (en) 2011-03-23 2017-04-19 北京大学 Construction of prokaryotic gene expression island for purpose of eukaryotic expression
EP2690959B1 (en) 2011-03-31 2016-03-23 Novozymes Biologicals, Inc. Competitive and effective bradyrhizobium japonicum strains
MX346662B (en) 2011-04-07 2017-03-27 Monsanto Technology Llc Insect inhibitory toxin family active against hemipteran and/or lepidopteran insects.
US8513494B2 (en) 2011-04-08 2013-08-20 Chunren Wu Plants and seeds of spring canola variety SCV695971
WO2012142116A2 (en) 2011-04-11 2012-10-18 Targeted Growth, Inc. Identification and use of krp mutants in wheat
US9392790B2 (en) 2011-05-06 2016-07-19 The Research Foundation For The State University Of New York Molecular roadblocks for RpoN binding sites
WO2012162533A2 (en) 2011-05-25 2012-11-29 Sam Houston State University Bioremediation reactor systems
EP2530159A1 (en) 2011-06-03 2012-12-05 Sandoz Ag Transcription terminator sequences
US20130005590A1 (en) 2011-06-06 2013-01-03 The Regents Of The University Of California Synthetic biology tools
HUE046506T2 (en) 2011-06-16 2020-03-30 Univ California Synthetic gene clusters
RU2014107672A (en) 2011-07-28 2015-09-10 Атеникс Корп. AXMI270 GENE AND WAYS OF ITS APPLICATION
UA122657C2 (en) 2011-07-29 2020-12-28 Атенікс Корп. Axmi279 pesticidal gene and methods for its use
CN102417882A (en) 2011-10-25 2012-04-18 中国农业科学院生物技术研究所 Expression method of pseudomonas stutzeri A1501 rpoN gene
CA2854362C (en) 2011-11-04 2018-10-16 International Marketing Partnerships Pty Ltd Microbial inoculants and fertilizer compositions comprising the same
AR083981A1 (en) 2011-11-24 2013-04-10 Consejo Nac Invest Cient Tec NITROGEN FIXING RECOMBINANT BACTERIA CEPA, INOCULATE THAT CONTAINS IT AND APPLICATION METHODS
US9321697B2 (en) 2012-03-03 2016-04-26 Department of Biotechnology Ministry of Science & Technology+Jawaharlal Nehru University Recombinant nitrogen fixing microorganism and uses thereof
WO2013141815A1 (en) 2012-03-21 2013-09-26 Temasek Life Sciences Laboratory Limited Nitrogen-fixing bacterial inoculant for improvement of crop productivity and reduction of nitrous oxide emission
WO2013178663A1 (en) 2012-05-30 2013-12-05 Bayer Cropscience Ag Compositions comprising a biological control agent and an insecticide
CN104470359B (en) 2012-05-30 2017-05-24 拜尔农作物科学股份公司 Compositiions comprising a biological control agent and an insecticide
WO2014042517A2 (en) 2012-09-14 2014-03-20 Universiti Putra Malaysia Biofertilizer
KR20150054944A (en) 2012-09-19 2015-05-20 바이오디스커버리 뉴질랜드 리미티드 Methods of screening for microorganisms that impart beneficial properties to plants
US10968446B2 (en) 2012-11-01 2021-04-06 Massachusetts Institute Of Technology Directed evolution of synthetic gene cluster
JP6024963B2 (en) 2012-11-13 2016-11-16 国立大学法人東京農工大学 Novel Bacillus genus nitrogen-fixing bacteria, plant growth promoter, and plant cultivation method
EP4234696A3 (en) 2012-12-12 2023-09-06 The Broad Institute Inc. Crispr-cas component systems, methods and compositions for sequence manipulation
RU2723946C2 (en) 2013-02-05 2020-06-18 Юниверсити Оф Сэскэтчевэн Endophyte microbial symbions in prenatal plant care
US9234213B2 (en) 2013-03-15 2016-01-12 System Biosciences, Llc Compositions and methods directed to CRISPR/Cas genomic engineering systems
EP2975942B1 (en) 2013-03-21 2018-08-08 Sangamo Therapeutics, Inc. Targeted disruption of t cell receptor genes using engineered zinc finger protein nucleases
WO2014201044A2 (en) 2013-06-10 2014-12-18 The Regents Of The University Of California Plant growth-promoting microorganisms and methods of use thereof
CN103451130B (en) 2013-07-25 2014-11-26 中国农业大学 Nitrogen fixing gene cluster and application thereof
JP6267073B2 (en) 2013-07-26 2018-01-24 株式会社前川製作所 New agricultural use of Escherichia bacteria
JP6241916B2 (en) 2013-08-18 2017-12-06 国立大学法人島根大学 Sweet potato cultivation method
WO2015061764A1 (en) 2013-10-25 2015-04-30 Asilomar Bio, Inc. Strigolactone formulations and uses thereof
EP3834616A1 (en) 2013-12-04 2021-06-16 Newleaf Symbiotics, Inc. Methods and compositions for improving corn yield
JP6296776B2 (en) 2013-12-16 2018-03-20 京都府 Biofertilizer manufacturing method
CA2883596A1 (en) 2014-02-26 2015-08-26 Bioponix Technologies Inc. Continuous bioprocess for organic greenhouse agriculture
IL300797A (en) 2014-05-23 2023-04-01 Bioconsortia Inc Integrated plant breeding methods for complementary pairings of plants and microbial consortia
WO2015184016A2 (en) 2014-05-27 2015-12-03 The Broad Institute, Inc. High-thoughput assembly of genetic elements
CA2950821C (en) 2014-06-10 2024-02-06 Dermtreat Aps Compositions comprising electrohydrodynamically obtained fibres for administration of specific dosages of an active substance to skin or mucosa
GB201413335D0 (en) 2014-07-28 2014-09-10 Azotic Technologies Ltd Agricultural methods
GB201413333D0 (en) 2014-07-28 2014-09-10 Azotic Technologies Ltd Plant inoculation
WO2016100727A1 (en) 2014-12-18 2016-06-23 The Regents Of The University Of California Recombinantly engineered diazotrophs for whole cell hydrocarbon production and methods for making and using them
MX369475B (en) 2015-01-15 2019-11-08 Pioneer Hi Bred Int INSECTICIDAL PROTEINS and METHODS FOR THEIR USE.
EP3261445B1 (en) 2015-02-09 2023-04-05 BioConsortia, Inc. Agriculturally beneficial microbes, microbial compositions, and consortia
US9796957B2 (en) 2015-03-11 2017-10-24 Regents Of The University Of Minnesota Genetically modified diazotrophs and methods of using same
FR3033790B1 (en) 2015-03-19 2018-05-04 Universite Claude Bernard Lyon I USE OF PROANTHOCYANIDINES TO LIMIT DENITRIFICATION
RU2017141632A (en) 2015-05-01 2019-06-03 Индиго Агрикултуре, Инк. ISOLATED COMPLEX ENDOPHITIC COMPOSITIONS AND METHODS OF IMPROVING PLANT SIGNS
NL2014777B1 (en) 2015-05-07 2017-01-26 Ibema Biezenmortel B V Nitrifying micro-organisms for fertilisation.
BR112017024265B1 (en) 2015-05-11 2022-08-09 Mybiotics Pharma Ltd METHODS FOR GROWING A BIOFILM OF PROBIOTIC BACTERIA IN SOLID PARTICLES
JP6887390B2 (en) 2015-06-05 2021-06-16 サステナブル オーガニック ソリューションズ ピーティーワイ リミテッドSustainable Organic Solutions Pty Ltd Methods, coatings and bacterial SOS3 strains for improving plant growth
KR102461443B1 (en) 2015-07-13 2022-10-31 피벗 바이오, 인크. Methods and compositions for improving plant traits
WO2017042833A1 (en) 2015-09-11 2017-03-16 Zydex Industries Pvt. Ltd. Bio-fertilizer composition
JP2018537119A (en) 2015-10-05 2018-12-20 マサチューセッツ インスティテュート オブ テクノロジー Nitrogen fixation using refactored nif clusters
TR201513059A1 (en) 2015-10-20 2019-01-21 Ibrahim Isildak A BIOGRAPHY FORMULATION
KR102689434B1 (en) 2015-11-19 2024-07-29 우니페르시테트 바젤 Bacteria-based protein delivery
AU2016378742A1 (en) 2015-12-21 2018-07-12 Indigo Ag, Inc. Endophyte compositions and methods for improvement of plant traits in plants of agronomic importance
ITUA20163807A1 (en) 2016-05-25 2017-11-25 Univ Degli Studi Di Foggia METHOD FOR THE PRODUCTION OF PROBIOTIC MICROBIAL BIOFILMS AND RELATED USES
WO2018081543A1 (en) 2016-10-28 2018-05-03 Bayer Cropscience Lp Mutants of bacillus and methods for their use
JP7234116B2 (en) 2017-01-12 2023-03-07 ピボット バイオ, インコーポレイテッド Methods and compositions for improving plant traits
CN106658654A (en) 2017-01-22 2017-05-10 北京佰才邦技术有限公司 Soft SIM control method and user terminal
EP3665141A4 (en) 2017-08-09 2021-08-25 Pivot Bio, Inc. Methods and compositions for improving engineered microbes
US10525318B2 (en) 2017-09-12 2020-01-07 Edmond J. Dougherty Timing display device
AU2018354338B2 (en) 2017-10-25 2023-10-26 Pivot Bio, Inc. Gene targets for nitrogen fixation targeting for improving plant traits
KR20200088342A (en) 2017-10-25 2020-07-22 피벗 바이오, 인크. Methods and compositions for improving genetically engineered microorganisms that fix nitrogen
AU2019206571A1 (en) 2018-01-10 2020-07-30 Bayer Cropscience Lp Improved microbes and methods for producing the same
MX2020013875A (en) 2018-06-27 2021-08-11 Pivot Bio Inc Agricultural compositions comprising remodeled nitrogen fixing microbes.
MX2020014295A (en) 2018-06-27 2021-05-27 Pivot Bio Inc Guided microbial remodeling, a platform for the rational improvement of microbial species for agriculture.
WO2020014498A1 (en) 2018-07-11 2020-01-16 Pivot Bio, Inc. Temporally and spatially targeted dynamic nitrogen delivery by remodeled microbes
WO2020023630A1 (en) 2018-07-25 2020-01-30 Convergent Genomics, Inc. Urinary microbiomic profiling
AU2019345144A1 (en) 2018-09-21 2021-04-15 Pivot Bio, Inc. Methods and compositions for improving phosphate solubilization
US20210315212A1 (en) 2018-11-01 2021-10-14 Pivot Bio, Inc. Biofilm compositions with improved stability for nitrogen fixing microbial products
EP3891112A4 (en) 2018-12-07 2022-11-09 Pivot Bio, Inc. Polymer compositions with improved stability for nitrogen fixing microbial products
WO2020132632A2 (en) 2018-12-21 2020-06-25 Pivot Bio, Inc. Methods, compositions, and media for improving plant traits
MX2021007777A (en) 2019-01-07 2021-10-13 Pivot Bio Inc PLANT COLONIZATION ASSAYS THROUGH THE USE OF NATURAL MICROBIAL BAR CODES.
CN113905998A (en) 2019-02-05 2022-01-07 皮沃特生物股份有限公司 Crop yield consistency enhancement by biological nitrogen fixation
WO2020190363A1 (en) 2019-03-19 2020-09-24 Massachusetts Institute Of Technology Control of nitrogen fixation in rhizobia that associate with cereals
AU2020261427A1 (en) 2019-04-24 2021-11-11 Pivot Bio, Inc. Gene targets for nitrogen fixation targeting for improving plant traits
EP3959302A4 (en) 2019-04-25 2023-06-21 Pivot Bio, Inc. HIGH-THROUGHPUT METHODS FOR THE ISOLATION AND CHARACTERIZATION OF LIBRARIES OF AMMONIUM EXCRETING MUTANTS GENERATED BY CHEMICAL MUTAGENESIS
WO2021113352A1 (en) 2019-12-04 2021-06-10 Pivot Bio, Inc. System to deliver a solution with a biological product in a planter assembly
WO2021146209A1 (en) 2020-01-13 2021-07-22 Pivot Bio, Inc. Consortia of microorganisms for spatial and temporal delivery of nitrogen
CN115011534B (en) * 2022-03-23 2023-11-03 山东农业大学 A mutant strain, construction method and application of stem nodule nitrogen-fixing rhizobium ORS571

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6548289B1 (en) * 1984-12-28 2003-04-15 Land O'lakes, Inc. Biological nitrogen fixation

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
A. Willems. The taxonomy of rhizobia: an overview. Plant and Soil (2006), 287:3, 3-14. (Year: 2006) *
Cannon et al. Recombinant plasmid that carries part of the nitrogen fixation (nif) gene cluster of Klebsiella pneumoniae. Proc. Natl. Acad. Sci. USA (1977), 74(7), 2963-2967. (Year: 1977) *
M. Venkateshwaran. Exploring the Feasibility of Transferring Nitrogen Fixation to Cereal Crops. Chapter 42 in B. Lugtenberg (ed.), Principles of Plant-Microbe Interactions (2015), p403-410. (Year: 2015) *
Nelson et al. The complete replicons of 16 Ensifer meliloti strains offer insights into intra- and inter-replicon gene transfer, transposon-associated loci, and repeat elements. Nelson et al. (Microbial Genomics (Apr. 2018), 4; 11 pages. (Year: 2018) *
Temme et al. Refactoring the nitrogen fixation gene cluster from Klebsiella oxytoca. PNAS (2012), 109(18), 7085-7090; Reference V). (Year: 2012) *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11963530B2 (en) 2018-06-27 2024-04-23 Pivot Bio, Inc. Agricultural compositions comprising remodeled nitrogen fixing microbes
US12268212B2 (en) 2018-06-27 2025-04-08 Pivot Bio, Inc. Agricultural compositions comprising remodeled nitrogen fixing microbes
US12290074B2 (en) 2018-06-27 2025-05-06 Pivot Bio, Inc. Agricultural compositions comprising remodeled nitrogen fixing microbes
US12471599B2 (en) 2018-06-27 2025-11-18 Pivot Bio, Inc. Agricultural compositions comprising remodeled nitrogen fixing microbes
US12391624B2 (en) 2018-07-11 2025-08-19 Pivot Bio, Inc. Temporally and spatially targeted dynamic nitrogen delivery by remodeled microbes
US12421519B2 (en) 2019-01-07 2025-09-23 Pivot Bio, Inc. Plant colonization assays using natural microbial barcodes
US12478068B2 (en) 2020-05-01 2025-11-25 Pivot Bio, Inc. Stable liquid formulations for nitrogen-fixing microorganisms

Also Published As

Publication number Publication date
US20250257316A1 (en) 2025-08-14
WO2020191201A1 (en) 2020-09-24
EP3941930A1 (en) 2022-01-26
WO2020191201A9 (en) 2020-10-22
US20200299637A1 (en) 2020-09-24
WO2020190363A1 (en) 2020-09-24
CN113710690A (en) 2021-11-26
US12281299B2 (en) 2025-04-22

Similar Documents

Publication Publication Date Title
US20250257316A1 (en) Control of nitrogen fixation in rhizobia that associate with cereals
Ryu et al. Control of nitrogen fixation in bacteria that associate with cereals
US20220411344A1 (en) Nitrogen fixation using refactored nif clusters
Contesto et al. Effects of rhizobacterial ACC deaminase activity on Arabidopsis indicate that ethylene mediates local root responses to plant growth-promoting rhizobacteria
US10676406B2 (en) Hopanoids producing bacteria and related biofertilizers, compositions, methods and systems
Tittabutr et al. The cloned 1-aminocyclopropane-1-carboxylate (ACC) deaminase gene from Sinorhizobium sp. strain BL3 in Rhizobium sp. strain TAL1145 promotes nodulation and growth of Leucaena leucocephala
Lin et al. Functional exploration of the bacterial type VI secretion system in mutualism: Azorhizobium caulinodans ORS571–Sesbania rostrata as a research model
Yurgel et al. Sinorhizobium meliloti flavin secretion and bacteria-host interaction: role of the bifunctional RibBA protein
US20220127624A1 (en) Inducible Ammonia Production from a Symbiotic Diazotroph, Methods of Creation and Uses Thereof
Perrine-Walker et al. Rhizobium-initiated rice growth inhibition caused by nitric oxide accumulation
Butcher et al. Disruption of the carA gene in Pseudomonas syringae results in reduced fitness and alters motility
Gamez-Reyes et al. The Rhizobium leucaenae CFN 299 pSym plasmid contains genes expressed in free life and symbiosis, as well as two replication systems
Sofi et al. Prospects of nitrogen fixation in rice
US20180179548A1 (en) Molecular biology tools for algal engineering
Wang et al. Targeted Genome Editing of the ACC Deaminase Gene in Bradyrhizobium: Toward Enhanced Plant Growth and Stress Tolerance
Armijo et al. Arabidopsis thaliana interaction with Ensifer meliloti can support plant growth under N-deficiency
Victoria et al. Engineering the highly productive cyanobacterium Synechococcus sp. PCC 11901
US20250324977A1 (en) Engineered bacteria for enhanced crop production
US20250243130A1 (en) Phosphate sensing microbial gene switch
Fink Establishment of tools for genetic modification of the thermophilic methanogenic archaeon Methanothermobacter thermautotrophicus deltaH
Kulakowski et al. Development of Modular Expression and Genome Editing Across Phylogenetically Distinct Diazotrophs
Breitstein Investigation of the promoter region of mdh-sucCDAB operon in Sinorhizobium meliloti
Ghosh et al. Sinorhizobium medicae WSM419 Genes That Improve Symbiosis between Sinorhizobium meliloti Medicago Rm1021 and truncatula Jemalong A17 and in Other Symbiosis Systems
Adolphsen Rhizobial Genes that Contribute to Effective Symbiosis with Legumes
Torrescassana et al. Genomic and functional analyses reveal Pseudomonas granadensis CT364 is a plant growth-promoting endophyte

Legal Events

Date Code Title Description
STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

AS Assignment

Owner name: MASSACHUSETTS INSTITUTE OF TECHNOLOGY, MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:VOIGT, CHRISTOPHER A.;RYU, MIN-HYUNG;SIGNING DATES FROM 20200210 TO 20200212;REEL/FRAME:059042/0824

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

AS Assignment

Owner name: NATIONAL SCIENCE FOUNDATION, VIRGINIA

Free format text: CONFIRMATORY LICENSE;ASSIGNOR:MASSACHUSETTS INSTITUTE OF TECHNOLOGY;REEL/FRAME:070794/0219

Effective date: 20211006

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION