WO2023081764A1 - Hexadepsipeptide compounds and methods of using the same - Google Patents
Hexadepsipeptide compounds and methods of using the same Download PDFInfo
- Publication number
- WO2023081764A1 WO2023081764A1 PCT/US2022/079230 US2022079230W WO2023081764A1 WO 2023081764 A1 WO2023081764 A1 WO 2023081764A1 US 2022079230 W US2022079230 W US 2022079230W WO 2023081764 A1 WO2023081764 A1 WO 2023081764A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- formula
- sequence
- compound
- polynucleotide
- gene cluster
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K7/00—Peptides having 5 to 20 amino acids in a fully defined sequence; Derivatives thereof
- C07K7/04—Linear peptides containing only normal peptide links
- C07K7/06—Linear peptides containing only normal peptide links having 5 to 11 amino acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
Definitions
- the present disclosure provides, inter alia, a compound of Formula (I), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof: wherein in Formula (I),
- R is selected from hydrogen and -OH.
- the present disclosure provides, inter alia, a compound of Formula (10), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof:
- the present disclosure provides, inter alia, a compound of Formula (11), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof:
- the compound of Formula (I) is substantially pure. In some embodiments, the compound of Formula (I) is enantiomerically pure.
- the compound of Formula (10) is substantially pure. In some embodiments, the compound of Formula (10) is enantiomerically pure.
- the compound of Formula (11) is substantially pure. In some embodiments, the compound of Formula (11) is enantiomerically pure.
- the disclosure provides for pharmaceutical compositions comprising a therapeutically effective amount of the compound of Formula (I) and one or more pharmaceutically acceptable excipients.
- the pharmaceutical composition comprises a pharmaceutical carrier.
- the disclosure provides for pharmaceutical compositions comprising a therapeutically effective amount of the compound of Formula (10) and one or more pharmaceutically acceptable excipients.
- the pharmaceutical composition comprises a pharmaceutical carrier.
- the disclosure provides for pharmaceutical compositions comprising a therapeutically effective amount of the compound of Formula (11) and one or more pharmaceutically acceptable excipients.
- the pharmaceutical composition comprises a pharmaceutical carrier.
- the compound of Formula (I), Formula (10), and/or Formula (11) is produced by a host cell comprising a heterologous biosynthetic gene cluster comprising at least six nonribosomal peptide synthetase (NRPS) modules and at least four polyketide synthase (PKS) modules, a set of modifying enzymes, precursor biosynthesis enzymes, transporters, and one or more transcriptional regulators.
- NRPS nonribosomal peptide synthetase
- PKS polyketide synthase
- the biosynthetic gene cluster is isolated or derived from Streptomyces sp. In some embodiments, the biosynthetic gene cluster is isolated or derived from Streptomyces strain NRRL-6131. [016] In some embodiments, the biosynthetic gene cluster comprises a sequence of SEQ ID NO:1.
- the biosynthetic gene cluster comprises one or more modifications of SEQ ID NO: 1 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
- the modification comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative to SEQ ID NO: 1.
- the modification comprises insertion of at least one promoter sequence.
- the promoter is selected from ermE and kaso, or functional variants or derivatives thereof.
- the sequence of the ermE promoter comprises SEQ ID NO: 9 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto
- the sequence of the kasO promoter comprises SEQ ID NO: 10 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
- the biosynthetic gene cluster comprises SEQ ID NO: 11 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
- the modification increases synthesis of the compound of any one of Formula (I), Formula (10), and Formula (11) compared to an otherwise equivalent host cell comprising an unmodified biosynthetic gene cluster.
- the host cell is a Streptomyces cell. In some embodiments, the host cell is a Streptomyces albus cell.
- the host cell further comprises a sequence LmBU operably linked to a constitutive promoter.
- the present disclosure provides a polynucleotide comprising a biosynthetic gene cluster, wherein the biosynthetic gene cluster comprises one or more genes that contribute to the production of at least a portion of the compound of Formula (I), Formula (10), and/or Formula (11) when the biosynthetic gene cluster is expressed by a host cell.
- the one or more genes comprise six nonribosomal peptide synthetase (NRPS) modules.
- NRPS nonribosomal peptide synthetase
- the six NRPS modules are encoded by sequences comprising a first NRPS open reading frame of SEQ ID NO:2, a second NRPS open reading frame of SEQ ID NO: 3, a third NRPS open reading frame of SEQ ID NO: 4 and a fourth NRPS open reading frame of SEQ ID NO: 5, or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the one or more genes comprise four polyketide synthase (PKS) modules.
- PKS polyketide synthase
- the four PKS modules are encoded by polynucleotide sequences comprising a first PKS open reading frame of SEQ ID NO: 6 and a second PKS open reading frame of SEQ ID NO: 7, or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the biosynthetic gene complex comprises a LmBU-encoding gene.
- the LmBU-encoding gene comprises a polynucleotide sequence of SEQ ID NO: 8, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the biosynthetic gene cluster comprises a polynucleotide sequence of SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the host cell is engineered to express the one or more genes in the biosynthetic cluster, which results in the production of the compound of any one of Formula (I), Formula (10), and Formula (11).
- overexpression of one or more genes in the biosynthetic cluster by the host cell increases the production of the compound of any one of Formula (I), Formula (10), and Formula (11) compared to an otherwise equivalent host cell comprising a biosynthetic gene cluster that does not overexpress one or more genes in the biosynthetic cluster.
- the LmBU is overexpressed.
- overexpression of the LmBU occurs in cis or in trans.
- trans overexpression of LmBU comprises expressing a sequence encoding the LmBu open reading frame under the control of a constitutive ermE promoter, a kasO promoter, or a functional variant or derivative thereof.
- the ermE promoter comprises a sequence of SEQ ID NO: 9
- the kasO promoter comprises a sequence of SEQ ID NO: 10.
- the biosynthetic gene cluster comprises one or more sequence modifications relative to a biosynthetic gene cluster of SEQ ID NO : 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the one or more modification of the biosynthetic gene cluster comprises a modification that results in overexpression of the LmBU-encoding gene in comparison to the expression of the LmBU-encoding gene by the biosynthetic gene cluster of SEQ ID NO: 1.
- the one or more modifications comprise modifications of a promoter of a gene in the biosynthetic gene cluster.
- the one or more modifications comprise insertion of at least one heterologous promoter in the biosynthetic gene cluster.
- the at least one heterologous promoter is a strong promoter.
- the at least one heterologous promoter is selected from the group consisting of ermE and kasO, or functional variants or derivatives thereof.
- the sequence of the ermE promoter comprises SEQ ID NO: 9 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto
- the sequence of the kasO promoter comprises SEQ ID NO: 10 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
- inserting the at least one heterologous promoter into the biosynthetic gene cluster comprises a nucleic acid guided endonuclease.
- the nucleic acid guided endonuclease is in a complex with at least one guide nucleic acid (gNA).
- gNA guide nucleic acid
- the nucleic acid guided endonuclease is a CRISPR/Cas endonuclease.
- the CRISPR/Cas endonuclease is Cas9.
- inserting the at least one heterologous promoter into the biosynthetic gene cluster further comprises a donor template comprising a sequence of the heterologous promoter.
- the biosynthetic gene cluster comprises an mbtH gene upstream of the four NRPS open reading frames, and wherein the at least one heterologous promoter is inserted upstream of the mbtH gene.
- the at least one heterologous promoter is one or more of an ermE promoter and kasO promoter.
- the biosynthetic gene cluster comprises a polynucleotide sequence of SEQ ID NO: 11 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
- the at least one modification of the biosynthetic gene cluster comprises a modification that results in overexpression of the LmBU-encoding gene in comparison to the expression of the LmBU-encoding gene by the biosynthetic gene cluster of SEQ ID NO: 1.
- At least one modification of the biosynthetic gene cluster comprises replacement of at least one promoter in comparison to the biosynthetic gene cluster of SEQ ID NO: 1.
- the biosynthetic gene cluster is isolated or derived from Streptomyces strain NRRL F-6131.
- the biosynthetic gene cluster produces the compound of any one of Formula (I), Formula (10), and Formula (11) in the host cell.
- the present disclosure provides a vector comprising the polynucleotide as described herein.
- the vector is a bacterial artificial chromosomal vector.
- the vector further comprises at least one promoter.
- the vector is suitable for expression in a Streptomyces species cell.
- the present disclosure provides a host cell comprising the polynucleotide as described herein or the vector as described herein.
- the present disclosure provides a host cell, comprising the polynucleotide as described herein.
- the host cell further comprises a polynucleotide comprising a sequence encoding a LmBU operably linked to a constitutive promoter.
- the constitutive promoter is one or more of an ermE promoter and a kasO promoter.
- the LmBU is encoded by a polynucleotide sequence of SEQ ID NO: 8, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the host cell is a Streptomyces cell.
- the Streptomyces cell is a Streptomyces lividans or Streptomyces albus cell.
- the present disclosure provides a method of making a polynucleotide comprising a modified biosynthetic gene cluster comprising: a. providing a first E. coli host cell comprising a first vector comprising a sequence of an unmodified biosynthetic gene cluster comprising a target sequence; b. introducing the first vector into a Streptomyces host cell by conjugation; c. providing a second E. coli host cell comprising a second vector comprising: i. a sequence of at least one gNA specific to the target sequence operably linked to a promoter, ii. a sequence encoding a Cas endonuclease; and iii.
- introducing the second vector into a Streptomyces host cell by conjugation whereby introducing the second vector into the Streptomyces host cell produces a double strand break in the target sequence and introduction of a donor template sequence, thereby generating a Streptomyces host cell comprising a modified biosynthetic gene cluster.
- the biosynthetic gene cluster is an unmodified biosynthetic gene cluster.
- the unmodified biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1.
- the polynucleotide sequence of the modified biosynthetic gene cluster comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative to SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the Cas endonuclease is selected from Cas9 (also known as Csnl and Csxl2), Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Casio, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, Csf4, homologues thereof, variants thereof, mutants thereof, and derivatives thereof.
- Cas9 also known as Csnl and Csxl2
- the endonuclease is a Cas9 endonuclease.
- the unmodified biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the donor template comprises, from 5’ to 3’, a sequence homologous to a sequence 5’ of the target sequence, a sequence of a promoter, and sequence homologous to a sequence 3’ of the target sequence.
- the promoter is selected from ermE and kasO, or functional variants or derivatives thereof.
- the present disclosure provides a method of making the compound of Formula (I), comprising a. introducing into a host cell a polynucleotide of the present disclosure or a vector of the present disclosure; b. culturing the host cell under conditions sufficient for the synthesis of the compound of Formula (I) by the biosynthetic gene cluster; and c. isolating and purifying the compound of Formula (I).
- the host cell is an Actinobacterial cell or a Streptomyces cell.
- the Streptomyces cell is a Streptomyces albus or Streptomyces lividans cell.
- the host cell comprises a sequence encoding a LmBU operably linked to a constitutive promoter.
- the polynucleotide or vector is introduced into the host cell by conjugation with an E. coli comprising the polynucleotide or vector.
- the compound of Formula (I) is a compound of Formula (10), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
- the compound of Formula (I) is a compound of Formula (11), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
- the present disclosure provides a pharmaceutical composition, comprising a compound of Formula (I), and a pharmaceutically acceptable excipient.
- the present disclosure provides a pharmaceutical composition, comprising a compound of Formula (10), and a pharmaceutically acceptable excipient. [086] In some embodiments, the present disclosure provides a pharmaceutical composition, comprising a compound of Formula (11), and a pharmaceutically acceptable excipient.
- the present disclosure provides a method of treating a disease or disorder in a subject, comprising administering a compound of Formula (I) or pharmaceutical composition thereof.
- the present disclosure provides a compound of Formula (I) or the pharmaceutical composition thereof, for use in treating a disease or disorder in a subject.
- the present disclosure provides a compound of Formula (I) for use in the manufacture of a medicament for treating a disease or disorder in a subject.
- the present disclosure provides the use of a compound of Formula (I) or the pharmaceutical composition thereof, for the treatment of a disease or disorder.
- the present disclosure provides a compound of Formula (10) or the pharmaceutical composition thereof, for use in treating a disease or disorder in a subject.
- the present disclosure provides a compound of Formula (10) for use in the manufacture of a medicament for treating a disease or disorder in a subject.
- the present disclosure provides the use of a compound of Formula (10) or the pharmaceutical composition thereof, for the treatment of a disease or disorder.
- the present disclosure provides a compound of Formula (11) or the pharmaceutical composition thereof, for use in treating a disease or disorder in a subject.
- the present disclosure provides a compound of Formula (11) for use in the manufacture of a medicament for treating a disease or disorder in a subject.
- the present disclosure provides the use of a compound of Formula (11) or the pharmaceutical composition thereof, for the treatment of a disease or disorder.
- the disease or disorder is cancer.
- the disease or disorder is fibrosis.
- the subject is human.
- FIG. 1 depicts the proposed biosynthesis of a compound of Formula (I) (Formula (10)) from the AZT039 biosynthetic gene cluster (BGC) based on BGC analysis and 2D structure.
- FIGS. 2A-2C depict the target identification of AZT039 compounds.
- FIG. 2A depicts LCMS selected ion chromatograms of target peaks in SA-LT039 vs SA-pdualP expression.
- FIG. 2B depicts isotopic labeling experiment showing mass shifts upon addition of labeled amino acid precursors. Stable isotope labeling of target peaks shows modified DIO-leucine incorporation.
- FIG. 2C depicts LCMS selected-ion chromatograms for LT003-0026 and LT003-0027 in WT strain under different media conditions showing lack of expression.
- FIG. 3A depicts the 2D NMR structure of a compound of Formula (I) (Formula (10)) from 2D NMR correlations.
- FIG. 3B depicts the 2D NMR structure of a compound of Formula (I) (Formula (11)) from 2D NMR correlations.
- FIG. 4A depicts the structure of a compound of Formula (I) (Formula (10)).
- the stereocenter at position 5 (piperazic acid 2) was determined by NOE
- FIG. 4B depicts the structure of a compound of Formula (I) (Formula (11)).
- the present disclosure relates to a compound of Formula (I), Formula (10), and Formula (11), and to the use of a compound of Formula (I), Formula (10), and Formula (11) in the treatment of diseases or disorders, such as cancer or fibrosis.
- the disclosure relates to the biosynthesis of the compound of Formula (I).
- the disclosure relates to the biosynthesis of the compound of Formula (10).
- the disclosure relates to the biosynthesis of the compound of Formula (11).
- the compound of Formula (I),” “a compound of Formula (I),” “the compound of Formula (10),” “a compound of Formula (10),” “the compound of Formula (11),” and “a compound of Formula (11)” includes all stereoisomer, mixture of stereoisomers, pharmaceutically acceptable salts, solvates, or tautomers thereof.
- the expressions “one or more of A, B, or C,” “one or more A, B, or C,” “one or more of A, B, and C,” “one or more A, B, and C,” “selected from the group consisting of A, B, and C”, “selected from A, B, and C”, and the like are used interchangeably and all refer to a selection from a group consisting of A, B, and/or C, i.e., one or more As, one or more Bs, one or more Cs, or any combination thereof, unless indicated otherwise.
- natural product refers to a compound that is synthesized by a living organism (e.g., bacteria) under normal physiological conditions and the compound can be quantified, and identified as a pathway specific product, using known techniques in the art. If one skilled in the art cannot quantify a compound in, e.g., extracts of native bacterial cells containing a biosynthetic gene cluster after culturing said cells with a growth media containing the nutrients required to produce a compound, one skilled in the art would reasonably believe that the bacterial cells do not naturally produce the compound (i.e., the compound is not a “natural product”).
- module refers to a set of active site domains of a protein that catalyze one or more of the biosynthetic steps leading to the compound of Formula (I), Formula (10), or Formula (11).
- each module may be composed of a protein, or a module may be composed of a plurality of domains.
- heterologous protein domains may be fused together to form a module.
- an open reading frame may be polycistronic, and encode a plurality of distinct proteins, each of which comprise one or more domains or modules.
- an open reading frame may encode a single protein, which has a plurality of modules, each of which comprises a combination of domains.
- compositions are described as having, including, or comprising specific components, it is contemplated that compositions also consist essentially of, or consist of, the recited components. Similarly, where methods or processes are described as having, including, or comprising specific process steps, the processes also consist essentially of, or consist of, the recited processing steps. Further, it should be understood that the order of steps or order for performing certain actions is immaterial so long as the invention remains operable. Moreover, two or more steps or actions can be conducted simultaneously.
- the term “pharmaceutically acceptable salts” refer to derivatives of the compounds of the present disclosure wherein the parent compound is modified by making acid or base salts thereof.
- pharmaceutically acceptable salts include, but are not limited to, mineral or organic acid salts of basic residues such as amines, alkali or organic salts of acidic residues such as carboxylic acids, and the like.
- the pharmaceutically acceptable salts include the conventional non-toxic salts or the quaternary ammonium salts of the parent compound formed, for example, from non-toxic inorganic or organic acids.
- such conventional non-toxic salts include, but are not limited to, those derived from inorganic and organic acids selected from 2-acetoxybenzoic, 2 -hydroxy ethane sulfonic, acetic, ascorbic, benzene sulfonic, benzoic, bicarbonic, carbonic, citric, edetic, ethane disulfonic, 1,2-ethane sulfonic, fumaric, glucoheptonic, gluconic, glutamic, glycolic, glycollyarsanilic, hexylresorcinic, hydrabamic, hydrobromic, hydrochloric, hydroiodic, hydroxymaleic, hydroxynaphthoic, isethionic, lactic, lactobionic, lauryl sulfonic, maleic, malic, mandelic, methane sulfonic, napsylic, nitric, oxalic, pamoic, pantothenic, phenylacetic
- the pharmaceutically acceptable salt is a sodium salt, a potassium salt, a calcium salt, a magnesium salt, a diethylamine salt, a choline salt, a meglumine salt, a benzathine salt, a tromethamine salt, an ammonia salt, an arginine salt, or a lysine salt.
- compositions include hexanoic acid, cyclopentane propionic acid, pyruvic acid, malonic acid, 3-(4-hydroxybenzoyl)benzoic acid, cinnamic acid, 4-chlorobenzenesulfonic acid, 2-naphthalenesulfonic acid, 4-toluenesulfonic acid, camphorsulfonic acid, 4-methylbicyclo-[2.2.2]-oct-2-ene-l-carboxylic acid, 3- phenylpropionic acid, trimethylacetic acid, tertiary butylacetic acid, muconic acid, and the like.
- the present disclosure also encompasses salts formed when an acidic proton present in the parent compound either is replaced by a metal ion, e.g., an alkali metal ion, an alkaline earth ion, or an aluminum ion; or coordinates with an organic base such as ethanolamine, diethanolamine, triethanolamine, tromethamine, N-methylglucamine, and the like.
- a metal ion e.g., an alkali metal ion, an alkaline earth ion, or an aluminum ion
- an organic base such as ethanolamine, diethanolamine, triethanolamine, tromethamine, N-methylglucamine, and the like.
- the ratio of the compound to the cation or anion of the salt can be 1:1, or any ratio other than 1:1, e.g., 3:1, 2:1, 1:2, or 1:3.
- the term “treating” or “treat” describes the management and care of a patient for the purpose of combating a disease, condition, or disorder and includes the administration of the compound of Formula (I), Formula (10), or Formula (11) to alleviate the symptoms or complications of a disease, condition or disorder, to eliminate the disease, condition or disorder, or to prevent the disease, condition or disorder.
- the term “treat” can also include treatment of a cell in vitro or an animal model. It is to be appreciated that references to “treating” or “treatment” include the alleviation of established symptoms of a condition.
- Treating” or “treatment” of a state, disorder or condition therefore includes: (1) preventing or delaying the appearance of clinical symptoms of the state, disorder or condition developing in a human that may be afflicted with or predisposed to the state, disorder or condition but does not yet experience or display clinical or subclinical symptoms of the state, disorder or condition, (2) inhibiting the state, disorder or condition, i.e., arresting, reducing or delaying the development of the disease or a relapse thereof (in case of maintenance treatment) or at least one clinical or subclinical symptom thereof, or (3) relieving or attenuating the disease, i.e., causing regression of the state, disorder or condition or at least one of its clinical or subclinical symptoms.
- the term “pharmaceutically acceptable” refers to those compounds, anions, cations, materials, compositions, carriers, and/or dosage forms which are, within the scope of sound medical judgment, suitable for use in contact with the tissues of human beings and animals without excessive toxicity, irritation, allergic response, or other problem or complication, commensurate with a reasonable benefit/risk ratio.
- the term “pharmaceutically acceptable excipient” means an excipient that is useful in preparing a pharmaceutical composition that is generally safe, non-toxic and neither biologically nor otherwise undesirable, and includes excipient that is acceptable for veterinary use as well as human pharmaceutical use.
- a “pharmaceutically acceptable excipient” as used in the specification and claims includes both one and more than one such excipient.
- the term “therapeutically effective amount” refers to an amount of a pharmaceutical agent to treat, ameliorate, or prevent an identified disease or condition, or to exhibit a detectable therapeutic or inhibitory effect. The effect can be detected by any assay method known in the art. The precise effective amount for a subject will depend upon the subject’s body weight, size, and health; the nature and extent of the condition; and the therapeutic or combination of therapeutics selected for administration.
- polynucleotide and “nucleic acid” are used interchangeably herein and refer to a polymeric form of nucleotides of any length, i.e., ribonucleotides or deoxy ribonucleotides or analogs thereof. These terms refer to the primary structure of the molecule and thus encompass double-and single-stranded DNA as well as double-and singlestranded RNA. The term also encompasses modified nucleic acids, such as methylated and/or capped nucleic acids, nucleic acids containing modified bases, backbone modifications, and the like.
- a gene refers to any segment of DNA associated with a biological function.
- a gene includes, but is not limited to, coding sequences and/or regulatory sequences required for its expression. Genes may also comprise non-expressed DNA segments, e.g. forming recognition sequences for other proteins. Genes can be obtained from a variety of sources, including cloning from a source of interest or synthesis from known or predicted sequence information, and can comprise sequences designed to have desired parameters.
- the genomic DNA, prior to modification is isolated from bacteria cells originally found in soil.
- the term “homologous” or “homolog” or “ortholog” is known in the art and refers to related sequences that share a common ancestor or family member and are determined based on the degree of sequence identity.
- the terms “substantially similar” and “substantially corresponding” are used interchangeably herein.
- the term refers to nucleic acid fragments wherein the difference in one or more nucleotide bases does not affect the ability of the nucleic acid fragment to mediate gene expression or produce a certain phenotype.
- nucleic acid fragments of the disclosure also refer to modifications of the nucleic acid fragments of the disclosure, such as deletions or insertions of one or more nucleotides that do not substantially alter the functional properties of the resulting nucleic acid fragment relative to the original, unmodified fragment.
- modifications of the nucleic acid fragments of the disclosure such as deletions or insertions of one or more nucleotides that do not substantially alter the functional properties of the resulting nucleic acid fragment relative to the original, unmodified fragment.
- modifications of the nucleic acid fragments of the disclosure such as deletions or insertions of one or more nucleotides that do not substantially alter the functional properties of the resulting nucleic acid fragment relative to the original, unmodified fragment.
- homologous or “homolog” or “ortholog” or “substantially similar” or “substantially corresponding” may describe the relationship between a gene found in one species, subspecies, variety, or strain and the corresponding or equivalent gene in another species, subspecies,
- endogenous and “native” refer to naturally occurring copies of a gene or promoter.
- a naturally occurring gene refers to a gene that is derived from a naturally occurring source.
- a naturally occurring gene refers to a gene that is a wild-type (non-transgenic) gene, whether located in an endogenous environment within its source organism or placed in a “heterologous” environment when introduced into a different organism.
- a “non-naturally occurring” gene is one that has been mutated or otherwise modified or synthesized to have a sequence that differs from a known native gene.
- the modification may be at the protein level (e.g, amino acid substitution).
- the modification can be at the DNA level without any effect on the protein sequence (e.g, codon optimization).
- homologous sequences are compared. “Homologous sequences” or “homologs” or “orthologs” are believed, believed or known to be functionally related.
- the functional relationships may be indicated in any of a number of ways, including but not limited to: (a) the degree of sequence identity and/or (b) the same or similar biological function. Preferably, both (a) and (b) are indicated.
- Homology can be determined using default parameters using software programs readily available in the art, such as NCBI BLAST (basic local alignment search tool).
- Percentage identity determinations can be performed for nucleic acids using BLASTN or standard nucleotide BLAST using default settings (Match/Mismatch scores 1, -2) Gap costs linear, Expect threshold 10, Word size 28, and match matches in a query range 0) and for proteins using BLAST using default settings (Expect threshold 10, Word size 3, Max matches in a query range 0, Matrix Blosum62, Gap costs Existence 11, extension 1 and conditional compositional score matrix adjustment).
- nucleotide change refers to, for example, a nucleotide substitution, deletion, and/or insertion, as is well known in the art. For example, mutations contain alterations that produce silent substitutions, additions or deletions without altering the properties or activity of the encoded protein or the manner in which the protein is made.
- heterologous refers to an amino acid or nucleic acid sequence (e.g, a gene or promoter) that is not naturally occurring in a particular organism or is not naturally occurring in a particular context (e.g, a genomic or plasmid location) in a particular organism.
- a native promoter or other nucleic acid sequence of Streptomyces albus may be heterologous when operably linked to a nucleic acid sequence which is not operably linked in the wild-type Streptomyces albus, or when the native promoter or other nucleic acid sequence is delivered in a non-native form (c.g.as a heterologous plasmid or heterologous nucleic acid sequence).
- exogenous is used interchangeably with the term “heterologous” and refers to material from a source other than its natural source.
- exogenous protein or “exogenous gene” refers to a protein or gene that is derived from a non-natural source or location and that has been artificially supplied to a biological system.
- the azinothricins are a diverse family of cyclic depsipeptides with potent antimicrobial and anticancer properties. Previous reports from literature have indicated that the AZTs exert their anticancer properties through modulation of the Wnt/betacatenin pathway.
- the general AZT compound scaffold consists of a core hexadepsipeptide with an attached polyketide-derived tail characterized by the presence of a tetrahydropyran ring. Biosynthetically, the AZTs are made viatypel modular non-ribosomal peptide synthetase- polyketide synthase (NRPS-PKS) assembly line.
- the present disclosure provides, inter alia, a compound of Formula (I): or a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
- the compound of Formula (I) is a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer of Formula (I).
- the compound of Formula (I) is a stereoisomer of Formula (I).
- the compound of Formula (I) is a mixture of stereoisomers of Formula (I).
- the compound of Formula (I) is a tautomer of Formula (I).
- the present disclosure provides, inter alia, a compound of Formula (10): or a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
- the compound of Formula (10) is a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer of Formula (10). [0147] In some embodiments, the compound of Formula (10) is a stereoisomer of Formula
- the compound of Formula (10) is a mixture of stereoisomers of Formula (I).
- the compound of Formula (10) is a tautomer of Formula (10).
- the present disclosure provides, inter alia, a compound of Formula
- the compound of Formula (11) is a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer of Formula (11).
- the compound of Formula (11) is a stereoisomer of Formula (I).
- the compound of Formula (11) is a mixture of stereoisomers of Formula (11).
- the compound of Formula (11) is a tautomer of Formula (11).
- the compounds of the disclosure are cyclic peptides.
- the compounds contain a hexadepsipetide core, which comprises 6 amino acid macro-cyclized through an ester bond, and a polyketide tail.
- the compounds of Formula (I) and Formula (10) comprise ten stereocenters.
- the compounds of Formula (11) comprise nine stereocenters.
- the compound of Formula (I) is Formula IA, wherein each stereocenter is identified with an *:
- each * of Formula IA represents a bond which is either (R) or (S).
- *2 of Formula IA is (R). In some embodiments, *2 of Formula IA is (S).
- *8 of Formula IA is (R). In some embodiments, *8 of Formula IA is (S).
- *9 of Formula IA is (R). In some embodiments, *9 of Formula IA is (S).
- *13 of Formula IA is (R). In some embodiments, *13 of Formula IA is (S). [0163] In some embodiments, *15 of Formula IA is (R) and R is -OH. In some embodiments, *15 of Formula IA is (S) and R is -OH.
- *18 of Formula IA is (R). In some embodiments, *18 of Formula IA is (S).
- *27 of Formula IA is (R). In some embodiments, *27 of Formula IA is (S).
- *32 of Formula IA is (R). In some embodiments, *32 of Formula IA is (S).
- *33 of Formula IA is (R). In some embodiments, *33 of Formula IA is (S).
- *39 of Formula IA is (R). In some embodiments, *33 of Formula IA is (S).
- al and a2 represent the stereochemistry of the alkene bond.
- alkene bond al is cis. In some embodiments of the compound of Formula IA, alkene bond al is trans.
- alkene bond a2 is cis. In some embodiments of the compound of Formula IA, alkene bond a2 is trans.
- the compounds of the disclosure provide a scaffold which can be derivatized to create therapeutic agents.
- the compounds of the disclosure themselves, are therapeutic agents.
- the compounds of the disclosure target Wnt/betacatenin (or P-catenin) signaling pathway.
- Wnt/p-catenin signaling a highly conserved pathway through evolution, regulates key cellular functions including proliferation, differentiation, migration, genetic stability, apoptosis, and stem cell renewal.
- the Wnt pathway mediates biological processes but the effect depends on the involvement of P-catenin in signal transduction.
- P-catenin is a core component of the cadherin protein complex, whose stabilization is essential for the activation of Wnt/p-catenin signaling.
- Azinothricin compounds are cyclic hexadepsipeptides that are characterised by a 19-membered cyclodepsipeptide ring composed of 6 unusual amino acids (hexadepsipeptide) and an acyl side chain connected through an amide bond.
- the first member of this class, azinothricin was reported from Streptomyces X- 14950. Because of the strong antitumor and antibacterial activity, significant efforts have been made to identify bacterially-produced natural products using the classical culturing approach. However, the discovery of new drug candidates using culturing approaches has been limited.
- BGC biosynthetic gene clusters
- AZT039 biosynthetic gene cluster
- the BGC in AZT039 conprises a LmBU regulator.
- a LmBU regulator is a class of BGC specific regulators that act as positive regulators of compound expression in many species of Streptomyces.
- the disclosure provides compositions and methods for overexpressing LmBU by cloning it into an integrative plasmid under the control of a constitutive promoter.
- the promoter is a strong heterologous promoter not present in the naturally occurring AZT039 BGC.
- the promoter is added to initiate transcription of RNA, and consequently synthesize a compound of Formula (I), Formula (10), and/or Formula (11).
- the promoter is selected from ermE* and kaso*.
- the disclosure provides polynucleotides comprising a biosynthetic gene cluster comprising one or more genes that contribute to the production of at least a portion of the compound of Formula (I) when the biosynthetic gene cluster is expressed by a host cell.
- Host cells expressing the polynucleotides of the disclosure can be used in the manufacture of the compound of Formula (I).
- the biosynthetic gene cluster (BGC) can be wild type, i.e. not subject to modifications through genetic engineering methods known in the art.
- the BGC is subject to one or more modifications, for example modifications that increase, or result in, expression of the compound of Formula (I) by the host cell.
- the biosynthetic gene cluster involved in the production of a compound of Formula (I), Formula (10), or Formula (11) is isolated or derived from a Streptomyces species of bacteria. Streptomyces are a species of Actinobacteria, and the type genus of the family Streptomycetaceae. Over 500 species of Streptomyces have been described to date, all of which are envisaged as within the scope of the instant disclosure.
- the biosynthetic gene cluster is isolated or derived from Streptomyces sp. NRRL F-6131.
- the biosynthetic gene cluster comprises one or more genes that contribute to the production of at least a portion of the compound of Formula (I), Formula (10), or Formula (11) when the biosynthetic gene cluster is expressed by a host cell.
- the biosynthetic gene cluster comprises at least one gene that, together with other genes in the host genome and/or the biosynthetic gene cluster, catalyzes or contributes to at least one biosynthetic step that results in the production of the compound of Formula (I), Formula (10), or Formula (11) from a precursor compound. Exemplary precursor compounds are shown in FIG. 1.
- the biosynthetic gene cluster comprises at least one nonribosomal peptide synthetase module.
- Nonribosomal peptide synthetases are enzymes which, unlike ribosomes, synthesize their own peptidic products independent of messenger RNA.
- NRPS are modular enzymes that catalyze synthesis of important peptide products from a variety of standard and non-proteinogenic amino acid substrates. Typically, each NRPS can synthesize one type of non-ribosomal peptide.
- Nonribosomal peptides often have cyclic and/or branched structures, can contain non-proteinogenic amino acids including D-amino acids, carry modifications like N-methyl and N-formyl groups, or are glycosylated, acylated, halogenated, or hydroxylated.
- the NRPS genes for an individual peptide are frequently organized into operons in bacteria. Functionally related operons may be organized into gene clusters.
- NRPS enzymes are organized in modules, each module comprising multiple catalytic domains that are responsible for incorporation of a single amino acid residue.
- a first domain activates and covalently attaches an amino acid to an integrated carrier protein domain, and the substrates and intermediates are then delivered to neighboring catalytic domains for peptide bond formation or, in some modules, chemical modification.
- the peptide is delivered to a terminal thioesterase domain that catalyzes release of the peptide product.
- the one or more genes comprise at least one nonribosomal peptide synthetase (NRPS) module. In some embodiments, the one or more genes comprise at least 1, 2, 3, 4, 5, 6, or more NRPS modules. In some embodiments, the one or more genes comprise six NRPS modules. Sequences of representative NRPS modules are described in
- the biosynthetic gene cluster comprises a polynucleotide sequence encoding one or more NRPS modules.
- the polynucleotide sequence encoding the NRPS module is selected from the group consisting of SEQ ID NOS: 12-17, or a sequence having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the sequence encoding the NRPS module comprises, or consisting essentially of, a polynucleotide sequence selected from the group consisting of SEQ ID NOS: 12-17.
- the biosynthetic gene cluster comprises polynucleotide sequences encoding six NRPS modules.
- the polynucleotide sequences encoding the six NRPS modules comprise sequences of SEQ ID NOS: 12-17, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the polynucleotide sequences encoding six NRPS modules comprise sequences of SEQ ID NOS: 12-17.
- biosynthetic gene cluster comprises one or more NRPS modules comprising a polypeptide sequence selected from the group consisting of SEQ ID NOS: 18-23, or a sequence having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- biosynthetic gene cluster comprises one or more NRPS modules comprising a polypeptide sequence selected from the group consisting of SEQ ID NOS: 18- 23.
- biosynthetic gene cluster comprises six NRPS modules comprising polypeptide sequences of SEQ ID NOS: 18-23, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- biosynthetic gene cluster comprises six NRPS modules comprising polypeptide sequences of SEQ ID NOS: 18-23. [0187] In some embodiments, the six NRPS modules are organized in 1, 2, 3, 4, 5 or 6 open reading frames.
- the open reading frames comprise polynucleotide sequences selected from the group consisting of SEQ ID NOS: 2-5, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the six NRPS modules are organized in 4 open reading frames.
- the six NRPS modules are encoded by sequences comprising a first NRPS open reading frame of SEQ ID NO:2, a second NRPS open reading frame of SEQ ID NO: 3, a third NRPS open reading frame of SEQ ID NO: 4 and a fourth NRPS open reading frame of SEQ ID NO: 5, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the six NRPS modules are encoded by sequences comprising SEQ ID NOS: 2- 5.
- one or more of the NRPS open reading frames encode a polypeptide having an amino acid sequence selected from SEQ ID NOS: 38-41, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the one or more NRPS open reading frames encode a polypeptide having a sequence of SEQ ID NO: 38-41, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the NRPS modules and/or the host cells described herein comprise a polynucleotide sequence which encodes a polypeptide having a sequence of SEQ ID NO: 38- 41, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the biosynthetic gene cluster comprises one or more genes that contribute to the production of at least a portion of a compound of Formula (I), Formula (10), and/or Formula (11) when the biosynthetic gene cluster is expressed by a host cell.
- the one or more genes comprise at least one polyketide synthase (PKS) module.
- the one or more genes comprise at least one PKS module. In some embodiments, the one or more genes comprise at least 1, 2, 3, 4 or more PKS modules. In some embodiments, the one or more genes comprise four PKS modules. Sequences of representative PKS modules are described in Table 2, below.
- PKS Polyketide synthases
- ACP Acyl carrier protein
- Type I polyketi de-synthase modules comprise several domains with defined functions, separated by short spacer regions.
- An exemplary, but non-limiting Type I PKS protein comprises, from N to C terminus, a starting or loading module comprising an Acyltransferase (AT) and Acyl carrier protein (ACP) domain, an elongation or extending module comprising Keto-synthase (KS), AT, Dehydratase (DH), Enoylreductase (ER) and Ketoreductase (KR) domains, and a termination or releasing domain or module comprising a Thioesterase.
- AT Acyltransferase
- ACP Acyl carrier protein
- KS Keto-synthase
- AT Acyltransferase
- ACP Acyl carrier protein
- KS Keto-synthase
- DH Dehydratase
- ER Enoylreducta
- the nascent polyketide chain is passed from one thiol group to the next by trans-acylation reactions, and is released at the end by hydrolysis or cyclization.
- the starter group for example acetyl-CoA or an analogue thereof, is loaded onto the ACP domain of the starter module in a reaction catalyzed by the starter module’s AT domain.
- the nascent polyketide chain is passed from the ACP domain of the previous module to the KS domain of the current module, in a reaction catalyzed by the KS domain.
- the elongation group is loaded onto the current ACP domain in a reaction catalyzed by the current AT domain.
- the ACP-bound elongation group reacts in a Claisen condensation with the KS-bound polyketide chain under CO2 evolution, leaving a free KS domain and an ACP-bound elongated polyketide chain.
- the reaction takes place at the KSn- bound end of the chain, so that the chain moves out one position and the elongation group becomes the new bound group.
- the fragment of the polyketide chain can be altered stepwise by additional domains. This cycle is repeated for each elongation module, until finally the TE domain hydrolyzes the completed polyketide chain from the ACP-domain of the previous module.
- the biosynthetic gene cluster comprises a polynucleotide sequence encoding one or more PKS modules.
- the polynucleotide sequence encoding the PKS module is selected from the group consisting of SEQ ID NOS: 24-27, or a sequence having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the sequence encoding the PKS module comprises, or consisting essentially of, a polynucleotide sequence selected from the group consisting of SEQ ID NOS: 24-27.
- the biosynthetic gene cluster comprises polynucleotide sequences encoding four PKS modules.
- the polynucleotide sequences encoding the four PKS modules comprise sequences of SEQ ID NOS: 24-27, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the polynucleotide sequences encoding four PKS modules comprise sequences of SEQ ID NOS: 24-27.
- biosynthetic gene cluster comprises one or more PKS modules comprising a polypeptide sequence selected from the group consisting of SEQ ID NOS: 28-31, or a sequence having at least about 80%, 85%, 90%, 95%, 97%, or 99% identity thereto. In some embodiments, biosynthetic gene cluster comprises one or more PKS modules comprising a polypeptide sequence selected from the group consisting of SEQ ID NOS: 28-31.
- biosynthetic gene cluster comprises four PKS modules comprising polypeptide sequences of SEQ ID NOS: 22-25, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- biosynthetic gene cluster comprises four PKS modules comprising sequences of SEQ ID NOS: 28-31.
- the four PKS modules are organized in 1, 2, 3 or 4 open reading frames.
- the open reading frames comprise polynucleotide sequences selected from the group consisting of SEQ ID NOS: 6 and 7, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the four PKS modules are organized in two open reading frames.
- the four PKS modules are encoded by sequences comprising a first PKS open reading frame of SEQ ID NO: 6, and a second PKS open reading frame of SEQ ID NO: 7, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the four PKS modules are encoded by sequences comprising SEQ ID NOS: 6-7.
- one or more of the PKS open reading frames encode a polypeptide having an amino acid sequence selected from SEQ ID NOs: 42-43, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the one or more PKS open reading frames encode a polypeptide having a sequence of SEQ ID NOs: 42-43, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the one or more PKS modules and/or the host cells described herein comprise a polynucleotide sequence which encodes a polypeptide having a sequence of SEQ ID NOs: 42-43, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
- the biosynthetic gene cluster of the disclosure can comprise additional genes involved in the production of a compound of Formula (I), Formula (10), and/or Formula (11) in addition to genes encoding NRPS and PKS proteins or modules.
- the biosynthetic gene cluster can include genes that regulate the expression of other genes in the cluster, such as NRPS and PKS encoding genes, genes involved in the synthesis of precursor compounds that are involved in the synthesis of the compound of Formula (I), Formula (10), and/or Formula (11) and genes involved the transport of same.
- the biosynthetic gene cluster further comprises a sequence encoding a LmBU regulator, sometimes referred to herein as the LmBU-encoding gene.
- LmBU-family regulators are transcription factors that have been shown to positively modulate the biosynthesis pathways in streptomycetes bacterial species, for example antibiotic biosynthesis.
- LmBU genes are known to occur in various types of antibiotic gene clusters encoding, inter alia genes encoding for the synthesis of lincomycin, where they can regulate expression of genes in the gene cluster.
- the biosynthetic gene cluster comprises a LmBU-encoding gene comprising a polynucleotide sequence of SEQ ID NO: 8, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the biosynthetic gene cluster comprises a LmBU-encoding gene comprising a protein coding sequence comprising, or consisting essentially of SEQ ID NO: 8.
- the biosynthetic gene cluster comprises a sequence encoding a LmBU protein comprising a sequence of:
- the biosynthetic gene cluster comprises a polynucleotide sequence encoding a LmBU protein comprising, or consisting essentially of SEQ ID NO: 32.
- the biosynthetic gene cluster comprises an mbtH gene.
- mbtH proteins are a family of small proteins encoded by genes found in many, but not all, non- ribosomal peptide synthetase-encoding gene clusters. Approximately 70 amino acids in length, mbtH proteins are named after mbtH contained in the gene cluster for the siderophore mycobactin in Mycobacterium tuberculosis, which codes for a 71 -amino acid protein.
- mbtH genes are involved in the biosynthesis pathways of the gene clusters in which they reside.
- the biosynthetic gene cluster comprises an mbtH gene. In some embodiments the biosynthetic gene cluster comprises four NRPS open reading frames, and the mbtH gene is located upstream of the four NRPS open reading frames. In some embodiments, the biosynthetic gene cluster comprises a mbtH-encoding gene comprising a polynucleotide sequence of SEQ ID NO: 36, or a sequence having at least about 80%, 85%, 90%, 95%, 97%, or 99% identity thereto. In some embodiments, the biosynthetic gene cluster comprises a mbtH-encoding gene comprising a protein coding sequence comprising, or consisting essentially of SEQ ID NO: 36.
- the biosynthetic gene cluster comprises a mbtH protein comprising a polypeptide sequence of SEQ ID NO: 37, or a sequence having at least about 80%, 85%, 90%, 95%, 97%, or 99% identity thereto. In some embodiments, the biosynthetic gene cluster comprises a mbtH protein comprising a polypeptide sequence comprising, or consisting essentially of SEQ ID NO: 37.
- the biosynthetic gene cluster is a wild type biosynthetic gene cluster isolated or derived from Streptomyces sp. NRRL F-6131.
- the biosynthetic gene cluster comprises 6 NRPS modules encoded by polynucleotide sequences comprising SEQ ID NOS: 12-17, and 4 PKS modules encoded by sequences comprising SEQ ID NOS: 24-27.
- the 6 NRPS modules are arranged in 4 open reading frames comprising sequences of SEQ ID NOS: 2-5
- the 4 PKS are arranged in 2 open reading frames comprising sequences of SEQ ID NOS: 6-7.
- the biosynthetic gene cluster further comprises a LmBU-encoding gene comprising a polynucleotide sequence of SEQ ID NO: 8, which is located downstream of the 2 PKS open reading frames, for example as shown in FIG. 1.
- the biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto. In some embodiments, the biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1. In some embodiments, the biosynthetic gene cluster consists essentially of a sequence of SEQ ID NO: 1.
- one or more genes of the biosynthetic gene cluster are expressed by a host cell comprising the biosynthetic gene cluster, resulting the production of a compound of Formula (I), Formula (10), and/or Formula (11).
- the host cells is engineered to express one or more genes in the biosynthetic cluster, which results in the production of a compound of Formula (I), Formula (10), and/or Formula (11).
- overexpression of one or more genes in the biosynthetic cluster by the host cell increases the production of a compound of Formula (I), Formula (10), and/or Formula (11) compared to an otherwise equivalent host cell comprising a biosynthetic gene cluster that does not overexpress one or more genes in the biosynthetic cluster.
- the modified host cell increases the production of a compound of Formula (I), Formula (10), and/or Formula (11). by about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% as measured by LCMS.
- the host cell overexpresses the LmBU -encoding gene.
- LmBU can be overexpressed in cis or in trans.
- the promoter of the LmBU protein in the biosynthetic gene cluster can be modified to increase LmBU expression.
- LmBU can be expressed in trans to the biosynthetic gene cluster by the host cell, for example by using a strong promoter to drive LmBU expression.
- LmBU protein regulates the expression of additional genes in the biosynthetic gene cluster by acting as a transcriptional activator.
- Increasing the expression of LmBU increases the expression of LmBU target genes in the biosynthetic gene cluster, thereby increasing the production of compounds of Formula (I), Formula (10), and/or Formula (11) by the host cell.
- increasing the expression of LmBU increases the production of compounds of Formula (I), Formula (10), and/or Formula (11) by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% as measured by LCMS.
- the disclosure provides polynucleotides comprising a sequence encoding the LmBUprotein and a sequence of a promoter, for the overexpression of LmBU protein in a host cell.
- the LmBU protein can be expressed in trans in a host cell, from a polynucleotide that does not form a part of the biosynthetic gene cluster.
- the host cell can comprise a first vector comprising the biosynthetic gene cluster, and a second vector comprising the sequences of the LmBU protein and a promoter.
- promoter refers to a DNA sequence capable of controlling the expression of a coding sequence or functional RNA.
- the two are operably linked.
- the sequence encoding the LmBU protein comprises a sequence of SEQ ID NO: 8
- the promoter comprises a sequence of SEQ ID NOS: 9-10 as set forth in Table 3.
- Representative promoters that can be used to overexpress genes such as LmBU, either in cis by insertion into the BGC, or in trans, are presented in Table 3 below.
- trans overexpression of the LmBU-encoding gene comprises expressing the LmBU-encoding gene under the control of an ermE promoter, a kasO promoter, or a functional variant or derivative thereof. In some embodiments, trans overexpression of the LmBU-encoding gene comprises expressing the LmBU-encoding gene under the control of a constitutive ermE promoter, or a functional variant or derivative thereof. In some embodiments, the ermE promoter is an ermE* promoter comprising a sequence of SEQ ID NO: 9.
- trans overexpression of the LmBU- encoding gene comprises expressing the LmBU-encoding gene under the control of a constitutive kasO promoter, or a functional variant or derivative thereof.
- the kasO promoter is a kasO* promoter comprising a sequence of SEQ ID NO: 10.
- the disclosure provides polynucleotides comprising biosynthetic gene clusters that have been modified relative to their wild type, or native equivalents, to increase production of a compound of Formula (I), Formula (10), and/or Formula (11) when the genes of the biosynthetic gene cluster are expressed by a host cell.
- the production of a compound of Formula (I), Formula (10), and/or Formula (11) by a polynucleotide comprising biosynthetic gene clusters that has been modified relative to its wild type, or native equivalent is increased by about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% compared to the wild type, or native equivalent, as measured by LCMS.
- the modified polynucleotide increases the production of a compound of Formula (I), Formula (10), and/or Formula (11) by about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% compared to its wild type, or native equivalent, as measured by LCMS.
- Modifications of the biosynthetic gene cluster can be modified relative to a biosynthetic gene cluster comprising a sequence of SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the modification comprises one or more modifications relative to a sequence of SEQ ID NO: 1.
- the biosynthetic gene cluster is modified to overexpress the LmBU protein in a host cell, thereby increasing production of a compound of Formula (I), Formula (10), and/or Formula (11) by the host cell.
- the at least one modification of the biosynthetic gene cluster comprises a modification that results in overexpression of the LmBU-encoding gene in comparison to the expression of the LmBU- encoding gene by the biosynthetic gene cluster of SEQ ID NO: 1.
- the production of a compound of Formula (I), Formula (10), and/or Formula (11) by a biosynthetic gene cluster modified to overexpress the LmBU protein in a host cell is increased by about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% compared to the LmBU- encoding gene by the biosynthetic gene cluster of SEQ ID NO: 1, as measured by LCMS.
- All modifications are envisaged as within the scope of the instant disclosure.
- modifications include substitutions, deletions, inversions, or insertions of heterologous sequences.
- the one or more modifications of the biosynthetic gene cluster comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative to SEQ ID NO: 1.
- the one or more modifications comprise modifications of a promoter of a gene in the biosynthetic gene cluster.
- a heterologous promoter sequence can be inserted near the coding sequence of one or more genes of the BGC.
- one or more promoters of a gene in the BGC can be replaced with a heterologous promoter.
- Heterologous promoters include, inter alia, strong promoters, constitutive promoters and regulatable promoters.
- Exemplary strong promoters include ermE* (SEQ ID NO: 9) and kasO* (SEQ ID NO: 10) as shown in Table 3.
- replacement of one or more promoters comprises replacement of the LmBU promoter, for example with a promoter shown in Table 3.
- the one or more modifications comprise insertion of at least one heterologous promoter in the biosynthetic gene cluster of SEQ ID NO: 1.
- the at least one heterologous promoter is a strong promoter.
- the at least one heterologous promoter is selected from ermE and kasO, or functional variants or derivatives thereof.
- the at least one heterologous promoter comprises a sequence of SEQ ID NOS: 9-10, or a functional variant or derivative thereof.
- engineered versions of the ermE and kasO promoters used herein are sometimes referred to herein as ermE* and kasO*.
- the at least one heterologous promoter comprises a sequence of SEQ ID NOS: 9-10, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the one or more modifications comprise insertion of at least one promoter in the biosynthetic gene cluster.
- the at least one promoter is inserted upstream of the mbtH gene.
- an ermE* promoter of SEQ ID NO: 9 or a kasO * promoter of SEQ ID NO: 10 is inserted upstream of the mbtH gene in SEQ ID NO: 1.
- the at least one promoter is inserted upstream of the LmBU-encoding gene in the biosynthetic gene cluster of SEQ ID NO: 1.
- a kasO* promoter of SEQ ID NO: 9 is inserted upstream of the LmBU-encoding gene in SEQ ID NO: 1.
- the biosynthetic gene cluster comprising one or more modifications relative to SEQ ID NO: 1.
- the modified biosynthetic gene cluster comprises SEQ ID NO: 11 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
- methods of modifying the biosynthetic gene clusters described herein comprise a nucleic acid guided endonuclease.
- the disclosure provides methods of modifying biosynthetic gene clusters comprising (a) providing a first E. coli host cell comprising a first vector comprising a sequence of an unmodified biosynthetic gene cluster comprising a target sequence; (b) introducing the first vector into a Streptomyces host cell by conjugation; (c) providing a second E.
- coli host cell comprising a second vector comprising: (i) a sequence of at least one gNA specific to the target sequence operably linked to a promoter, (ii) a sequence encoding a Cas endonuclease; and (iii) a sequence encoding a donor template; and (d) introducing the second vector into a Streptomyces host cell by conjugation; whereby introducing the second vector into the Streptomyces host cell produces a double strand break in the target sequence and introduction of a donor template sequence, thereby generating a Streptomyces host cell comprising a modified biosynthetic gene cluster.
- the unmodified gene cluster comprises SEQ ID NO: 1 or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto, and the one or more modifications are modifications of SEQ ID NO: 1 or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the polynucleotide sequence of the modified biosynthetic gene cluster comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative to SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the nucleic acid guided endonuclease is a CRISPR/Cas endonuclease.
- the CRISPR/Cas endonuclease is Cas9.
- Other endonucleases known in the art may be used with the constructs described herein.
- the Cas endonuclease is selected from Cas9 (also known as Csnl and Csxl2), Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Casio, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, Csf4, homologues thereof, variants thereof, mutants thereof, and derivatives thereof.
- Cas9 also known as Csnl and Csxl2
- CRISPR/Cas endonuclease refers to an enzymatic system that includes a guide nucleic acid (gNA) contains a nucleotide sequence complementary or substantially complementary to a region of a target polynucleotide, and a protein with active Nuclease.
- the CRISPR/Cas systems include the CRISPR-Cas Type I system, the CRISPR- Cas Type II system, the CRISPR-Cas Type III system, and derivatives thereof.
- CRISPR/Cas systems include genetically modified nuclease systems and / or programmed nuclease systems derived from naturally occurring CRISPR-Cas systems.
- CRISPR-Cas systems can contain genetically modified Cas proteins and/or mutated Cas proteins.
- CRISPR/Cas systems may contain genetically modified and/or programmed gNA.
- gNA guide nucleic acid
- a gNA may contain nucleotide sequences in a region other than the region complementary or substantially complementary to a region of a target DNA sequence, sometimes termed a leader RNA.
- a leader RNA can be an rRNA or a derivative thereof, for example, a rRNA: chimera RNAtracr.
- gNAs can be RNAs (gRNAs) or DNAs (gDNAs).
- the gNA forms a complex with the CRISPR/Cas enzyme and the targeting portion of the gNA targets the CRISPR/Cas endonuclease to a specific target sequence in a target DNA polynucleotide.
- the CRISPR/Cas endonuclease then cuts the DNA, producing a double strand break.
- This double strand break can be repaired by non-homologous end joining, resulting in a deletion, or by homology directed repair (HDR) from a donor template. If the donor template includes sequences different from the target DNA polynucleotide, these sequence differences are incorporated into the target DNA polynucleotide.
- HDR homology directed repair
- the donor template comprises, from 5' to 3', a sequence homologous to a sequence 5' of the target sequence, a sequence of a promoter, and sequence homologous to a sequence 3' of the target sequence.
- the promoter is selected from the group consisting of ermE and kasO, or functional variants or derivatives thereof.
- the biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1, or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the sequence of SEQ ID NO: 1 or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto is modified using a CRISPR/Cas endonuclease and a donor template to insert at least one heterologous promoter into the biosynthetic gene cluster.
- the heterologous at least one promoter can be inserted upstream of the mbtH gene in SEQ ID NO: 1 or a sequence having or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto, upstream of the LmBU-encoding gene in SEQ ID NO: 1 or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto, or downstream of the second PKS open reading frame or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- gRNAs comprise a targeting sequence, sometimes referred to as a protospacer, and a scaffold.
- the gRNA is selected from CCTTGACAGACAAATTAGGA (SEQ ID NO: 33), TGTGATTCCACTTTTCGAGT (SEQ ID NO: 34), and CGCCGATGCCCTGTGATTCC (SEQ ID NO: 35).
- the CRISPR/Cas endonuclease is a Cas9 endonuclease.
- inserting at least one heterologous promoter into the biosynthetic gene cluster further comprises a donor template comprising a sequence of the heterologous promoter.
- the disclosure provides vectors comprising the polynucleotides of the disclosure.
- the vectors comprise the sequence of the biosynthetic gene cluster of SEQ ID NO: 1, or a sequence sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the vector comprises the sequence of a biosynthetic gene cluster comprising at least one modification relative to SEQ ID NO: 1, for example the insertion of a heterologous promoter.
- Suitable vectors for the cloning and expression of the biosynthetic gene clusters described herein will be known to persons of ordinary skill in the art. For example, suitable vectors for expressing biosynthetic gene clusters in Streptomyces are described in US20200291430A1, the contents of which are incorporated by reference in their entirety herein.
- Exemplary vectors include, inter alia, cloning sites, promoters to direct expression of gene products, and selectable markers for host cells such as Streptomyces and/or E. coli.
- the expression vector further comprises an E. coli and/or Streptomyces origin of replication.
- the expression vector further comprises one or more selectable markers for E. coli and/or Streptomyces.
- a number of antibiotic resistance markers are available for Streptomyces, and include thiostrepton (tsr), kanamycin-neomycin (kmr), apramycin (amr), geneticin, viomycin, hygromycin, bleomycin, chloramphenicol, and the like.
- the expression vector further comprises a gene that stabilizes large plasmids.
- the expression vector is configured to accept an insert comprising more than 10 kb, more than 20 kb, more than 50 kb, and/or more than 100 kb.
- Suitable vectors can be configured to express a product of the biosynthetic gene cluster nucleic acid when the expression vector is present in a host cell, such as a Streptomyces host cell.
- the vector is an expression vector.
- the vector is a shuttle vector.
- shuttle vector refers to a vector constructed so that it can propagate in two different host species, e.g.. E. coli and another organism such as Streptomyces.
- the vector is a plasmid or a bacterial artificial chromosome.
- a compound of Formula (I), Formula (10), and/or Formula (11) is synthesized using a semi-synthetic approach. In some embodiments, a compound of Formula (I), Formula (10), and/or Formula (11) is synthesized using a biosynthetic approach.
- the compound is cyclized with the use of a biosynthetic gene cluster (BCG) such as the biosynthetic gene cluster described supra, sometimes referred to herein as the AZT039 biosynthetic gene cluster.
- BCG biosynthetic gene cluster
- AZT039 biosynthetic gene cluster refers to a biosynthetic gene cluster isolated or derived from Streptomyces species, which is described further detail supra.
- the AZT039 biosynthetic gene cluster is isolated or derived from Streptomyces strain NRRL F-6131.
- the wild-type AZT039 biosynthetic gene cluster is modified.
- the modified AZT039 biosynthetic gene cluster produces a compound of Formula (I), Formula (10), and/or Formula (11).
- the modification(s) of the BGC is necessary to produce quantifiable levels of the compounds of the disclosure.
- Modifications of the biosynthetic gene cluster can be carried out by any methods known in the art.
- the BGC can be modified using a CRISPR/Cas endonuclease.
- the present disclosure provides a method of making a compound of Formula (I), Formula (10), and/or Formula (11) comprising: a. genome mining to identify a biosynthetic gene cluster; b. modifying the identified biosynthetic gene cluster; c. identifying a target compound; and d. isolating the target compound.
- the genome mining identifies a biosynthetic gene cluster.
- the identified biosynthetic gene cluster is AZT039.
- AZT039 is isolated or derived from Streptomyces strain NRRL F-6131.
- the genome is sequenced prior to modification.
- the biosynthetic gene cluster is modified by overexpression of at least one gene in the cluster.
- the overexpressed gene is LmBU.
- the biosynthetic gene cluster is isolated. In some embodiments, the biosynthetic gene cluster is isolated prior to identifying the target compound. In some embodiments, the biosynthetic gene cluster is isolated prior to isolating the target compound.
- the biosynthetic gene cluster is expressed in a heterologous host.
- the heterologous host is 5. albus.
- the biosynthetic gene cluster is further modified. In some embodiments, the biosynthetic gene cluster is further modified by the insertion of one or more strong promoters, using methods provided herein. In some embodiments, the strong promoter is one or more selected from ermE and kasO, or a functional derivative thereof.
- the compound of Formula (I), Formula (10), and/or Formula (1 l) is isolated from culture.
- the compound of Formula (I), Formula (10), and/or Formula (ll) is isolated and then purified.
- the present disclosure provides a method of making a compound of Formula (I), Formula (10), and/or Formula (11) further comprising a step of: (e) purifying the isolated compound.
- the present disclosure provides a method of making a compound of Formula (I), Formula (10), and/or Formula (11), or derivatizing the compound of Formula (I), Formula (10), and/or Formula (11), by solid phase peptide synthesis wherein the amino acid a-N-terminal is protected by an acid or base protecting group.
- Such protecting groups should have the properties of being stable to the conditions of peptide linkage formation while being readily removable without destruction of the growing peptide chain or racemization of any of the chiral centers contained therein.
- Suitable protecting groups are 9- fluorenylmethyloxycarbonyl (Fmoc), t-butyloxycarbonyl (Boc), benzyloxycarbonyl (Cbz), biphenylisopropyloxycarbonyl, t-amyloxycarbonyl, isobomyloxycarbonyl, a,a-dimethyl-3,5- dimethoxybenzyloxy carbonyl, o-nitrophenylsulfenyl, 2-cyano-t-butyloxycarbonyl, and the like.
- side chain protecting groups are, for example, for side chain amino groups (e.g, lysine and arginine) are 2,2,5,7,8-pentamethylchroman-6-sulfonyl (pmc), nitro, p- toluenesulfonyl, 4-methoxybenzene-sulfonyl, Cbz, Boc, and adamantyloxycarbonyl; for tyrosine are benzyl, o-bromobenzyloxy-carbonyl, 2,6-dichlorobenzyl, isopropyl, t-butyl (t- Bu), cyclohexyl, cyclopentyl and acetyl (Ac); for serine are t-butyl, benzyl and tetrahydropyranyl; for histidine are trityl, benzyl, Cbz, p-toluenesulfonyl and 2,4- dinitropheny
- the a-C-terminal amino acid is attached to a suitable solid support or resin.
- suitable solid supports useful for the above synthesis are those materials which are inert to the reagents and reaction conditions of the stepwise condensation-deprotection reactions, as well as being insoluble in the media used.
- Solid supports for synthesis of a-C-terminal carboxy peptides may be 4-hydroxymethylphenoxymethyl-copoly(styrene-l% divinylbenzene) or 4- (2',4'-dimethoxyphenyl-Fmoc-aminomethyl)phenoxyacetamidoethyl.
- the a-C-terminal amino acid may be coupled to the resin by means of N,N'-dicyclohexylcarbodiimide (DCC), N,N'-diisopropylcarbodiimide (DIC), or O-benzotriazol-l-yl-N,N,N',N'- tetramethyluroniumhexafluorophosphate (HBTU), with or without 4-dimethylaminopyridine (DMAP), 1 -hydroxy benzotriazole (HOBT), benzotriazol- 1-yloxy- tris(dimethylamino)phosphoniumhexafluorophosphate (BOP), or bis(2-oxo-3- oxazolidinyl)phosphine chloride (BOPCI), mediated coupling for from about 1 hour to about 24 hours at a temperature of between 10°C and 50°C in a solvent (e.g, dichloromethane or DMF).
- a solvent e.g, dichlor
- the Fmoc group is cleaved with a secondary amine (e.g, piperidine) prior to coupling with the a-C-terminal amino acid as described above.
- a secondary amine e.g, piperidine
- the coupling of successive protected amino acids may be carried out in an automatic polypeptide synthesizer.
- the a-N-terminal in the amino acids of the growing peptide chain are protected with Fmoc.
- the removal of the Fmoc protecting group from the a-N-terminal side of the growing peptide may be accomplished by treatment with a secondary amine (e.g, piperidine). Each protected amino acid may then be introduced in about 3-fold molar excess, and the coupling may be carried out in DMF.
- the polypeptide is removed from the resin and deprotected, either in successively or in a single operation. Removal of the polypeptide and deprotection may be accomplished in a single operation by treating the resin-bound polypeptide with a cleavage reagent (e.g, thianisole, water, ethanedithiol, and trifluoroacetic acid).
- a cleavage reagent e.g, thianisole, water, ethanedithiol, and trifluoroacetic acid.
- the resin may be cleaved by aminolysis with an alkylamine.
- the peptide may be removed by transesterification (e.g. with methanol) followed by aminolysis or by direct transamidation.
- the protected peptide may be purified or taken directly to the next step without purification.
- the removal of the side chain protecting groups may be accomplished using the appropriate cleavage conditions.
- the fully deprotected peptide may be purified by a sequence of chromatographic steps employing one or more of the following types: ion exchange on a weakly basic resin (acetate form); hydrophobic adsorption chromatography on underivitized polystyrene-divinylbenzene (e.g, Amberlite XAD); silica gel adsorption chromatography; ion exchange chromatography on carboxymethylcellulose; partition chromatography (e.g, on Sephadex G-25, LH-20 or countercurrent distribution); high performance liquid chromatography (HPLC), such as reverse-phase HPLC on octyl- or octadecylsilyl-silica bonded phase column packing.
- HPLC high performance liquid chromatography
- compounds of the present disclosure can be prepared in a variety of ways using commercially available starting materials, compounds known in the literature, or from readily prepared intermediates, by employing standard synthetic methods and procedures either known to those skilled in the art, or which will be apparent to the skilled artisan in light of the teachings herein.
- Standard synthetic methods and procedures for the preparation of organic molecules and functional group transformations and manipulations can be obtained from the relevant scientific literature or from standard textbooks in the field. Although not limited to any one or several sources, classic texts such as Smith, M.
- the disclosure provides methods of making a compound of Formula (I), Formula (10), and/or Formula (11) in a host cell comprising the biosynthetic gene cluster described herein.
- the host cell does not produce a compound of Formula (I), Formula (10), and/or Formula (11) in the absence of the biosynthetic gene cluster described herein.
- the disclosure provides methods of making the compound of Formula (I), comprising (a) introducing into a host cell the polynucleotides or vectors of the disclosure; (b) culturing the host cell under conditions sufficient for the synthesis of the compound of Formula (I) by the biosynthetic gene cluster; and (c) isolating and purifying the compound of Formula (I).
- the host cell is a Streptomyces cell, such as a Streptomyces coelicolor or Streptomyces albus cell.
- the host cell comprises a sequence encoding a LmBu operably linked to a constitutive promoter.
- the promoter is selected from ermE and kasO or functional variants or derivatives thereof.
- the sequence of the ermE promoter comprises SEQ ID NO: 9 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto
- the sequence of the kasO promoter comprises SEQ ID NO: 10 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto
- Methods of introducing polynucleotides and vectors into suitable host cells will be known to persons of ordinary skill in the art, and include electroporation and by conjugation with an E. coli cell comprising the polynucleotide or vector.
- Intergenic conjugation with E. coli allows for the introduction of vectors into Streptomyces species.
- Exemplary vectors for intergeneric conjugation between E. coli and Streptomyces comprise the 760-bp oriT fragment for conjugation, but require the transfer functions to be supplied in trans by the E. coli donor strain.
- Some vectors include the attachment site (attP) and the integrase (int) function of the temperate phage q)C31 to facilitate the site-specific integration of the vector at the attB site of the Streptomyces chromosome.
- the disclosure provides host cells, comprising the polynucleotides and vectors described herein.
- the host cell further comprises a polynucleotide comprising a sequence encoding a LmBU operably linked to one or more constitutive promoters, such as ermE* and/or kasO*.
- the sequence encoding the LmBU comprises SEQ ID NO: 8, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
- the host cell, or host organism is typically, but not necessarily, a genetically tractable (e.g, culturable under laboratory conditions and manipulable by molecular biological techniques) organism.
- the host organism may be a member of the domain Bacteria, the domain Eukarya, or the domain Archaea.
- the host microorganism is from the domain Bacteria.
- the host organism is a bacterium in the terrabacteria group.
- the host microorganism is from the taxa Actinobacteria, Streptomycetales, or Streptomycetaceae.
- the host is from the genus Streptomyces .
- the host is a Streptomyces expression strain, e.g., as defined herein (e.g., Streptomyces avermitilis, Streptomyces venezuelae, Streptomyces albus, Streptomyces lividans, and Streptomyces coellcolor).
- the host organism is a Streptomyces species.
- the host is Streptomyces albus.
- “Streptomyces expression strains” or ‘"heterologous Streptomyces expression strains’’ refers to bacterial strains including, but not limited to, commonly used species such as Streptomyces avermitiUs, Streptomyces venezuelae, Streptomyces albus, Streptomyces lividans, and Streptomyces coellcolor.
- Streptomyces may be grown in suitable liquid media (e.g.. Tryptic Soy -Broth (TSB), R2YE and YEME media) at about 28 °C, in baffled Erlenmeyer or similar shaking flask systems. Long term storage of Streptomyces can be accomplished through glycerol stocks.
- suitable liquid media e.g.. Tryptic Soy -Broth (TSB), R2YE and YEME media
- the present disclosure provides a pharmaceutical composition comprising one or more compounds of any one of a compound of Formula (I), Formula (10), and/or Formula (11) as an active ingredient.
- the present disclosure provides a pharmaceutical composition comprising one or more compounds of any one of Formula (I), Formula (10), and/or Formula (11) and one or more pharmaceutically acceptable carriers, diluents or excipients.
- Pharmaceutically acceptable carriers, diluents or excipients include without limitation any adjuvant, carrier, excipient, glidant, sweetening agent, diluent, preservative, dye/colorant, flavor enhancer, surfactant, wetting agent, dispersing agent, suspending agent, stabilizer, isotonic agent, solvent, or emulsifier.
- composition is intended to encompass a product comprising the specified ingredients in the specified amounts, as well as any product which results, directly or indirectly, from combination of the specified ingredients in the specified amounts.
- compositions comprising any compound described herein in combination with at least one pharmaceutically acceptable excipient or carrier.
- the term “pharmaceutical composition” is a formulation containing the compounds of the present disclosure in a form suitable for administration to a subject.
- the pharmaceutical composition is in bulk or in unit dosage form.
- the unit dosage form is any of a variety of forms, including, for example, a capsule, an IV bag, a tablet, a single pump on an aerosol inhaler or a vial.
- the quantity of active ingredient (e.g, a formulation of a compound of Formula (I), Formula (10), and/or Formula (11)) in a unit dose of composition is an effective amount and is varied according to the particular treatment involved.
- active ingredient e.g, a formulation of a compound of Formula (I), Formula (10), and/or Formula (11)
- the dosage will also depend on the route of administration.
- routes of administration A variety of routes are contemplated, including oral, pulmonary, rectal, parenteral, transdermal, subcutaneous, intravenous, intramuscular, intraperitoneal, inhalational, buccal, sublingual, intrapleural, intrathecal, intranasal, and the like.
- Dosage forms for the topical or transdermal administration of a compound of this disclosure include powders, sprays, ointments, pastes, creams, lotions, gels, solutions, patches and inhalants.
- the active compound is mixed under sterile conditions with a pharmaceutically acceptable carrier, and with any preservatives, buffers, or propellants that are required.
- the therapeutically effective amount can be estimated initially either in cell culture assays, e.g, of neoplastic cells, or in animal models, usually rats, mice, rabbits, dogs, or pigs.
- the animal model may also be used to determine the appropriate concentration range and route of administration. Such information can then be used to determine useful doses and routes for administration in humans.
- Therapeutic/prophylactic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., ED50 (the dose therapeutically effective in 50 % of the population) and LD50 (the dose lethal to 50 % of the population).
- the dose ratio between toxic and therapeutic effects is the therapeutic index, and it can be expressed as the ratio, LD50/ED50.
- Pharmaceutical compositions that exhibit large therapeutic indices are preferred.
- the dosage may vary within this range depending upon the dosage form employed, sensitivity of the patient, and the route of administration.
- Dosage and administration are adjusted to provide sufficient levels of the active agent(s) or to maintain the desired effect.
- Factors which may be taken into account include the severity of the disease state, general health of the subject, age, weight, and gender of the subject, diet, time and frequency of administration, drug combination(s), reaction sensitivities, and tolerance/response to therapy.
- Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or once every two weeks depending on half-life and clearance rate of the particular formulation.
- compositions containing active compounds of the present disclosure may be manufactured in a manner that is generally known, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes.
- Pharmaceutical compositions may be formulated in a conventional manner using one or more pharmaceutically acceptable carriers comprising excipients and/or auxiliaries that facilitate processing of the active compounds into preparations that can be used pharmaceutically. Of course, the appropriate formulation is dependent upon the route of administration chosen.
- the compounds, or pharmaceutically acceptable salts thereof may be administered orally, nasally, transdermally, pulmonary, inhalationally, buccally, sublingually, intraperitoneally, subcutaneously, intramuscularly, intravenously, rectally, intrapleurally, intrathecally and parenterally.
- the compound is administered orally.
- One skilled in the art will recognize the advantages of certain routes of administration.
- the dosage regimen utilizing the compounds is selected in accordance with a variety of factors including type, species, age, weight, sex and medical condition of the patient; the severity of the condition to be treated; the route of administration; the renal and hepatic function of the patient; and the particular compound or salt thereof employed.
- An ordinarily skilled physician or veterinarian can readily determine and prescribe the effective amount of the drug required to prevent, counter, or arrest the progress of the condition.
- An ordinarily skilled physician or veterinarian can readily determine and prescribe the effective amount of the drug required to counter or arrest the progress of the condition.
- the pharmaceutical compositions of the present disclosure may additionally contain other adjunct components conventionally found in pharmaceutical compositions, at their art-established usage levels.
- the pharmaceutical compositions may contain additional, compatible, pharmaceutically-active materials such as antipruritics, astringents, local anesthetics or anti-inflammatory agents, or may contain additional materials useful in physically formulating various dosage forms of the compositions of the present invention, such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, thickening agents and stabilizers.
- additional materials useful in physically formulating various dosage forms of the compositions of the present invention such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, thickening agents and stabilizers.
- such materials when added, should not unduly interfere with the biological activities of the components of the compositions of the present invention.
- the formulations can be sterilized and, if desired, mixed with auxiliary agents, e.g, lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic substances and the like which do not deleteriously interact with the oligonucleotide(s) of the formulation.
- auxiliary agents e.g, lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic substances and the like which do not deleteriously interact with the oligonucleotide(s) of the formulation.
- the compounds described herein, and the pharmaceutically acceptable salts thereof are used in pharmaceutical preparations in combination with a pharmaceutically acceptable carrier or diluent.
- suitable pharmaceutically acceptable carriers include inert solid fillers or diluents and sterile aqueous or organic solutions. The compounds will be present in such pharmaceutical compositions in amounts sufficient to provide the desired dosage amount in the range described herein.
- the compound of Formula (I), Formula (10), and/or Formula (11) can be formulated for oral administration in forms such as, for example, tablets, lozenges, hard or soft capsules, aqueous or oily suspensions, emulsions, dispersible powders, granules, syrups, elixirs, and tinctures.
- the compound of Formula (I), Formula (10), and/or Formula (11) can also be formulated for intravenous (bolus or in-fusion), intraperitoneal, topical (for example as creams, ointments, gels, or aqueous or oily solutions or suspensions), inhalation (for example as a finely divided powder or a liquid aerosol), for administration by insufflation (for example as a finely divided powder), or parenteral administration (for example as a sterile aqueous or oily solution for intravenous, subcutaneous, intramuscular, intraperitoneal or intramuscular dosing) as a suppository for rectal dosing, or transdermal (e.g, patch).
- intravenous bolus or in-fusion
- topical for example as creams, ointments, gels, or aqueous or oily solutions or suspensions
- inhalation for example as a finely divided powder or a liquid aerosol
- parenteral administration for example as a sterile
- the present disclosure provides pharmaceutical compositions comprising a compound of Formula (I), Formula (10), and/or Formula (11) combined with a pharmaceutically acceptable carrier.
- suitable pharmaceutically acceptable carriers include, but are not limited to, inert solid fillers or diluents and sterile aqueous or organic solutions.
- Pharmaceutically acceptable carriers are well known to those skilled in the art and include, but are not limited to, from about 0.01 to about 0.1 M phosphate buffer or saline (e.g, about 0.8%).
- Such pharmaceutically acceptable carriers can be aqueous or non-aqueous solutions, suspensions and emulsions.
- non-aqueous solvents suitable for use in the present application include, but are not limited to, propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate.
- Liquid carriers suitable for use in the present application can be used in preparing solutions, suspensions, emulsions, syrups, elixirs and pressurized compounds.
- the active ingredient can be dissolved or suspended in a pharmaceutically acceptable liquid carrier such as water, an organic solvent, a mixture of both or pharmaceutically acceptable oils or fats.
- the liquid carrier can contain other suitable pharmaceutical additives such as solubilizers, emulsifiers, buffers, preservatives, sweeteners, flavoring agents, suspending agents, thickening agents, colors, viscosity regulators, stabilizers or osmo-regulators.
- Liquid carriers suitable for use in the present application include, but are not limited to, water (partially containing additives as above, e.g. cellulose derivatives, preferably sodium carboxymethyl cellulose solution), alcohols (including monohydric alcohols and polyhydric alcohols, e.g. glycols) and their derivatives, and oils (e.g. fractionated coconut oil and arachis oil).
- the carrier can also include an oily ester such as ethyl oleate and isopropyl myristate.
- Sterile liquid carriers are useful in sterile liquid form comprising compounds for parenteral administration.
- the liquid carrier for pressurized compounds disclosed herein can be halogenated hydrocarbon or other pharmaceutically acceptable propellent.
- Aqueous carriers suitable for use in the present application include, but are not limited to, water, ethanol, alcoholic/aqueous solutions, glycerol, emulsions or suspensions, including saline and buffered media. Oral carriers can be elixirs, syrups, capsules, tablets and the like.
- the formulation of the present disclosure may be in the form of an aqueous solution comprising an aqueous vehicle.
- the aqueous vehicle component may comprise water and at least one pharmaceutically acceptable excipient. Suitable acceptable excipients include those selected from the group consisting of a solubility enhancing agent, chelating agent, preservative, tonicity agent, viscosity/suspending agent, buffer, and pH modifying agent, and a mixture thereof.
- any suitable solubility enhancing agent can be used.
- a solubility enhancing agent include cyclodextrin, such as those selected from the group consisting of hydroxypropyl-P-cyclodextrin, methyl-P-cyclodextrin, randomly methylated-P-cyclodextrin, ethylated-P-cyclodextrin, triacetyl-P-cyclodextrin, peracetylated-P-cyclodextrin, carboxymethyl-P-cyclodextrin, hydroxy ethyl-P-cyclodextrin, 2-hydroxy-3- (trimethylammonio)propyl-P-cyclodextrin, glucosyl-P-cyclodextrin, sulfated P-cyclodextrin (S-P-CD), maltosyl-P-cyclodextrin, P-cyclodextrin sulfobutyl ether,
- Any suitable chelating agent can be used.
- a suitable chelating agent include those selected from the group consisting of ethylenediaminetetraacetic acid and metal salts thereof, disodium edetate, trisodium edetate, and tetrasodium edetate, and mixtures thereof.
- any suitable preservative can be used.
- a preservative include those selected from the group consisting of quaternary ammonium salts such as benzalkonium halides (preferably benzalkonium chloride), chlorhexidine gluconate, benzethonium chloride, cetyl pyridinium chloride, benzyl bromide, phenylmercury nitrate, phenylmercury acetate, phenylmercury neodecanoate, merthiolate, methylparaben, propylparaben, sorbic acid, potassium sorbate, sodium benzoate, sodium propionate, ethyl p-hydroxybenzoate, propylaminopropyl biguanide, and butyl-p-hydroxybenzoate, and sorbic acid, and mixtures thereof.
- quaternary ammonium salts such as benzalkonium halides (preferably benzalkonium chloride), chlorhexidine
- the aqueous vehicle may also include a tonicity agent to adjust the tonicity (osmotic pressure).
- the tonicity agent can be selected from the group consisting of a glycol (such as propylene glycol, diethylene glycol, triethylene glycol), glycerol, dextrose, glycerin, mannitol, potassium chloride, and sodium chloride, and a mixture thereof.
- the aqueous vehicle may also contain a viscosity/suspending agent.
- Suitable viscosity/suspending agents include those selected from the group consisting of cellulose derivatives, such as methyl cellulose, ethyl cellulose, hydroxyethylcellulose, polyethylene glycols (such as polyethylene glycol 300, polyethylene glycol 400), carboxymethyl cellulose, hydroxypropylmethyl cellulose, and cross-linked acrylic acid polymers (carbomers), such as polymers of acrylic acid cross-linked with polyalkenyl ethers or divinyl glycol (Carbopols - such as Carbopol 934, Carbopol 934P, Carbopol 971, Carbopol 974 and Carbopol 974P), and a mixture thereof.
- the formulation may contain a pH modifying agent.
- the pH modifying agent is typically a mineral acid or metal hydroxide base, selected from the group of potassium hydroxide, sodium hydroxide, and hydrochloric acid, and mixtures thereof, and preferably sodium hydroxide and/or hydrochloric acid.
- the aqueous vehicle may also contain a buffering agent to stabilize the pH.
- the buffer is selected from the group consisting of a phosphate buffer (such as sodium dihydrogen phosphate and disodium hydrogen phosphate), a borate buffer (such as boric acid, or salts thereof including disodium tetraborate), a citrate buffer (such as citric acid, or salts thereof including sodium citrate), and 8-aminocaproic acid, and mixtures thereof.
- Solid carriers suitable for use in the present application include, but are not limited to, inert substances such as lactose, starch, glucose, methyl-cellulose, magnesium stearate, dicalcium phosphate, mannitol and the like.
- a solid carrier can further include one or more substances acting as flavoring agents, lubricants, solubilizers, suspending agents, fillers, glidants, compression aids, binders or tablet-disintegrating agents; it can also be an encapsulating material.
- the carrier can be a finely divided solid which is in admixture with the finely divided active compound.
- the active compound is mixed with a carrier having the necessary compression properties in suitable proportions and compacted in the shape and size desired.
- the powders and tablets preferably contain up to 99% of the active compound.
- suitable solid carriers include, for example, calcium phosphate, magnesium stearate, talc, sugars, lactose, dextrin, starch, gelatin, cellulose, polyvinylpyrrolidine, low melting waxes and ion exchange resins.
- a tablet may be made by compression or molding, optionally with one or more accessory ingredients.
- Compressed tablets may be prepared by compressing in a suitable machine the active ingredient in a free flowing form such as a powder or granules, optionally mixed with a binder (e.g., povidone, gelatin, hydroxypropylmethyl cellulose), lubricant, inert diluent, preservative, disintegrant (e.g, sodium starch glycolate, cross-linked povidone, cross-linked sodium carboxymethyl cellulose) surface active or dispersing agent.
- Molded tablets may be made by molding in a suitable machine a mixture of the powdered compound moistened with an inert liquid diluent.
- the tablets may optionally be coated or scored and may be formulated so as to provide slow or controlled release of the active ingredient therein using, for example, hydroxypropyl methylcellulose in varying proportions to provide the desired release profile. Tablets may optionally be provided with an enteric coating, to provide release in parts of the gut other than the stomach.
- Parenteral carriers suitable for use in the present application include, but are not limited to, sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's and fixed oils.
- Intravenous carriers include fluid and nutrient replenishers, electrolyte replenishers such as those based on Ringer's dextrose and the like.
- Preservatives and other additives can also be present, such as, for example, antimicrobials, antioxidants, chelating agents, inert gases and the like.
- Carriers suitable for use in the present application can be mixed as needed with disintegrants, diluents, granulating agents, lubricants, binders and the like using conventional techniques known in the art.
- the carriers can also be sterilized using methods that do not deleteriously react with the compounds, as is generally known in the art.
- Diluents may be added to the formulations of the present invention. Diluents increase the bulk of a solid pharmaceutical composition and/or combination, and may make a pharmaceutical dosage form containing the composition and/or combination easier for the patient and care giver to handle.
- Diluents for solid compositions and/or combinations include, for example, microcrystalline cellulose (e.g, AVICEL), microfine cellulose, lactose, starch, pregelatinized starch, calcium carbonate, calcium sulfate, sugar, dextrates, dextrin, dextrose, dibasic calcium phosphate dihydrate, tribasic calcium phosphate, kaolin, magnesium carbonate, magnesium oxide, maltodextrin, mannitol, polymethacrylates (e.g., EUDRAGIT(r)), potassium chloride, powdered cellulose, sodium chloride, sorbitol, and talc.
- microcrystalline cellulose e.g, AVICEL
- microfine cellulose e.g, lactose, starch, pregelatinized starch
- calcium carbonate calcium sulfate
- sugar dextrates
- dextrin dextrin
- dextrose dibasic calcium phosphate dihydrate
- the pharmaceutical composition may be selected from the group consisting of a solid, powder, liquid and a gel.
- the pharmaceutical compositions of the present disclosure is a solid (e.g., a powder, tablet, a capsule, granulates, and/or aggregates).
- the solid pharmaceutical composition comprises one or more excipients known in the art, including, but not limited to, starches, sugars, diluents, granulating agents, lubricants, binders, and disintegrating agents.
- the pharmaceutical compositions of the present disclosure are prepared for oral administration.
- the pharmaceutical compositions are formulated by combining one or more agents and pharmaceutically acceptable carriers. Certain of such carriers enable pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a subject.
- Suitable excipients include, but are not limited to, fillers, such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP).
- PVP polyvinylpyrrolidone
- such a mixture is optionally ground and auxiliaries are optionally added.
- pharmaceutical compositions are formed to obtain tablets or dragee cores.
- disintegrating agents e.g., cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof, such as sodium alginate are added.
- dragee cores are provided with coatings.
- concentrated sugar solutions may be used, which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures.
- Dyestuffs or pigments may be added to tablets or dragee coatings.
- compositions for oral administration are push- fit capsules made of gelatin.
- Certain of such push-fit capsules comprise one or more pharmaceutical agents of the present invention in admixture with one or more filler such as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, optionally, stabilizers.
- the pharmaceutical compositions for oral administration are soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or sorbitol.
- one or more compounds disclosed herein, or a pharmaceutically acceptable solvate, hydrate, tautomer, /V-oxide, or salt thereof are be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols.
- suitable liquids such as fatty oils, liquid paraffin, or liquid polyethylene glycols.
- stabilizers may be added.
- Solid pharmaceutical compositions that are compacted into a dosage form may include excipients whose functions include helping to bind the active ingredient and other excipients together after compression.
- Binders for solid pharmaceutical compositions and/or combinations include acacia, alginic acid, carbomer (e.g., carbopol), carboxymethylcellulose sodium, dextrin, ethyl cellulose, gelatin, guar gum, gum tragacanth, hydrogenated vegetable oil, hydroxyethyl cellulose, hydroxypropyl cellulose (e.g., KLUCEL), hydroxypropyl methyl cellulose (e.g, METHOCEL), liquid glucose, magnesium aluminum silicate, maltodextrin, methylcellulose, polymethacrylates, povidone (e.g, KOLLIDON, PLASDONE), pregelatinized starch, sodium alginate, and starch.
- carbomer e.g., carbopol
- the dissolution rate of a compacted solid pharmaceutical composition in the patient’s stomach may be increased by the addition of a disintegrant to the composition and/or combination.
- Disintegrants include alginic acid, carboxymethylcellulose calcium, carboxymethylcellulose sodium (e.g, AC-DI-SOL and PRIMELLOSE), colloidal silicon dioxide, croscarmellose sodium, crospovidone (e.g, KOLLIDON and POLYPLASDONE), guar gum, magnesium aluminum silicate, methyl cellulose, microcrystalline cellulose, polacrilin potassium, powdered cellulose, pregelatinized starch, sodium alginate, sodium starch glycolate (e.g, EXPLOTAB), potato starch, and starch.
- a disintegrant include alginic acid, carboxymethylcellulose calcium, carboxymethylcellulose sodium (e.g, AC-DI-SOL and PRIMELLOSE), colloidal silicon dioxide, croscarmellose sodium, crospovidone (e.g, KO
- Glidants can be added to improve the flowability of a non-compacted solid composition and/or combination and to improve the accuracy of dosing.
- Excipients that may function as glidants include colloidal silicon dioxide, magnesium trisilicate, powdered cellulose, starch, talc, and tribasic calcium phosphate.
- a dosage form such as a tablet is made by the compaction of a powdered composition
- the composition is subjected to pressure from a punch and dye.
- Some excipients and active ingredients have a tendency to adhere to the surfaces of the punch and dye, which can cause the product to have pitting and other surface irregularities.
- a lubricant can be added to the composition and/or combination to reduce adhesion and ease the release of the product from the dye.
- Lubricants include magnesium stearate, calcium stearate, glyceryl monostearate, glyceryl palmitostearate, hydrogenated castor oil, hydrogenated vegetable oil, mineral oil, polyethylene glycol, sodium benzoate, sodium lauryl sulfate, sodium stearyl fumarate, stearic acid, talc, and zinc stearate. [0306] Flavoring agents and flavor enhancers make the dosage form more palatable to the patient.
- compositions and/or combination of the present invention include maltol, vanillin, ethyl vanillin, menthol, citric acid, fumaric acid, ethyl maltol, and tartaric acid.
- Solid and liquid compositions may also be dyed using any pharmaceutically acceptable colorant to improve their appearance and/or facilitate patient identification of the product and unit dosage level.
- a pharmaceutical composition of the present invention is a liquid (e.g., a suspension, elixir and/or solution).
- a liquid pharmaceutical composition is prepared using ingredients known in the art, including, but not limited to, water, glycols, oils, alcohols, flavoring agents, preservatives, and coloring agents.
- Liquid pharmaceutical compositions can be prepared using compounds of the present disclosure, or a pharmaceutically acceptable solvate, hydrate, tautomer, /V-oxide, or salt thereof, and any other solid excipients where the components are dissolved or suspended in a liquid carrier such as water, vegetable oil, alcohol, polyethylene glycol, propylene glycol, or glycerin.
- formulations for parenteral administration can contain as common excipients sterile water or saline, polyalkylene glycols such as polyethylene glycol, oils of vegetable origin, hydrogenated naphthalenes and the like.
- polyalkylene glycols such as polyethylene glycol, oils of vegetable origin, hydrogenated naphthalenes and the like.
- biocompatible, biodegradable lactide polymer, lactide/glycolide copolymer, or polyoxyethylenepolyoxypropylene copolymers can be useful excipients to control the release of active compounds.
- Other potentially useful parenteral delivery systems include ethylene-vinyl acetate copolymer particles, osmotic pumps, implantable infusion systems, and liposomes.
- Formulations for inhalation administration contain as excipients, for example, lactose, or can be aqueous solutions containing, for example, polyoxyethylene-9-auryl ether, glycocholate and deoxy cholate, or oily solutions for administration in the form of nasal drops, or as a gel to be applied intranasally.
- Formulations for parenteral administration can also include glycocholate for buccal administration, methoxysalicylate for rectal administration, or citric acid for vaginal administration.
- Liquid pharmaceutical compositions can contain emulsifying agents to disperse uniformly throughout the composition and/or combination an active ingredient or other excipient that is not soluble in the liquid carrier.
- Emulsifying agents that may be useful in liquid compositions and/or combinations of the present invention include, for example, gelatin, egg yolk, casein, cholesterol, acacia, tragacanth, chondrus, pectin, methyl cellulose, carbomer, cetostearyl alcohol, and cetyl alcohol.
- Liquid pharmaceutical compositions can also contain a viscosity enhancing agent to improve the mouth-feel of the product and/or coat the lining of the gastrointestinal tract.
- a viscosity enhancing agent include acacia, alginic acid bentonite, carbomer, carboxymethylcellulose calcium or sodium, cetostearyl alcohol, methyl cellulose, ethylcellulose, gelatin guar gum, hydroxyethyl cellulose, hydroxypropyl cellulose, hydroxypropyl methyl cellulose, maltodextrin, polyvinyl alcohol, povidone, propylene carbonate, propylene glycol alginate, sodium alginate, sodium starch glycolate, starch tragacanth, and xanthan gum.
- Sweetening agents such as aspartame, lactose, sorbitol, saccharin, sodium saccharin, sucrose, aspartame, fructose, mannitol, and invert sugar may be added to improve the taste.
- Preservatives and chelating agents such as alcohol, sodium benzoate, butylated hydroxyl toluene, butylated hydroxyanisole, and ethylenediamine tetraacetic acid may be added at levels safe for ingestion to improve storage stability.
- a pharmaceutical composition is prepared for administration by injection (e.g., intravenous, subcutaneous, intramuscular, etc.).
- a pharmaceutical composition comprises a carrier and is formulated in aqueous solution, such as water or physiologically compatible buffers such as Hanks's solution, Ringer's solution, or physiological saline buffer.
- other ingredients are included (e.g., ingredients that aid in solubility or serve as preservatives).
- injectable suspensions are prepared using appropriate liquid carriers, suspending agents and the like.
- compositions for injection are suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents.
- Certain solvents suitable for use in pharmaceutical compositions for injection include, but are not limited to, lipophilic solvents and fatty oils, such as sesame oil, synthetic fatty acid esters, such as ethyl oleate or triglycerides, and liposomes.
- Aqueous injection suspensions may contain substances that increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran.
- such suspensions may also contain suitable stabilizers or agents that increase the solubility of the pharmaceutical agents to allow for the preparation of highly concentrated solutions.
- the sterile injectable preparation may also be a sterile injectable solution or suspension in a non-toxic parenterally acceptable diluent or solvent, such as a solution in 1,3- butane-diol or prepared as a lyophilized powder.
- a non-toxic parenterally acceptable diluent or solvent such as a solution in 1,3- butane-diol or prepared as a lyophilized powder.
- sterile fixed oils may conventionally be employed as a solvent or suspending medium.
- any bland fixed oil may be employed including synthetic mono- or diglycerides.
- fatty acids such as oleic acid may likewise be used in the preparation of injectables.
- Formulations for intravenous administration can comprise solutions in sterile isotonic aqueous buffer.
- the formulations can also include a solubilizing agent and a local anesthetic to ease pain at the site of the injection.
- the ingredients are supplied either separately or mixed together in unit dosage form, for example, as a dry lyophilized powder or water free concentrate in a hermetically sealed container such as an ampule or sachet indicating the quantity of active agent.
- the compound can be dispensed in a formulation with an infusion bottle containing sterile pharmaceutical grade water, saline or dextrose/water.
- an ampule of sterile water for injection or saline can be provided so that the ingredients can be mixed prior to administration.
- Suitable formulations further include aqueous and non-aqueous sterile injection solutions that can contain antioxidants, buffers, bacteriostats, bactericidal antibiotics and solutes that render the formulation isotonic with the bodily fluids of the intended recipient; and aqueous and non-aqueous sterile suspensions, which can include suspending agents and thickening agents.
- a pharmaceutical compositions of the present invention are formulated as a depot preparation. Certain such depot preparations are typically longer acting than non-depot preparations. In certain embodiments, such preparations are administered by implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. In certain embodiments, depot preparations are prepared using suitable polymeric or hydrophobic materials (for example an emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt.
- suitable polymeric or hydrophobic materials for example an emulsion in an acceptable oil
- ion exchange resins for example an emulsion in an acceptable oil
- sparingly soluble derivatives for example, as a sparingly soluble salt.
- a pharmaceutical composition of the present invention comprises a sustained-release system.
- a sustained-release system is a semi-permeable matrix of solid hydrophobic polymers.
- sustained-release systems may, depending on their chemical nature, release pharmaceutical agents over a period of hours, days, weeks or months.
- the formulation may further comprise a wetting agent.
- wetting agents include those selected from the group consisting of poly oxypropylenepolyoxyethylene block copolymers (poloxamers), polyethoxylated ethers of castor oils, polyoxyethylenated sorbitan esters (polysorbates), polymers of oxyethylated octyl phenol (Tyloxapol), polyoxyl 40 stearate, fatty acid glycol esters, fatty acid glyceryl esters, sucrose fatty esters, and polyoxyethylene fatty esters, and mixtures thereof.
- the amount of the compound of any one of Formula (I), Formula (10), or Formula (11) may be present in the composition in a therapeutically effective amount.
- the compound may be administered at about 0.001 mg/kg to about 100 mg/kg body weight (e.g, about 0.01 mg/kg to about 10 mg/kg or about 0.1 mg/kg to about 5 mg/kg).
- a therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) for use in therapy is an amount sufficient to treat or prevent cancer, slow its progression and/or reduce the symptoms associated with the condition.
- a therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) for use in therapy is an amount sufficient to treat cancer, slow its progression and/or reduce the symptoms associated with the condition.
- a therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) for use in therapy is an amount sufficient to treat or prevent fibrosis, slow its progression and/or reduce the symptoms associated with the condition.
- a therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) for use in therapy is an amount sufficient to treat fibrosis, slow its progression and/or reduce the symptoms associated with the condition.
- the size of the dose for therapeutic or prophylactic purposes of a compound of any one of Formula (I), Formula (10), or Formula (11) will naturally vary according to the nature and severity of the conditions, the age and sex of the animal or patient and the route of administration, according to well-known principles of medicine.
- Examples of useful dermatological compositions which can be used to deliver a compound of Formula (I), Formula (10), and/or Formula (11) to the skin are known to the art; for example, see Jacquet et al. (U.S. Pat. No. 4,608,392), Geria (U.S. Pat. No. 4,992,478), Smith et al. (U.S. Pat. No. 4,559,157) and Wortzman (U.S. Pat. No. 4,820,508).
- a “subject” includes a mammal.
- the mammal can be e.g., a human or appropriate non-human mammal, such as primate, mouse, rat, dog, cat, cow, horse, goat, camel, sheep or a pig.
- the subject can also be a bird or fowl.
- the mammal is a human.
- the present disclosure provides a method of treating or preventing a disease or disorder disclosed herein in a subject in need thereof, comprising administering to the subject a therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) or a pharmaceutical composition of the present disclosure.
- the present disclosure provides a method of treating cancer in a subject in need thereof, comprising administering to the subject a therapeutically effective amount of the compound of Formula (I), Formula (10), or Formula (11) or a pharmaceutical composition of the present disclosure.
- the present disclosure provides a method of treating fibrosis in a subject in need thereof, comprising administering to the subject a therapeutically effective amount of the compound of Formula (I), Formula (10), or Formula (11) or a pharmaceutical composition of the present disclosure.
- the present disclosure provides the compound of Formula (I) for use in treating cancer in a subject in need thereof.
- the present disclosure provides the compound of Formula (I) for use in treating fibrosis in a subject in need thereof.
- the present disclosure provides use of the compound of any one of Formula (I), Formula (10), or Formula (11) in the manufacture of a medicament for treating a disease or disorder disclosed herein.
- the present disclosure provides use of the compound of Formula (I), Formula (10), or Formula (11) in the manufacture of a medicament for treating cancer in a subject in need thereof.
- the present disclosure provides use of the compound of Formula (I), Formula (10), or Formula (11) in the manufacture of a medicament for treating fibrosis in a subject in need thereof. [0338] In some embodiments, the present disclosure provides use of the compound of any one of Formula (I), Formula (10), or Formula (11) for the treatment of a disease or disorder disclosed herein.
- the present disclosure provides use of the compound of Formula (I), Formula (10), or Formula (11) for the treatment of cancer.
- the present disclosure provides use of the compound of Formula (I), Formula (10), or Formula (11) for the treatment of fibrosis.
- the disease or disorder is a cancer.
- the cancer is a disease that involves abnormal cell growth with the potential to invade or spread to other parts of the body.
- the cancer is a malignant tumor or neoplasm.
- the cancer is breast cancer, pancreatic cancer, non-small cell lung cancer, ovarian cancer, esophageal cancer, melanoma, lymphoma, uterine cancer, peritoneal cancer, fallopian tube cancer, endometrial cancer, cervical cancer, thyroid cancer, gastric cancer, gastroesophageal junction cancer, urothelial cancer, bladder cancer, oropharynx cancer, hypopharynx cancer, larynx cancer, head and neck cancer, germ cell cancer/tumors, prostate cancer, colon cancer, rectal cancer, kidney cancer, cholangiocarcinoma (bile duct cancer), glioblastoma, leukemia, or non-Hodgkin lymphoma.
- cholangiocarcinoma bile duct cancer
- glioblastoma glioblastoma
- leukemia or non-Hodgkin lymphoma.
- the cancer is Acute Lymphoblastic Leukemia, Acute Myeloid Leukemia, Adrenocortical Carcinoma, AIDS-Related Cancers, Kaposi Sarcoma, Lymphoma, Anal Cancer, Appendix Cancer, Astrocytomas, Childhood Atypical Teratoid/Rhabdoid Tumor, Basal Cell Carcinoma, Skin Cancer (Nonmelanoma), Childhood Bile Duct Cancer, Extrahepatic Bladder Cancer, Bone Cancer, Ewing Sarcoma Family of Tumors, Osteosarcoma and Malignant Fibrous Histiocytoma, Brain Stem Glioma, Brain Tumors, Embryonal Tumors, Germ Cell Tumors, Craniopharyngioma, Ependymoma, Bronchial Tumors, Burkitt Lymphoma (Non-Hodgkin Lymphoma), Carcinoid Tumor, Gastrointestinal Carcinoma of Un
- the disease or disorder is a fibrosis.
- Fibrotic conditions are characterized, in whole or in part, by excess production of fibrotic material. These conditions can include systemic sclerosis, multifocal fibrosclerosis, nephrogenic systemic fibrosis, scleroderma (including morphea, generalized morphea, or linear scleroderma), sclerodermatous graft-vs-host-disease, kidney fibrosis (including glomerular sclerosis, renal tubulointerstitial fibrosis, progressive renal disease or diabetic nephropathy), cardiac fibrosis (e.g, myocardial fibrosis), pulmonary fibrosis (e.g.
- pulmonary fibrosis glomerulosclerosis pulmonary fibrosis, idiopathic pulmonary fibrosis, silicosis, asbestosis, interstitial lung disease, interstitial fibrotic lung disease, and chemotherapy/radiation induced pulmonary fibrosis
- oral fibrosis endomyocardial fibrosis, deltoid fibrosis, pancreatitis, inflammatory bowel disease, Crohn's disease, nodular fascilitis, eosinophilic fasciitis, general fibrosis syndrome characterized by replacement of normal muscle tissue by fibrous tissue in varying degrees, retroperitoneal fibrosis, liver fibrosis, liver cirrhosis, chronic renal failure; myelofibrosis (bone marrow fibrosis), drug induced ergotism, myelodysplastic syndrome, myeloproferative syndrome, collagenous colitis, acute fibrosis, organ specific fibrosis, and the like.
- the fibrosis is pulmonary fibrosis, liver fibrosis, heart fibrosis, mediastinal fibrosis, retroperitoneal cavity fibrosis, bone marrow fibrosis, or skin fibrosis.
- the fibrotic condition is pulmonary hypertension, chronic obstructive pulmonary disease (COPD), idiopathic pulmonary fibrosis, sarcoidosis, cystic fibrosis, familial pulmonary fibrosis, silicosis, asbestosis, coal worker's pneumoconiosis, carbon pneumoconiosis, hypersensitivity pneumonitides, or pulmonary hypertension,
- COPD chronic obstructive pulmonary disease
- idiopathic pulmonary fibrosis sarcoidosis
- cystic fibrosis familial pulmonary fibrosis
- silicosis asbestosis
- asbestosis coal worker's pneumoconiosis
- carbon pneumoconiosis carbon pneumoconiosis
- hypersensitivity pneumonitides or pulmonary hypertension
- the fibrosis is cystic fibrosis.
- the subject is a mammal. In some embodiments the mammal is a human.
- the compound of Formula (I) is administered once, twice, three times, four times, or five times per day. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered once daily. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered twice daily. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered three times daily. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered four times daily. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered five times daily.
- the compound of Formula (I), Formula (10), and/or Formula (11) is administered with a drug holiday. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered without a drug holiday.
- Compounds of Formula (I) were identified as products of AZT039 biosynthetic gene cluster (BGC) for example using heterologous expression and stable isotope labeling to identify target compounds.
- methods included cloning and conjugation of AZT039 BGC in S. albus J1074, small scale production and isotope labeling, extraction and LCMS analysis, andl scale production and isolation of compounds.
- Spectroscopic characterization of AZT039 compounds Formula (10) and Formula (11) was performed and 2D structures were obtained.
- Example 2 Proposed Biosynthesis [0357] To initiate discovery of molecules from the AZT family, genome and metagenome mining was performed on publicly deposited (NCBI, JGI, etc) and internal sequence collections. The mined biosynthetic gene clusters (BGCs) were processed using internal bioinformatic tools followed by analysis of individual BGCs to select for potentially new compound structures. About 190 BGCs were identified. AZT039, was initially found in the genome of Streptomyces sp. NRRL F-6131 as a partial cluster. Streptomyces sp. NRRL- F6131 is the wildtype strain harboring AZT039 BGC. No characterized AZT molecule was reported from this strain. The genome was resequenced and reassembled, and upon further analysis of the BGC, the resulting compound was predicted to have a unique structure, and AZT039 was prioritized for development.
- AZT039 was first identified from the genome sequence of Streptomyces sp. NRRL F- 6131 as a partial gene cluster showing only the NRPS portion of the molecule. In silico reassembly of the deposited genome in-house and antismash analysis showed additional contigs that were potential overlaps to the gene cluster but were fragmented. To obtain the full BGC the strain was ordered from the NRRL collection (https://nrrl.ncaur.usda.gov/). The genome was sequenced using a combination of long-read (ONT) and short-read shotgun (Illumina) platforms, assembled to obtain the full-length sequence of AZT039 BGC, and annotated using in-house pipeline.
- ONT long-read
- Illumina short-read shotgun
- the AZT039 gene cluster belongs to the type 1 modular hybrid NRPS- PKS family of BGCs.
- Six NRPS modules corresponding to the core peptide macrocycle are encoded in 4 open reading frames (aztN39, aztO39, aztP39, and aztAG39) (FIG. 1).
- the PKS contains 4 modules encoded in 2 open reading frames (aztAD and aztAE).
- Other genes for precursor biosynthesis, post-PKS modifications, regulation, and transport are distributed downstream and in between the NRPS and PKS core genes.
- aztN contains 7 typical domains belonging to 2 amino acid modules; module 1 contains C-A-T-E domains followed by C-A-T in module 2.
- aztO has 8 domains from modules 3 and 4; module 3 contains C-A-T-E domains followed by C-A-T from module 4.
- Module 5 is encoded in the aztP gene containing C-A-T- Nmt-TE domains.
- aztAG downstream of the PKS contains module 6 of the NRPS core with a C (starter) domain, where the PKS chain gets loaded in a typical AZT biosynthesis. In an unusual case, annotation of module 6 shows a missing A domain. To rule out sequencing errors, specific primers were redesigned for the regions and submitted the amplicon for sanger sequencing and confirmed that although the A domain is present, it contains a significant deletion in the middle.
- E domains epimerization
- modules 1 and 3 predict the presence of E domains (epimerization) in modules 1 and 3 predicts that these substrates maybe epimerized into D- amino acids in the final structure.
- the peptide core of the compound was predicted to be ‘1: unknown (mod6)-2: piz (modl)-3: nOHval (mod2)- 4: piz (mod3)-5: 3oh-3mepro(mod4)-6: nOHval (mod5)’, with weaker predictions for positions 3 and 6 which indicates other kinds of substrates maybe be incorporated.
- the PKS core is composed of four modules typical of the AZT BGCs.
- aztAD gene contains the first 3 modules- module 1 contains a KS-AT-ACP domain, followed by module 2 containing KS-AT-DH-KR-ACP domains, and module 3 with KS-AT-DH-KR-ACP.
- aztAE contains module 4 with KS-AT-ACP domains. Distinct from typical AZT BGCs, module 3 of the AZT039 PKS lacks an ER domain that corresponds to the saturated THP ring in the PKS tail.
- genes predicted to be involved in the biosynthesis or transformation of precursor amino acids are also identified in AZT039 BGC.
- azt (Z, AA. AB) homologous to ply(Q,R,S) in the polyoxypeptin biosynthesis, are proposed to be involved in b-OH leucine formation
- azt AQ and CO are involved in the conversion of ornithine to piperazic acid.
- Azt (K, L, M) homologous to ply (C,D,E) are involved in the formation of hydroxamate containing residues.
- AZT039 also contains a set of 3 genes azt (CI, CJ, CK) related to the synthesis of hydroxyphenylglycine, which has not been reported in the peptide core of NRPS.
- a cis-proline hydroxylase (aztX) is also present.
- Other biosynthetic genes identified in the gene cluster include an LmBU regulator, an MbtH (aztl), a pair of ABC transporters, and a pair of P-type atpase heavy metal translocating transporters.
- Post-assembly modifying genes are also present including a CyP450 (aztAR) that may be responsible for hydroxylation on piperazic acid residues, and an O-methyltransferase.
- beta-OH-leucine was proposed to be derived from L-leucine via hydroxylation by a cytochrome P450 enzyme (aztAA).
- aztAA cytochrome P450 enzyme
- the incorporation of DIO-labeled b-OH-leucine in the compound core was detected as a shift of +8 Da in the MS spectra, consistent with the loss of 1H upon hydroxylation at the beta position, and a possible exchange of the acidic alpha proton during NRPS assembly.
- Piperazic acid residues are biosynthesized from L-omithine by the action of two enzymes ktzT and ktzl as demonstrated in kutzneride biosynthesis.
- FIGS. 2A-2C show non-production of compounds of Formula (I), Formula (10), or Formula (11) in wild type strains.
- Colonies were grown at 30° C overnight under apramycin (50 ug/mL) and trimetropim (10 ug/mL) selection. Colonies were picked into 5 mL of the LB broth and grown overnight at 30° C under the same antibiotic selection. Overnight cultures were screened for the presence of the BGC using 3 primer sets spanning the gene cluster, as well as a primer set designed for the junction of the backbone plasmid and the BGC. For conjugation, 200 uL of an overnight grown E. coli S17 cells containing the LT039 BGC was inoculated into 50 mL of LB broth with antibiotics and grown at 37° C to an OD600nm of (0.6-0.9).
- Solvent A 0.01% FA in water and solvent B: 0.01% FA Acetonitrile.
- Samples were monitored with UV diode array, ELSD, and Single Quad ESI MS in positive and negative mode.
- MS chromatograms of cultures with and without added DIO-leucine were compared and scanned for peaks in the molecular weight range of 700-1200 Da having a shift of +8 Da in the presence of DIO- leucine.
- Example 4 Large scale production and isolation of target compounds [0363] Large scale production. 0.5 mL of 2-3 days old SA-LT039 seed cultures prepared as described above was inoculated into 50 mL R5A media in 250 mL baffled flasks. A total of 5 L of cultures were grown in batches at 28° C, 220 rpm, and 7 days. At day 7, the cultures (mixed my celia and broth) were extracted with equal volumes of 1 : 1 IPA: chloroform twice. The extracts were dried under vacuum to yield the crude material (10 grams).
- Example 5 Spectroscopic characterization of AZT039 molecules and 2D structure [0365] 2D NMR and HRMSMS characterization of the compound of Formula (10).
- the compound of Formula (10) was isolated as an amorphous white powder with a molecular formula of C47H68N8O14 (Calc. MW 969.1030 g/mol) determined by high-resolution ESI - Q- TOF mass spectrometry. In positive mode and negative mode, the molecular ion peak appeared at m/z 969.5037 (M+H) + , 991.4740 (M+Na) + , and 967.4845 (M-H)'.
- the structure was established by interpretation of ID and 2D NMR data in CDCh (Fig 3).
- the J H NMR spectrum showed 2 downfield protons at 9.95 and 7.87 ppm, 1 amide doublet protons, 11 protons between 4 and 6 ppm, 4 aromatic protons between 6.5 and 7.5 ppm with a p- substitution pattern, and 3 olefinic protons between 6 and 7.5 ppm.
- the 13 C spectrum displayed 7 amide or ester carbons, and one ketone carbonyl carbon, 4 oxygen bearing carbons, 9 carbons attached to nitrogen, and 14 aliphatic carbons.
- the 2D structure of the compound of Formula (10) was proposed to be a novel cyclic hexadepsipeptide consisting of N-hydroxyleucine, 3-hydroxyproline, 5-hydroxypiperazic acid (y-OH-piperazic acid, piz2), N-hydroxy-p-methoxy-phenylglycine, piperazic acid (pizl), 3- hy dr oxy leucine (P-hydroxyleucine), and a polyketide side chain lacking the canonical pyran ring (FIG. 4A).
- the amino acid sequence and PKS chain were determined by key HMBC, COSY and TOCSY correlations.
- the presence of P-hydroxyleucine was established by the COSY correlations between the amide NH (87.37 ppm) to the a proton, between a and P protons (84.94 and 5.40 ppm), and HMBC correlations from the P proton to the isopropyl moiety.
- the JV-hydroxyleucine was assigned based on the COSY spin system from the a and P protons to the two methyl groups (80.91 and 0.97 ppm) of the isopropyl moiety, the lack of NH correlation to the a proton, weak NOESY cross peak between a hydroxamate proton (8 7.87 ppm) and a methyl group (80.97 ppm).
- HMBC showed correlations from the two methyl groups to the methine carbon (y carbon, 824.8 ppm), y proton (8 1.70 ppm) to P carbon (837.5 ppm), and a proton (85.45 ppm) to P carbon.
- the 5-hydroxypiperazic acid spin system was assigned based on ⁇ /H-COSY and TOCSY correlations, with an amide NH with a chemical shift of 4.28 ppm, and a characteristic chemical shift of a proton attached to an oxygenated carbon (83.51 ppm).
- the HMBC showed correlation between the a proton (85.51 ppm) to an oxygen-bearing carbon (y carbon) at 59 ppm.
- the second piperazic acid unit was also determined based on the ⁇ /H-COSY and TOCSY spin system.
- the amino acid residue 3-hydroxyproline showed a ⁇ /H-COSY cross peak between the a proton (8 5.19 ppm) and P proton (84.6 ppm) which is not present in the traditionally reported 3- hydroxy-3-methylproline residue.
- the HMBC data confirmed the 3-hydroxyproline backbone with the key correlations from the a proton to the oxygen-bearing P carbon (872.9 ppm), and the following y and 8 carbons with chemical shifts at 32.2 and 46.1 ppm.
- the hydroxyl group showed 'H.'H-COSY/TOCSY correlations with the a, y, and 8 protons (85.19, 2.21, and diasterotopic 3.24/4.83 ppm).
- the reaction mixture was heated at 80°C for 3 min and further quenched with 50 pL of HC1 2N.
- a volume of 300 pL of acetonitrile 50 % v/v in LC/MS water was added to the solution.
- the L-FDLA mixtures were analyzed by LC/MS standard method, and the amino acids configuration was determined based on retention time and MS comparison against the respective amino acid standards.
- the hydroxamate groups in the depsipeptide were reduced to their -NH- form with TiCh/THF before hydrolysis and Marfey’s reaction.
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Zoology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Plant Pathology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present disclosure provides compounds of Formula (I) or a pharmaceutically acceptable salt, solvate, or tautomer thereof, and uses of the same in treating a disease or disorder (e.g., cancer or fibrosis).
Description
HEXADEPSIPEPTIDE COMPOUNDS AND METHODS OF USING THE
SAME
CROSS-REFERENCE TO RELATED APPLICATIONS
[001] The present application claims priority to, and the benefit of, U.S. Provisional Application No.63/276, 322, filed November 5, 2021, the contents of which is incorporated herein by reference in its entirety.
INCORPORATION BY REFERENCE OF SEQUENCE LISTING
[002] The present application is being filed with a Sequence Listing in electronic format. The Sequence Listing is provided as a file entitled “ZYM-018PC_107586-5018_Sequence Listing_ST26”, created on November 3, 2022, and is 318,525 bytes in size. The information in electronic format of the Sequence Listing is incorporated by reference in its entirety.
BACKGROUND
[003] Many examples of semisynthetic derivatives of natural product therapeutics are derived and inspired from cyclic peptides, such as cyclosporine, daptomycin, romidepsin, aplidine, atosiban, caspofungin, telavancin, linaclotide, and pasireotide. These structurally diverse molecules are the active pharmaceutical ingredients of prescribed drugs to treat a range of indications across many therapeutic applications such as immunosuppressants, antibacterials, antifungals, oncology, premature birth, IBS, and Cushings syndrome. One minimally studied natural product cyclic peptide is verucopeptin, a cyclic hexadepsipeptide which has demonstrated antiproliferative activity with potential applications to oncology. Over the last 10-20 years, the search for new biologically-active bacterially-produced natural products using the classical culturing approach has frequently led to the re-discovery of the same metabolites. Recently, methodology has been revolutionized to develop biologically active congeners that arise from evolutionarily related biosynthetic gene clusters. These congeners are structurally related to natural products. These close analogs typically engage the same therapeutic target, yet they often exhibit differences in biological activity, including differences in potency and therapeutic index to general cytotoxicity.
[004] There is unmet medical need for the discovery of new agents to treat various conditions, including cancer and fibrosis. The present disclosure addresses this unmet need in the art.
SUMMARY
[005] In some aspects, the present disclosure provides, inter alia, a compound of Formula (I), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof:
wherein in Formula (I),
R is selected from hydrogen and -OH.
[006] In some aspects, the present disclosure provides, inter alia, a compound of Formula (10), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof:
Formula (10).
[007] In some aspects, the present disclosure provides, inter alia, a compound of Formula (11), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof:
Formula (11).
[008] In some embodiments, the compound of Formula (I) is substantially pure. In some embodiments, the compound of Formula (I) is enantiomerically pure.
[009] In some embodiments, the compound of Formula (10) is substantially pure. In some embodiments, the compound of Formula (10) is enantiomerically pure.
[010] In some embodiments, the compound of Formula (11) is substantially pure. In some embodiments, the compound of Formula (11) is enantiomerically pure.
[OH] In some embodiments, the disclosure provides for pharmaceutical compositions comprising a therapeutically effective amount of the compound of Formula (I) and one or more pharmaceutically acceptable excipients. In some embodiments, the pharmaceutical composition comprises a pharmaceutical carrier.
[012] In some embodiments, the disclosure provides for pharmaceutical compositions comprising a therapeutically effective amount of the compound of Formula (10) and one or more pharmaceutically acceptable excipients. In some embodiments, the pharmaceutical composition comprises a pharmaceutical carrier.
[013] In some embodiments, the disclosure provides for pharmaceutical compositions comprising a therapeutically effective amount of the compound of Formula (11) and one or more pharmaceutically acceptable excipients. In some embodiments, the pharmaceutical composition comprises a pharmaceutical carrier.
[014] In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is produced by a host cell comprising a heterologous biosynthetic gene cluster comprising at least six nonribosomal peptide synthetase (NRPS) modules and at least four polyketide synthase (PKS) modules, a set of modifying enzymes, precursor biosynthesis enzymes, transporters, and one or more transcriptional regulators.
[015] In some embodiments, the biosynthetic gene cluster is isolated or derived from Streptomyces sp. In some embodiments, the biosynthetic gene cluster is isolated or derived from Streptomyces strain NRRL-6131.
[016] In some embodiments, the biosynthetic gene cluster comprises a sequence of SEQ ID NO:1.
[017] In some embodiments, the biosynthetic gene cluster comprises one or more modifications of SEQ ID NO: 1 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
[018] In some embodiments, the modification comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative to SEQ ID NO: 1.
[019] In some embodiments, the modification comprises insertion of at least one promoter sequence.
[020] In some embodiments, the promoter is selected from ermE and kaso, or functional variants or derivatives thereof.
[021] In some embodiments, the sequence of the ermE promoter comprises SEQ ID NO: 9 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto, and the sequence of the kasO promoter comprises SEQ ID NO: 10 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
[022] In some embodiments, the biosynthetic gene cluster comprises SEQ ID NO: 11 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto. [023] In some embodiments, the modification increases synthesis of the compound of any one of Formula (I), Formula (10), and Formula (11) compared to an otherwise equivalent host cell comprising an unmodified biosynthetic gene cluster.
[024] In some embodiments, the host cell is a Streptomyces cell. In some embodiments, the host cell is a Streptomyces albus cell.
[025] In some embodiments, the host cell further comprises a sequence LmBU operably linked to a constitutive promoter.
[026] In some embodiments, the present disclosure provides a polynucleotide comprising a biosynthetic gene cluster, wherein the biosynthetic gene cluster comprises one or more genes that contribute to the production of at least a portion of the compound of Formula (I), Formula (10), and/or Formula (11) when the biosynthetic gene cluster is expressed by a host cell.
[027] In some embodiments, the one or more genes comprise six nonribosomal peptide synthetase (NRPS) modules.
[028] In some embodiments, the six NRPS modules are encoded by sequences comprising a first NRPS open reading frame of SEQ ID NO:2, a second NRPS open reading frame of
SEQ ID NO: 3, a third NRPS open reading frame of SEQ ID NO: 4 and a fourth NRPS open reading frame of SEQ ID NO: 5, or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[029] In some embodiments, the one or more genes comprise four polyketide synthase (PKS) modules.
[030] In some embodiments, the four PKS modules are encoded by polynucleotide sequences comprising a first PKS open reading frame of SEQ ID NO: 6 and a second PKS open reading frame of SEQ ID NO: 7, or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[031] In some embodiments, the biosynthetic gene complex comprises a LmBU-encoding gene.
[032] In some embodiments, the LmBU-encoding gene comprises a polynucleotide sequence of SEQ ID NO: 8, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[033] In some embodiments, the biosynthetic gene cluster comprises a polynucleotide sequence of SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[034] In some embodiments, the host cell is engineered to express the one or more genes in the biosynthetic cluster, which results in the production of the compound of any one of Formula (I), Formula (10), and Formula (11).
[035] In some embodiments, overexpression of one or more genes in the biosynthetic cluster by the host cell increases the production of the compound of any one of Formula (I), Formula (10), and Formula (11) compared to an otherwise equivalent host cell comprising a biosynthetic gene cluster that does not overexpress one or more genes in the biosynthetic cluster.
[036] In some embodiments, the LmBU is overexpressed.
[037] In some embodiments, overexpression of the LmBU occurs in cis or in trans.
[038] In some embodiments, trans overexpression of LmBU comprises expressing a sequence encoding the LmBu open reading frame under the control of a constitutive ermE promoter, a kasO promoter, or a functional variant or derivative thereof.
[039] In some embodiments, the ermE promoter comprises a sequence of SEQ ID NO: 9, and the kasO promoter comprises a sequence of SEQ ID NO: 10.
[040] In some embodiments, the biosynthetic gene cluster comprises one or more sequence modifications relative to a biosynthetic gene cluster of SEQ ID NO : 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[041] In some embodiments, the one or more modification of the biosynthetic gene cluster comprises a modification that results in overexpression of the LmBU-encoding gene in comparison to the expression of the LmBU-encoding gene by the biosynthetic gene cluster of SEQ ID NO: 1.
[042] In some embodiments, the one or more modifications comprise modifications of a promoter of a gene in the biosynthetic gene cluster.
[043] In some embodiments, the one or more modifications comprise insertion of at least one heterologous promoter in the biosynthetic gene cluster.
[044] In some embodiments, the at least one heterologous promoter is a strong promoter.
[045] In some embodiments, the at least one heterologous promoter is selected from the group consisting of ermE and kasO, or functional variants or derivatives thereof.
[046] In some embodiments, the sequence of the ermE promoter comprises SEQ ID NO: 9 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto, and the sequence of the kasO promoter comprises SEQ ID NO: 10 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
[047] In some embodiments, inserting the at least one heterologous promoter into the biosynthetic gene cluster comprises a nucleic acid guided endonuclease.
[048] In some embodiments, the nucleic acid guided endonuclease is in a complex with at least one guide nucleic acid (gNA).
[049] In some embodiments, the nucleic acid guided endonuclease is a CRISPR/Cas endonuclease.
[050] In some embodiments, the CRISPR/Cas endonuclease is Cas9.
[051] In some embodiments, inserting the at least one heterologous promoter into the biosynthetic gene cluster further comprises a donor template comprising a sequence of the heterologous promoter.
[052] In some embodiments, the biosynthetic gene cluster comprises an mbtH gene upstream of the four NRPS open reading frames, and wherein the at least one heterologous promoter is inserted upstream of the mbtH gene.
[053] In some embodiments, the at least one heterologous promoter is one or more of an ermE promoter and kasO promoter.
[054] In some embodiments, the biosynthetic gene cluster comprises a polynucleotide sequence of SEQ ID NO: 11 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
[055] In some embodiments, the at least one modification of the biosynthetic gene cluster comprises a modification that results in overexpression of the LmBU-encoding gene in comparison to the expression of the LmBU-encoding gene by the biosynthetic gene cluster of SEQ ID NO: 1.
[056] In some embodiments, at least one modification of the biosynthetic gene cluster comprises replacement of at least one promoter in comparison to the biosynthetic gene cluster of SEQ ID NO: 1.
[057] In some embodiments, the biosynthetic gene cluster is isolated or derived from Streptomyces strain NRRL F-6131.
[058] In some embodiments, the biosynthetic gene cluster produces the compound of any one of Formula (I), Formula (10), and Formula (11) in the host cell.
[059] In some embodiments, the present disclosure provides a vector comprising the polynucleotide as described herein.
[060] In some embodiments, the vector is a bacterial artificial chromosomal vector.
[061] In some embodiments, the vector further comprises at least one promoter.
[062] In some embodiments, the vector is suitable for expression in a Streptomyces species cell.
[063] In some embodiments, the present disclosure provides a host cell comprising the polynucleotide as described herein or the vector as described herein.
[064] In some embodiments, the present disclosure provides a host cell, comprising the polynucleotide as described herein.
[065] In some embodiments, the host cell further comprises a polynucleotide comprising a sequence encoding a LmBU operably linked to a constitutive promoter.
[066] In some embodiments, the constitutive promoter is one or more of an ermE promoter and a kasO promoter.
[067] In some embodiments, the LmBU is encoded by a polynucleotide sequence of SEQ ID NO: 8, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[068] In some embodiments, the host cell is a Streptomyces cell.
[069] In some embodiments, the Streptomyces cell is a Streptomyces lividans or Streptomyces albus cell.
[070] In one aspect, the present disclosure provides a method of making a polynucleotide comprising a modified biosynthetic gene cluster comprising: a. providing a first E. coli host cell comprising a first vector comprising a sequence of an unmodified biosynthetic gene cluster comprising a target sequence; b. introducing the first vector into a Streptomyces host cell by conjugation; c. providing a second E. coli host cell comprising a second vector comprising: i. a sequence of at least one gNA specific to the target sequence operably linked to a promoter, ii. a sequence encoding a Cas endonuclease; and iii. a sequence encoding a donor template; and d. introducing the second vector into a Streptomyces host cell by conjugation; whereby introducing the second vector into the Streptomyces host cell produces a double strand break in the target sequence and introduction of a donor template sequence, thereby generating a Streptomyces host cell comprising a modified biosynthetic gene cluster. In some embodiments,
[071] In some embodiments, the biosynthetic gene cluster is an unmodified biosynthetic gene cluster. In some embodiments, the unmodified biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1. In some embodiments, the polynucleotide sequence of the modified biosynthetic gene cluster comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative to SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[072] In some embodiments, the Cas endonuclease is selected from Cas9 (also known as Csnl and Csxl2), Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Casio, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, Csf4, homologues thereof, variants thereof, mutants thereof, and derivatives thereof.
[073] In some embodiments, the endonuclease is a Cas9 endonuclease.
[074] In some embodiments, the unmodified biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[075] In some embodiments, the donor template comprises, from 5’ to 3’, a sequence homologous to a sequence 5’ of the target sequence, a sequence of a promoter, and sequence homologous to a sequence 3’ of the target sequence.
[076] In some embodiments, the promoter is selected from ermE and kasO, or functional variants or derivatives thereof.
[077] In one aspect, the present disclosure provides a method of making the compound of Formula (I), comprising a. introducing into a host cell a polynucleotide of the present disclosure or a vector of the present disclosure; b. culturing the host cell under conditions sufficient for the synthesis of the compound of Formula (I) by the biosynthetic gene cluster; and c. isolating and purifying the compound of Formula (I).
[078] In some embodiments, the host cell is an Actinobacterial cell or a Streptomyces cell.
[079] In some embodiments, the Streptomyces cell is a Streptomyces albus or Streptomyces lividans cell.
[080] In some embodiments, the host cell comprises a sequence encoding a LmBU operably linked to a constitutive promoter.
[081] In some embodiments, the polynucleotide or vector is introduced into the host cell by conjugation with an E. coli comprising the polynucleotide or vector.
[082] In some embodiments, the compound of Formula (I) is a compound of Formula (10), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
[083] In some embodiments, the compound of Formula (I) is a compound of Formula (11), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
[084] In some embodiments, the present disclosure provides a pharmaceutical composition, comprising a compound of Formula (I), and a pharmaceutically acceptable excipient.
[085] In some embodiments, the present disclosure provides a pharmaceutical composition, comprising a compound of Formula (10), and a pharmaceutically acceptable excipient.
[086] In some embodiments, the present disclosure provides a pharmaceutical composition, comprising a compound of Formula (11), and a pharmaceutically acceptable excipient.
[087] In some embodiments, the present disclosure provides a method of treating a disease or disorder in a subject, comprising administering a compound of Formula (I) or pharmaceutical composition thereof.
[088] In some embodiments, the present disclosure provides a compound of Formula (I) or the pharmaceutical composition thereof, for use in treating a disease or disorder in a subject. [089] In some embodiments, the present disclosure provides a compound of Formula (I) for use in the manufacture of a medicament for treating a disease or disorder in a subject.
[090] In some embodiments, the present disclosure provides the use of a compound of Formula (I) or the pharmaceutical composition thereof, for the treatment of a disease or disorder.
[091] In some embodiments, the present disclosure provides a compound of Formula (10) or the pharmaceutical composition thereof, for use in treating a disease or disorder in a subject.
[092] In some embodiments, the present disclosure provides a compound of Formula (10) for use in the manufacture of a medicament for treating a disease or disorder in a subject. [093] In some embodiments, the present disclosure provides the use of a compound of Formula (10) or the pharmaceutical composition thereof, for the treatment of a disease or disorder.
[094] In some embodiments, the present disclosure provides a compound of Formula (11) or the pharmaceutical composition thereof, for use in treating a disease or disorder in a subject.
[095] In some embodiments, the present disclosure provides a compound of Formula (11) for use in the manufacture of a medicament for treating a disease or disorder in a subject. [096] In some embodiments, the present disclosure provides the use of a compound of Formula (11) or the pharmaceutical composition thereof, for the treatment of a disease or disorder.
[097] In some embodiments, the disease or disorder is cancer.
[098] In some embodiments, the disease or disorder is fibrosis.
[099] In some embodiments, the subject is human.
[0100] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure
belongs. In the specification, the singular forms also include the plural unless the context clearly dictates otherwise. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure, suitable methods and materials are described below. All publications, patent applications, patents and other references mentioned herein are incorporated by reference. The references cited herein are not admitted to be prior art to the claimed invention. In the case of conflict, the present specification, including definitions, will control. In addition, the materials, methods and examples are illustrative only and are not intended to be limiting. In the case of conflict between the chemical structures and names of the compounds disclosed herein, the chemical structures will control.
[0101] Other features and advantages of the disclosure will be apparent from the following detailed description and claims
BRIEF DESCRIPTION OF THE DRAWINGS
[0102] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
[0103] FIG. 1 depicts the proposed biosynthesis of a compound of Formula (I) (Formula (10)) from the AZT039 biosynthetic gene cluster (BGC) based on BGC analysis and 2D structure.
[0104] FIGS. 2A-2C depict the target identification of AZT039 compounds. FIG. 2A depicts LCMS selected ion chromatograms of target peaks in SA-LT039 vs SA-pdualP expression. FIG. 2B depicts isotopic labeling experiment showing mass shifts upon addition of labeled amino acid precursors. Stable isotope labeling of target peaks shows modified DIO-leucine incorporation. FIG. 2C depicts LCMS selected-ion chromatograms for LT003-0026 and LT003-0027 in WT strain under different media conditions showing lack of expression.
[0105] FIG. 3A depicts the 2D NMR structure of a compound of Formula (I) (Formula (10)) from 2D NMR correlations.
[0106] FIG. 3B depicts the 2D NMR structure of a compound of Formula (I) (Formula (11)) from 2D NMR correlations.
[0107] FIG. 4A depicts the structure of a compound of Formula (I) (Formula (10)). The stereocenter at position 5 (piperazic acid 2) was determined by NOE
[0108] FIG. 4B depicts the structure of a compound of Formula (I) (Formula (11)).
DETAILED DESCRIPTION
[0109] The present disclosure relates to a compound of Formula (I), Formula (10), and Formula (11), and to the use of a compound of Formula (I), Formula (10), and Formula (11) in the treatment of diseases or disorders, such as cancer or fibrosis. In some embodiments, the disclosure relates to the biosynthesis of the compound of Formula (I). In some embodiments, the disclosure relates to the biosynthesis of the compound of Formula (10). In some embodiments, the disclosure relates to the biosynthesis of the compound of Formula (11).
Definitions
[0110] Unless otherwise stated, the following terms used in the specification and claims have the following meanings set out below.
[OHl] In some embodiments, “the compound of Formula (I),” “a compound of Formula (I),” “the compound of Formula (10),” “a compound of Formula (10),” “the compound of Formula (11),” and “a compound of Formula (11)” includes all stereoisomer, mixture of stereoisomers, pharmaceutically acceptable salts, solvates, or tautomers thereof.
[0112] As used herein, the expressions “one or more of A, B, or C,” “one or more A, B, or C,” “one or more of A, B, and C,” “one or more A, B, and C,” “selected from the group consisting of A, B, and C”, “selected from A, B, and C”, and the like are used interchangeably and all refer to a selection from a group consisting of A, B, and/or C, i.e., one or more As, one or more Bs, one or more Cs, or any combination thereof, unless indicated otherwise.
[0113] As used herein, “natural product” refers to a compound that is synthesized by a living organism (e.g., bacteria) under normal physiological conditions and the compound can be quantified, and identified as a pathway specific product, using known techniques in the art. If one skilled in the art cannot quantify a compound in, e.g., extracts of native bacterial cells containing a biosynthetic gene cluster after culturing said cells with a growth media containing the nutrients required to produce a compound, one skilled in the art would reasonably believe that the bacterial cells do not naturally produce the compound (i.e., the compound is not a “natural product”). For the avoidance of doubt, the compounds disclosed herein are not “natural products,” because Applicant was not able to quantify the compound of Formula (I), Formula (10), or Formula (11) in the native bacterial cells containing the biosynthetic gene cluster. For example, as described in Example 3 and shown in FIGS. 2A-
2C, Applicant was not able to quantify the compound of Formula (I), Formula (10), or Formula (11) in extracts of AZT039 despite culturing said cells under appropriate conditions. [0114] As used herein, “module” refers to a set of active site domains of a protein that catalyze one or more of the biosynthetic steps leading to the compound of Formula (I), Formula (10), or Formula (11). Thus, each module may be composed of a protein, or a module may be composed of a plurality of domains. In some embodiments, heterologous protein domains may be fused together to form a module. In some embodiments, an open reading frame may be polycistronic, and encode a plurality of distinct proteins, each of which comprise one or more domains or modules. In some embodiments, an open reading frame may encode a single protein, which has a plurality of modules, each of which comprises a combination of domains.
[0115] It is to be understood that, throughout the description, where compositions are described as having, including, or comprising specific components, it is contemplated that compositions also consist essentially of, or consist of, the recited components. Similarly, where methods or processes are described as having, including, or comprising specific process steps, the processes also consist essentially of, or consist of, the recited processing steps. Further, it should be understood that the order of steps or order for performing certain actions is immaterial so long as the invention remains operable. Moreover, two or more steps or actions can be conducted simultaneously.
[0116] It is to be understood that, for the compounds of the present disclosure being capable of further forming salts, all of these forms are also contemplated within the scope of the claimed disclosure.
[0117] As used herein, the term “pharmaceutically acceptable salts” refer to derivatives of the compounds of the present disclosure wherein the parent compound is modified by making acid or base salts thereof. Examples of pharmaceutically acceptable salts include, but are not limited to, mineral or organic acid salts of basic residues such as amines, alkali or organic salts of acidic residues such as carboxylic acids, and the like. The pharmaceutically acceptable salts include the conventional non-toxic salts or the quaternary ammonium salts of the parent compound formed, for example, from non-toxic inorganic or organic acids. For example, such conventional non-toxic salts include, but are not limited to, those derived from inorganic and organic acids selected from 2-acetoxybenzoic, 2 -hydroxy ethane sulfonic, acetic, ascorbic, benzene sulfonic, benzoic, bicarbonic, carbonic, citric, edetic, ethane disulfonic, 1,2-ethane sulfonic, fumaric, glucoheptonic, gluconic, glutamic, glycolic,
glycollyarsanilic, hexylresorcinic, hydrabamic, hydrobromic, hydrochloric, hydroiodic, hydroxymaleic, hydroxynaphthoic, isethionic, lactic, lactobionic, lauryl sulfonic, maleic, malic, mandelic, methane sulfonic, napsylic, nitric, oxalic, pamoic, pantothenic, phenylacetic, phosphoric, polygalacturonic, propionic, salicylic, stearic, subacetic, succinic, sulfamic, sulfanilic, sulfuric, tannic, tartaric, toluene sulfonic, and the commonly occurring amine acids, e.g, glycine, alanine, phenylalanine, arginine, etc.
[0118] In some embodiments, the pharmaceutically acceptable salt is a sodium salt, a potassium salt, a calcium salt, a magnesium salt, a diethylamine salt, a choline salt, a meglumine salt, a benzathine salt, a tromethamine salt, an ammonia salt, an arginine salt, or a lysine salt.
[0119] Other examples of pharmaceutically acceptable salts include hexanoic acid, cyclopentane propionic acid, pyruvic acid, malonic acid, 3-(4-hydroxybenzoyl)benzoic acid, cinnamic acid, 4-chlorobenzenesulfonic acid, 2-naphthalenesulfonic acid, 4-toluenesulfonic acid, camphorsulfonic acid, 4-methylbicyclo-[2.2.2]-oct-2-ene-l-carboxylic acid, 3- phenylpropionic acid, trimethylacetic acid, tertiary butylacetic acid, muconic acid, and the like. The present disclosure also encompasses salts formed when an acidic proton present in the parent compound either is replaced by a metal ion, e.g., an alkali metal ion, an alkaline earth ion, or an aluminum ion; or coordinates with an organic base such as ethanolamine, diethanolamine, triethanolamine, tromethamine, N-methylglucamine, and the like. In the salt form, it is understood that the ratio of the compound to the cation or anion of the salt can be 1:1, or any ratio other than 1:1, e.g., 3:1, 2:1, 1:2, or 1:3.
[0120] It is to be understood that all references to pharmaceutically acceptable salts include solvent addition forms (solvates) or crystal forms (polymorphs) as defined herein, of the same salt.
[0121] As used herein, the term “treating” or “treat” describes the management and care of a patient for the purpose of combating a disease, condition, or disorder and includes the administration of the compound of Formula (I), Formula (10), or Formula (11) to alleviate the symptoms or complications of a disease, condition or disorder, to eliminate the disease, condition or disorder, or to prevent the disease, condition or disorder. The term “treat” can also include treatment of a cell in vitro or an animal model. It is to be appreciated that references to “treating” or “treatment” include the alleviation of established symptoms of a condition. “Treating” or “treatment” of a state, disorder or condition therefore includes: (1) preventing or delaying the appearance of clinical symptoms of the state, disorder or condition
developing in a human that may be afflicted with or predisposed to the state, disorder or condition but does not yet experience or display clinical or subclinical symptoms of the state, disorder or condition, (2) inhibiting the state, disorder or condition, i.e., arresting, reducing or delaying the development of the disease or a relapse thereof (in case of maintenance treatment) or at least one clinical or subclinical symptom thereof, or (3) relieving or attenuating the disease, i.e., causing regression of the state, disorder or condition or at least one of its clinical or subclinical symptoms.
[0122] As used herein, the term “pharmaceutically acceptable” refers to those compounds, anions, cations, materials, compositions, carriers, and/or dosage forms which are, within the scope of sound medical judgment, suitable for use in contact with the tissues of human beings and animals without excessive toxicity, irritation, allergic response, or other problem or complication, commensurate with a reasonable benefit/risk ratio.
[0123] As used herein, the term “pharmaceutically acceptable excipient” means an excipient that is useful in preparing a pharmaceutical composition that is generally safe, non-toxic and neither biologically nor otherwise undesirable, and includes excipient that is acceptable for veterinary use as well as human pharmaceutical use. A “pharmaceutically acceptable excipient” as used in the specification and claims includes both one and more than one such excipient.
[0124] As used herein, the term “therapeutically effective amount” refers to an amount of a pharmaceutical agent to treat, ameliorate, or prevent an identified disease or condition, or to exhibit a detectable therapeutic or inhibitory effect. The effect can be detected by any assay method known in the art. The precise effective amount for a subject will depend upon the subject’s body weight, size, and health; the nature and extent of the condition; and the therapeutic or combination of therapeutics selected for administration.
[0125] All percentages and ratios used herein, unless otherwise indicated, are by weight. Other features and advantages of the present disclosure are apparent from the different examples. The provided examples illustrate different components and methodology useful in practicing the present disclosure. The examples do not limit the claimed disclosure. Based on the present disclosure the skilled artisan can identify and employ other components and methodology useful for practicing the present disclosure.
[0126] The terms “polynucleotide” and “nucleic acid” are used interchangeably herein and refer to a polymeric form of nucleotides of any length, i.e., ribonucleotides or deoxy ribonucleotides or analogs thereof. These terms refer to the primary structure of the
molecule and thus encompass double-and single-stranded DNA as well as double-and singlestranded RNA. The term also encompasses modified nucleic acids, such as methylated and/or capped nucleic acids, nucleic acids containing modified bases, backbone modifications, and the like.
[0127] As used herein, the term “gene” refers to any segment of DNA associated with a biological function. Thus, a gene includes, but is not limited to, coding sequences and/or regulatory sequences required for its expression. Genes may also comprise non-expressed DNA segments, e.g. forming recognition sequences for other proteins. Genes can be obtained from a variety of sources, including cloning from a source of interest or synthesis from known or predicted sequence information, and can comprise sequences designed to have desired parameters.
[0128] In some embodiments, the genomic DNA, prior to modification, is isolated from bacteria cells originally found in soil.
[0129] As used herein, the term “homologous” or “homolog” or “ortholog” is known in the art and refers to related sequences that share a common ancestor or family member and are determined based on the degree of sequence identity. The terms “substantially similar” and “substantially corresponding” are used interchangeably herein. The term refers to nucleic acid fragments wherein the difference in one or more nucleotide bases does not affect the ability of the nucleic acid fragment to mediate gene expression or produce a certain phenotype. These terms also refer to modifications of the nucleic acid fragments of the disclosure, such as deletions or insertions of one or more nucleotides that do not substantially alter the functional properties of the resulting nucleic acid fragment relative to the original, unmodified fragment. Thus, as will be understood by those skilled in the art, it is to be understood that the present disclosure encompasses more than the specific exemplary sequences. These terms “homologous” or “homolog” or “ortholog” or “substantially similar” or “substantially corresponding” may describe the relationship between a gene found in one species, subspecies, variety, or strain and the corresponding or equivalent gene in another species, subspecies, variety, or strain.
[0130] As used herein, the terms “endogenous” and “native” refer to naturally occurring copies of a gene or promoter.
[0131] As used herein, the term “naturally occurring” refers to a gene that is derived from a naturally occurring source. In some aspects, a naturally occurring gene refers to a gene that is a wild-type (non-transgenic) gene, whether located in an endogenous environment within its
source organism or placed in a “heterologous” environment when introduced into a different organism. Thus, for purposes of this disclosure, a “non-naturally occurring” gene is one that has been mutated or otherwise modified or synthesized to have a sequence that differs from a known native gene. In some aspects, the modification may be at the protein level (e.g, amino acid substitution). In other aspects, the modification can be at the DNA level without any effect on the protein sequence (e.g, codon optimization).
[0132] For the purposes of this disclosure, homologous sequences are compared. “Homologous sequences” or “homologs” or “orthologs” are believed, believed or known to be functionally related. The functional relationships may be indicated in any of a number of ways, including but not limited to: (a) the degree of sequence identity and/or (b) the same or similar biological function. Preferably, both (a) and (b) are indicated. Homology can be determined using default parameters using software programs readily available in the art, such as NCBI BLAST (basic local alignment search tool).
[0133] Percentage identity determinations can be performed for nucleic acids using BLASTN or standard nucleotide BLAST using default settings (Match/Mismatch scores 1, -2) Gap costs linear, Expect threshold 10, Word size 28, and match matches in a query range 0) and for proteins using BLAST using default settings (Expect threshold 10, Word size 3, Max matches in a query range 0, Matrix Blosum62, Gap costs Existence 11, extension 1 and conditional compositional score matrix adjustment).
[0134] As used herein, the term “nucleotide change” refers to, for example, a nucleotide substitution, deletion, and/or insertion, as is well known in the art. For example, mutations contain alterations that produce silent substitutions, additions or deletions without altering the properties or activity of the encoded protein or the manner in which the protein is made. [0135] As used herein, the term “heterologous” refers to an amino acid or nucleic acid sequence (e.g, a gene or promoter) that is not naturally occurring in a particular organism or is not naturally occurring in a particular context (e.g, a genomic or plasmid location) in a particular organism. For example, a native promoter or other nucleic acid sequence of Streptomyces albus may be heterologous when operably linked to a nucleic acid sequence which is not operably linked in the wild-type Streptomyces albus, or when the native promoter or other nucleic acid sequence is delivered in a non-native form (c.g.as a heterologous plasmid or heterologous nucleic acid sequence).
[0136] As used herein, the term “exogenous” is used interchangeably with the term “heterologous” and refers to material from a source other than its natural source. For
example, the term “exogenous protein” or “exogenous gene” refers to a protein or gene that is derived from a non-natural source or location and that has been artificially supplied to a biological system.
[0137] All publications and patent documents cited herein are incorporated herein by reference as if each such publication or document was specifically and individually indicated to be incorporated herein by reference. Citation of publications and patent documents is not intended as an admission that any is pertinent prior art, nor does it constitute any admission as to the contents or date of the same. The invention having now been described by way of written description, those of skill in the art will recognize that the invention can be practiced in a variety of embodiments and that the foregoing description and examples below are for purposes of illustration and not limitation of the claims that follow.
[0138] As used herein, the phrase “compound of the disclosure” refers to those compounds which are disclosed herein, both generically and specifically.
[0139] The azinothricins (AZTs) are a diverse family of cyclic depsipeptides with potent antimicrobial and anticancer properties. Previous reports from literature have indicated that the AZTs exert their anticancer properties through modulation of the Wnt/betacatenin pathway. The general AZT compound scaffold consists of a core hexadepsipeptide with an attached polyketide-derived tail characterized by the presence of a tetrahydropyran ring. Biosynthetically, the AZTs are made viatypel modular non-ribosomal peptide synthetase- polyketide synthase (NRPS-PKS) assembly line.
Compounds of the Present Disclosure
[0140] In some aspects, the present disclosure provides, inter alia, a compound of Formula (I):
or a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
[0141] In some embodiments, the compound of Formula (I) is a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer of Formula (I). [0142] In some embodiments, the compound of Formula (I) is a stereoisomer of Formula (I).
[0143] In some embodiments, the compound of Formula (I) is a mixture of stereoisomers of Formula (I).
[0144] In some embodiments, the compound of Formula (I) is a tautomer of Formula (I).
[0145] In some aspects, the present disclosure provides, inter alia, a compound of Formula (10):
or a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
[0146] In some embodiments, the compound of Formula (10) is a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer of Formula (10). [0147] In some embodiments, the compound of Formula (10) is a stereoisomer of Formula
(10).
[0148] In some embodiments, the compound of Formula (10) is a mixture of stereoisomers of Formula (I).
[0149] In some embodiments, the compound of Formula (10) is a tautomer of Formula (10). [0150] In some aspects, the present disclosure provides, inter alia, a compound of Formula
(11):
or a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
[0151] In some embodiments, the compound of Formula (11) is a stereoisomer, mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer of Formula (11). [0152] In some embodiments, the compound of Formula (11) is a stereoisomer of Formula (I).
[0153] In some embodiments, the compound of Formula (11) is a mixture of stereoisomers of Formula (11).
[0154] In some embodiments, the compound of Formula (11) is a tautomer of Formula (11).
[0155] In one aspect, the compounds of the disclosure are cyclic peptides. In some embodiments, the compounds contain a hexadepsipetide core, which comprises 6 amino acid macro-cyclized through an ester bond, and a polyketide tail.
[0156] The compounds of Formula (I) and Formula (10) comprise ten stereocenters. The compounds of Formula (11) comprise nine stereocenters.
[0157] In some embodiments, the compound of Formula (I) is Formula IA, wherein each stereocenter is identified with an *:
[0158] In some embodiments, each * of Formula IA represents a bond which is either (R) or (S).
[0159] In some embodiments, *2 of Formula IA is (R). In some embodiments, *2 of Formula IA is (S).
[0160] In some embodiments, *8 of Formula IA is (R). In some embodiments, *8 of Formula IA is (S).
[0161] In some embodiments, *9 of Formula IA is (R). In some embodiments, *9 of Formula IA is (S).
[0162] In some embodiments, *13 of Formula IA is (R). In some embodiments, *13 of Formula IA is (S).
[0163] In some embodiments, *15 of Formula IA is (R) and R is -OH. In some embodiments, *15 of Formula IA is (S) and R is -OH.
[0164] In some embodiments, *18 of Formula IA is (R). In some embodiments, *18 of Formula IA is (S).
[0165] In some embodiments, *27 of Formula IA is (R). In some embodiments, *27 of Formula IA is (S).
[0166] In some embodiments, *32 of Formula IA is (R). In some embodiments, *32 of Formula IA is (S).
[0167] In some embodiments, *33 of Formula IA is (R). In some embodiments, *33 of Formula IA is (S).
[0168] In some embodiments, *39 of Formula IA is (R). In some embodiments, *33 of Formula IA is (S).
[0169] In some embodiments of the compound of Formula IA, al and a2 represent the stereochemistry of the alkene bond.
[0170] In some embodiments of the compound of Formula IA, alkene bond al is cis. In some embodiments of the compound of Formula IA, alkene bond al is trans.
[0171] In some embodiments of the compound of Formula IA, alkene bond a2 is cis. In some embodiments of the compound of Formula IA, alkene bond a2 is trans.
[0172] In some embodiments, the compounds of the disclosure provide a scaffold which can be derivatized to create therapeutic agents. In some embodiments, the compounds of the disclosure, themselves, are therapeutic agents.
[0173] For example, in some embodiments, the compounds of the disclosure target Wnt/betacatenin (or P-catenin) signaling pathway. Wnt/p-catenin signaling, a highly conserved pathway through evolution, regulates key cellular functions including proliferation, differentiation, migration, genetic stability, apoptosis, and stem cell renewal. The Wnt pathway mediates biological processes but the effect depends on the involvement of P-catenin in signal transduction. P-catenin is a core component of the cadherin protein complex, whose stabilization is essential for the activation of Wnt/p-catenin signaling.
[0174] Without wishing to be bound by theory, it is thought that the azinothricin family of compounds targets Wnt/p-catenin signaling, and exhibits strong antitumor and antibacterial activity. Alternatively, or in addition, HIF-lalpha and v-ATPase have recently been suggested to be the direct targets of this family of molecules. Azinothricin compounds are
cyclic hexadepsipeptides that are characterised by a 19-membered cyclodepsipeptide ring composed of 6 unusual amino acids (hexadepsipeptide) and an acyl side chain connected through an amide bond. The first member of this class, azinothricin, was reported from Streptomyces X- 14950. Because of the strong antitumor and antibacterial activity, significant efforts have been made to identify bacterially-produced natural products using the classical culturing approach. However, the discovery of new drug candidates using culturing approaches has been limited.
[0175] Instead of culturing, Applicant has mined genomes for biosynthetic gene clusters (BGC) similar to those associated with azinothricin, in order to identify potentially new compound structures.
[0176] Using this approach, Applicant found a biosynthetic gene cluster (BGC) termed AZT039, was identified from the genome of Streptomyces sp. Strain NRRL F-6131. The AZT039 BGC had potential to produce hexadepsipeptide; however, as described in Example 3, one of ordinary skill in the art would reasonably believe that the AZT039 BGC did not naturally produce the hexadepsipeptide of any of Formula (I), Formula (10), or Formula (11). [0177] The BGC in AZT039 conprises a LmBU regulator. A LmBU regulator is a class of BGC specific regulators that act as positive regulators of compound expression in many species of Streptomyces.
[0178] In some embodiments, the inability to quantify a compound of Formula (I), Formula (10), or Formula (11) suggest that LmBU expression levels were not sufficient to produce a compound of Formula (I), Formula (10), or Formula (11). Therefore, in one aspect, the disclosure provides compositions and methods for overexpressing LmBU by cloning it into an integrative plasmid under the control of a constitutive promoter. In some embodiments, the promoter is a strong heterologous promoter not present in the naturally occurring AZT039 BGC. In some embodiment, the promoter is added to initiate transcription of RNA, and consequently synthesize a compound of Formula (I), Formula (10), and/or Formula (11). In some embodiments, the promoter is selected from ermE* and kaso*.
Polynucleotides and Vectors
[0179] The disclosure provides polynucleotides comprising a biosynthetic gene cluster comprising one or more genes that contribute to the production of at least a portion of the compound of Formula (I) when the biosynthetic gene cluster is expressed by a host cell. Host cells expressing the polynucleotides of the disclosure can be used in the manufacture of the
compound of Formula (I). In some embodiments, the biosynthetic gene cluster (BGC) can be wild type, i.e. not subject to modifications through genetic engineering methods known in the art. In other embodiments, the BGC is subject to one or more modifications, for example modifications that increase, or result in, expression of the compound of Formula (I) by the host cell.
Biosynthetic Gene Cluster
[0180] In some embodiments, the biosynthetic gene cluster involved in the production of a compound of Formula (I), Formula (10), or Formula (11) is isolated or derived from a Streptomyces species of bacteria. Streptomyces are a species of Actinobacteria, and the type genus of the family Streptomycetaceae. Over 500 species of Streptomyces have been described to date, all of which are envisaged as within the scope of the instant disclosure. In some embodiments, the biosynthetic gene cluster is isolated or derived from Streptomyces sp. NRRL F-6131.
[0181] In some embodiments, the biosynthetic gene cluster comprises one or more genes that contribute to the production of at least a portion of the compound of Formula (I), Formula (10), or Formula (11) when the biosynthetic gene cluster is expressed by a host cell. As a non-limiting example, the biosynthetic gene cluster comprises at least one gene that, together with other genes in the host genome and/or the biosynthetic gene cluster, catalyzes or contributes to at least one biosynthetic step that results in the production of the compound of Formula (I), Formula (10), or Formula (11) from a precursor compound. Exemplary precursor compounds are shown in FIG. 1.
[0182] In some embodiments, the biosynthetic gene cluster comprises at least one nonribosomal peptide synthetase module. Nonribosomal peptide synthetases (NRPS) are enzymes which, unlike ribosomes, synthesize their own peptidic products independent of messenger RNA. NRPS are modular enzymes that catalyze synthesis of important peptide products from a variety of standard and non-proteinogenic amino acid substrates. Typically, each NRPS can synthesize one type of non-ribosomal peptide. Nonribosomal peptides often have cyclic and/or branched structures, can contain non-proteinogenic amino acids including D-amino acids, carry modifications like N-methyl and N-formyl groups, or are glycosylated, acylated, halogenated, or hydroxylated. The NRPS genes for an individual peptide are frequently organized into operons in bacteria. Functionally related operons may be organized into gene clusters. NRPS enzymes are organized in modules, each module comprising multiple catalytic domains that are responsible for incorporation of a single amino acid
residue. In an exemplary NRPS module, a first domain activates and covalently attaches an amino acid to an integrated carrier protein domain, and the substrates and intermediates are then delivered to neighboring catalytic domains for peptide bond formation or, in some modules, chemical modification. In the final module, the peptide is delivered to a terminal thioesterase domain that catalyzes release of the peptide product. All NRPS, and modules thereof, that are capable of contributing to the production of the compound of any one of Formula (I), Formula (10), or Formula (11) are envisaged as within the scope of the instant disclosure.
[0183] In some embodiments, the one or more genes comprise at least one nonribosomal peptide synthetase (NRPS) module. In some embodiments, the one or more genes comprise at least 1, 2, 3, 4, 5, 6, or more NRPS modules. In some embodiments, the one or more genes comprise six NRPS modules. Sequences of representative NRPS modules are described in
Table 1, below.
[0184] In some embodiments, the biosynthetic gene cluster comprises a polynucleotide sequence encoding one or more NRPS modules. In some embodiments, the polynucleotide sequence encoding the NRPS module is selected from the group consisting of SEQ ID NOS: 12-17, or a sequence having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the sequence encoding the NRPS module comprises, or consisting essentially of, a polynucleotide sequence selected from the group consisting of SEQ ID NOS: 12-17.
[0185] In some embodiments, the biosynthetic gene cluster comprises polynucleotide sequences encoding six NRPS modules. In some embodiments, the polynucleotide sequences encoding the six NRPS modules comprise sequences of SEQ ID NOS: 12-17, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the polynucleotide sequences encoding six NRPS modules comprise sequences of SEQ ID NOS: 12-17.
[0186] In some embodiments, biosynthetic gene cluster comprises one or more NRPS modules comprising a polypeptide sequence selected from the group consisting of SEQ ID NOS: 18-23, or a sequence having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, biosynthetic gene cluster comprises one or more NRPS modules comprising a polypeptide sequence selected from the group consisting of SEQ ID NOS: 18- 23. In some embodiments, biosynthetic gene cluster comprises six NRPS modules comprising polypeptide sequences of SEQ ID NOS: 18-23, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, biosynthetic gene cluster comprises six NRPS modules comprising polypeptide sequences of SEQ ID NOS: 18-23. [0187] In some embodiments, the six NRPS modules are organized in 1, 2, 3, 4, 5 or 6 open reading frames. In some embodiments, the open reading frames comprise polynucleotide sequences selected from the group consisting of SEQ ID NOS: 2-5, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the six NRPS modules are organized in 4 open reading frames. In some embodiments, the six NRPS modules are encoded by sequences comprising a first NRPS open reading frame of SEQ ID NO:2, a second NRPS open reading frame of SEQ ID NO: 3, a third NRPS open reading frame of SEQ ID NO: 4 and a fourth NRPS open reading frame of SEQ ID NO: 5, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the six NRPS modules are encoded by sequences comprising SEQ ID NOS: 2- 5. In some embodiments, one or more of the NRPS open reading frames encode a polypeptide having an amino acid sequence selected from SEQ ID NOS: 38-41, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%,
92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the one or more NRPS open reading frames encode a polypeptide having a sequence of SEQ ID NO: 38-41, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the NRPS modules and/or the host cells described herein comprise a polynucleotide sequence which encodes a polypeptide having a sequence of SEQ ID NO: 38- 41, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
[0188] In some embodiments, the biosynthetic gene cluster comprises one or more genes that contribute to the production of at least a portion of a compound of Formula (I), Formula (10), and/or Formula (11) when the biosynthetic gene cluster is expressed by a host cell. In some embodiments, the one or more genes comprise at least one polyketide synthase (PKS) module.
[0189] In some embodiments, the one or more genes comprise at least one PKS module. In some embodiments, the one or more genes comprise at least 1, 2, 3, 4 or more PKS modules. In some embodiments, the one or more genes comprise four PKS modules. Sequences of representative PKS modules are described in Table 2, below.
[0190] Polyketide synthases (PKS) are a family of multi-domain enzymes or enzyme complexes that produce polyketides, a large class of secondary metabolites, in bacteria, fungi, plants, and a few animal lineages. PKS genes for an individual polyketide are usually organized in single operon or in gene cluster. PKSs can be classified into three groups: Type I, Type II and Type III. Type I polyketide synthases are large, highly modular proteins, Type II polyketide synthases are aggregates of monofunctional proteins, and Type III polyketide synthases do not use Acyl carrier protein (ACP) domains. All Types of PKSs, and modules thereof, capable of contributing to the production of the compound of Formula (I) are envisaged as within the scope of the instant disclosure.
[0191] Type I polyketi de-synthase modules comprise several domains with defined functions, separated by short spacer regions. An exemplary, but non-limiting Type I PKS protein comprises, from N to C terminus, a starting or loading module comprising an Acyltransferase (AT) and Acyl carrier protein (ACP) domain, an elongation or extending module comprising Keto-synthase (KS), AT, Dehydratase (DH), Enoylreductase (ER) and Ketoreductase (KR) domains, and a termination or releasing domain or module comprising a Thioesterase. As the polyketide is synthesized, the nascent polyketide chain is passed from one thiol group to the next by trans-acylation reactions, and is released at the end by
hydrolysis or cyclization. At the start, the starter group, for example acetyl-CoA or an analogue thereof, is loaded onto the ACP domain of the starter module in a reaction catalyzed by the starter module’s AT domain. In the polyketide elongation stages, the nascent polyketide chain is passed from the ACP domain of the previous module to the KS domain of the current module, in a reaction catalyzed by the KS domain. The elongation group, usually malonyl-CoA or methylmalonyl-CoA, is loaded onto the current ACP domain in a reaction catalyzed by the current AT domain. The ACP-bound elongation group reacts in a Claisen condensation with the KS-bound polyketide chain under CO2 evolution, leaving a free KS domain and an ACP-bound elongated polyketide chain. The reaction takes place at the KSn- bound end of the chain, so that the chain moves out one position and the elongation group becomes the new bound group. In some cases, the fragment of the polyketide chain can be altered stepwise by additional domains. This cycle is repeated for each elongation module, until finally the TE domain hydrolyzes the completed polyketide chain from the ACP-domain of the previous module.
[0192] In some embodiments, the biosynthetic gene cluster comprises a polynucleotide sequence encoding one or more PKS modules. In some embodiments, the polynucleotide sequence encoding the PKS module is selected from the group consisting of SEQ ID NOS: 24-27, or a sequence having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the sequence encoding the PKS module comprises, or consisting essentially of, a polynucleotide sequence selected from the group consisting of SEQ ID NOS: 24-27.
[0193] In some embodiments, the biosynthetic gene cluster comprises polynucleotide sequences encoding four PKS modules. In some embodiments, the polynucleotide sequences encoding the four PKS modules comprise sequences of SEQ ID NOS: 24-27, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the polynucleotide sequences encoding four PKS modules comprise sequences of SEQ ID NOS: 24-27.
[0194] In some embodiments, biosynthetic gene cluster comprises one or more PKS modules comprising a polypeptide sequence selected from the group consisting of SEQ ID NOS: 28-31, or a sequence having at least about 80%, 85%, 90%, 95%, 97%, or 99% identity thereto. In some embodiments, biosynthetic gene cluster comprises one or more PKS
modules comprising a polypeptide sequence selected from the group consisting of SEQ ID NOS: 28-31. In some embodiments, biosynthetic gene cluster comprises four PKS modules comprising polypeptide sequences of SEQ ID NOS: 22-25, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, biosynthetic gene cluster comprises four PKS modules comprising sequences of SEQ ID NOS: 28-31.
[0195] In some embodiments, the four PKS modules are organized in 1, 2, 3 or 4 open reading frames. In some embodiments, the open reading frames comprise polynucleotide sequences selected from the group consisting of SEQ ID NOS: 6 and 7, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the four PKS modules are organized in two open reading frames. In some embodiments, the four PKS modules are encoded by sequences comprising a first PKS open reading frame of SEQ ID NO: 6, and a second PKS open reading frame of SEQ ID NO: 7, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the four PKS modules are encoded by sequences comprising SEQ ID NOS: 6-7. In some embodiments, one or more of the PKS open reading frames encode a polypeptide having an amino acid sequence selected from SEQ ID NOs: 42-43, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the one or more PKS open reading frames encode a polypeptide having a sequence of SEQ ID NOs: 42-43, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto. In some embodiments, the one or more PKS modules and/or the host cells described herein comprise a polynucleotide sequence which encodes a polypeptide having a sequence of SEQ ID NOs: 42-43, or sequences having at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity thereto.
[0196] The biosynthetic gene cluster of the disclosure can comprise additional genes involved in the production of a compound of Formula (I), Formula (10), and/or Formula (11) in addition to genes encoding NRPS and PKS proteins or modules. For example, the biosynthetic gene cluster can include genes that regulate the expression of other genes in the cluster, such as NRPS and PKS encoding genes, genes involved in the synthesis of precursor
compounds that are involved in the synthesis of the compound of Formula (I), Formula (10), and/or Formula (11) and genes involved the transport of same.
[0197] In some embodiments, the biosynthetic gene cluster further comprises a sequence encoding a LmBU regulator, sometimes referred to herein as the LmBU-encoding gene. LmBU-family regulators are transcription factors that have been shown to positively modulate the biosynthesis pathways in streptomycetes bacterial species, for example antibiotic biosynthesis. LmBU genes are known to occur in various types of antibiotic gene clusters encoding, inter alia genes encoding for the synthesis of lincomycin, where they can regulate expression of genes in the gene cluster.
[0198] In some embodiments, the biosynthetic gene cluster comprises a LmBU-encoding gene comprising a polynucleotide sequence of SEQ ID NO: 8, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto. In some embodiments, the biosynthetic gene cluster comprises a LmBU-encoding gene comprising a protein coding sequence comprising, or consisting essentially of SEQ ID NO: 8.
[0199] In some embodiments, the biosynthetic gene cluster comprises a sequence encoding a LmBU protein comprising a sequence of:
MDEVSNQGNVRLMHRRMHEPDAQNCQVLTTKTGLRIPQGMAFEEWERAGRQIAGWDSSSWWLGDWLVYGKDHYT DRYQRGIKAVGLRYQTLRNYAWVSRRFEFNRRRPGLTFQHHAELASLPVPEQDLWLDRAEQMNWTTKQLRHAIRA ANEERVPEQRQAETTRRLAVPGNRLQWWHEAAEQLGTDLEQWVLATLDQAARQVLENAEGRTGLPG* (SEQ ID NO : 32 ) , or a sequence having at least about 80%, 85%, 90%, 95%, 97%, or 99% identity thereto. In some embodiments, the biosynthetic gene cluster comprises a polynucleotide sequence encoding a LmBU protein comprising, or consisting essentially of SEQ ID NO: 32. [0200] In some embodiments, the biosynthetic gene cluster comprises an mbtH gene. mbtH proteins are a family of small proteins encoded by genes found in many, but not all, non- ribosomal peptide synthetase-encoding gene clusters. Approximately 70 amino acids in length, mbtH proteins are named after mbtH contained in the gene cluster for the siderophore mycobactin in Mycobacterium tuberculosis, which codes for a 71 -amino acid protein.
Without wishing to be bound by theory, it is thought that mbtH genes are involved in the biosynthesis pathways of the gene clusters in which they reside.
[0201] In some embodiments, the biosynthetic gene cluster comprises an mbtH gene. In some embodiments the biosynthetic gene cluster comprises four NRPS open reading frames, and the mbtH gene is located upstream of the four NRPS open reading frames. In some embodiments, the biosynthetic gene cluster comprises a mbtH-encoding gene comprising a polynucleotide sequence of SEQ ID NO: 36, or a sequence having at least about 80%, 85%,
90%, 95%, 97%, or 99% identity thereto. In some embodiments, the biosynthetic gene cluster comprises a mbtH-encoding gene comprising a protein coding sequence comprising, or consisting essentially of SEQ ID NO: 36. In some embodiments, the biosynthetic gene cluster comprises a mbtH protein comprising a polypeptide sequence of SEQ ID NO: 37, or a sequence having at least about 80%, 85%, 90%, 95%, 97%, or 99% identity thereto. In some embodiments, the biosynthetic gene cluster comprises a mbtH protein comprising a polypeptide sequence comprising, or consisting essentially of SEQ ID NO: 37.
[0202] In some embodiments, the biosynthetic gene cluster is a wild type biosynthetic gene cluster isolated or derived from Streptomyces sp. NRRL F-6131. In some embodiments, the biosynthetic gene cluster comprises 6 NRPS modules encoded by polynucleotide sequences comprising SEQ ID NOS: 12-17, and 4 PKS modules encoded by sequences comprising SEQ ID NOS: 24-27. In some embodiments, the 6 NRPS modules are arranged in 4 open reading frames comprising sequences of SEQ ID NOS: 2-5, and the 4 PKS are arranged in 2 open reading frames comprising sequences of SEQ ID NOS: 6-7. In some embodiments, the biosynthetic gene cluster further comprises a LmBU-encoding gene comprising a polynucleotide sequence of SEQ ID NO: 8, which is located downstream of the 2 PKS open reading frames, for example as shown in FIG. 1.
[0203] In some embodiments, the biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto. In some embodiments, the biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1. In some embodiments, the biosynthetic gene cluster consists essentially of a sequence of SEQ ID NO: 1.
[0204] In some embodiments, one or more genes of the biosynthetic gene cluster are expressed by a host cell comprising the biosynthetic gene cluster, resulting the production of a compound of Formula (I), Formula (10), and/or Formula (11). In some embodiments, the host cells is engineered to express one or more genes in the biosynthetic cluster, which results in the production of a compound of Formula (I), Formula (10), and/or Formula (11).
[0205] In some embodiments, overexpression of one or more genes in the biosynthetic cluster by the host cell increases the production of a compound of Formula (I), Formula (10), and/or Formula (11) compared to an otherwise equivalent host cell comprising a biosynthetic gene cluster that does not overexpress one or more genes in the biosynthetic cluster.
[0206] In some embodiments, the modified host cell increases the production of a compound of Formula (I), Formula (10), and/or Formula (11). by about 5%, 10%, 15%, 20%, 25%, 30%,
35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% as measured by LCMS.
[0207] In some embodiments, the host cell overexpresses the LmBU -encoding gene. LmBU can be overexpressed in cis or in trans. For example, for cis over expression, the promoter of the LmBU protein in the biosynthetic gene cluster can be modified to increase LmBU expression. Alternatively, or in addition, LmBU can be expressed in trans to the biosynthetic gene cluster by the host cell, for example by using a strong promoter to drive LmBU expression.
[0208] Without wishing to be bound by theory, it is thought that the LmBU protein regulates the expression of additional genes in the biosynthetic gene cluster by acting as a transcriptional activator. Increasing the expression of LmBU increases the expression of LmBU target genes in the biosynthetic gene cluster, thereby increasing the production of compounds of Formula (I), Formula (10), and/or Formula (11) by the host cell.
[0209] In some embodiments, increasing the expression of LmBU increases the production of compounds of Formula (I), Formula (10), and/or Formula (11) by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% as measured by LCMS.
[0210] The disclosure provides polynucleotides comprising a sequence encoding the LmBUprotein and a sequence of a promoter, for the overexpression of LmBU protein in a host cell. The LmBU protein can be expressed in trans in a host cell, from a polynucleotide that does not form a part of the biosynthetic gene cluster. For example, the host cell can comprise a first vector comprising the biosynthetic gene cluster, and a second vector comprising the sequences of the LmBU protein and a promoter.
[0211] As used herein, “promoter” refers to a DNA sequence capable of controlling the expression of a coding sequence or functional RNA.
[0212] In some embodiments of the polynucleotides comprising a sequence encoding the LmBU protein and a sequence of promoter, the two are operably linked. In some embodiments, the sequence encoding the LmBU protein comprises a sequence of SEQ ID NO: 8, and the promoter comprises a sequence of SEQ ID NOS: 9-10 as set forth in Table 3. [0213] Representative promoters that can be used to overexpress genes such as LmBU, either in cis by insertion into the BGC, or in trans, are presented in Table 3 below.
[0214] Table 3. Promoter Sequences
[0215] In some embodiments, trans overexpression of the LmBU-encoding gene comprises expressing the LmBU-encoding gene under the control of an ermE promoter, a kasO promoter, or a functional variant or derivative thereof. In some embodiments, trans overexpression of the LmBU-encoding gene comprises expressing the LmBU-encoding gene under the control of a constitutive ermE promoter, or a functional variant or derivative thereof. In some embodiments, the ermE promoter is an ermE* promoter comprising a sequence of SEQ ID NO: 9. In some embodiments, trans overexpression of the LmBU- encoding gene comprises expressing the LmBU-encoding gene under the control of a constitutive kasO promoter, or a functional variant or derivative thereof. In some embodiments, the kasO promoter is a kasO* promoter comprising a sequence of SEQ ID NO: 10.
Modified Biosynthetic Gene Clusters
[0216] The disclosure provides polynucleotides comprising biosynthetic gene clusters that have been modified relative to their wild type, or native equivalents, to increase production of a compound of Formula (I), Formula (10), and/or Formula (11) when the genes of the biosynthetic gene cluster are expressed by a host cell. In a non-limiting embodiment, the production of a compound of Formula (I), Formula (10), and/or Formula (11) by a polynucleotide comprising biosynthetic gene clusters that has been modified relative to its wild type, or native equivalent is increased by about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% compared to the wild type, or native equivalent, as measured by LCMS.
[0217] In some embodiments, the modified polynucleotide increases the production of a compound of Formula (I), Formula (10), and/or Formula (11) by about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% compared to its wild type, or native equivalent, as measured by LCMS.
[0218] Modifications of the biosynthetic gene cluster can be modified relative to a biosynthetic gene cluster comprising a sequence of SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto. In some embodiments, the modification comprises one or more modifications relative to a sequence of SEQ ID NO: 1.
[0219] In some embodiments, the biosynthetic gene cluster is modified to overexpress the LmBU protein in a host cell, thereby increasing production of a compound of Formula (I), Formula (10), and/or Formula (11) by the host cell. In some embodiments, the at least one modification of the biosynthetic gene cluster comprises a modification that results in overexpression of the LmBU-encoding gene in comparison to the expression of the LmBU- encoding gene by the biosynthetic gene cluster of SEQ ID NO: 1. In anon-limiting embodiment, the production of a compound of Formula (I), Formula (10), and/or Formula (11) by a biosynthetic gene cluster modified to overexpress the LmBU protein in a host cell is increased by about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% compared to the LmBU- encoding gene by the biosynthetic gene cluster of SEQ ID NO: 1, as measured by LCMS. [0220] All modifications are envisaged as within the scope of the instant disclosure.
Exemplary, but non-limiting, modifications include substitutions, deletions, inversions, or insertions of heterologous sequences. In some embodiments, the one or more modifications of the biosynthetic gene cluster comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative to SEQ ID NO: 1.
[0221] In some embodiments, the one or more modifications comprise modifications of a promoter of a gene in the biosynthetic gene cluster. For example, a heterologous promoter sequence can be inserted near the coding sequence of one or more genes of the BGC. Alternatively, or in addition, one or more promoters of a gene in the BGC can be replaced with a heterologous promoter. Heterologous promoters include, inter alia, strong promoters, constitutive promoters and regulatable promoters. Exemplary strong promoters include ermE* (SEQ ID NO: 9) and kasO* (SEQ ID NO: 10) as shown in Table 3. In some embodiments, replacement of one or more promoters comprises replacement of the LmBU promoter, for example with a promoter shown in Table 3.
[0222] In some embodiments, the one or more modifications comprise insertion of at least one heterologous promoter in the biosynthetic gene cluster of SEQ ID NO: 1. In some embodiments, the at least one heterologous promoter is a strong promoter. In some embodiments, the at least one heterologous promoter is selected from ermE and kasO, or functional variants or derivatives thereof. In some embodiments, the at least one heterologous promoter comprises a sequence of SEQ ID NOS: 9-10, or a functional variant or derivative thereof. For example, engineered versions of the ermE and kasO promoters used herein are sometimes referred to herein as ermE* and kasO*. In some embodiments, the at least one
heterologous promoter comprises a sequence of SEQ ID NOS: 9-10, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[0223] In some embodiments, the one or more modifications comprise insertion of at least one promoter in the biosynthetic gene cluster. In some embodiments, the at least one promoter is inserted upstream of the mbtH gene. For example, an ermE* promoter of SEQ ID NO: 9 or a kasO * promoter of SEQ ID NO: 10 is inserted upstream of the mbtH gene in SEQ ID NO: 1. Alternatively, or in addition, the at least one promoter is inserted upstream of the LmBU-encoding gene in the biosynthetic gene cluster of SEQ ID NO: 1. For example, a kasO* promoter of SEQ ID NO: 9 is inserted upstream of the LmBU-encoding gene in SEQ ID NO: 1.
[0224] In some embodiments the biosynthetic gene cluster comprising one or more modifications relative to SEQ ID NO: 1. In some embodiments the modified biosynthetic gene cluster comprises SEQ ID NO: 11 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
Methods of Modifying Biosynthetic Gene Clusters
[0225] The disclosure provides methods of modifying the biosynthetic gene clusters described herein. In some embodiments, methods of modifying the biosynthetic gene clusters described herein comprise a nucleic acid guided endonuclease.
The disclosure provides methods of modifying biosynthetic gene clusters comprising (a) providing a first E. coli host cell comprising a first vector comprising a sequence of an unmodified biosynthetic gene cluster comprising a target sequence; (b) introducing the first vector into a Streptomyces host cell by conjugation; (c) providing a second E. coli host cell comprising a second vector comprising: (i) a sequence of at least one gNA specific to the target sequence operably linked to a promoter, (ii) a sequence encoding a Cas endonuclease; and (iii) a sequence encoding a donor template; and (d) introducing the second vector into a Streptomyces host cell by conjugation; whereby introducing the second vector into the Streptomyces host cell produces a double strand break in the target sequence and introduction of a donor template sequence, thereby generating a Streptomyces host cell comprising a modified biosynthetic gene cluster. In some embodiments, the unmodified gene cluster comprises SEQ ID NO: 1 or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto, and the one or more modifications are modifications of SEQ ID NO: 1 or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto. In some embodiments, the polynucleotide sequence of the modified biosynthetic gene cluster
comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative to SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[0226] In some embodiments, the nucleic acid guided endonuclease is a CRISPR/Cas endonuclease. In some embodiments, the CRISPR/Cas endonuclease is Cas9. Other endonucleases known in the art may be used with the constructs described herein. In some embodiments, the Cas endonuclease is selected from Cas9 (also known as Csnl and Csxl2), Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Casio, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, Csf4, homologues thereof, variants thereof, mutants thereof, and derivatives thereof.
[0227] As used herein, “CRISPR/Cas endonuclease” refers to an enzymatic system that includes a guide nucleic acid (gNA) contains a nucleotide sequence complementary or substantially complementary to a region of a target polynucleotide, and a protein with active Nuclease. The CRISPR/Cas systems include the CRISPR-Cas Type I system, the CRISPR- Cas Type II system, the CRISPR-Cas Type III system, and derivatives thereof. CRISPR/Cas systems include genetically modified nuclease systems and / or programmed nuclease systems derived from naturally occurring CRISPR-Cas systems. CRISPR-Cas systems can contain genetically modified Cas proteins and/or mutated Cas proteins. CRISPR/Cas systems may contain genetically modified and/or programmed gNA.
[0228] As used herein, the term “guide nucleic acid” or “gNA” refers to an NA that contains a sequence complementary or substantially complementary to a region of a target DNA sequence. A gNA may contain nucleotide sequences in a region other than the region complementary or substantially complementary to a region of a target DNA sequence, sometimes termed a leader RNA. A leader RNA can be an rRNA or a derivative thereof, for example, a rRNA: chimera RNAtracr. gNAs can be RNAs (gRNAs) or DNAs (gDNAs).
[0229] In the CRISPR/Cas systems described herein, the gNA forms a complex with the CRISPR/Cas enzyme and the targeting portion of the gNA targets the CRISPR/Cas endonuclease to a specific target sequence in a target DNA polynucleotide. The CRISPR/Cas endonuclease then cuts the DNA, producing a double strand break. This double strand break can be repaired by non-homologous end joining, resulting in a deletion, or by homology directed repair (HDR) from a donor template. If the donor template includes sequences
different from the target DNA polynucleotide, these sequence differences are incorporated into the target DNA polynucleotide.
[0230] In some embodiments, the donor template comprises, from 5' to 3', a sequence homologous to a sequence 5' of the target sequence, a sequence of a promoter, and sequence homologous to a sequence 3' of the target sequence. In some embodiments, the promoter is selected from the group consisting of ermE and kasO, or functional variants or derivatives thereof.
[0231] In some embodiments, the biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1, or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto. In some embodiments, the sequence of SEQ ID NO: 1 or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto is modified using a CRISPR/Cas endonuclease and a donor template to insert at least one heterologous promoter into the biosynthetic gene cluster. In some embodiments, the heterologous at least one promoter can be inserted upstream of the mbtH gene in SEQ ID NO: 1 or a sequence having or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto, upstream of the LmBU-encoding gene in SEQ ID NO: 1 or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto, or downstream of the second PKS open reading frame or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[0232] In some embodiments, gRNAs comprise a targeting sequence, sometimes referred to as a protospacer, and a scaffold. In some embodiments, the gRNA is selected from CCTTGACAGACAAATTAGGA (SEQ ID NO: 33), TGTGATTCCACTTTTCGAGT (SEQ ID NO: 34), and CGCCGATGCCCTGTGATTCC (SEQ ID NO: 35).
[0233] In some embodiments, the CRISPR/Cas endonuclease is a Cas9 endonuclease.
[0234] In some embodiments, inserting at least one heterologous promoter into the biosynthetic gene cluster further comprises a donor template comprising a sequence of the heterologous promoter.
Vectors
[0235] The disclosure provides vectors comprising the polynucleotides of the disclosure. In some embodiments, the vectors comprise the sequence of the biosynthetic gene cluster of SEQ ID NO: 1, or a sequence sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto. In some embodiments, the vector comprises the sequence of a biosynthetic gene cluster comprising at least one modification relative to SEQ ID NO: 1, for example the insertion of a heterologous promoter.
[0236] Suitable vectors for the cloning and expression of the biosynthetic gene clusters described herein will be known to persons of ordinary skill in the art. For example, suitable vectors for expressing biosynthetic gene clusters in Streptomyces are described in US20200291430A1, the contents of which are incorporated by reference in their entirety herein.
[0237] Exemplary vectors include, inter alia, cloning sites, promoters to direct expression of gene products, and selectable markers for host cells such as Streptomyces and/or E. coli. In some embodiments, the expression vector further comprises an E. coli and/or Streptomyces origin of replication. In some embodiments, the expression vector further comprises one or more selectable markers for E. coli and/or Streptomyces. A number of antibiotic resistance markers are available for Streptomyces, and include thiostrepton (tsr), kanamycin-neomycin (kmr), apramycin (amr), geneticin, viomycin, hygromycin, bleomycin, chloramphenicol, and the like.
[0238] In some embodiments, the expression vector further comprises a gene that stabilizes large plasmids. In some embodiments, the expression vector is configured to accept an insert comprising more than 10 kb, more than 20 kb, more than 50 kb, and/or more than 100 kb. [0239] Suitable vectors can be configured to express a product of the biosynthetic gene cluster nucleic acid when the expression vector is present in a host cell, such as a Streptomyces host cell.
[0240] In some embodiments the vector is an expression vector.
[0241] In some embodiments the vector is a shuttle vector. As used herein, the term ‘ shuttle vector” refers to a vector constructed so that it can propagate in two different host species, e.g.. E. coli and another organism such as Streptomyces.
[0242] In some embodiments, the vector is a plasmid or a bacterial artificial chromosome.
Synthesis of the Compounds of the Disclosure
[0243] In some embodiments, a compound of Formula (I), Formula (10), and/or Formula (11) is synthesized using a semi-synthetic approach. In some embodiments, a compound of Formula (I), Formula (10), and/or Formula (11) is synthesized using a biosynthetic approach.
[0244] In some embodiments, the compound is cyclized with the use of a biosynthetic gene cluster (BCG) such as the biosynthetic gene cluster described supra, sometimes referred to herein as the AZT039 biosynthetic gene cluster.
[0245] As used herein, “AZT039 biosynthetic gene cluster” refers to a biosynthetic gene cluster isolated or derived from Streptomyces species, which is described further detail supra. In some embodiments, the AZT039 biosynthetic gene cluster is isolated or derived from Streptomyces strain NRRL F-6131.
[0246] In some embodiments, the wild-type AZT039 biosynthetic gene cluster is modified. In some embodiments, the modified AZT039 biosynthetic gene cluster produces a compound of Formula (I), Formula (10), and/or Formula (11). For the avoidance of doubt, the modification(s) of the BGC is necessary to produce quantifiable levels of the compounds of the disclosure. Modifications of the biosynthetic gene cluster can be carried out by any methods known in the art. For example, the BGC can be modified using a CRISPR/Cas endonuclease.
[0247] In some embodiments, the present disclosure provides a method of making a compound of Formula (I), Formula (10), and/or Formula (11) comprising: a. genome mining to identify a biosynthetic gene cluster; b. modifying the identified biosynthetic gene cluster; c. identifying a target compound; and d. isolating the target compound.
[0248] In some embodiments, the genome mining identifies a biosynthetic gene cluster. In some embodiments, the identified biosynthetic gene cluster is AZT039. In some embodiments, AZT039 is isolated or derived from Streptomyces strain NRRL F-6131.
[0249] In some embodiments, the genome is sequenced prior to modification.
[0250] In some embodiments, the biosynthetic gene cluster is modified by overexpression of at least one gene in the cluster. In some embodiments, the overexpressed gene is LmBU.
[0251] In some embodiments, the biosynthetic gene cluster is isolated. In some embodiments, the biosynthetic gene cluster is isolated prior to identifying the target compound. In some embodiments, the biosynthetic gene cluster is isolated prior to isolating the target compound.
[0252] In some embodiments, the biosynthetic gene cluster is expressed in a heterologous host. In some embodiments, the heterologous host is 5. albus.
[0253] In some embodiments, the biosynthetic gene cluster is further modified. In some embodiments, the biosynthetic gene cluster is further modified by the insertion of one or more strong promoters, using methods provided herein. In some embodiments, the strong promoter is one or more selected from ermE and kasO, or a functional derivative thereof. [0254] In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (1 l)is isolated from culture.
[0255] In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (ll)is isolated and then purified.
[0256] In some embodiments, the present disclosure provides a method of making a compound of Formula (I), Formula (10), and/or Formula (11) further comprising a step of: (e) purifying the isolated compound.
[0257] In some embodiments, the present disclosure provides a method of making a compound of Formula (I), Formula (10), and/or Formula (11), or derivatizing the compound of Formula (I), Formula (10), and/or Formula (11), by solid phase peptide synthesis wherein the amino acid a-N-terminal is protected by an acid or base protecting group. Such protecting groups should have the properties of being stable to the conditions of peptide linkage formation while being readily removable without destruction of the growing peptide chain or racemization of any of the chiral centers contained therein. Suitable protecting groups are 9- fluorenylmethyloxycarbonyl (Fmoc), t-butyloxycarbonyl (Boc), benzyloxycarbonyl (Cbz), biphenylisopropyloxycarbonyl, t-amyloxycarbonyl, isobomyloxycarbonyl, a,a-dimethyl-3,5- dimethoxybenzyloxy carbonyl, o-nitrophenylsulfenyl, 2-cyano-t-butyloxycarbonyl, and the like. Other side chain protecting groups are, for example, for side chain amino groups (e.g, lysine and arginine) are 2,2,5,7,8-pentamethylchroman-6-sulfonyl (pmc), nitro, p- toluenesulfonyl, 4-methoxybenzene-sulfonyl, Cbz, Boc, and adamantyloxycarbonyl; for tyrosine are benzyl, o-bromobenzyloxy-carbonyl, 2,6-dichlorobenzyl, isopropyl, t-butyl (t- Bu), cyclohexyl, cyclopentyl and acetyl (Ac); for serine are t-butyl, benzyl and tetrahydropyranyl; for histidine are trityl, benzyl, Cbz, p-toluenesulfonyl and 2,4- dinitrophenyl; for tryptophan are formyl; for aspartic acid and glutamic acid are benzyl and t- butyl; and for cysteine are triphenylmethyl (trityl). In the solid phase peptide synthesis method, the a-C-terminal amino acid is attached to a suitable solid support or resin. Suitable solid supports useful for the above synthesis are those materials which are inert to the reagents and reaction conditions of the stepwise condensation-deprotection reactions, as well as being insoluble in the media used. Solid supports for synthesis of a-C-terminal carboxy peptides may be 4-hydroxymethylphenoxymethyl-copoly(styrene-l% divinylbenzene) or 4- (2',4'-dimethoxyphenyl-Fmoc-aminomethyl)phenoxyacetamidoethyl. The a-C-terminal amino acid may be coupled to the resin by means of N,N'-dicyclohexylcarbodiimide (DCC), N,N'-diisopropylcarbodiimide (DIC), or O-benzotriazol-l-yl-N,N,N',N'- tetramethyluroniumhexafluorophosphate (HBTU), with or without 4-dimethylaminopyridine (DMAP), 1 -hydroxy benzotriazole (HOBT), benzotriazol- 1-yloxy-
tris(dimethylamino)phosphoniumhexafluorophosphate (BOP), or bis(2-oxo-3- oxazolidinyl)phosphine chloride (BOPCI), mediated coupling for from about 1 hour to about 24 hours at a temperature of between 10°C and 50°C in a solvent (e.g, dichloromethane or DMF). When the solid support is 4-(2',4'-dimethoxyphenyl-Fmoc-aminomethyl)phenoxy- acetamidoethyl resin, the Fmoc group is cleaved with a secondary amine (e.g, piperidine) prior to coupling with the a-C-terminal amino acid as described above. The coupling of successive protected amino acids may be carried out in an automatic polypeptide synthesizer. In some embodiments, the a-N-terminal in the amino acids of the growing peptide chain are protected with Fmoc. The removal of the Fmoc protecting group from the a-N-terminal side of the growing peptide may be accomplished by treatment with a secondary amine (e.g, piperidine). Each protected amino acid may then be introduced in about 3-fold molar excess, and the coupling may be carried out in DMF. Following completion of synthesis, the polypeptide is removed from the resin and deprotected, either in successively or in a single operation. Removal of the polypeptide and deprotection may be accomplished in a single operation by treating the resin-bound polypeptide with a cleavage reagent (e.g, thianisole, water, ethanedithiol, and trifluoroacetic acid). In cases wherein the a-C-terminal of the polypeptide is an alkylamide, the resin may be cleaved by aminolysis with an alkylamine. Alternatively, the peptide may be removed by transesterification (e.g. with methanol) followed by aminolysis or by direct transamidation. The protected peptide may be purified or taken directly to the next step without purification. The removal of the side chain protecting groups may be accomplished using the appropriate cleavage conditions. The fully deprotected peptide may be purified by a sequence of chromatographic steps employing one or more of the following types: ion exchange on a weakly basic resin (acetate form); hydrophobic adsorption chromatography on underivitized polystyrene-divinylbenzene (e.g, Amberlite XAD); silica gel adsorption chromatography; ion exchange chromatography on carboxymethylcellulose; partition chromatography (e.g, on Sephadex G-25, LH-20 or countercurrent distribution); high performance liquid chromatography (HPLC), such as reverse-phase HPLC on octyl- or octadecylsilyl-silica bonded phase column packing. [0258] In some embodiments, compounds of the present disclosure can be prepared in a variety of ways using commercially available starting materials, compounds known in the literature, or from readily prepared intermediates, by employing standard synthetic methods and procedures either known to those skilled in the art, or which will be apparent to the skilled artisan in light of the teachings herein. Standard synthetic methods and procedures for
the preparation of organic molecules and functional group transformations and manipulations can be obtained from the relevant scientific literature or from standard textbooks in the field. Although not limited to any one or several sources, classic texts such as Smith, M. B., March, J., March ’s Advanced Organic Chemistry: Reactions, Mechanisms, and Structure, 5th edition, John Wiley & Sons: New York, 2001; Greene, T.W., Wuts, P.G. M., Protective Groups in Organic Synthesis, 3rd edition, John Wiley & Sons: New York, 1999; R. Larock, Comprehensive Organic Transformations, VCH Publishers (1989); L. Fieser and M. Fieser, Fieser and Fieser ’s Reagents for Organic Synthesis, John Wiley and Sons (1994); and L. Paquette, ed., Encyclopedia of Reagents for Organic Synthesis, John Wiley and Sons (1995), incorporated by reference herein, are useful and recognized reference textbooks of organic synthesis known to those in the art
[0259] One of ordinary skill in the art will note that, during the reaction sequences and synthetic scheme described herein, the order of certain steps may be changed, such as the introduction and removal of protecting groups. One of ordinary skill in the art will recognize that certain groups may require protection from the reaction conditions via the use of protecting groups. Protecting groups may also be used to differentiate similar functional groups in molecules. A list of protecting groups and how to introduce and remove these groups can be found in Greene, T.W., Wuts, P.G. M., Protective Groups in Organic Synthesis, 3rd edition, John Wiley & Sons: New York, 1999.
[0260] It is to be understood that one skilled in the art may refer to general reference texts for detailed descriptions of known techniques discussed herein or equivalent techniques. These texts include Ausubel et al., Current Protocols in Molecular Biology, John Wiley and Sons, Inc. (2005); Sambrook et al., Molecular Cloning, A Laboratory Manual (3 rd edition), Cold Spring Harbor Press, Cold Spring Harbor, New York (2000); Coligan et al., Current Protocols in Immunology, John Wiley & Sons, N.Y.; Enna et al., Current Protocols in Pharmacology, John Wiley & Sons, N.Y.; Fingl et al., The Pharmacological Basis of Therapeutics (1975), Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, PA, 18th edition (1990). These texts can, of course, also be referred to in making or using an aspect of the disclosure.
Production of the Compounds of Formula (I). Formula (10), and/or Formula (11) from Host Cells
[0261] The disclosure provides methods of making a compound of Formula (I), Formula (10), and/or Formula (11) in a host cell comprising the biosynthetic gene cluster described herein.
[0262] In some embodiments, the host cell does not produce a compound of Formula (I), Formula (10), and/or Formula (11) in the absence of the biosynthetic gene cluster described herein.
[0263] The disclosure provides methods of making the compound of Formula (I), comprising (a) introducing into a host cell the polynucleotides or vectors of the disclosure; (b) culturing the host cell under conditions sufficient for the synthesis of the compound of Formula (I) by the biosynthetic gene cluster; and (c) isolating and purifying the compound of Formula (I). In some embodiments, the host cell is a Streptomyces cell, such as a Streptomyces coelicolor or Streptomyces albus cell. In some embodiments, the host cell comprises a sequence encoding a LmBu operably linked to a constitutive promoter. In some embodiments, the promoter is selected from ermE and kasO or functional variants or derivatives thereof. In some embodiments, the sequence of the ermE promoter comprises SEQ ID NO: 9 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto, and the sequence of the kasO promoter comprises SEQ ID NO: 10 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto
[0264] Methods of introducing polynucleotides and vectors into suitable host cells will be known to persons of ordinary skill in the art, and include electroporation and by conjugation with an E. coli cell comprising the polynucleotide or vector.
[0265] Intergenic conjugation with E. coli allows for the introduction of vectors into Streptomyces species. Exemplary vectors for intergeneric conjugation between E. coli and Streptomyces comprise the 760-bp oriT fragment for conjugation, but require the transfer functions to be supplied in trans by the E. coli donor strain. Some vectors include the attachment site (attP) and the integrase (int) function of the temperate phage q)C31 to facilitate the site-specific integration of the vector at the attB site of the Streptomyces chromosome.
Host Cells
[0266] The disclosure provides host cells, comprising the polynucleotides and vectors described herein.
[0267] In some embodiments, tor example those embodiments where the BGC is an unmodified BGC of SEQ ID NO: 1, the host cell further comprises a polynucleotide comprising a sequence encoding a LmBU operably linked to one or more constitutive promoters, such as ermE* and/or kasO*. In some embodiments the sequence encoding the
LmBU comprises SEQ ID NO: 8, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
[0268] The host cell, or host organism, is typically, but not necessarily, a genetically tractable (e.g, culturable under laboratory conditions and manipulable by molecular biological techniques) organism. The host organism may be a member of the domain Bacteria, the domain Eukarya, or the domain Archaea. In some embodiments, the host microorganism is from the domain Bacteria. In some embodiments, the host organism is a bacterium in the terrabacteria group. In particular embodiments, the host microorganism is from the taxa Actinobacteria, Streptomycetales, or Streptomycetaceae. In some embodiments, the host is from the genus Streptomyces . In some embodiments, the host is a Streptomyces expression strain, e.g., as defined herein (e.g., Streptomyces avermitilis, Streptomyces venezuelae, Streptomyces albus, Streptomyces lividans, and Streptomyces coellcolor). In some embodiments, the host organism is a Streptomyces species. In some embodiments, the host is Streptomyces albus.
[0269] As used herein the term ‘“Streptomyces expression strains” or ‘"heterologous Streptomyces expression strains’’ refers to bacterial strains including, but not limited to, commonly used species such as Streptomyces avermitiUs, Streptomyces venezuelae, Streptomyces albus, Streptomyces lividans, and Streptomyces coellcolor.
Methods of culturing host cells will be known to persons of ordinary skill in the art and are described in “Laboratory Maintenance of Streptomyces species,” Curr Protoc Microbiol. 2010 Aug; CHzkPI ER: Unit - -10E.1 , the contents of which are incorporated by reference in their entirety herein. For example, Streptomyces may be grown in suitable liquid media (e.g.. Tryptic Soy -Broth (TSB), R2YE and YEME media) at about 28 °C, in baffled Erlenmeyer or similar shaking flask systems. Long term storage of Streptomyces can be accomplished through glycerol stocks.
Pharmaceutical Compositions
[0270] In some aspects, the present disclosure provides a pharmaceutical composition comprising one or more compounds of any one of a compound of Formula (I), Formula (10), and/or Formula (11) as an active ingredient. In some embodiments, the present disclosure provides a pharmaceutical composition comprising one or more compounds of any one of Formula (I), Formula (10), and/or Formula (11) and one or more pharmaceutically acceptable carriers, diluents or excipients. Pharmaceutically acceptable carriers, diluents or excipients
include without limitation any adjuvant, carrier, excipient, glidant, sweetening agent, diluent, preservative, dye/colorant, flavor enhancer, surfactant, wetting agent, dispersing agent, suspending agent, stabilizer, isotonic agent, solvent, or emulsifier.
[0271] As used herein, the term “composition” is intended to encompass a product comprising the specified ingredients in the specified amounts, as well as any product which results, directly or indirectly, from combination of the specified ingredients in the specified amounts.
[0272] It is to be understood that the present disclosure also provides pharmaceutical compositions comprising any compound described herein in combination with at least one pharmaceutically acceptable excipient or carrier.
[0273] As used herein, the term “pharmaceutical composition” is a formulation containing the compounds of the present disclosure in a form suitable for administration to a subject. In some embodiments, the pharmaceutical composition is in bulk or in unit dosage form. The unit dosage form is any of a variety of forms, including, for example, a capsule, an IV bag, a tablet, a single pump on an aerosol inhaler or a vial. The quantity of active ingredient (e.g, a formulation of a compound of Formula (I), Formula (10), and/or Formula (11)) in a unit dose of composition is an effective amount and is varied according to the particular treatment involved. One skilled in the art will appreciate that it is sometimes necessary to make routine variations to the dosage depending on the age and condition of the patient. The dosage will also depend on the route of administration. A variety of routes are contemplated, including oral, pulmonary, rectal, parenteral, transdermal, subcutaneous, intravenous, intramuscular, intraperitoneal, inhalational, buccal, sublingual, intrapleural, intrathecal, intranasal, and the like. Dosage forms for the topical or transdermal administration of a compound of this disclosure include powders, sprays, ointments, pastes, creams, lotions, gels, solutions, patches and inhalants. In one embodiment, the active compound is mixed under sterile conditions with a pharmaceutically acceptable carrier, and with any preservatives, buffers, or propellants that are required.
[0274] It is to be understood that, for any compound, the therapeutically effective amount can be estimated initially either in cell culture assays, e.g, of neoplastic cells, or in animal models, usually rats, mice, rabbits, dogs, or pigs. The animal model may also be used to determine the appropriate concentration range and route of administration. Such information can then be used to determine useful doses and routes for administration in humans. Therapeutic/prophylactic efficacy and toxicity may be determined by standard
pharmaceutical procedures in cell cultures or experimental animals, e.g., ED50 (the dose therapeutically effective in 50 % of the population) and LD50 (the dose lethal to 50 % of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index, and it can be expressed as the ratio, LD50/ED50. Pharmaceutical compositions that exhibit large therapeutic indices are preferred. The dosage may vary within this range depending upon the dosage form employed, sensitivity of the patient, and the route of administration.
[0275] Dosage and administration are adjusted to provide sufficient levels of the active agent(s) or to maintain the desired effect. Factors which may be taken into account include the severity of the disease state, general health of the subject, age, weight, and gender of the subject, diet, time and frequency of administration, drug combination(s), reaction sensitivities, and tolerance/response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or once every two weeks depending on half-life and clearance rate of the particular formulation.
[0276] The pharmaceutical compositions containing active compounds of the present disclosure may be manufactured in a manner that is generally known, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. Pharmaceutical compositions may be formulated in a conventional manner using one or more pharmaceutically acceptable carriers comprising excipients and/or auxiliaries that facilitate processing of the active compounds into preparations that can be used pharmaceutically. Of course, the appropriate formulation is dependent upon the route of administration chosen.
[0277] The compounds, or pharmaceutically acceptable salts thereof, may be administered orally, nasally, transdermally, pulmonary, inhalationally, buccally, sublingually, intraperitoneally, subcutaneously, intramuscularly, intravenously, rectally, intrapleurally, intrathecally and parenterally. In one embodiment, the compound is administered orally. One skilled in the art will recognize the advantages of certain routes of administration.
[0278] The dosage regimen utilizing the compounds is selected in accordance with a variety of factors including type, species, age, weight, sex and medical condition of the patient; the severity of the condition to be treated; the route of administration; the renal and hepatic function of the patient; and the particular compound or salt thereof employed. An ordinarily skilled physician or veterinarian can readily determine and prescribe the effective amount of the drug required to prevent, counter, or arrest the progress of the condition. An ordinarily
skilled physician or veterinarian can readily determine and prescribe the effective amount of the drug required to counter or arrest the progress of the condition.
[0279] In certain embodiments, the pharmaceutical compositions of the present disclosure may additionally contain other adjunct components conventionally found in pharmaceutical compositions, at their art-established usage levels. Thus, for example, the pharmaceutical compositions may contain additional, compatible, pharmaceutically-active materials such as antipruritics, astringents, local anesthetics or anti-inflammatory agents, or may contain additional materials useful in physically formulating various dosage forms of the compositions of the present invention, such as dyes, flavoring agents, preservatives, antioxidants, opacifiers, thickening agents and stabilizers. However, such materials, when added, should not unduly interfere with the biological activities of the components of the compositions of the present invention. The formulations can be sterilized and, if desired, mixed with auxiliary agents, e.g, lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, colorings, flavorings and/or aromatic substances and the like which do not deleteriously interact with the oligonucleotide(s) of the formulation.
[0280] Techniques for formulation and administration of the disclosed compounds of the disclosure can be found in Remington: the Science and Practice of Pharmacy, 19th edition, Mack Publishing Co., Easton, PA (1995). In an embodiment, the compounds described herein, and the pharmaceutically acceptable salts thereof, are used in pharmaceutical preparations in combination with a pharmaceutically acceptable carrier or diluent. Suitable pharmaceutically acceptable carriers include inert solid fillers or diluents and sterile aqueous or organic solutions. The compounds will be present in such pharmaceutical compositions in amounts sufficient to provide the desired dosage amount in the range described herein.
[0281] The compound of Formula (I), Formula (10), and/or Formula (11) can be formulated for oral administration in forms such as, for example, tablets, lozenges, hard or soft capsules, aqueous or oily suspensions, emulsions, dispersible powders, granules, syrups, elixirs, and tinctures. The compound of Formula (I), Formula (10), and/or Formula (11) can also be formulated for intravenous (bolus or in-fusion), intraperitoneal, topical (for example as creams, ointments, gels, or aqueous or oily solutions or suspensions), inhalation (for example as a finely divided powder or a liquid aerosol), for administration by insufflation (for example as a finely divided powder), or parenteral administration (for example as a sterile
aqueous or oily solution for intravenous, subcutaneous, intramuscular, intraperitoneal or intramuscular dosing) as a suppository for rectal dosing, or transdermal (e.g, patch).
[0282] In some embodiments, the present disclosure provides pharmaceutical compositions comprising a compound of Formula (I), Formula (10), and/or Formula (11) combined with a pharmaceutically acceptable carrier. In some embodiments, suitable pharmaceutically acceptable carriers include, but are not limited to, inert solid fillers or diluents and sterile aqueous or organic solutions. Pharmaceutically acceptable carriers are well known to those skilled in the art and include, but are not limited to, from about 0.01 to about 0.1 M phosphate buffer or saline (e.g, about 0.8%). Such pharmaceutically acceptable carriers can be aqueous or non-aqueous solutions, suspensions and emulsions. Examples of non-aqueous solvents suitable for use in the present application include, but are not limited to, propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate.
[0283] Liquid carriers suitable for use in the present application can be used in preparing solutions, suspensions, emulsions, syrups, elixirs and pressurized compounds. The active ingredient can be dissolved or suspended in a pharmaceutically acceptable liquid carrier such as water, an organic solvent, a mixture of both or pharmaceutically acceptable oils or fats. The liquid carrier can contain other suitable pharmaceutical additives such as solubilizers, emulsifiers, buffers, preservatives, sweeteners, flavoring agents, suspending agents, thickening agents, colors, viscosity regulators, stabilizers or osmo-regulators.
[0284] Liquid carriers suitable for use in the present application include, but are not limited to, water (partially containing additives as above, e.g. cellulose derivatives, preferably sodium carboxymethyl cellulose solution), alcohols (including monohydric alcohols and polyhydric alcohols, e.g. glycols) and their derivatives, and oils (e.g. fractionated coconut oil and arachis oil). For parenteral administration, the carrier can also include an oily ester such as ethyl oleate and isopropyl myristate. Sterile liquid carriers are useful in sterile liquid form comprising compounds for parenteral administration. The liquid carrier for pressurized compounds disclosed herein can be halogenated hydrocarbon or other pharmaceutically acceptable propellent.
[0285] Aqueous carriers suitable for use in the present application include, but are not limited to, water, ethanol, alcoholic/aqueous solutions, glycerol, emulsions or suspensions, including saline and buffered media. Oral carriers can be elixirs, syrups, capsules, tablets and the like.
[0286] The formulation of the present disclosure may be in the form of an aqueous solution comprising an aqueous vehicle. The aqueous vehicle component may comprise water and at least one pharmaceutically acceptable excipient. Suitable acceptable excipients include those selected from the group consisting of a solubility enhancing agent, chelating agent, preservative, tonicity agent, viscosity/suspending agent, buffer, and pH modifying agent, and a mixture thereof.
[0287] Any suitable solubility enhancing agent can be used. Examples of a solubility enhancing agent include cyclodextrin, such as those selected from the group consisting of hydroxypropyl-P-cyclodextrin, methyl-P-cyclodextrin, randomly methylated-P-cyclodextrin, ethylated-P-cyclodextrin, triacetyl-P-cyclodextrin, peracetylated-P-cyclodextrin, carboxymethyl-P-cyclodextrin, hydroxy ethyl-P-cyclodextrin, 2-hydroxy-3- (trimethylammonio)propyl-P-cyclodextrin, glucosyl-P-cyclodextrin, sulfated P-cyclodextrin (S-P-CD), maltosyl-P-cyclodextrin, P-cyclodextrin sulfobutyl ether, branched-P-cyclodextrin, hydroxypropyl-y-cyclodextrin, randomly methylated-y-cyclodextrin, and trimethyl-y- cyclodextrin, and mixtures thereof.
[0288] Any suitable chelating agent can be used. Examples of a suitable chelating agent include those selected from the group consisting of ethylenediaminetetraacetic acid and metal salts thereof, disodium edetate, trisodium edetate, and tetrasodium edetate, and mixtures thereof.
[0289] Any suitable preservative can be used. Examples of a preservative include those selected from the group consisting of quaternary ammonium salts such as benzalkonium halides (preferably benzalkonium chloride), chlorhexidine gluconate, benzethonium chloride, cetyl pyridinium chloride, benzyl bromide, phenylmercury nitrate, phenylmercury acetate, phenylmercury neodecanoate, merthiolate, methylparaben, propylparaben, sorbic acid, potassium sorbate, sodium benzoate, sodium propionate, ethyl p-hydroxybenzoate, propylaminopropyl biguanide, and butyl-p-hydroxybenzoate, and sorbic acid, and mixtures thereof.
[0290] The aqueous vehicle may also include a tonicity agent to adjust the tonicity (osmotic pressure). The tonicity agent can be selected from the group consisting of a glycol (such as propylene glycol, diethylene glycol, triethylene glycol), glycerol, dextrose, glycerin, mannitol, potassium chloride, and sodium chloride, and a mixture thereof.
[0291] The aqueous vehicle may also contain a viscosity/suspending agent. Suitable viscosity/suspending agents include those selected from the group consisting of cellulose
derivatives, such as methyl cellulose, ethyl cellulose, hydroxyethylcellulose, polyethylene glycols (such as polyethylene glycol 300, polyethylene glycol 400), carboxymethyl cellulose, hydroxypropylmethyl cellulose, and cross-linked acrylic acid polymers (carbomers), such as polymers of acrylic acid cross-linked with polyalkenyl ethers or divinyl glycol (Carbopols - such as Carbopol 934, Carbopol 934P, Carbopol 971, Carbopol 974 and Carbopol 974P), and a mixture thereof.
[0292] In order to adjust the formulation to an acceptable pH (typically a pH range of about 5.0 to about 9.0, more preferably about 5.5 to about 8.5, particularly about 6.0 to about 8.5, about 7.0 to about 8.5, about 7.2 to about 7.7, about 7.1 to about 7.9, or about 7.5 to about 8.0), the formulation may contain a pH modifying agent. The pH modifying agent is typically a mineral acid or metal hydroxide base, selected from the group of potassium hydroxide, sodium hydroxide, and hydrochloric acid, and mixtures thereof, and preferably sodium hydroxide and/or hydrochloric acid. These acidic and/or basic pH modifying agents are added to adjust the formulation to the target acceptable pH range. Hence it may not be necessary to use both acid and base - depending on the formulation, the addition of one of the acid or base may be sufficient to bring the mixture to the desired pH range.
[0293] The aqueous vehicle may also contain a buffering agent to stabilize the pH. When used, the buffer is selected from the group consisting of a phosphate buffer (such as sodium dihydrogen phosphate and disodium hydrogen phosphate), a borate buffer (such as boric acid, or salts thereof including disodium tetraborate), a citrate buffer (such as citric acid, or salts thereof including sodium citrate), and 8-aminocaproic acid, and mixtures thereof.
[0294] Solid carriers suitable for use in the present application include, but are not limited to, inert substances such as lactose, starch, glucose, methyl-cellulose, magnesium stearate, dicalcium phosphate, mannitol and the like. A solid carrier can further include one or more substances acting as flavoring agents, lubricants, solubilizers, suspending agents, fillers, glidants, compression aids, binders or tablet-disintegrating agents; it can also be an encapsulating material. In powders, the carrier can be a finely divided solid which is in admixture with the finely divided active compound. In tablets, the active compound is mixed with a carrier having the necessary compression properties in suitable proportions and compacted in the shape and size desired. The powders and tablets preferably contain up to 99% of the active compound. Suitable solid carriers include, for example, calcium phosphate, magnesium stearate, talc, sugars, lactose, dextrin, starch, gelatin, cellulose, polyvinylpyrrolidine, low melting waxes and ion exchange resins. A tablet may be made by
compression or molding, optionally with one or more accessory ingredients. Compressed tablets may be prepared by compressing in a suitable machine the active ingredient in a free flowing form such as a powder or granules, optionally mixed with a binder (e.g., povidone, gelatin, hydroxypropylmethyl cellulose), lubricant, inert diluent, preservative, disintegrant (e.g, sodium starch glycolate, cross-linked povidone, cross-linked sodium carboxymethyl cellulose) surface active or dispersing agent. Molded tablets may be made by molding in a suitable machine a mixture of the powdered compound moistened with an inert liquid diluent. The tablets may optionally be coated or scored and may be formulated so as to provide slow or controlled release of the active ingredient therein using, for example, hydroxypropyl methylcellulose in varying proportions to provide the desired release profile. Tablets may optionally be provided with an enteric coating, to provide release in parts of the gut other than the stomach.
[0295] Parenteral carriers suitable for use in the present application include, but are not limited to, sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's and fixed oils. Intravenous carriers include fluid and nutrient replenishers, electrolyte replenishers such as those based on Ringer's dextrose and the like. Preservatives and other additives can also be present, such as, for example, antimicrobials, antioxidants, chelating agents, inert gases and the like.
[0296] Carriers suitable for use in the present application can be mixed as needed with disintegrants, diluents, granulating agents, lubricants, binders and the like using conventional techniques known in the art. The carriers can also be sterilized using methods that do not deleteriously react with the compounds, as is generally known in the art.
[0297] Diluents may be added to the formulations of the present invention. Diluents increase the bulk of a solid pharmaceutical composition and/or combination, and may make a pharmaceutical dosage form containing the composition and/or combination easier for the patient and care giver to handle. Diluents for solid compositions and/or combinations include, for example, microcrystalline cellulose (e.g, AVICEL), microfine cellulose, lactose, starch, pregelatinized starch, calcium carbonate, calcium sulfate, sugar, dextrates, dextrin, dextrose, dibasic calcium phosphate dihydrate, tribasic calcium phosphate, kaolin, magnesium carbonate, magnesium oxide, maltodextrin, mannitol, polymethacrylates (e.g., EUDRAGIT(r)), potassium chloride, powdered cellulose, sodium chloride, sorbitol, and talc. [0298] In various embodiments, the pharmaceutical composition may be selected from the group consisting of a solid, powder, liquid and a gel. In certain embodiments, the
pharmaceutical compositions of the present disclosure is a solid (e.g., a powder, tablet, a capsule, granulates, and/or aggregates). In certain of such embodiments, the solid pharmaceutical composition comprises one or more excipients known in the art, including, but not limited to, starches, sugars, diluents, granulating agents, lubricants, binders, and disintegrating agents.
[0299] In some embodiments, the pharmaceutical compositions of the present disclosure are prepared for oral administration. In certain of such embodiments, the pharmaceutical compositions are formulated by combining one or more agents and pharmaceutically acceptable carriers. Certain of such carriers enable pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a subject. Suitable excipients include, but are not limited to, fillers, such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). In certain embodiments, such a mixture is optionally ground and auxiliaries are optionally added. In certain embodiments, pharmaceutical compositions are formed to obtain tablets or dragee cores. In certain embodiments, disintegrating agents (e.g., cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof, such as sodium alginate) are added.
[0300] In some embodiments, dragee cores are provided with coatings. In certain such embodiments, concentrated sugar solutions may be used, which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be added to tablets or dragee coatings.
[0301] In some embodiments, pharmaceutical compositions for oral administration are push- fit capsules made of gelatin. Certain of such push-fit capsules comprise one or more pharmaceutical agents of the present invention in admixture with one or more filler such as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, optionally, stabilizers. In certain embodiments, the pharmaceutical compositions for oral administration are soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or sorbitol. In certain soft capsules, one or more compounds disclosed herein, or a pharmaceutically acceptable solvate, hydrate, tautomer, /V-oxide, or salt thereof, are be
dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be added.
[0302] Solid pharmaceutical compositions that are compacted into a dosage form, such as a tablet, may include excipients whose functions include helping to bind the active ingredient and other excipients together after compression. Binders for solid pharmaceutical compositions and/or combinations include acacia, alginic acid, carbomer (e.g., carbopol), carboxymethylcellulose sodium, dextrin, ethyl cellulose, gelatin, guar gum, gum tragacanth, hydrogenated vegetable oil, hydroxyethyl cellulose, hydroxypropyl cellulose (e.g., KLUCEL), hydroxypropyl methyl cellulose (e.g, METHOCEL), liquid glucose, magnesium aluminum silicate, maltodextrin, methylcellulose, polymethacrylates, povidone (e.g, KOLLIDON, PLASDONE), pregelatinized starch, sodium alginate, and starch.
[0303] The dissolution rate of a compacted solid pharmaceutical composition in the patient’s stomach may be increased by the addition of a disintegrant to the composition and/or combination. Disintegrants include alginic acid, carboxymethylcellulose calcium, carboxymethylcellulose sodium (e.g, AC-DI-SOL and PRIMELLOSE), colloidal silicon dioxide, croscarmellose sodium, crospovidone (e.g, KOLLIDON and POLYPLASDONE), guar gum, magnesium aluminum silicate, methyl cellulose, microcrystalline cellulose, polacrilin potassium, powdered cellulose, pregelatinized starch, sodium alginate, sodium starch glycolate (e.g, EXPLOTAB), potato starch, and starch.
[0304] Glidants can be added to improve the flowability of a non-compacted solid composition and/or combination and to improve the accuracy of dosing. Excipients that may function as glidants include colloidal silicon dioxide, magnesium trisilicate, powdered cellulose, starch, talc, and tribasic calcium phosphate.
[0305] When a dosage form such as a tablet is made by the compaction of a powdered composition, the composition is subjected to pressure from a punch and dye. Some excipients and active ingredients have a tendency to adhere to the surfaces of the punch and dye, which can cause the product to have pitting and other surface irregularities. A lubricant can be added to the composition and/or combination to reduce adhesion and ease the release of the product from the dye. Lubricants include magnesium stearate, calcium stearate, glyceryl monostearate, glyceryl palmitostearate, hydrogenated castor oil, hydrogenated vegetable oil, mineral oil, polyethylene glycol, sodium benzoate, sodium lauryl sulfate, sodium stearyl fumarate, stearic acid, talc, and zinc stearate.
[0306] Flavoring agents and flavor enhancers make the dosage form more palatable to the patient. Common flavoring agents and flavor enhancers for pharmaceutical products that may be included in the composition and/or combination of the present invention include maltol, vanillin, ethyl vanillin, menthol, citric acid, fumaric acid, ethyl maltol, and tartaric acid. [0307] Solid and liquid compositions may also be dyed using any pharmaceutically acceptable colorant to improve their appearance and/or facilitate patient identification of the product and unit dosage level.
[0308] In certain embodiments, a pharmaceutical composition of the present invention is a liquid (e.g., a suspension, elixir and/or solution). In certain of such embodiments, a liquid pharmaceutical composition is prepared using ingredients known in the art, including, but not limited to, water, glycols, oils, alcohols, flavoring agents, preservatives, and coloring agents. [0309] Liquid pharmaceutical compositions can be prepared using compounds of the present disclosure, or a pharmaceutically acceptable solvate, hydrate, tautomer, /V-oxide, or salt thereof, and any other solid excipients where the components are dissolved or suspended in a liquid carrier such as water, vegetable oil, alcohol, polyethylene glycol, propylene glycol, or glycerin.
[0310] For example, formulations for parenteral administration can contain as common excipients sterile water or saline, polyalkylene glycols such as polyethylene glycol, oils of vegetable origin, hydrogenated naphthalenes and the like. In particular, biocompatible, biodegradable lactide polymer, lactide/glycolide copolymer, or polyoxyethylenepolyoxypropylene copolymers can be useful excipients to control the release of active compounds. Other potentially useful parenteral delivery systems include ethylene-vinyl acetate copolymer particles, osmotic pumps, implantable infusion systems, and liposomes. Formulations for inhalation administration contain as excipients, for example, lactose, or can be aqueous solutions containing, for example, polyoxyethylene-9-auryl ether, glycocholate and deoxy cholate, or oily solutions for administration in the form of nasal drops, or as a gel to be applied intranasally. Formulations for parenteral administration can also include glycocholate for buccal administration, methoxysalicylate for rectal administration, or citric acid for vaginal administration.
[0311] Liquid pharmaceutical compositions can contain emulsifying agents to disperse uniformly throughout the composition and/or combination an active ingredient or other excipient that is not soluble in the liquid carrier. Emulsifying agents that may be useful in liquid compositions and/or combinations of the present invention include, for example,
gelatin, egg yolk, casein, cholesterol, acacia, tragacanth, chondrus, pectin, methyl cellulose, carbomer, cetostearyl alcohol, and cetyl alcohol.
[0312] Liquid pharmaceutical compositions can also contain a viscosity enhancing agent to improve the mouth-feel of the product and/or coat the lining of the gastrointestinal tract. Such agents include acacia, alginic acid bentonite, carbomer, carboxymethylcellulose calcium or sodium, cetostearyl alcohol, methyl cellulose, ethylcellulose, gelatin guar gum, hydroxyethyl cellulose, hydroxypropyl cellulose, hydroxypropyl methyl cellulose, maltodextrin, polyvinyl alcohol, povidone, propylene carbonate, propylene glycol alginate, sodium alginate, sodium starch glycolate, starch tragacanth, and xanthan gum.
[0313] Sweetening agents such as aspartame, lactose, sorbitol, saccharin, sodium saccharin, sucrose, aspartame, fructose, mannitol, and invert sugar may be added to improve the taste. [0314] Preservatives and chelating agents such as alcohol, sodium benzoate, butylated hydroxyl toluene, butylated hydroxyanisole, and ethylenediamine tetraacetic acid may be added at levels safe for ingestion to improve storage stability.
[0315] In some embodiments, a pharmaceutical composition is prepared for administration by injection (e.g., intravenous, subcutaneous, intramuscular, etc.). In certain of such embodiments, a pharmaceutical composition comprises a carrier and is formulated in aqueous solution, such as water or physiologically compatible buffers such as Hanks's solution, Ringer's solution, or physiological saline buffer. In certain embodiments, other ingredients are included (e.g., ingredients that aid in solubility or serve as preservatives). In certain embodiments, injectable suspensions are prepared using appropriate liquid carriers, suspending agents and the like. Certain pharmaceutical compositions for injection are presented in unit dosage form, e.g., in ampoules or in multi-dose containers. Certain pharmaceutical compositions for injection are suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents. Certain solvents suitable for use in pharmaceutical compositions for injection include, but are not limited to, lipophilic solvents and fatty oils, such as sesame oil, synthetic fatty acid esters, such as ethyl oleate or triglycerides, and liposomes. Aqueous injection suspensions may contain substances that increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Optionally, such suspensions may also contain suitable stabilizers or agents that increase the solubility of the pharmaceutical agents to allow for the preparation of highly concentrated solutions.
[0316] The sterile injectable preparation may also be a sterile injectable solution or suspension in a non-toxic parenterally acceptable diluent or solvent, such as a solution in 1,3- butane-diol or prepared as a lyophilized powder. Among the acceptable vehicles and solvents that may be employed are water, Ringer's solution and isotonic sodium chloride solution. In addition, sterile fixed oils may conventionally be employed as a solvent or suspending medium. For this purpose any bland fixed oil may be employed including synthetic mono- or diglycerides. In addition, fatty acids such as oleic acid may likewise be used in the preparation of injectables. Formulations for intravenous administration can comprise solutions in sterile isotonic aqueous buffer. Where necessary, the formulations can also include a solubilizing agent and a local anesthetic to ease pain at the site of the injection. Generally, the ingredients are supplied either separately or mixed together in unit dosage form, for example, as a dry lyophilized powder or water free concentrate in a hermetically sealed container such as an ampule or sachet indicating the quantity of active agent. Where the compound is to be administered by infusion, it can be dispensed in a formulation with an infusion bottle containing sterile pharmaceutical grade water, saline or dextrose/water. Where the compound is administered by injection, an ampule of sterile water for injection or saline can be provided so that the ingredients can be mixed prior to administration.
[0317] Suitable formulations further include aqueous and non-aqueous sterile injection solutions that can contain antioxidants, buffers, bacteriostats, bactericidal antibiotics and solutes that render the formulation isotonic with the bodily fluids of the intended recipient; and aqueous and non-aqueous sterile suspensions, which can include suspending agents and thickening agents.
[0318] In certain embodiments, a pharmaceutical compositions of the present invention are formulated as a depot preparation. Certain such depot preparations are typically longer acting than non-depot preparations. In certain embodiments, such preparations are administered by implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. In certain embodiments, depot preparations are prepared using suitable polymeric or hydrophobic materials (for example an emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt.
[0319] In certain embodiments, a pharmaceutical composition of the present invention comprises a sustained-release system. A non-limiting example of such a sustained-release system is a semi-permeable matrix of solid hydrophobic polymers. In certain embodiments,
sustained-release systems may, depending on their chemical nature, release pharmaceutical agents over a period of hours, days, weeks or months.
[0320] The formulation may further comprise a wetting agent. Suitable classes of wetting agents include those selected from the group consisting of poly oxypropylenepolyoxyethylene block copolymers (poloxamers), polyethoxylated ethers of castor oils, polyoxyethylenated sorbitan esters (polysorbates), polymers of oxyethylated octyl phenol (Tyloxapol), polyoxyl 40 stearate, fatty acid glycol esters, fatty acid glyceryl esters, sucrose fatty esters, and polyoxyethylene fatty esters, and mixtures thereof.
[0321] The amount of the compound of any one of Formula (I), Formula (10), or Formula (11) may be present in the composition in a therapeutically effective amount. For example, in some embodiments, the compound may be administered at about 0.001 mg/kg to about 100 mg/kg body weight (e.g, about 0.01 mg/kg to about 10 mg/kg or about 0.1 mg/kg to about 5 mg/kg).
[0322] A therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) for treating a disease or disorder.
[0323] A therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) for use in therapy is an amount sufficient to treat or prevent cancer, slow its progression and/or reduce the symptoms associated with the condition.
[0324] A therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) for use in therapy is an amount sufficient to treat cancer, slow its progression and/or reduce the symptoms associated with the condition.
[0325] A therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) for use in therapy is an amount sufficient to treat or prevent fibrosis, slow its progression and/or reduce the symptoms associated with the condition. [0326] A therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) for use in therapy is an amount sufficient to treat fibrosis, slow its progression and/or reduce the symptoms associated with the condition.
[0327] The size of the dose for therapeutic or prophylactic purposes of a compound of any one of Formula (I), Formula (10), or Formula (11) will naturally vary according to the nature and severity of the conditions, the age and sex of the animal or patient and the route of administration, according to well-known principles of medicine.
[0328] Examples of useful dermatological compositions which can be used to deliver a compound of Formula (I), Formula (10), and/or Formula (11) to the skin are known to the art;
for example, see Jacquet et al. (U.S. Pat. No. 4,608,392), Geria (U.S. Pat. No. 4,992,478), Smith et al. (U.S. Pat. No. 4,559,157) and Wortzman (U.S. Pat. No. 4,820,508).
Methods of Use
[0329] A “subject” includes a mammal. The mammal can be e.g., a human or appropriate non-human mammal, such as primate, mouse, rat, dog, cat, cow, horse, goat, camel, sheep or a pig. The subject can also be a bird or fowl. In one embodiment, the mammal is a human. [0330] In some embodiments, the present disclosure provides a method of treating or preventing a disease or disorder disclosed herein in a subject in need thereof, comprising administering to the subject a therapeutically effective amount of the compound of any one of Formula (I), Formula (10), or Formula (11) or a pharmaceutical composition of the present disclosure.
[0331] In some embodiments, the present disclosure provides a method of treating cancer in a subject in need thereof, comprising administering to the subject a therapeutically effective amount of the compound of Formula (I), Formula (10), or Formula (11) or a pharmaceutical composition of the present disclosure.
[0332] In some embodiments, the present disclosure provides a method of treating fibrosis in a subject in need thereof, comprising administering to the subject a therapeutically effective amount of the compound of Formula (I), Formula (10), or Formula (11) or a pharmaceutical composition of the present disclosure.
[0333] In some embodiments, the present disclosure provides the compound of Formula (I) for use in treating cancer in a subject in need thereof.
[0334] In some embodiments, the present disclosure provides the compound of Formula (I) for use in treating fibrosis in a subject in need thereof.
[0335] In embodiments, the present disclosure provides use of the compound of any one of Formula (I), Formula (10), or Formula (11) in the manufacture of a medicament for treating a disease or disorder disclosed herein.
[0336] In some embodiments, the present disclosure provides use of the compound of Formula (I), Formula (10), or Formula (11) in the manufacture of a medicament for treating cancer in a subject in need thereof.
[0337] In some embodiments, the present disclosure provides use of the compound of Formula (I), Formula (10), or Formula (11) in the manufacture of a medicament for treating fibrosis in a subject in need thereof.
[0338] In some embodiments, the present disclosure provides use of the compound of any one of Formula (I), Formula (10), or Formula (11) for the treatment of a disease or disorder disclosed herein.
[0339] In some embodiments, the present disclosure provides use of the compound of Formula (I), Formula (10), or Formula (11) for the treatment of cancer.
[0340] In some embodiments, the present disclosure provides use of the compound of Formula (I), Formula (10), or Formula (11) for the treatment of fibrosis.
[0341] In some embodiments, the disease or disorder is a cancer.
[0342] In some embodiments, the cancer is a disease that involves abnormal cell growth with the potential to invade or spread to other parts of the body.
[0343] In some embodiments, the cancer is a malignant tumor or neoplasm.
[0344] In some embodiments, the cancer is breast cancer, pancreatic cancer, non-small cell lung cancer, ovarian cancer, esophageal cancer, melanoma, lymphoma, uterine cancer, peritoneal cancer, fallopian tube cancer, endometrial cancer, cervical cancer, thyroid cancer, gastric cancer, gastroesophageal junction cancer, urothelial cancer, bladder cancer, oropharynx cancer, hypopharynx cancer, larynx cancer, head and neck cancer, germ cell cancer/tumors, prostate cancer, colon cancer, rectal cancer, kidney cancer, cholangiocarcinoma (bile duct cancer), glioblastoma, leukemia, or non-Hodgkin lymphoma. [0345] In some embodiments, the cancer is Acute Lymphoblastic Leukemia, Acute Myeloid Leukemia, Adrenocortical Carcinoma, AIDS-Related Cancers, Kaposi Sarcoma, Lymphoma, Anal Cancer, Appendix Cancer, Astrocytomas, Childhood Atypical Teratoid/Rhabdoid Tumor, Basal Cell Carcinoma, Skin Cancer (Nonmelanoma), Childhood Bile Duct Cancer, Extrahepatic Bladder Cancer, Bone Cancer, Ewing Sarcoma Family of Tumors, Osteosarcoma and Malignant Fibrous Histiocytoma, Brain Stem Glioma, Brain Tumors, Embryonal Tumors, Germ Cell Tumors, Craniopharyngioma, Ependymoma, Bronchial Tumors, Burkitt Lymphoma (Non-Hodgkin Lymphoma), Carcinoid Tumor, Gastrointestinal Carcinoma of Unknown Primary, Cardiac (Heart) Tumors, Lymphoma, Primary, Cervical Cancer, Childhood Cancers, Chordoma, Chronic Lymphocytic Leukemia, Chronic Myelogenous Leukemia, Chronic Myeloproliferative Neoplasms Colon Cancer, Colorectal Cancer, Cutaneous T-Cell Lymphoma, Ductal Carcinoma In Situ, Endometrial Cancer, Ependymoma, Esophageal Cancer, Esthesioneuroblastoma, Ewing Sarcoma, Extracranial Germ Cell Tumor, Extragonadal Germ Cell Tumor, Extrahepatic Bile Duct Cancer, Eye Cancer, Intraocular Melanoma, Retinoblastoma, Fibrous Histiocytoma of Bone, Malignant,
and Osteosarcoma, Gallbladder Cancer, Gastric (Stomach) Cancer, Gastrointestinal Carcinoid Tumor, Gastrointestinal Stromal Tumors, Extragonadal Cancer, Ovarian Cancer, Testicular Cancer, Gestational Trophoblastic Disease, Glioma, Brain Stem Cancer, Hairy Cell Leukemia, Head and Neck Cancer, Heart Cancer, Hepatocellular (Liver) Cancer, Histiocytosis, Langerhans Cell Cancer, Hodgkin Lymphoma, Hypopharyngeal Cancer, Intraocular Melanoma, Islet Cell Tumors, Pancreatic Neuroendocrine Tumors, Kaposi Sarcoma, Kidney Cancer, Renal Cell Cancer, Wilms Tumor and Other Childhood Kidney Tumors, Langerhans Cell Histiocytosis, Laryngeal Cancer, Leukemia, Chronic Lymphocytic Cancer, Chronic Myelogenous Cancer, Hairy Cell Cancer, Lip and Oral Cavity Cancer, Liver Cancer (Primary), Lobular Carcinoma In Situ (LCIS), Lung Cancer, Non-Small Cell Cancer, Small Cell Cancer, Lymphoma, Cutaneous T-Cell (Mycosis Fungoides and Sezary Syndrome), Hodgkin Cancer, Non-Hodgkin Cancer, Macroglobulinemia, Waldenstrom, Male Breast Cancer, Malignant Fibrous Histiocytoma of Bone and Osteosarcoma, Melanoma, Intraocular (Eye) Cancer, Merkel Cell Carcinoma, Mesothelioma, Malignant, Metastatic Squamous Neck Cancer with Occult Primary, Midline Tract Carcinoma Involving NUT Gene, Mouth Cancer, Multiple Endocrine Neoplasia Syndromes, Multiple Myeloma/Plasma Cell Neoplasm, Mycosis Fungoides, Myelodysplastic Syndromes, Myelodysplastic/Myeloproliferative Neoplasms, Myelogenous Leukemia, Chronic, Myeloid Leukemia, Acute, Myeloma Multiple, Chronic Myeloproliferative Neoplasms, Nasal Cavity and Paranasal Sinus Cancer, Nasopharyngeal Cancer, Neuroblastoma, Non-Hodgkin Lymphoma, Non-Small Cell Lung Cancer, Oral Cancer, Oral Cavity Cancer, Lip and Oropharyngeal Cancer, Osteosarcoma and Malignant Fibrous Histiocytoma of Bone, Epithelial Cancer, Low Malignant Potential Tumor, Pancreatic Cancer, Pancreatic Neuroendocrine Tumors (Islet Cell Tumors), Papillomatosis, Paraganglioma, Parathyroid Cancer, Penile Cancer, Pharyngeal Cancer, Pheochromocytoma, Pituitary Tumor, Plasma Cell Neoplasm/Multiple Myeloma, Pleuropulmonary Blastoma, Primary Central Nervous System Lymphoma, Rectal Cancer, Renal Cell (Kidney) Cancer, Retinoblastoma, Rhabdomyosarcoma, Salivary Gland Cancer, Sarcoma, Ewing Cancer, Kaposi Cancer, Osteosarcoma (Bone Cancer), Soft Tissue Cancer, Uterine Cancer, Sezary Syndrome, Skin Cancer, Childhood Melanoma, Merkel Cell Carcinoma, Nonmelanoma, Small Cell Lung Cancer, Small Intestine Cancer, Soft Tissue Sarcoma, Squamous Cell Carcinoma, Skin Cancer (Nonmelanoma), Childhood Squamous Neck Cancer with Occult Primary, Metastatic Cancer, Stomach (Gastric) Cancer, T-Cell Lymphoma, Cutaneous Cancer, Testicular Cancer,
Throat Cancer, Thymoma and Thymic Carcinoma, Thyroid Cancer, Transitional Cell Cancer of the Renal Pelvis and Ureter, Unknown Primary, Carcinoma of Childhood, Unusual Cancers of Childhood, Urethral Cancer, Uterine Cancer, Endometrial Cancer, Uterine Sarcoma, Vaginal Cancer, Vulvar Cancer, Waldenstrom Macroglobulinemia, Wilms Tumor, and Women's Cancers.
[0346] In some embodiments, the disease or disorder is a fibrosis.
[0347] Fibrotic conditions are characterized, in whole or in part, by excess production of fibrotic material. These conditions can include systemic sclerosis, multifocal fibrosclerosis, nephrogenic systemic fibrosis, scleroderma (including morphea, generalized morphea, or linear scleroderma), sclerodermatous graft-vs-host-disease, kidney fibrosis (including glomerular sclerosis, renal tubulointerstitial fibrosis, progressive renal disease or diabetic nephropathy), cardiac fibrosis (e.g, myocardial fibrosis), pulmonary fibrosis (e.g. pulmonary fibrosis, glomerulosclerosis pulmonary fibrosis, idiopathic pulmonary fibrosis, silicosis, asbestosis, interstitial lung disease, interstitial fibrotic lung disease, and chemotherapy/radiation induced pulmonary fibrosis), oral fibrosis, endomyocardial fibrosis, deltoid fibrosis, pancreatitis, inflammatory bowel disease, Crohn's disease, nodular fascilitis, eosinophilic fasciitis, general fibrosis syndrome characterized by replacement of normal muscle tissue by fibrous tissue in varying degrees, retroperitoneal fibrosis, liver fibrosis, liver cirrhosis, chronic renal failure; myelofibrosis (bone marrow fibrosis), drug induced ergotism, myelodysplastic syndrome, myeloproferative syndrome, collagenous colitis, acute fibrosis, organ specific fibrosis, and the like.
[0348] In some embodiments, the fibrosis is pulmonary fibrosis, liver fibrosis, heart fibrosis, mediastinal fibrosis, retroperitoneal cavity fibrosis, bone marrow fibrosis, or skin fibrosis. [0349] In some embodiments, the fibrotic condition is pulmonary hypertension, chronic obstructive pulmonary disease (COPD), idiopathic pulmonary fibrosis, sarcoidosis, cystic fibrosis, familial pulmonary fibrosis, silicosis, asbestosis, coal worker's pneumoconiosis, carbon pneumoconiosis, hypersensitivity pneumonitides, or pulmonary hypertension, [0350] In some embodiments, the fibrosis is cystic fibrosis.
[0351] In some embodiments, the subject is a mammal. In some embodiments the mammal is a human.
[0352] In some embodiments, the compound of Formula (I) is administered once, twice, three times, four times, or five times per day. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered once daily. In some
embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered twice daily. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered three times daily. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered four times daily. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered five times daily.
[0353] In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered with a drug holiday. In some embodiments, the compound of Formula (I), Formula (10), and/or Formula (11) is administered without a drug holiday.
EXAMPLES
[0354] The disclosure is further described in detail by reference to the following experimental examples. These examples are provided for purposes of illustration only, and are not intended to be limiting unless otherwise specified. Thus, the disclosure should in no way be construed as being limited to the following examples, but rather, should be construed to encompass any and all variations which become evident as a result of the teaching provided herein.
[0355] Without further description, it is believed that one of ordinary skill in the art can, using the preceding description and the following illustrative examples, make and utilize the compositions of the present disclosure and practice the claimed methods. The following working examples therefore are not to be construed as limiting in any way the remainder of the disclosure.
Example 1: General Overview of Compounds Discovery
[0356] Compounds of Formula (I) (including Formula (10) and Formula (11)) were identified as products of AZT039 biosynthetic gene cluster (BGC) for example using heterologous expression and stable isotope labeling to identify target compounds. In some embodiments, methods included cloning and conjugation of AZT039 BGC in S. albus J1074, small scale production and isotope labeling, extraction and LCMS analysis, andl scale production and isolation of compounds. Spectroscopic characterization of AZT039 compounds Formula (10) and Formula (11) was performed and 2D structures were obtained.
Example 2: Proposed Biosynthesis
[0357] To initiate discovery of molecules from the AZT family, genome and metagenome mining was performed on publicly deposited (NCBI, JGI, etc) and internal sequence collections. The mined biosynthetic gene clusters (BGCs) were processed using internal bioinformatic tools followed by analysis of individual BGCs to select for potentially new compound structures. About 190 BGCs were identified. AZT039, was initially found in the genome of Streptomyces sp. NRRL F-6131 as a partial cluster. Streptomyces sp. NRRL- F6131 is the wildtype strain harboring AZT039 BGC. No characterized AZT molecule was reported from this strain. The genome was resequenced and reassembled, and upon further analysis of the BGC, the resulting compound was predicted to have a unique structure, and AZT039 was prioritized for development.
[0358] AZT039 was first identified from the genome sequence of Streptomyces sp. NRRL F- 6131 as a partial gene cluster showing only the NRPS portion of the molecule. In silico reassembly of the deposited genome in-house and antismash analysis showed additional contigs that were potential overlaps to the gene cluster but were fragmented. To obtain the full BGC the strain was ordered from the NRRL collection (https://nrrl.ncaur.usda.gov/). The genome was sequenced using a combination of long-read (ONT) and short-read shotgun (Illumina) platforms, assembled to obtain the full-length sequence of AZT039 BGC, and annotated using in-house pipeline.
[0359] BGC analysis. The AZT039 gene cluster belongs to the type 1 modular hybrid NRPS- PKS family of BGCs. Six NRPS modules corresponding to the core peptide macrocycle are encoded in 4 open reading frames (aztN39, aztO39, aztP39, and aztAG39) (FIG. 1). The PKS contains 4 modules encoded in 2 open reading frames (aztAD and aztAE). Other genes for precursor biosynthesis, post-PKS modifications, regulation, and transport are distributed downstream and in between the NRPS and PKS core genes. aztN contains 7 typical domains belonging to 2 amino acid modules; module 1 contains C-A-T-E domains followed by C-A-T in module 2. aztO has 8 domains from modules 3 and 4; module 3 contains C-A-T-E domains followed by C-A-T from module 4. Module 5 is encoded in the aztP gene containing C-A-T- Nmt-TE domains. aztAG downstream of the PKS contains module 6 of the NRPS core with a C (starter) domain, where the PKS chain gets loaded in a typical AZT biosynthesis. In an unusual case, annotation of module 6 shows a missing A domain. To rule out sequencing errors, specific primers were redesigned for the regions and submitted the amplicon for sanger sequencing and confirmed that although the A domain is present, it contains a significant deletion in the middle. The presence of E domains (epimerization) in modules 1
and 3 predicts that these substrates maybe epimerized into D- amino acids in the final structure. From the substrate specificity for the A-domains (Stachelhaus codes and NRPSPredictor in Antismash 4/5), the peptide core of the compound was predicted to be ‘1: unknown (mod6)-2: piz (modl)-3: nOHval (mod2)- 4: piz (mod3)-5: 3oh-3mepro(mod4)-6: nOHval (mod5)’, with weaker predictions for positions 3 and 6 which indicates other kinds of substrates maybe be incorporated. The PKS core is composed of four modules typical of the AZT BGCs. aztAD gene contains the first 3 modules- module 1 contains a KS-AT-ACP domain, followed by module 2 containing KS-AT-DH-KR-ACP domains, and module 3 with KS-AT-DH-KR-ACP. aztAE contains module 4 with KS-AT-ACP domains. Distinct from typical AZT BGCs, module 3 of the AZT039 PKS lacks an ER domain that corresponds to the saturated THP ring in the PKS tail.
[0360] Without being bound by theory, genes predicted to be involved in the biosynthesis or transformation of precursor amino acids are also identified in AZT039 BGC. azt (Z, AA. AB), homologous to ply(Q,R,S) in the polyoxypeptin biosynthesis, are proposed to be involved in b-OH leucine formation, azt AQ and CO (ktzl and kztT from kutzneride biosynthesis) are involved in the conversion of ornithine to piperazic acid. Azt (K, L, M) homologous to ply (C,D,E) are involved in the formation of hydroxamate containing residues. AZT039 also contains a set of 3 genes azt (CI, CJ, CK) related to the synthesis of hydroxyphenylglycine, which has not been reported in the peptide core of NRPS. A cis-proline hydroxylase (aztX) is also present. Other biosynthetic genes identified in the gene cluster include an LmBU regulator, an MbtH (aztl), a pair of ABC transporters, and a pair of P-type atpase heavy metal translocating transporters. Post-assembly modifying genes are also present including a CyP450 (aztAR) that may be responsible for hydroxylation on piperazic acid residues, and an O-methyltransferase.
Example 3. Identification of Product/s of AZT039 BGC
Heterologous expression of the BGC Stable isotope labeling to detect AZTs.To identify the product of AZT039 BGC in the extracts, the biosynthetic analysis described in Example 2 was used to find a unique ‘marker’ biotransformation for the putative products of AZT gene clusters. While the amino acids in the peptide core of AZT molecules vary, two residues are conserved in most structures- b-OH leucine, and piperazic acid. The enzymatic routes and genes involved for the formation of these non-proteinogenic AAs are present in the AZT039 BGC. beta-OH-leucine was proposed to be derived from L-leucine via hydroxylation by a
cytochrome P450 enzyme (aztAA). The incorporation of DIO-labeled b-OH-leucine in the compound core was detected as a shift of +8 Da in the MS spectra, consistent with the loss of 1H upon hydroxylation at the beta position, and a possible exchange of the acidic alpha proton during NRPS assembly. Piperazic acid residues are biosynthesized from L-omithine by the action of two enzymes ktzT and ktzl as demonstrated in kutzneride biosynthesis. Homologs of ktzT and ktzl are found in AZT BGCs and this transformation was used as a secondary test for the presence and the number of Piz residues. This approach was validated in proof-of-concept experiments using strains producing known compounds verucopeptin and polyoxypeptin. With this strategy, initial production in different media was performed using the wildtype strain in small scale (50 mL) but did not yield any detectable labeled peak. The BGC was then transferred into a more tractable host for expression by cloning the 78 Kbp region spanning the BGC into a BAC (performed by Varigen Biosciences) and subsequent conjugation into host strain 5. albus J1074. Transferring the BGC was expected to give cleaner and characterized background and enable the comparison to an empty-vector expression hence faster identification of target peaks. Upon heterologous production in 5. albus, a unique set of LCMS peaks/compounds was observed in SA-LT039 (Streptomyces albus host harboring cloned AZT039 BGC) which were absent in extracts of the empty- vector expression, SA-pDUalP (FIG. 2A). The peaks- m/z 985 [M-H]' (J), 969 [M-H]' (2), 967 [M-H]' (3. Formula (10)), 951 [M-H]' (4, Formula (11)), exhibited the expected mass shift of +8 Da in cultures were DIO-leucine was added indicating the present of b-OH-leucine in the molecule (FIG. 2B). Additional mass shift of + 9Da was also observed. While not wishing to be bound by any particular theory, this result indicated the possible incorporation of a non-modified Leucine residue. In parallel, a mass shift of + 6 and 12Da was also observed confirming the presence of piperazic acid residues. The target peaks were dereplicated by HRMS to confirm that the compounds were novel, and were targeted for downstream isolation and characterization. FIGS. 2A-2C show non-production of compounds of Formula (I), Formula (10), or Formula (11) in wild type strains.
[0361] Conjugal transfer ofLT039 into S. albus. AZT039 BGC from Streptomyces NRRL F- 6131 was carved out the of the genome and cloned into pDualP-backbone (Varigen Biosciences). The resulting construct is referred to as pDualp-LT039 to distinguish from the wildtype BGC. The construct was sequenced in-house for confirmation before further experimentation. For conjugation of the BGC into S. albus, purified pDual-LT039 plasmid DNA was transformed into E.coli S17 cells by electroporation. Colonies were grown at 30° C
overnight under apramycin (50 ug/mL) and trimetropim (10 ug/mL) selection. Colonies were picked into 5 mL of the LB broth and grown overnight at 30° C under the same antibiotic selection. Overnight cultures were screened for the presence of the BGC using 3 primer sets spanning the gene cluster, as well as a primer set designed for the junction of the backbone plasmid and the BGC. For conjugation, 200 uL of an overnight grown E. coli S17 cells containing the LT039 BGC was inoculated into 50 mL of LB broth with antibiotics and grown at 37° C to an OD600nm of (0.6-0.9). Cells were washed 3X with 20 mL of LB and resuspended in 500 uL SOC. To prepare the receiving 5. albus strain, 30 uL of spores (stocked at 10x9 CFUs) was diluted into 1 mL of SOC, heat shocked at 50° C for 10 min, and cooled at room temperature. 100 pL of the washed E. coli cells were mixed with 200 pL of heat shocked spores. 200 pL of the mating mixture was spotted on ISP4-AMC plates containing nystatin (30 ug/mL) and incubated at 30° C for 16 hours. Grown mating spots were scraped into LB and plated on ISP4-AMC plates with nalidixic acid (50 ug/mL) and nystatin (30 ug/mL) and apramycin (50 ug/mL) and incubated at 30° C for another 2-4 days. S. albus ex-conjugants were picked into 10 mL of TSB broth containing apramycin (50 ug/mL) and grown for 2 to 3 days in a shaking incubator at 30° C 220 rpm. Cultures were screened by PCR using the BGC screening primers described above to confirm integration. Confirmed positive strains (SA-LT039) were glycerol stocked and stored for later production studies.
[0362] Extraction and LCMS analysis. Cultures were harvested by solvent extraction using 1:1 volumes of Chloroform: IPA (2:1) in 250 mL separatory funnel. The chloroform layer was dried, and the extracts were resuspended in 0.2 mL methanol. For LCMS analysis, 25 uL of the methanol extract was injected into a Phenomenex Kinetex Cl 8 column (2.6 pm, 100 x 4.6mm). The following general condition was used for all LCMS profiling and analyses: flow rate, 1 mL/ min, solvent gradient: 20-100% B in 1-20 mins, 100% B 20-25 min, ramp to 20% B 25-30 min. Solvent A: 0.01% FA in water and solvent B: 0.01% FA Acetonitrile. Samples were monitored with UV diode array, ELSD, and Single Quad ESI MS in positive and negative mode. To identify putative AZT molecules in the extracts, MS chromatograms of cultures with and without added DIO-leucine were compared and scanned for peaks in the molecular weight range of 700-1200 Da having a shift of +8 Da in the presence of DIO- leucine.
Example 4: Large scale production and isolation of target compounds
[0363] Large scale production. 0.5 mL of 2-3 days old SA-LT039 seed cultures prepared as described above was inoculated into 50 mL R5A media in 250 mL baffled flasks. A total of 5 L of cultures were grown in batches at 28° C, 220 rpm, and 7 days. At day 7, the cultures (mixed my celia and broth) were extracted with equal volumes of 1 : 1 IPA: chloroform twice. The extracts were dried under vacuum to yield the crude material (10 grams).
[0364] Isolation and purification. The crude extract was subjected to silica column chromatography to obtain 2.2 g of semicrude extract A and 1.5 g of semicrude extract B. Extracts A and B were then subjected to Sephadex LH-20 fractionation and a series of RP- HPLC purifications to yield 0.37 mg (1), 0.73 mg (2), 19.8 mg (3; Formula (10)), 19.5 mg (4; Formula (11)). Compounds of Formula (10) and Formula (11) were isolated as major and minor peaks, 3-1 and 2, 4-1 and 2, respectively.
Example 5. Spectroscopic characterization of AZT039 molecules and 2D structure [0365] 2D NMR and HRMSMS characterization of the compound of Formula (10). The compound of Formula (10) was isolated as an amorphous white powder with a molecular formula of C47H68N8O14 (Calc. MW 969.1030 g/mol) determined by high-resolution ESI - Q- TOF mass spectrometry. In positive mode and negative mode, the molecular ion peak appeared at m/z 969.5037 (M+H)+, 991.4740 (M+Na)+, and 967.4845 (M-H)'. The structure was established by interpretation of ID and 2D NMR data in CDCh (Fig 3). The JH NMR spectrum showed 2 downfield protons at 9.95 and 7.87 ppm, 1 amide doublet protons, 11 protons between 4 and 6 ppm, 4 aromatic protons between 6.5 and 7.5 ppm with a p- substitution pattern, and 3 olefinic protons between 6 and 7.5 ppm. The 13C spectrum displayed 7 amide or ester carbons, and one ketone carbonyl carbon, 4 oxygen bearing carbons, 9 carbons attached to nitrogen, and 14 aliphatic carbons. By 2D NMR interpretation (COSY, TOCSY, HSQC, and HMBC) and HRMS-MS/MS fragmentation analysis, the 2D structure of the compound of Formula (10) was proposed to be a novel cyclic hexadepsipeptide consisting of N-hydroxyleucine, 3-hydroxyproline, 5-hydroxypiperazic acid (y-OH-piperazic acid, piz2), N-hydroxy-p-methoxy-phenylglycine, piperazic acid (pizl), 3- hy dr oxy leucine (P-hydroxyleucine), and a polyketide side chain lacking the canonical pyran ring (FIG. 4A). The amino acid sequence and PKS chain were determined by key HMBC, COSY and TOCSY correlations. The presence of P-hydroxyleucine was established by the COSY correlations between the amide NH (87.37 ppm) to the a proton, between a and P protons (84.94 and 5.40 ppm), and HMBC correlations from the P proton to the isopropyl
moiety. The JV-hydroxyleucine was assigned based on the COSY spin system from the a and P protons to the two methyl groups (80.91 and 0.97 ppm) of the isopropyl moiety, the lack of NH correlation to the a proton, weak NOESY cross peak between a hydroxamate proton (8 7.87 ppm) and a methyl group (80.97 ppm). HMBC showed correlations from the two methyl groups to the methine carbon (y carbon, 824.8 ppm), y proton (8 1.70 ppm) to P carbon (837.5 ppm), and a proton (85.45 ppm) to P carbon. The 5-hydroxypiperazic acid spin system was assigned based on ^/H-COSY and TOCSY correlations, with an amide NH with a chemical shift of 4.28 ppm, and a characteristic chemical shift of a proton attached to an oxygenated carbon (83.51 ppm). The HMBC showed correlation between the a proton (85.51 ppm) to an oxygen-bearing carbon (y carbon) at 59 ppm. The second piperazic acid unit was also determined based on the ^/H-COSY and TOCSY spin system. The amino acid residue 3-hydroxyproline showed a ^/H-COSY cross peak between the a proton (8 5.19 ppm) and P proton (84.6 ppm) which is not present in the traditionally reported 3- hydroxy-3-methylproline residue. The HMBC data confirmed the 3-hydroxyproline backbone with the key correlations from the a proton to the oxygen-bearing P carbon (872.9 ppm), and the following y and 8 carbons with chemical shifts at 32.2 and 46.1 ppm. The hydroxyl group showed 'H.'H-COSY/TOCSY correlations with the a, y, and 8 protons (85.19, 2.21, and diasterotopic 3.24/4.83 ppm). The presence of two aromatic proton signals integrating for 2 protons each (86.88 and 7.32 ppm, d, 3J= 7.95 Hz) indicated an 1 ,4-disubstituted (p- substituted) six- membered aromatic ring. A NOESY correlation between a hydroxamate proton (89.95 ppm) and an a proton (87.02 ppm), the HMBC from the sharp singlet a proton to a quaternary carbon (8 127.6 ppm), and a methoxy group (83.8 ppm, s) HMBC correlation to a second quaternary carbon (8 159.5 ppm) confirmed the assignment of the amino acid residue /V-hydroxy-p-methoxy-phenylglycine.
[0366] 2D NMR and HRMSMS characterization of compound of Formula (11). The compound of Formula (11) was isolated as an amorphous white powder with a molecular formula of C47H68N8O13 (Calc. MW 953.1040 g/mol) determined by high-resolution ESI - Q- TOF mass spectrometry. In positive mode and negative mode, the molecular ion peak appeared at m/z 953.5094 (M+H)+, 975.4792 (M+Na)+, and 951.4818 (M-H)', respectively. By 2D NMR interpretation (COSY, TOCSY, HSQC, and HMBC), the 2D structure of the compound of Formula (11) was proposed to be an analog of the novel cyclic hexadepsipeptide 3-1, consisting of /V-hydroxyleucine, 3-hydroxyproline, two piperazic acid
residues, JV-hydroxy-p-methoxy-phenylglycine, 3-hydroxyleucine (P-hydroxyleucine), and a polyketide side chain identical to the compound of Formula (10) (FIG. 3B). In contrast to the compound of Formula (10), both piperazic acids residues were not hydroxylated based on ^/H-COSY and TOCSY correlations. FIG. 3A and FIG. 4B show additional structures of compounds of Formula (I).
[0367] Determination of stereochemistry byMarfey ’s method. Briefly, 1 mg of the depsipeptide compound was hydrolyzed in 500 pL of HC1 6N at 115°C for 1 h, and the acid was removed under vacuum. The hydrolysate was dissolved in 500 pL of LC/MS water and dried three times to remove any residual acid. The hydrolysate was cleaned using a 500 mg Cl 8 cartridge and eluted with acetonitrile 10% in water, and the eluate was dried under vacuum. The residue was dissolved in 100 pL of NaHCOs IN, and 50 pL of L-FDLA (10 mg/mL in acetone) were added. The reaction mixture was heated at 80°C for 3 min and further quenched with 50 pL of HC1 2N. A volume of 300 pL of acetonitrile 50 % v/v in LC/MS water was added to the solution. The L-FDLA mixtures were analyzed by LC/MS standard method, and the amino acids configuration was determined based on retention time and MS comparison against the respective amino acid standards. To determine the configuration of the /V-hydroxyleucine and /V-hydroxy-p-methoxy -phenylglycine, the hydroxamate groups in the depsipeptide were reduced to their -NH- form with TiCh/THF before hydrolysis and Marfey’s reaction. For the standards, 1 mg of each 25'.3/?-P- hydroxyleucine/27?,35-P-hydroxyleucine, D/.-piperazic acid, D-piperazic acid, /.-leucine/D- leucine, ( 7.s-/.-3-hydroxyproline. and 5-2-amino-2-(4-methoxyphenyl) acetic acid//?-2-amino- 2-(4-methoxyphenyl) acetic acid (synonym of p-methoxy-phenylglycine). The results indicated the following amino acid configuration in the depsipeptide: 25'.3/?-P- hydroxyleucine, one single product for D-piperazic acid, /.-leucine. (7.s-/.-3-hydroxyproline. and 5-2-amino-2-(4-methoxyphenyl) acetic acid.
EQUIVALENTS
[0368] The details of one or more embodiments of the disclosure are set forth in the accompanying description above. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present disclosure, the preferred methods and materials are now described. Other features, objects, and advantages of the disclosure will be apparent from the description and from the claims. In the specification and the appended claims, the singular forms include plural referents unless the
context clearly dictates otherwise. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. All patents and publications cited in this specification are incorporated by reference.
[0369] The foregoing description has been presented only for the purposes of illustration and is not intended to limit the disclosure to the precise form disclosed, but by the claims appended hereto.
Claims
1. A compound of Formula (I), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof:
wherein in Formula (I),
R is selected from hydrogen and -OH.
3. The compound of claim 1, wherein the compound of Formula (I) is a compound of Formula (11), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof:
4. The compound of any one of claims 1-3, wherein the compound is produced or producible by a host cell comprising a heterologous biosynthetic gene cluster comprising at least about six nonribosomal peptide synthetase (NRPS) modules and at least about four polyketide synthase (PKS) modules, one or more modifying enzymes, precursor biosynthesis enzymes, transporters, and one or more transcriptional regulators.
5. The compound of claim 4, wherein the biosynthetic gene cluster is isolated or derived from Streptomyces strain NRRL F-6131.
6. The compound of claim 4 or 5, wherein the biosynthetic gene cluster comprises a nucleic acid sequence of SEQ ID NO: 1 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
7. The compound of any one of claims 4-6, wherein the biosynthetic gene cluster comprises one or more modifications of SEQ ID NO: 1.
8. The compound of any one of claims 4-7, wherein the modification comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative to SEQ ID NO: 1.
9. The compound of claim 7 or 8, wherein the modification comprises insertion of at least one promoter sequence.
10. The compound of claim 9, wherein the promoter is selected from ermE and kaso, or functional variants or derivatives thereof.
11. The compound of claim 10, wherein the sequence of the ermE promoter comprises SEQ ID NO: 9 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto, and the sequence of the kasO promoter comprises SEQ ID NO: 10 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
12. The compound of any one of claims 7-11, wherein the biosynthetic gene cluster comprises SEQ ID NO: 11 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
13. The compound of any one of claims 7-12, wherein the modification increases synthesis of the compound of any one of Formula (I), Formula (10), and Formula (11) in a host cell compared to an otherwise equivalent host cell comprising an unmodified biosynthetic gene cluster.
14. The compound of any one of claims 7-13, wherein the host cell is a Streptomyces cell.
15. The compound of claim 14, wherein the host cell is a Streptomyces albus cell.
16. The compound of any one of claims 4-15, wherein the host cell further comprises a sequence encoding LmBU operably linked to a constitutive promoter.
17. A recombinant polynucleotide comprising a biosynthetic gene cluster, wherein the biosynthetic gene cluster comprises one or more genes that contribute to the production of at least a portion of the compound of any one of claims 1-3 when the biosynthetic gene cluster is expressed by a host cell.
18. The polynucleotide of claim 17, wherein the one or more genes comprise six nonribosomal peptide synthetase (NRPS) modules.
19. The polynucleotide of claim 18, wherein the six NRPS modules are encoded by polynucleotide sequences comprising a first NRPS open reading frame of SEQ ID NO:2, a second NRPS open reading frame of SEQ ID NO: 3, a third NRPS open reading frame of
SEQ ID NO: 4 and a fourth NRPS open reading frame of SEQ ID NO: 5, or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
20. The polynucleotide of any one of claims 17-19, wherein the one or more genes comprise four polyketide synthase (PKS) modules.
21. The polynucleotide of claim 20, wherein the four PKS modules are encoded by polynucleotide sequences comprising a first PKS open reading frame of SEQ ID NO: 6 and a second PKS open reading frame of SEQ ID NO: 7, or sequences having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
22. The polynucleotide of any one of claims 17-21, wherein the biosynthetic gene complex comprises a LmBU-encoding gene.
23. The polynucleotide of claim 22, wherein the LmBU-encoding gene comprises a polynucleotide sequence of SEQ ID NO: 8, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
24. The polynucleotide of any one of claims 17-23, wherein the biosynthetic gene cluster comprises a polynucleotide sequence of SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
25. The polynucleotide of any one of claims 17-24, wherein the host cell is engineered to express the one or more genes in the biosynthetic cluster, which results in the production of the compound of any one of Formula (I), Formula (10), and Formula (11).
26. The polynucleotide of any one of claims 17-24, wherein overexpression of one or more genes in the biosynthetic cluster by the host cell increases the production of the compound of any one of Formula (I), Formula (10), and Formula (11) compared to an otherwise equivalent host cell comprising a biosynthetic gene cluster that does not overexpress one or more genes in the biosynthetic cluster.
27. The polynucleotide of claim 26, wherein the one or more genes is LmBU, and LmBU is overexpressed.
28. The polynucleotide of claim 27, wherein overexpression of LmBU occurs in cis or in trans.
29. The polynucleotide of claim 28, wherein trans overexpression of the LmBU comprises expressing a sequence encoding the LmBU open reading frame under the control of one or more of a constitutive ermE promoter, a kasO promoter, or a functional variant or derivative thereof.
30. The polynucleotide of claim 29, wherein the ermE promoter comprises a sequence of SEQ ID NO: 9, and the kasO promoter comprises a sequence of SEQ ID NO: 10.
31. The polynucleotide of any one of claims 17-30, wherein the biosynthetic gene cluster comprises one or more sequence modifications relative to a biosynthetic gene cluster of SEQ ID NO :1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
32. The polynucleotide of claim 31, wherein the one or more modifications of the biosynthetic gene cluster comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative to SEQ ID NO: 1.
33. The polynucleotide of claim 31 or 32, wherein the one or more modification of the biosynthetic gene cluster comprises a modification that results in overexpression of the LmBU-encoding gene in comparison to the expression of the LmBU-encoding gene by the biosynthetic gene cluster of SEQ ID NO: 1.
34. The polynucleotide of claim 31-33, wherein the one or more modifications comprise modifications of a promoter of a gene in the biosynthetic gene cluster.
35. The polynucleotide of claim 31-33, wherein the one or more modifications comprise insertion of at least one heterologous promoter in the biosynthetic gene cluster.
36. The polynucleotide of claim 35, wherein the at least one heterologous promoter is a strong promoter.
37. The polynucleotide of claim 35 or 36, wherein the at least one heterologous promoter is selected from ermE and kasO, or functional variants or derivatives thereof.
38. The polynucleotide of claim 37, wherein the sequence of the ermE promoter comprises SEQ ID NO: 9 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto, and the sequence of the kasO promoter comprises SEQ ID NO: 10 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
39. The polynucleotide of any one of claims 35-38, wherein inserting the at least one heterologous promoter into the biosynthetic gene cluster comprises a nucleic acid guided endonuclease.
40. The polynucleotide of claim 39, wherein the nucleic acid guided endonuclease is in a complex with at least one guide nucleic acid (gNA).
41. The polynucleotide of claim 39 or 40, wherein the nucleic acid guided endonuclease is a CRISPR/Cas endonuclease.
42. The polynucleotide of claim 41, wherein the CRISPR/Cas endonuclease is Cas9.
43. The polynucleotide of any one of claims 35-42, wherein inserting the at least one heterologous promoter into the biosynthetic gene cluster further comprises a donor template comprising a sequence of the heterologous promoter.
44. The polynucleotide of any one of claims 35-43, wherein the biosynthetic gene cluster comprises an mbtH gene upstream of the four NRPS open reading frames, and wherein the at least one heterologous promoter is inserted upstream of the mbtH gene.
45. The polynucleotide of claim 44, wherein the at least one heterologous promoter is one or more of an ermE promoter and kasO promoter.
46. The polynucleotide of claim 44 or 45, wherein the biosynthetic gene cluster comprises a polynucleotide sequence of SEQ ID NO: 11 or a sequence having at least about 80%, 85%, 90%, 95%, 98%, or 99% sequence identity thereto.
47. The polynucleotide of claim 31-46, wherein the at least one modification of the biosynthetic gene cluster comprises a modification that results in overexpression of the LmBU-encoding gene in comparison to the expression of the LmBU-encoding gene by the biosynthetic gene cluster of SEQ ID NO: 1.
48. The polynucleotide of claim 31, wherein the at least one modification of the biosynthetic gene cluster comprises replacement of at least one promoter in comparison to the biosynthetic gene cluster of SEQ ID NO: 1.
49. The polynucleotide of any one of claims 17-48, wherein the biosynthetic gene cluster is isolated or derived from Streptomyces strain NRRL F-6131.
50. The polynucleotide of any one of claims 17-49, wherein the biosynthetic gene cluster produces the compound of any one of Formula (I), Formula (10), and Formula (11) in the host cell.
51. A vector comprising the polynucleotide of any one of claims 17-50.
52. The vector of claim 51, wherein the vector is a bacterial artificial chromosomal vector.
53. The vector of claim 51 or 52, wherein the vector further comprises at least one promoter.
54. The vector of any one of claims 51-53, wherein the vector is suitable for expression in a Streptomyces species cell.
55. A host cell comprising the polynucleotide of any one of claims 17-50 or the vector of any one of claims 51-54.
56. The host cell of claim 55, wherein the host cell further comprises a sequence encoding LmBU operably linked to a constitutive promoter.
57. The host cell of claim 56, wherein the constitutive promoter is one or more of an ermE promoter and a kasO promoter.
58. The host cell of claim 56 or 57, wherein the LmBU is encoded by a polynucleotide sequence of SEQ ID NO: 8, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
59. The host cell of any one of claims 55-58, wherein the host cell is a Streptomyces cell.
60. The host cell of claim 59, wherein the Streptomyces cell is a Streptomyces lividans or Streptomyces albus cell.
61. A method of making a polynucleotide comprising a modified biosynthetic gene cluster comprising: a. providing a first E. coli host cell comprising a first vector comprising a sequence of an unmodified biosynthetic gene cluster comprising a target sequence; b. introducing the first vector into a Streptomyces host cell by conjugation; c. providing a second E. coli host cell comprising a second vector comprising: i. a sequence of at least one guide nucleic acid (gNA) specific to the target sequence operably linked to a promoter, ii. a sequence encoding a Cas endonuclease; and iii. a sequence encoding a donor template; and d. introducing the second vector into the Streptomyces host cell by conjugation; whereby introducing the second vector into the Streptomyces host cell produces a double strand break in the target sequence and introduction of a donor template sequence, thereby generating a Streptomyces host cell comprising a modified biosynthetic gene cluster; wherein the polynucleotide sequence of the modified biosynthetic gene cluster comprises a substitution, deletion, inversion, or insertion of one or more nucleotides relative
to SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
62. The method of claim 61, wherein the Cas endonuclease is selected from a Cas9 (also known as Csnl and Csxl2), Casl, CaslB, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Casio, Csyl, Csy2, Csy3, Csel, Cse2, Cscl, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmrl, Cmr3, Cmr4, Cmr5, Cmr6, Csbl, Csb2, Csb3, Csxl7, Csxl4, CsxlO, Csxl6, CsaX, Csx3, Csxl, Csxl5, Csfl, Csf2, Csf3, Csf4, homologues thereof, variants thereof, mutants thereof, and derivatives thereof.
63. The method of claim 61 or 62, wherein the Cas endonuclease is a Cas9 endonuclase.
64. The method of any one of claims 61-63, wherein the unmodified biosynthetic gene cluster comprises a sequence of SEQ ID NO: 1, or a sequence having at least about 80%, 85%, 90%, 95%, 97% or 99% identity thereto.
65. The method of any one of claims 61-64, wherein the donor template comprises, from 5’ to 3’, a sequence homologous to a sequence 5’ of the target sequence, a sequence of a promoter, and sequence homologous to a sequence 3’ of the target sequence.
66. The method of claim 65, wherein the promoter is selected from ermE and kasO or functional variants or derivatives thereof.
67. A method of making a compound of Formula (I), comprising a. introducing into a host cell the polynucleotide comprising a polynucleotide gene cluster of any one of claims 17-50 or the vector of any one of claims 51- 54; b. culturing the host cell under conditions suitable for the synthesis of the compound of Formula (I) by the biosynthetic gene cluster; and c. isolating and purifying the compound of Formula (I).
68. The method of claim 67, wherein the host cell is an Actinobacterial cell or a Streptomyces cell.
69. The method of claim 68, wherein the Streptomyces cell is a Streptomyces albus or Streptomyces lividans cell.
70. The method of any one of claims 67-69, wherein the host cell comprises a polynucleotide sequence encoding LmBU operably linked to a constitutive promoter.
71. The method of any one of claims 67-70, wherein the polynucleotide or vector is introduced into the host cell by conjugation with an E. coli cell comprising the polynucleotide or vector.
72. The method of any one of claims 67-71, wherein the compound of Formula (I) is a compound of Formula (10), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
73. The method of any one of claims 67-71, wherein the compound of Formula (I) is a compound of Formula (11), or a stereoisomer, a mixture of stereoisomers, a pharmaceutically acceptable salt, solvate, or tautomer thereof.
74. A pharmaceutical composition, comprising the compound of any one of claims 1-3, and a pharmaceutically acceptable excipient.
75. A method of treating a disease or disorder in a subject, comprising administering the compound of any one of claims 1-3 to the subject or the pharmaceutical composition of claim 74.
76. The compound of any one of claims 1-3 or the pharmaceutical composition of claim 74, for use in treating a disease or disorder in a subject.
77. A compound of any one of claims 1-3 for use in the manufacture of a medicament for treating a disease or disorder in a subject.
78. Use of a compound of any one of claims 1-3 or the pharmaceutical composition of claim 74 for the treatment of a disease or disorder.
79. The method, use, or compound of any one of claims 75-78, wherein the disease or disorder is cancer.
80. The method, use, or compound of any one of claims 75-78, wherein the disease or disorder is fibrosis.
81. The method, use, or compound of any one of claims 75-80, wherein the subject is human.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202163276322P | 2021-11-05 | 2021-11-05 | |
| US63/276,322 | 2021-11-05 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023081764A1 true WO2023081764A1 (en) | 2023-05-11 |
Family
ID=86242184
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2022/079230 Ceased WO2023081764A1 (en) | 2021-11-05 | 2022-11-03 | Hexadepsipeptide compounds and methods of using the same |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2023081764A1 (en) |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5665703A (en) * | 1994-04-07 | 1997-09-09 | Kyowa Hakko Kogyo Co., Ltd. | GE3 compound |
| US20090215789A1 (en) * | 2006-03-29 | 2009-08-27 | Thallion Pharmaceuticals, Inc. | Cyclic hexadepsipeptides, processes for their production and their use as pharmaceuticals |
| US20150080312A1 (en) * | 2012-05-07 | 2015-03-19 | Piramal Enterprises Limited | Hexadepsipeptide analogues as anticancer compounds |
| WO2022178403A1 (en) * | 2021-02-22 | 2022-08-25 | Lodo Therapeutics Corporation | Hexadepsipeptide compounds and methods of using the same |
-
2022
- 2022-11-03 WO PCT/US2022/079230 patent/WO2023081764A1/en not_active Ceased
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5665703A (en) * | 1994-04-07 | 1997-09-09 | Kyowa Hakko Kogyo Co., Ltd. | GE3 compound |
| US20090215789A1 (en) * | 2006-03-29 | 2009-08-27 | Thallion Pharmaceuticals, Inc. | Cyclic hexadepsipeptides, processes for their production and their use as pharmaceuticals |
| US20150080312A1 (en) * | 2012-05-07 | 2015-03-19 | Piramal Enterprises Limited | Hexadepsipeptide analogues as anticancer compounds |
| WO2022178403A1 (en) * | 2021-02-22 | 2022-08-25 | Lodo Therapeutics Corporation | Hexadepsipeptide compounds and methods of using the same |
Non-Patent Citations (2)
| Title |
|---|
| UCHIHATA YUKI, ANDO NORITAKA, IKEDA YOKO, KONDO SHINICHI, HAMADA MASA, UMEZAWA KAZUO: "Isolation of a Novel Cyclic Hexadepsipeptide Pipalamycin from Streptomyces as an Apoptosis-inducing Agent.", THE JOURNAL OF ANTIBIOTICS, NATURE PUBLISHING GROUP UK, LONDON, vol. 55, no. 1, 1 January 2002 (2002-01-01), London, pages 1 - 5, XP093065939, ISSN: 0021-8820, DOI: 10.7164/antibiotics.55.1 * |
| YANHUA DU;YEMIN WANG;TINGTING HUANG;MEIFENG TAO;ZIXIN DENG;SHUANGJUN LIN: "Identification and characterization of the biosynthetic gene cluster of polyoxypeptin A, a potent apoptosis inducer", BMC MICROBIOLOGY, BIOMED CENTRAL LTD., GB, vol. 14, no. 1, 8 February 2014 (2014-02-08), GB , pages 30, XP021179850, ISSN: 1471-2180, DOI: 10.1186/1471-2180-14-30 * |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Liu et al. | Rational construction of genome-reduced Burkholderiales chassis facilitates efficient heterologous production of natural products from proteobacteria | |
| JP5789190B2 (en) | New gene cluster | |
| Evans et al. | Nucleotide sequence and genetic analysis of the Azotobacter chroococcum nifUSVWZM gene cluster, including a new gene (nifP) which encodes a serine acetyltransferase | |
| US20170081690A1 (en) | Moenomycin biosynthesis-related compositions and methods of use thereof | |
| CN110305881B (en) | A biosynthetic gene cluster of polyketide neoenterocins and its application | |
| US20240140992A1 (en) | Hexadepsipeptide compounds and methods of using the same | |
| WO2023081764A1 (en) | Hexadepsipeptide compounds and methods of using the same | |
| EA020118B1 (en) | Nucleic acid molecule of a biosynthetic cluster encoding non ribosomal peptide synthases and uses thereof | |
| Baltz | Biosynthesis and genetic engineering of lipopeptides in Streptomyces roseosporus | |
| US9630911B2 (en) | Genes for biosynthesis of tetracycline compounds and uses thereof | |
| CN105777870B (en) | Novel thiostrepton analogue and its preparation method and use | |
| US10590159B2 (en) | Lincomycin biosynthetic intermediates, method for preparation, and use thereof | |
| CN102911957B (en) | Biosynthesis gene cluster of griseoviridin and viridogrisein and application of biosynthesis gene cluster | |
| Zhang et al. | In vivo production of thiopeptide variants | |
| US20140228278A1 (en) | Antibiotics and methods for manufacturing the same | |
| CN116239607A (en) | A novel derivative of maytansine and its biosynthesis method and application | |
| US8329430B2 (en) | Polymyxin synthetase and gene cluster thereof | |
| KR100861697B1 (en) | Production Method of Ansamycin Derivative Using AHVA Biosynthetic Gene Mutation and Novel Ansamycin Derivatives | |
| CN107641146B (en) | High-yield production strain of salinomycin and analogues thereof, preparation method of salinomycin and analogues thereof and application of salinomycin and analogues thereof | |
| CN105755076B (en) | Method for obtaining SANSANMYCIN structural analogue by mutation synthesis | |
| Weijia et al. | Construction and heterologous expression of the di-AFN A1 biosynthetic gene cluster in Streptomyces model strains | |
| Stegmann et al. | Precursor-directed biosynthesis for the generation of novel glycopetides | |
| KR20130097538A (en) | Chejuenolide biosynthetic gene cluster from hahella chejuensis | |
| CN102260644A (en) | Mutant strain of Streptomyces flaveolus and construction method and application thereof | |
| CN102174539A (en) | Piericidin A1 biosynthetic gene cluster |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22891046 Country of ref document: EP Kind code of ref document: A1 |
|
| 32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 22891046 Country of ref document: EP Kind code of ref document: A1 |