[go: up one dir, main page]

WO2024178007A1 - Mafb promoter for gene therapy for diabetes - Google Patents

Mafb promoter for gene therapy for diabetes Download PDF

Info

Publication number
WO2024178007A1
WO2024178007A1 PCT/US2024/016536 US2024016536W WO2024178007A1 WO 2024178007 A1 WO2024178007 A1 WO 2024178007A1 US 2024016536 W US2024016536 W US 2024016536W WO 2024178007 A1 WO2024178007 A1 WO 2024178007A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
vector
promoter
acid molecule
mafb
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2024/016536
Other languages
French (fr)
Inventor
George GITTES
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Pittsburgh
Original Assignee
University of Pittsburgh
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Pittsburgh filed Critical University of Pittsburgh
Publication of WO2024178007A1 publication Critical patent/WO2024178007A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/005Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P3/00Drugs for disorders of the metabolism
    • A61P3/08Drugs for disorders of the metabolism for glucose homeostasis
    • A61P3/10Drugs for disorders of the metabolism for glucose homeostasis for hyperglycaemia, e.g. antidiabetics
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/82Translation products from oncogenes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N5/00Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
    • C12N5/06Animal cells or tissues; Human cells or tissues
    • C12N5/0602Vertebrate cells
    • C12N5/0676Pancreatic cells
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/005Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
    • A61K48/0058Nucleic acids adapted for tissue specific expression, e.g. having tissue specific promoters as part of a contruct
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2510/00Genetically modified cells
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14141Use of virus, viral particle or viral elements as a vector
    • C12N2750/14143Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/34Vector systems having a special element relevant for transcription being a transcription initiation element

Definitions

  • Sequence Listing is submitted as an XML file in the form of the file named “Sequences” (25,742 bytes), which was created on February 2, 2024, which is incorporated by reference herein.
  • This relates to the field of alpha cells, specifically to Musculoaponeurotic fibrosarcoma oncogene homolog (Maf)B promoters that can be used to express a heterologous protein in alpha cells.
  • Musculoaponeurotic fibrosarcoma oncogene homolog (Maf)B promoters that can be used to express a heterologous protein in alpha cells.
  • a mammalian pancreas is composed of two subclasses of tissue: the exocrine cells of the acinar tissue and the endocrine cells of the islets of Langerhans.
  • the exocrine cells produce digestive enzymes that are secreted through the pancreatic duct to the intestine.
  • the islet cells produce polypeptide hormones that are involved in carbohydrate metabolism.
  • the islands of endocrine tissue that exist within the adult mammalian pancreas are termed the islets of Langerhans.
  • Adult mammalian islets are composed of five major cell types, the alpha (a), beta (3), delta (8), pancreatic polypeptide (PP), and ghrelin (s) cells. These cells are distinguished by their production of glucagon, insulin, somatostatin, pancreatic polypeptide, and ghrelin, respectively.
  • Adeno-associated virus encoding PDX1 and MafA can be infused through the pancreatic duct, such as by using endoscopic retrograde cholangiopancreatography (ERCP), to transdifferentiate a cells into 0 cells (see PCT Publication No. WO 2015/164218, incorporated herein by reference).
  • ERCP endoscopic retrograde cholangiopancreatography
  • promoters that can be used to achieve high levels of these proteins, and/or other proteins, in alpha cells.
  • MafB promoters are disclosed herein, wherein the MafB promoter consists essentially of, or consists of a nucleotide sequence at least 95% identical to SEQ ID NO: 1, that functions as a promoter, or the nucleotide sequence of SEQ ID NO: 1.
  • recombinant nucleic acid molecules include a MafB promoter operably linked to a heterologous nucleic acid molecule encoding a protein.
  • vectors include these recombinant nucleic acid molecules, and host cells transformed with these vectors. In more aspects, methods of using these vectors and host cells are disclosed. Methods for treating diabetes in a subject are also provided.
  • FIG. 1 provides a schematic diagram of a construct including the human MafB promoter, and evidence of its use to target human alpha cells. Results are provided from three days after transfection of human alpha cells in the lower panels. Expression is compared to a similar construct that includes the cytomegalovirus (CMV) promoter instead of the MafB promoter.
  • CMV cytomegalovirus
  • FIG. 2 provides two schematic diagrams of constructs that include the human MafB promoter, that can be used to target human alpha cells. Two schematic diagrams of control constructs, that include the CMV promoter, are also provided.
  • FIG. 3 is a bar graph showing the results of an Insulin ELISA. Results are from three days after transfection of human alpha cells with constructs shown in FIG. 2.
  • FIG. 4 provides graphs showing the daily blood glucose (top) and daily insulin requirement (bottom) from a monkey administered Tina-059, shown in FIG. 2.
  • FIG. 5 is a graph of the results from a glucose tolerance test in a monkey following treatment with Tina-059.
  • FIG. 6 is a graph of the insulin levels in the same monkey as Fig. 5 30 minutes after glucose administration following treatment with Tina-059.
  • FIG. 7 is a schematic diagram of pTina-055.
  • This vector includes a human MafB promoter, a chimeric intron, a Kozak sequence, a nucleic acid molecule encoding human PDX1 (without a STOP codon), P2A, WPRE, and a poly(A) sequence.
  • This vector also includes a nucleic acid molecule encoding human MafA (without a STOP codon) T2A and a nucleic acid molecule encoding human MafB.
  • FIG. 8 is a schematic diagram of pTina-059.
  • This vector includes a human MafB promoter, a chimeric intron, a Kozak sequence, a nuclei acid encoding human PDX1 (no STOP CODON), P2A, WPRE, and a poly(A) sequence.
  • This vector also includes a nucleic acid molecule encoding human MafA (with a STOP codon) but does not include T2A. It does not include a nucleic acid molecule encoding human MafB.
  • FIG. 9 is a graph of a glucose tolerance tests (GTT) for a first monkey (M263) that received a pancreatic ductal infusion of AAV6 carrying pTina-059.
  • GTT glucose tolerance tests
  • M263 received mycophenolate mofetil (MMF) for immunosuppression.
  • FIG. 10 is a graph of a GTT for a second monkey (Ml 88) that received a pancreatic ductal infusion of AAV6 carrying Tina-059.
  • nucleic and amino acid sequences listed are shown using standard letter abbreviations for nucleotide bases, and three letter code for amino acids, as defined in 37 C.F.R. 1.822. Only one strand of each nucleic acid sequence is shown, but the complementary strand is understood as included by any reference to the displayed strand.
  • SEQ ID NO: 1 is a nucleic acid sequence of an exemplary MafB promoter.
  • SEQ ID NO: 2 is an amino acid sequence of and exemplary human MafA.
  • SEQ ID NO: 3 is an amino acid sequence of an exemplary mouse MafA.
  • SEQ ID NO: 4 is an amino acid sequence of an exemplary human PDX1.
  • SEQ ID NO: 5 is an amino acid sequence of an exemplary mouse PDX1.
  • SEQ ID NOs: 6-8 are the nucleic acid sequences of exemplary connectors.
  • SEQ ID NO: 9 is an amino acid sequence of an exemplary human MafB.
  • SEQ ID NO: 10 is an amino acid sequence of am exemplary mouse MafB.
  • SEQ ID NO: 11 is an amino acid sequence of an exemplary Ngn3.
  • SEQ ID NO: 12 is the nucleic acid sequence a portion of an exemplary MafB promoter.
  • SEQ ID NO: 13 is a nucleic acid sequence of a chimeric intron.
  • SEQ ID NO: 14 is a nucleic acid sequence of a Kozak sequence.
  • SEQ ID NO: 15 is a nucleic acid sequence encoding human PDX1, without a STOP codon.
  • SEQ ID NO: 16 is a nucleic acid sequence of a P2A connector.
  • SEQ ID NO: 17 is a nucleic acid sequence encoding human MafA, without a STOP codon.
  • SEQ ID NO: 18 is a nucleic acid sequence of a T2A connector.
  • SEQ ID NO: 19 is a nucleic acid sequence of human MafA, with a STOP codon.
  • SEQ ID NO: 20 is a nucleic acid sequence of a Woodchuck hepatitis virus Posttranscriptional
  • SEQ ID NO: 21 is a nucleic acid sequence encoding a poly A tail.
  • MafB is a transcription factor that is expressed in alpha cells in mice, but not in mouse beta cells. However, in humans and non-human primates it is expressed in both alpha cells and beta cells.
  • Gene therapy constructs were engineered in which a shortened form of the MafB promoter drives expression of PDX1 and MafA, with or without additionally driving MafB production. This construct is able to convert human and mouse alpha cell lines into insulin-producing cells.
  • MafB promoters that can be operably linked to a heterologous nucleic acid molecule encoding a protein.
  • Recombinant nucleic acid molecules including such a MafB promoter operably linked to the heterologous nucleic acid molecule encoding a protein provide a high level of expression of the protein in alpha cells.
  • Vectors such as adeno-associated virus vectors including a disclosed MafB promoter operably linked to a heterologous nucleic acid molecule encoding PDX1 and MafA, and optionally MafB itself, can be infused through the pancreatic duct, such as by using endoscopic retrograde cholangiopancreatography (ERCP), to transdifferentiate a cells into functional 0 cells.
  • ERCP endoscopic retrograde cholangiopancreatography
  • PCT Publication No. WO 2015/164218 discloses that adeno- associated virus encoding PDX1 and MafA can be infused through the pancreatic duct, such as by using endoscopic retrograde cholangiopancreatography (ERCP), to reprogram a cells into 0 cells, which secrete insulin.
  • ERCP endoscopic retrograde cholangiopancreatography
  • the new beta cells are immunologically unrecognized for an extended period, resulting in persistent euglycemia without further interventions.
  • the immune system in treated subjects was reverted to a naive state in which the immune cells were not actively being exposed to beta-cell autoantigens.
  • the presently disclosed promoters can be used to achieve high levels of expression of MafA and PDX1 and can be used to increase the efficiency of these methods.
  • a beta cell includes single or plural beta cells and can be considered equivalent to the phrase “at least one beta cell.”
  • the term “comprises” means “includes.” It is further to be understood that any and all base sizes or amino acid sizes, and all molecular weight or molecular mass values, given for nucleic acids or polypeptides are approximate, and are provided for descriptive purposes, unless otherwise indicated. Although many methods and materials similar or equivalent to those described herein can be used, particular suitable methods and materials are described herein. In case of conflict, the present specification, including explanations of terms, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting. To facilitate review of the various aspects, the following explanations of terms are provided:
  • Alpha (a) cells Mature glucagon producing endocrine cells. In vivo, these cells are found in the pancreatic islets of Langerhans.
  • Beta (P) cells Mature insulin producing endocrine cells. In vivo, these cells are found in the pancreatic islets of Langerhans,
  • Delta (8) cells Mature somatostatin producing endocrine cells. In vivo, these cells are found in the pancreatic islets of Langerhans.
  • PP cells Mature pancreatic polypeptide (PP) producing endocrine cells. In vivo, these cells are found in the pancreatic islets of Langerhans.
  • Adeno-associated virus A small, replication-defective, non-enveloped virus that infects humans and some other primate species. AAV is not known to cause disease and elicits a very mild immune response. Gene therapy vectors that utilize AAV can infect both dividing and quiescent cells and can persist in an extrachromosomal state without integrating into the genome of the host cell. These features make AAV an attractive viral vector for gene therapy. There are currently 14 recognized serotypes of AAV (AAV1-13 and AAV-DJ).
  • Administration To provide or give a subject an agent by any effective route.
  • routes of administration include, but are not limited to, oral, injection (such as subcutaneous, intramuscular, intradermal, intraperitoneal, intravenous, and intratumoral), sublingual, rectal, transdermal, intranasal, vaginal and inhalation routes.
  • administration is to a pancreatic duct.
  • Agent Any polypeptide, compound, small molecule, organic compound, salt, polynucleotide, vector, or other molecule of interest.
  • Agent can include a therapeutic agent, a diagnostic agent or a pharmaceutical agent.
  • a therapeutic agent is a substance that demonstrates some therapeutic effect by restoring or maintaining health, such as by alleviating the symptoms associated with a disease or physiological disorder, or delaying (including preventing) progression or onset of a disease, such as T1D.
  • a therapeutic agent can include a disclosed MafB promoter operably linked to a nucleic acid molecule encoding a polypeptide.
  • Amino acid substitution The replacement of one amino acid in a polypeptide with a different amino acid.
  • Animal Living multi-cellular vertebrate organisms, a category that includes, for example, mammals and birds.
  • mammal includes both human and non-human mammals.
  • subject includes both human and veterinary subjects.
  • Anti-diabetic lifestyle modifications Changes to lifestyle, habits, and practices intended to alleviate the symptoms of diabetes or pre-diabetes. Obesity and sedentary lifestyle may both independently increase the risk of a subject developing type 2 diabetes, so anti-diabetic lifestyle modifications include those changes that will lead to a reduction in a subject’s body mass index (BMI), increase physical activity, or both. Specific, non-limiting examples include the lifestyle interventions described in Diabetes Care, 22(4):623-34 at pages 626-27, herein incorporated by reference.
  • Conservative Substitutions Modifications of a polypeptide that involve the substitution of one or more amino acids for amino acids having similar biochemical properties that do not result in change or loss of a biological or biochemical function of the polypeptide are designated “conservative” substitutions. These conservative substitutions are likely to have minimal impact on the activity of the resultant protein.
  • Table 1 shows amino acids that can be substituted for an original amino acid in a protein, and which are regarded as conservative substitutions.
  • One or more conservative changes, or up to ten conservative changes can be made in the polypeptide without changing a biochemical function of the protein, such as PDX1 or MafA.
  • Consists essentially of: With regarding to nucleic acid molecules, a term that indicates that additional nucleotides are not included in the part of the molecule, but that other agents (such as labels, altered forms of a nucleotide, or chemical compounds) can be included. “Consists of’ indicates that the exact nucleotide sequence is present in that part of a molecule.
  • Degenerate variant refers to a polynucleotide encoding a polypeptide (such as PDX1 or MafA) that includes a sequence that is degenerate as a result of the genetic code. There are 20 natural amino acids, most of which are specified by more than one codon. Therefore, all degenerate nucleotide sequences encoding a peptide are included as long as the amino acid sequence of the peptide encoded by the nucleotide sequence is unchanged.
  • Diabetes mellitus A group of metabolic diseases in which a subject has high blood sugar, either because the pancreas does not produce enough insulin, or because cells do not respond to the insulin that is produced.
  • Type 1 diabetes results from the body's failure to produce insulin. This form has also been called ' insulin-dependent diabetes mellitus” (IDDM) or "juvenile diabetes". Type 1 diabetes mellitus is characterized by loss of the insulin-producing P cells, leading to insulin deficiency. This type can be further classified as immune-mediated or idiopathic. Type 2 diabetes results from insulin resistance, a condition in which cells fail to use insulin properly, sometimes combined with an absolute insulin deficiency. This form is also called “non insulin-dependent diabetes mellitus” (NIDDM) or "adult-onset diabetes.” The defective responsiveness of body tissues to insulin is believed to involve the insulin receptor.
  • NIDDM non insulin-dependent diabetes mellitus
  • Diabetes mellitus is characterized by recurrent or persistent hyperglycemia, and is diagnosed by demonstrating any one of: a. Fasting plasma glucose level > 7.0 mmol/1 (126 mg/dl); b. Plasma glucose > 11.1 mmol/1 (200 mg/dL) two hours after a 75 g oral glucose load as in a glucose tolerance test; c. Symptoms of hyperglycemia and casual plasma glucose > 11.1 mmol/1 (200 mg/dl); d. Glycated hemoglobin (Hb A1C) > 6.5%
  • Differentiation The process whereby a first cell acquires specialized structural and/or functional features characteristic of a certain type of mature cells. Similarly, “differentiate” refers to this process. Typically, during differentiation, cellular structure alters and tissue-specific proteins appear.
  • the term "differentiated pancreatic endocrine cell” refers to cells expressing a protein characteristic of the specific pancreatic endocrine cell type.
  • a differentiated pancreatic endocrine cell includes an a cell, a 0 cell, a 8 cell, and a PP cell, which express glucagon, insulin, somatostatin, and pancreatic polypeptide, respectively.
  • Endocrine Tissue which secretes regulatory hormones directly into the bloodstream without the need for an associated duct system.
  • Enhancer A nucleic acid sequence that increases the rate of transcription by increasing the activity of a promoter.
  • Expand A process by which the number or amount of cells is increased due to cell division. Similarly, the terms “expansion” or “expanded” refers to this process.
  • the terms “proliferate,” “proliferation” or “proliferated” may be used interchangeably with the words “expand,” “expansion,” or “expanded.”
  • Proteins may be expressed and remain intracellular, become a component of the cell surface membrane, or be secreted into the extracellular matrix or medium.
  • Exocrine Secretory tissue which distributes its products, such as enzymes, via an associated duct network.
  • the exocrine pancreas is the part of the pancreas that secretes enzymes required for digestion.
  • the exocrine cells of the pancreas include the centroacinar cells and basophilic cells, which produce secretin and cholecystokinin.
  • Expression Control Sequences Nucleic acid sequences that regulate the expression of a heterologous nucleic acid sequence to which it is operatively linked. Expression control sequences are operatively linked to a nucleic acid sequence when the expression control sequences control and regulate the transcription and, as appropriate, translation of the nucleic acid sequence.
  • expression control sequences can include appropriate promoters, enhancers, transcription terminators, a start codon (i.e., ATG) in front of a protein-encoding gene, splicing signal for introns, maintenance of the correct reading frame of that gene to permit proper translation of mRNA, and stop codons.
  • control sequences is intended to include, at a minimum, components whose presence can influence expression, and can also include additional components whose presence is advantageous, for example, leader sequences and fusion partner sequences. Expression control sequences can include a promoter.
  • Expression vector A vector comprising a recombinant polynucleotide comprising expression control sequences operatively linked to a nucleotide sequence to be expressed.
  • An expression vector comprises sufficient cis- acting elements for expression; other elements for expression can be supplied by the host cell or in an in vitro expression system.
  • Non-limiting examples of expression vectors include cosmids, plasmids (e.g., naked or contained in liposomes) and viruses (e.g., lentiviruses, retroviruses, adenoviruses, and adeno-associated viruses) that incorporate the recombinant polynucleotide.
  • a polynucleotide can be inserted into an expression vector that contains a promoter sequence which facilitates the efficient transcription of the inserted genetic sequence of the host.
  • the expression vector typically contains an origin of replication, a promoter, as well as specific nucleic acid sequences that allow phenotypic selection of the transformed cells.
  • Glucagon A pancreatic enzyme produced by the pancreatic a cells in vivo.
  • Exemplary glucagon amino acid sequences are shown in GENBANK® accession Nos: NP_002045.1 (pro-protein) (human); NP 032126.1 (mouse), both incorporated by reference.
  • the term “glucagon” also encompasses species variants, homologues, allelic forms, mutant forms, and equivalents thereof, including conservative substitutions, additions, deletions therein not adversely affecting the function, such as binding to the glucagon receptor.
  • Glucagon is encoded by nucleic acid corresponding to GENBANK® Accession No: NM_002054.5 (human); NM_008100.4 (mouse), both incorporated by reference as available on January 4, 2022.
  • the glucagon protein encoded by the glucagon gene which includes a glucagon promoter in vivo.
  • Glucagon is expressed as a preproprotein that is cleaved into four distinct mature peptides, one of which is glucagon.
  • Glucagon is a pancreatic hormone that counteracts the glucose-lowering action of insulin by stimulating glycogenolysis and gluconeogenesis.
  • Glucagon is a ligand for a specific G-protein linked receptor whose signaling pathway controls cell proliferation.
  • a heterologous sequence is a sequence that is not normally (in the wild-type sequence) found adjacent to a second sequence.
  • the sequence is from a different genetic source, such as a virus or organism, than the second sequence.
  • a protein that is heterologous to a MafB promoter is any protein other than MafB (it is not MafB).
  • Host cells Cells in which a vector can be propagated and its DNA expressed.
  • the cell may be prokaryotic or eukaryotic.
  • the term also includes any progeny of the subject host cell. It is understood that all progeny may not be identical to the parental cell since there may be mutations that occur during replication. However, such progeny are included when the term “host cell” is used.
  • Insulin A protein hormone involved in the regulation of blood sugar levels that is produced by pancreatic beta cells. In vivo, insulin is produced as a precursor proinsulin, consisting of the B and A chains of insulin linked together via a connecting C-peptide. Insulin itself includes only the B and A chains. Exemplary nucleic acid sequences encoding insulin are provided in GENBANK® Accession No: NM_000207.3 (human) and NM_008386.4 (mouse), as available on December 28, 2021, and are incorporated by reference herein. The term insulin also encompasses species variants, homologues, allelic forms, mutant forms, and equivalents thereof, including conservative substitutions, additions, deletions therein not adversely affecting the structure of function.
  • Islets of Langerhans Small discrete clusters of pancreatic endocrine tissue. In vivo, in an adult mammal, the islets of Langerhans are found in the pancreas as discrete clusters (islands) of pancreatic endocrine tissue surrounded by the pancreatic exocrine (or acinar) tissue. In vivo, the islets of Langerhans consist of the a cells, cells, 6 cells, PP cells, and e cells. Histologically, in rodents, the islets of Langerhans consist of a central core of 0 cells surrounded by an outer layer of a cells, 8 cells, and PP cells. The structure of human islets of Langerhans is different and distinct from rodents. The islets of Langerhans are sometimes referred to herein as “islets.”
  • Isolated An “isolated” biological component (such as a nucleic acid, peptide or protein) has been substantially separated, produced apart from, or purified away from other biological components in the cell of the organism in which the component naturally occurs, i.e., other chromosomal and extrachromosomal DNA and RNA, and proteins. Nucleic acids, peptides and proteins which have been “isolated” thus include nucleic acids and proteins purified by standard purification methods. The term also embraces nucleic acids, peptides and proteins prepared by recombinant expression in a host cell as well as chemically synthesized nucleic acids. An isolated cell type has been substantially separated from other cell types, such as a different cell type that occurs in an organ. A purified cell or component can be at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% pure.
  • Label A detectable compound or composition that is conjugated directly or indirectly to another molecule, such as an antibody or a protein, to facilitate detection of that molecule.
  • molecule such as an antibody or a protein
  • labels include fluorescent tags, enzymatic linkages, and radioactive isotopes.
  • MafA Musculoaponeurotic fibrosarcoma oncogene homolog A
  • MafA is a transcription factor that binds RIPE3b, a conserved enhancer element that regulates pancreatic beta cell-specific expression of the insulin gene (INS; MIM 176730) (Olbrot et al., 2002).
  • MafA is referred in the art as aliases; v-maf musculoaponeurotic fibrosarcoma oncogene homolog A (avian), hMafA; RIPE3bl; MafA.
  • Exemplary MafA proteins are the MafA protein of GENBANK® Accession No: NM_194350 (mouse) (SEQ ID NOs:3 and 32 of U.S. Published Patent Application No. 2011/0280842) or NP_963883.2 (Human)(SEQ ID NOs: 33 and 32 of U.S. Published Patent Application No. 2011/0280842); GenelD No: 389692, which are all incorporated by reference.
  • the term MafA also encompasses species variants, homologues, allelic forms. mutant forms, and equivalents thereof, including conservative substitutions, additions, deletions that do not adversely affecting the structure of function.
  • MafA refers to a polypeptide having a naturally occurring amino acid sequence of a MafA" protein or a fragment, variant, or derivative thereof retains the ability of the naturally occurring protein to bind to DNA and activate gene transcription of Glut2 and pyruvate carboxylase, and other genes such as Glut2, PDX-1, Nkx6.1, GLP-1 receptor, prohormone convertase-1/3 as disclosed in Wang et al., Diabetologia. 2007 February; 50(2): 348- 358, which is incorporated herein by reference.
  • Exemplary MafA nucleic acids are GENBANK® Accession No: NM_201589 (human) (SEQ ID NO:36 of U.S. Published Patent Application No. 2011/0280842) and GENBANK® Accession No: NM_194350 (mouse) (SEQ ID NO: 39 of U.S. Published Patent Application No. 2011/0280842), which are all incorporated by reference.
  • allelic variants of the MafA sequences that may exist in the population, it will be appreciated that, as is the case for virtually all proteins, a variety of changes can be introduced into the sequences of SEQ ID NO: 3 of U.S. Published Patent Application No. 2011/0280842 or SEQ ID NO: 33 of U.S.
  • Musculoaponeurotic fibrosarcoma oncogene homolog B (MafB): A basic leucine zipper (bZIP) transcription factor that plays an important role in the regulation of lineage-specific hematopoiesis.
  • MafB is an encoded nuclear protein that represses ETS1 -mediated transcription of erythroid-specific genes in myeloid cells.
  • MafB is also a negative regulator of cellular induction of type I interferon (IFN) and other pro-inflammatory cytokines, and thus is believed to protect against the development of auto-inflammatory disorders in multiple mammalian tissues.
  • IFN type I interferon
  • MafB belongs to the large Maf family of transcription factors, which consists of MafB, c-Maf, MafA and NrL.
  • MafB expression is generally restricted to alpha cells. However, in humans and non-human primates, MafB is expressed in beta cells (see Cyphert et al., Diabetes 2019;68(2):337-348, released on-line November 13, 2018).
  • Exemplary nucleic acid molecules encoding human MafB, and the encoded human MafB protein sequence are provided in GENBANK® Accession No. AF134157.1, August 31, 1999, incorporated herein by reference.
  • Exemplary nucleic acid molecules encoding murine MafB, and the encoded murine MafB protein sequence are provided in GENBANK® Accession No: NM_010658.3, December 21, 2022, incorporated herein by reference.
  • the MafB gene is disclosed at Gene ID No. 9935, updated January 22, 2023, incorporated herein by reference, as available from the National Library of Medicine, available at the NCBI website.
  • Mammal This term includes both human and non-human mammals. Similarly, the term “subject” includes both human and veterinary subjects.
  • Neurogenin (Ngn)-3 Neurogenin-3 (also known as NEUROG3) is expressed in endocrine progenitor cells and is required for endocrine cell development in the pancreas and intestine. It belongs to a family of basic helix-loop-helix transcription factors involved in the determination of neural precursor cells in the neuroectoderm. Ngn3 is referred in the art as aliases; Neurogenin 3; Atoh5; Math4B; bHLHa7; NEUR0G3. Exemplary Ngn3 proteins are provided in GENBANK® Accession No: NM 009719 (mouse) and SEQ ID NO:2 of U.S. Published Patent Application No.
  • Ngn3 also encompasses species variants, homologues, allelic forms, mutant forms, and equivalents thereof, including conservative substitutions, additions, deletions therein not adversely affecting the structure of function.
  • Human Ngn3 is encoded by nucleic acid corresponding to GENBANK® Accession No: NM 020999 (human), SEQ ID NO:35 of U.S. Published Patent Application No.
  • Ngn3 refers to a polypeptide having a naturally occurring amino acid sequence of a Ngn3 protein or a fragment, variant, or derivative thereof that retains the ability of the naturally occurring protein to bind to DNA and activate gene transcription of NeuroD, Delta-like l(Dlll), HeyL, insulinoma-assiciated-1 (IA1), Nk2.2, Notch, HesS, Isll, Somatostatin receptor 2 (Sstr2) and other genes as disclosed in Serafimidis et al., Stem cells; 2008; 26; 3-16, which is incorporated herein in its entirety by reference.
  • IA1 insulinoma-assiciated-1
  • Sstr2 Somatostatin receptor 2
  • Ngn3 sequences that may exist in the population, it will be appreciated that, as is the case for virtually all proteins, a variety of changes can be introduced into a wild-type sequence (listed above in GENBANK® enteries) without substantially altering the functional (biological) activity of the polypeptides. Such variants are included within the scope of the terms "Ngn3 ", "Ngn3 protein", etc.
  • Nucleic acid A polymer composed of nucleotide units (ribonucleotides, deoxyribonucleotides, related naturally occurring structural variants, and synthetic non-naturally occurring analogs thereof) linked via phosphodiester bonds, related naturally occurring structural variants, and synthetic non-naturally occurring analogs thereof.
  • nucleotide polymers in which the nucleotides and the linkages between them include non-naturally occurring synthetic analogs, such as, for example and without limitation, phosphorothioates, phosphoramidates, methyl phosphonates, chiral-methyl phosphonates, 2-O- methyl ribonucleotides, peptide-nucleic acids (PNAs), and the like.
  • oligonucleotide typically refers to short polynucleotides, generally no greater than about 50 nucleotides. It will be understood that when a nucleotide sequence is represented by a DNA sequence (i.e., A, T, G, C), this also includes an RNA sequence (i.e., A, U, G, C) in which “U” replaces “T.”
  • nucleotide sequences the left-hand end of a singlestranded nucleotide sequence is the 5 '-end; the left-hand direction of a double- stranded nucleotide sequence is referred to as the 5’-direction.
  • the direction of 5' to 3' addition of nucleotides to nascent RNA transcripts is referred to as the transcription direction.
  • the DNA strand having the same sequence as an mRNA is referred to as the “coding strand;” sequences on the DNA strand having the same sequence as an mRNA transcribed from that DNA and which are located 5' to the 5'-end of the RNA transcript are referred to as “upstream sequences;” sequences on the DNA strand having the same sequence as the RNA and which are 3' to the 3' end of the coding RNA transcript are referred to as “downstream sequences.”
  • cDNA refers to a DNA that is complementary or identical to an mRNA, in either single stranded or double stranded form.
  • Encoding refers to the inherent property of specific sequences of nucleotides in a polynucleotide, such as a gene, a cDNA, or an mRNA, to serve as templates for synthesis of other polymers and macromolecules in biological processes having either a defined sequence of nucleotides (i.e., rRNA, tRNA and mRNA) or a defined sequence of amino acids and the biological properties resulting therefrom.
  • a gene encodes a protein if transcription and translation of mRNA produced by that gene produces the protein in a cell or other biological system.
  • coding strand the nucleotide sequence of which is identical to the mRNA sequence and is usually provided in sequence listings
  • non-coding strand used as the template for transcription
  • a "nucleotide sequence encoding an amino acid sequence" includes all nucleotide sequences that are degenerate versions of each other and that encode the same amino acid sequence. Nucleotide sequences that encode proteins and RNA may include introns.
  • a first sequence is an “antisense” with respect to a second sequence if a polynucleotide whose sequence is the first sequence specifically hybridizes with a polynucleotide whose sequence is the second sequence.
  • sequence relationships between two or more nucleotide sequences or amino acid sequences include “reference sequence,” “selected from,” “comparison window,” “identical,” “percentage of sequence identity,” “substantially identical,” “complementary,” and “substantially complementary.”
  • sequence comparison For sequence comparison of nucleic acid sequences, typically one sequence acts as a reference sequence, to which test sequences are compared.
  • test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters are used.
  • Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison can be conducted, for example, by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482, 1981, by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443, 1970, by the search for similarity method of Pearson & Lipman, Proc. Nat'l. Acad. Sci.
  • PILEUP uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol. Evol. 35:351-360, 1987. The method used is similar to the method described by Higgins & Sharp, CABIOS 5:151-153, 1989.
  • a reference sequence is compared to other test sequences to determine the percent sequence identity relationship using the following parameters: default gap weight (3.00), default gap length weight (0.10), and weighted end gaps.
  • PILEUP can be obtained from the GCG sequence analysis software package, such as version 7.0 (Devereaux el al., Nuc. Acids Res. 12:387-395, 1984.
  • BLAST Altschul et al., J. Mol. Biol. 215:403-410, 1990 and Altschul et al., Nucleic Acids Res. 25:3389-3402, 1977.
  • Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/).
  • the BLASTP program (for amino acid sequences) uses as defaults a word length (W) of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915, 1989).
  • a first nucleic acid sequence is operably linked with a second nucleic acid sequence when the first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence.
  • a promoter is operably linked to a coding sequence if the promoter affects the transcription or expression of the coding sequence.
  • operably linked DNA sequences are contiguous and, where necessary to join two protein coding regions, in the same reading frame.
  • ORF open reading frame: A series of nucleotide triplets (codons) coding for amino acids without any termination codons. These sequences are usually translatable into a peptide.
  • Pancreatic endocrine cell An endocrine cell of pancreatic origin that produces one or more pancreatic hormone, such as insulin, glucagon, somatostatin, or pancreatic polypeptide.
  • pancreatic hormone such as insulin, glucagon, somatostatin, or pancreatic polypeptide.
  • Subsets of pancreatic endocrine cells include the a (glucagon producing), (insulin producing) 3 (somatostatin producing) or PP (pancreatic polypeptide producing) cells. Additional subsets produce more than one pancreatic hormone, such as, but not limited to, a cell that produces both insulin and glucagon, or a cell that produces insulin, glucagon, and somatostatin, or a cell that produces insulin and somatostatin.
  • Pancreas duodenal homeobox protein (PDX)l PDX1 protein is a transcriptional activator of several genes, including insulin, somatostatin, glucokinase, islet amyloid polypeptide, and glucose transporter type 2 (GLUT2).
  • PDX1 is a nuclear protein is involved in the early development of the pancreas and plays a major role in glucose-dependent regulation of insulin gene expression. Defects in the gene encoding the PDX1 protein are a cause of pancreatic agenesis, which can lead to early-onset insulindependent diabetes mellitus (NIDDM), as well as maturity onset diabetes of the young type 4 (M0DY4).
  • NIDDM early-onset insulindependent diabetes mellitus
  • M0DY4 maturity onset diabetes of the young type 4
  • PDX1 is referred in the art as aliases; pancreatic and duodenal homeobox 1, IDX-1, STF-1, PDX-1, M0DY4, Ipfl.
  • Exemplary PDX1 proteins are shown in GENBANK® Accession No. NM 008814 (mouse) (SEQ ID NO:1 of U.S. Published Patent Application No. 2011/0280842) or GENBANK® Accession No. NP 000200.1 (Human)(SEQ ID NO: 31 of U.S. Published Patent Application No. 2011/0280842), or Gene ID: 3651, which are all incorporated herein by reference.
  • PDX1 also encompasses species variants, homologues, allelic forms, mutant forms, and equivalents thereof, including conservative substitutions, additions, deletions therein not adversely affecting the structure of function.
  • Exemplary nucleic acid sequences are shown in GENBANK® Accession No NM 000209 (human) (SEQ ID NO:34 of U.S. Published Patent Application No. 2011/0280842) or GENBANK® Accession No NM 008814 (mouse)(SEQ ID NO: 37 of U.S. Published Patent Application No. 2011/0280842), which are all incorporated by reference.
  • PDX1 refers to a polypeptide having a naturally occurring amino acid sequence of a PDX1 protein or a fragment, variant, or derivative thereof that at least in part retains the ability of the naturally occurring protein to bind to DNA and activate gene transcription of insulin, somatostatin, glucokinase, islet amyloid polypeptide, and glucose transporter type 2 (GLUT2).
  • allelic variants of the PDX1 sequences that may exist in the population, it will be appreciated that, as is the case for virtually all proteins, a variety of changes can be introduced into a wild type sequence (see the listed GENBANK® enteries) without substantially altering the functional (biological) activity of the polypeptides. Such variants are included within the scope of the terms "PDX1", "PDX1 protein", etc.
  • the listed GENBANK® Accession Nos. and U.S. Published Patent Application No. 2011/0280842 are incorporated by reference herein.
  • parenteral formulations usually comprise injectable fluids that include pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like as a vehicle.
  • pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like as a vehicle.
  • physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like
  • solid compositions e.g., powder, pill, tablet, or capsule forms
  • conventional non-toxic solid carriers can include, for example, pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate.
  • compositions to be administered can contain minor amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate.
  • non-toxic auxiliary substances such as wetting or emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate.
  • Pharmaceutical agent A chemical compound or a composition capable of inducing a desired therapeutic or prophylactic effect when properly administered to a subject or a cell.
  • “Incubating” includes a sufficient amount of time for a drug to interact with a cell.
  • “Contacting” includes incubating a drug in solid or in liquid form with a cell.
  • Pre-diabetes A state in which some, but not all, of the criteria for diabetes are met.
  • a subject can have impaired fasting glycemia or impaired fasting glucose (IFG).
  • IGF impaired fasting glucose
  • Subjects with fasting glucose levels from 110 to 125 mg/dl (6.1 to 6.9 mmol/1) are considered to have impaired fasting glucose.
  • Subjects with plasma glucose at or above 140 mg/dL (7.8 mmol/1), but not over 200 mg/dl (11.1 mmol/1), two hours after a 75 g oral glucose load are considered to have impaired glucose tolerance.
  • Predisposition for diabetes A subject that is at high risk for developing diabetes.
  • risk factors include: genetic factors (e.g., carrying alleles that result in a higher occurrence of diabetes than in the average population or having parents or siblings with diabetes); overweight (e.g., body mass index (BMI) greater or equal to 25 kg/m.sup.2); habitual physical inactivity, race/ethnicity (e.g., African-American, Hispanic-American, Native Americans, Asian-Americans, Pacific Islanders); previously identified impaired fasting glucose or impaired glucose tolerance, hypertension (e.g., greater or equal to 140/90 mmHg in adults); HDL cholesterol greater or equal to 35 mg/dl; triglyceride levels greater or equal to 250 mg/dl; a history of gestational diabetes or delivery of a baby over nine pounds; and/or polycystic ovary syndrome. See, e.g., "Report of the Expert Committee on the Diagnosis and Class
  • Polypeptide A polymer in which the monomers are amino acid residues that are joined together through amide bonds. When the amino acids are alpha-amino acids, either the L-optical isomer or the D- optical isomer can be used, the L-isomers being preferred.
  • polypeptide or “protein” as used herein is intended to encompass any amino acid sequence and include modified sequences such as glycoproteins.
  • polypeptide is specifically intended to cover naturally occurring proteins, as well as those that are recombinantly or synthetically produced.
  • polypeptide fragment refers to a portion of a polypeptide which exhibits at least one useful epitope.
  • functional fragments of a polypeptide refers to all fragments of a polypeptide that retain an activity of the polypeptide.
  • Biologically functional fragments can vary in size from a polypeptide fragment as small as an epitope capable of binding an antibody molecule to a large polypeptide capable of participating in the characteristic induction or programming of phenotypic changes within a cell.
  • An “epitope” is a region of a polypeptide capable of binding an immunoglobulin generated in response to contact with an antigen. Thus, smaller peptides containing the biological activity of insulin, or conservative variants of the insulin, are thus included as being of use.
  • soluble refers to a form of a polypeptide that is not inserted into a cell membrane.
  • substantially purified polypeptide refers to a polypeptide which is substantially free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated.
  • the polypeptide is at least 50%, for example at least 80% free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated.
  • the polypeptide is at least 90% free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated.
  • the polypeptide is at least 95% free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated.
  • Conservative substitutions replace one amino acid with another amino acid that is similar in size, hydrophobicity, etc. Variations in the cDNA sequence that result in amino acid changes, whether conservative or not, should be minimized in order to preserve the functional and immunologic identity of the encoded protein.
  • the immunologic identity of the protein may be assessed by determining if it is recognized by an antibody; a variant that is recognized by such an antibody is immunologically conserved. Any cDNA sequence variant will preferably introduce no more than twenty, and preferably fewer than ten amino acid substitutions into the encoded polypeptide. Variant amino acid sequences may, for example, be 80, 90 or even 95% or 98% identical to the native amino acid sequence.
  • Polynucleotide A nucleic acid sequence (such as a linear sequence) of any length. Therefore, a polynucleotide includes oligonucleotides, and also gene sequences found in chromosomes.
  • An “oligonucleotide” is a plurality of joined nucleotides joined by native phosphodiester bonds.
  • An oligonucleotide is a polynucleotide of between 6 and 300 nucleotides in length.
  • An oligonucleotide analog refers to moieties that function similarly to oligonucleotides but have non-naturally occurring portions.
  • oligonucleotide analogs can contain non-naturally occurring portions, such as altered sugar moieties or inter-sugar linkages, such as a phosphorothioate oligodeoxynucleotide.
  • Functional analogs of naturally occurring polynucleotides can bind to RNA or DNA, and include peptide nucleic acid (PNA) molecules.
  • PNA peptide nucleic acid
  • Preventing a disease (such as Type 1 diabetes) refers to inhibiting the full development of a disease in a subject with a pre-disposition to develop the disease.
  • Treating refers to a therapeutic intervention that ameliorates a sign or symptom of a disease or pathological condition after it has begun to develop.
  • “Ameliorating” refers to the reduction in the number or severity of signs or symptoms of a disease.
  • a promoter is an array of nucleic acid control sequences which direct transcription of a nucleic acid.
  • a promoter includes necessary nucleic acid sequences near the start site of transcription, such as, in the case of a polymerase II type promoter, a TATA element.
  • a promoter can optionally include distal enhancer or repressor elements which can be located as much as several thousand base pairs from the start site of transcription. Also included arc those promoter elements which arc sufficient to render promoterdependent gene expression controllable for cell-type specific, tissue-specific, or inducible by external signals or agents; such elements may be located in the 5' or 3' regions of the gene. Both constitutive and inducible promoters are included (see for example, Bitter et al., Methods in Enzymology 153:516-544, 1987).
  • purified does not require absolute purity; rather, it is intended as a relative term.
  • a purified peptide, protein, virus, or other active compound is one that is isolated in whole or in part from naturally associated proteins and other contaminants.
  • substantially purified refers to a peptide, protein, virus or other active compound that has been isolated from a cell, cell culture medium, or other crude preparation and subjected to fractionation to remove various components of the initial preparation, such as proteins, cellular debris, and other components.
  • a recombinant nucleic acid is one that has a sequence that is not naturally occurring or has a sequence that is made by an artificial combination of two otherwise separated segments of sequence. This artificial combination is often accomplished by chemical synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic acids, such as by genetic engineering techniques.
  • a recombinant nucleic acid can include a nucleic acid that has a non-coding function (such as a promoter. origin of replication, ribosome-binding site, etc.).
  • a recombinant protein is one encoded for by a recombinant nucleic acid molecule.
  • a recombinant virus is a virus comprising sequence (such as genomic sequence) that is non-naturally occurring or made by artificial combination of at least two sequences of different origin.
  • the term “recombinant” also includes nucleic acids, proteins and viruses that have been altered solely by addition, substitution, or deletion of a portion of a natural nucleic acid molecule, protein or virus.
  • “recombinant AAV” refers to an AAV particle in which a recombinant nucleic acid molecule (such as a recombinant nucleic acid molecule comprising a MafB promoter and encoding PDX1 and MafA) has been packaged.
  • a host cell that comprises the recombinant nucleic acid is referred to as a “recombinant host cell.”
  • Selectable Marker A gene, RNA, or protein that when expressed, confers upon cells a selectable phenotype, such as resistance to a cytotoxic or cytostatic agent (e.g., antibiotic resistance), nutritional prototrophy, or expression of a particular protein that can be used as a basis to distinguish cells that express the protein from cells that do not. Proteins whose expression can be readily detected such as a fluorescent or luminescent protein or an enzyme that acts on a substrate to produce a colored, fluorescent, or luminescent substance (“detectable markers”) constitute a subset of selectable markers.
  • a selectable marker linked to expression control elements native to a gene that is normally expressed selectively or exclusively in pluripotent cells makes it possible to identify and select specific cells of interest.
  • selectable marker genes can be used, such as neomycin resistance gene (neo), puromycin resistance gene (puro), guanine phosphoribosyl transferase (gpt), dihydrofolate reductase (DHFR), adenosine deaminase (ada), puromycin-N-acetyltransf erase (PAC), hygromycin resistance gene (hyg), multidrug resistance gene (mdr), thymidine kinase (TK), hypoxanthine-guanine phosphoribosyltransferase (HPRT), and hisD gene.
  • neomycin resistance gene neo
  • puro puro
  • DHFR dihydrofolate reductase
  • ada puromycin-N-acetyltransf erase
  • hygromycin resistance gene hygromycin resistance gene
  • mdr
  • Detectable markers include green fluorescent protein (GFP) blue, sapphire, yellow, red, orange, and cyan fluorescent proteins and variants of any of these. Luminescent proteins such as luciferase (e.g., firefly or Renilla luciferase) are also of use.
  • GFP green fluorescent protein
  • Luminescent proteins such as luciferase (e.g., firefly or Renilla luciferase) are also of use.
  • selectable marker as used herein can refer to a gene or to an expression product of the gene, e.g., an encoded protein.
  • Sequence identity of amino acid sequences The similarity between amino acid sequences is expressed in terms of the similarity between the sequences, otherwise referred to as sequence identity. Sequence identity is frequently measured in terms of percentage identity (or similarity or homology); the higher the percentage, the more similar the two sequences are. Homologs or variants of a polypeptide will possess a relatively high degree of sequence identity when aligned using standard methods.
  • NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et al., J. Mol. Biol. 215:403, 1990) is available from several sources, including the National Center for Biotechnology Information (NCBI, Bethesda, MD) and on the internet, for use in connection with the sequence analysis programs blastp, blastn, blastx, tblastn and tblastx. A description of how to determine sequence identity using this program is available on the NCBI website on the internet.
  • Homologs and variants of proteins are typically characterized by possession of at least about 75%, for example at least about 80%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity counted over the full length alignment with the amino acid sequence of the antibody using the NCBI Blast 2.0, gapped blastp set to default parameters.
  • the Blast 2 sequences function is employed using the default BLOSUM62 matrix set to default parameters, (gap existence cost of 11, and a per residue gap cost of 1).
  • the alignment should be performed using the Blast 2 sequences function, employing the PAM30 matrix set to default parameters (open gap 9, extension gap 1 penalties). Proteins with even greater similarity to the reference sequences will show increasing percentage identities when assessed by this method, such as at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity. When less than the entire sequence is being compared for sequence identity, homologs and variants will typically possess at least 80% sequence identity over short windows of 10-20 amino acids, and may possess sequence identities of at least 85% or at least 90% or 95% depending on their similarity to the reference sequence.
  • Specific binding agent An agent that binds substantially only to a defined target.
  • a 0 cell specific binding agent is an agent that binds substantially to a 0 cell
  • a pancreatic endocrine cell specific binding agent is an agent that binds substantially only to pancreatic endocrine cells or a subset thereof (and not to pancreatic exocrine cells).
  • a pancreatic exocrine cell specific binding agent is an agent that binds substantially to exocrine cells.
  • the specific binding agent is a monoclonal or polyclonal antibody that specifically binds a type of pancreatic cell.
  • the term "specifically binds" refers, with respect to a cell, such as a pancreatic endocrine cell, to the preferential association of an antibody or other ligand, in whole or part, with a cell or tissue bearing that antigen and not to cells or tissues lacking that antigen. It is, of course, recognized that a certain degree of non-specific interaction may occur between a molecule and a non-target cell or tissue. Nevertheless, specific binding may be distinguished as mediated through specific recognition of the antigen. Although selectively reactive antibodies bind antigen, they may do so with low affinity. On the other hand, specific binding results in a much stronger association between the antibody (or other ligand) and cells bearing the antigen than between the bound antibody (or other ligand) and cells lacking the antigen.
  • Specific binding typically results in greater than 2-fold, such as greater than 5-fold, greater than 10-fold, or greater than 100- fold increase in amount of bound antibody or other ligand (per unit time) to a cell or tissue expressing the target epitope as compared to a cell or tissue lacking this epitope.
  • Specific binding to a protein under such conditions requires an antibody that is selected for its specificity for a particular protein.
  • a variety of immunoassay formats are appropriate for selecting antibodies or other ligands specifically immunoreactive with a particular protein.
  • solid-phase ELISA immunoassays are routinely used to select monoclonal antibodies specifically immunoreactive with a protein. See Harlow & Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York (1988), for a description of immunoassay formats and conditions that can be used to determine specific immunoreactivity.
  • Subject Any mammal, such as humans, non-human primates, pigs, sheep, cows, rodents and the like which is to be the recipient of the particular treatment.
  • a subject is a human subject or a murine subject.
  • Therapeutic agent Used in a generic sense, it includes treating agents, prophylactic agents, and replacement agents.
  • a therapeutic agent can be a nucleic acid molecule encoding MafA and PDX-1, or a vector encoding these factors.
  • Therapeutically effective amount or Effective amount A quantity of a specified pharmaceutical or therapeutic agent (e.g., a recombinant AAV) sufficient to achieve a desired effect in a subject, or in a cell, being treated with the agent, such as increasing insulin production.
  • a specified pharmaceutical or therapeutic agent e.g., a recombinant AAV
  • the effective amount of the agent will be dependent on several factors, including, but not limited to the subject or cells being treated, and the manner of administration of the therapeutic composition.
  • Transdifferentiate A process wherein a differentiated cell of one type, such as an alpha cell, differentiates into a differentiated cell of another type, such as a beta cells.
  • a virus or vector “transduces” a cell when it transfers nucleic acid into the cell.
  • a cell is “transformed” or “transfected” by a nucleic acid transduced into the cell when the DNA becomes stably replicated by the cell, either by incorporation of the nucleic acid into the cellular genome, or by episomal replication.
  • transfection Numerous methods of transfection are known to those skilled in the art, such as: chemical methods (e.g., calcium-phosphate transfection), physical methods (e.g., electroporation, microinjection, particle bombardment), fusion (e.g., liposomes), receptor-mediated endocytosis (e.g., DNA-protein complexes, viral envelope/capsid-DNA complexes) and by biological infection by viruses such as recombinant viruses.
  • Methods for the introduction of genes into the pancreatic endocrine cells are known (e.g. see U.S. Patent No. 6,110,743). These methods can be used to transduce a pancreatic endocrine cell produced by the methods described herein, or an artificial islet produced by the methods described herein.
  • Genetic modification of the target cell is an indicium of successful transfection.
  • Genetically modified cells refers to cells whose genotypes have been altered as a result of cellular uptakes of exogenous nucleotide sequence by transfection.
  • a reference to a transfected cell or a genetically modified cell includes both the particular cell into which a vector or polynucleotide is introduced and progeny of that cell.
  • Transgene An exogenous gene supplied by a vector, such as AAV.
  • Vector A nucleic acid molecule as introduced into a host cell, thereby producing a transformed host cell.
  • a vector may include nucleic acid sequences that permit it to replicate in the host cell, such as an origin of replication.
  • a vector may also include one or more therapeutic genes and/or selectable marker genes and other genetic elements known in the art.
  • a vector can transduce, transform or infect a cell, thereby causing the cell to express nucleic acids and/or proteins other than those native to the cell.
  • a vector optionally includes materials to aid in achieving entry of the nucleic acid into the cell, such as a viral particle, liposome, protein coating or the like.
  • the vector is an AAV vector.
  • MafB promoters are disclosed herein that can be linked to a nucleic acid molecule encoding a heterologous protein, and used to treat diabetes in a subject.
  • a recombinant nucleic acid molecule comprising a MafB promoter operably linked to a nucleic acid molecule encoding a heterologous protein
  • the MafB promoter consists essentially of: a) a nucleotide sequence at least 95% identical to SEQ ID NO: 1, that functions as a promoter; or b) the nucleotide sequence of SEQ ID NO: 1.
  • the MafB promoter consists of a) the nucleotide sequence at least 95% identical to SEQ ID NO: 1, that functions as a promoter; or b) the nucleotide sequence of SEQ ID NO: 1.
  • Vectors arc also disclosed that include the comprising the recombinant nucleic acid molecule.
  • the vector is a viral vector.
  • the viral vector can be a lentivirus vector, an adenovirus vector or an adeno-associated virus (AAV) vector, such as, but not limited to, an AAV6 vector.
  • AAV adeno-associated virus
  • the MafB promoter is operably linked to a nucleic acid molecule encoding MafA and/or a nucleic acid molecule encoding PDX1.
  • the MafB promoter can also be operably linked to a nucleic acid molecule encoding MafB.
  • the recombinant nucleic acid molecule includes both the nucleic acid molecule encoding MafA and the nucleic acid molecule encoding PDX1, and includes a connector between the nucleic acid molecule encoding MafA and the nucleic acid molecule encoding PDX1.
  • the connector is a 2 A connector.
  • host cells transformed with the disclosed recombinant nucleic acid molecule or the disclosed vectors.
  • the host cell can be a mammalian host cell, such as a human host cell or a non-human primate host cell.
  • the host cell can be an alpha cell, such as a human alpha cell.
  • methods for producing a heterologous protein in a host cell that include transforming a host cell with an effective amount of a disclosed vector, thereby producing the heterologous protein in the host cell.
  • the host cell can be a mammalian host cell, such as a human host cell or a non-human primate host cell.
  • compositions that include an effective amount of a disclosed recombinant nucleic acid molecule or the vector and a pharmaceutically acceptable carrier.
  • These compositions can also include a contrast dye for endoscopic retrograde cholangiopancreatography.
  • the contrast dye can be a low-osmolar low-viscosity non-ionic dye, a low-viscosity high-osmolar dye, or a dissociable high- viscosity dye.
  • the contrast dye can be lopromid, loglicinate, or loxaglinate.
  • Methods are also disclosed for producing pancreatic beta cells in a subject, that include administering to the subject a composition comprising an effective amount of a disclosed vector, wherein the MafB promoter is operably linked to the nucleic acid molecule encoding PDX1 and a nucleic acid molecule encoding MafA, wherein the vector does not encode Neurogenin 3 (Ngn3), wherein the subject is not administered any other nucleic acid encoding Ngn3, wherein the vector is administered intraductally into a pancreatic duct of the subject.
  • Ngn3 Neurogenin 3
  • the MafB promoter is further operably linked to a nucleic acid molecule encoding MafB.
  • the method includes the use of endoscopic retrograde cholangiopancreatography (ERCP).
  • the subject is not administered an immunosuppressive agent.
  • the subject is administered an immunosuppressive agent.
  • the subject is a human.
  • the subject has Type 1 diabetes.
  • the composition further includes a contrast dye for endoscopic retrograde cholangiopancreatography.
  • the contrast dye can be a low-osmolar low-viscosity non-ionic dye, a low-viscosity high-osmolar dye, or a dissociable high-viscosity dye.
  • the contrast dye can be lopromid, loglicinate, or loxaglinate.
  • compositions include a disclosed vector, wherein the MafB promoter is operably linked to the nucleic acid molecule encoding PDX1 and the nucleic acid molecule encoding MafaA. These compositions are of use for treating diabetes in a subject.
  • MafB promoters that include a portion of the full-length MafB promoter.
  • An exemplary human MafB promoter (348 nucleotides) consists of, or consists essentially of: GGACCTGCAGGGTACGGGGGTGGGCCCAGGCTGGCCGTCCCACTCCGCTCCAGCTCCGCCAGGCCTCCGC CGCTCCGGGGCCGCTCGGGCCGCCAGTCAGCTGACGCGGGGGGCGGGGGAGCTGTCAGGCGCGCCCCGCCCT GCGCCGCTGGGCCGCGCGGAGGCCGTGCAGCTATTGGCCCGCGCACCGGGCCGCCCGGGCCCCCGCACCCCAGT GACATCAGGAGGCGATAAAAGGCTGCGGCGCCGCCGGATCCAGCACAGCTGCACCGCCGAGCTGCGAGCGGC TGCGAGCGAGCGAGCGTAAGAGCAAGAGCTAGAGAGCGAGCAACGGGCACTCGCCCCA ( SEQ ID NO : 1 )
  • variants of this promoter is of use, such as promoters at least 95%, 96%, 97%, 98%, 99% identical to SEQ ID NO: 1 provided the promoter functions, such that a heterologous nucleic acid operably linked to the promoter is expressed when transferred into a host cell, such as an alpha cell.
  • the promoter can include at most 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 nucleic acid substitutions in SEQ ID NO: 1, provided the promoter functions, such that a heterologous nucleic acid operably linked to the promoter can be expressed when transferred into a host cell.
  • the variants of the promoter of SEQ ID NO: 1 are about 360 to about 340 nucleotides in length, such as 348 nucleotides in length.
  • the promoter is at least 95%, 96%, 97%, 98%, 99% identical to SEQ ID NO: 1 and 360, 355, 350, or 348 nucleotides in length, and the promoter functions, such that a heterologous nucleic acid operably linked to the promoter is expressed when transferred into a host cell that expresses glucagon.
  • the promoter includes a nucleotide sequence identical to SEQ ID NO: 1 and is 360, 355, 350, or 348 nucleotides in length, and the promoter functions, such that a heterologous nucleic acid operably linked to the promoter is expressed when transferred into a host cell that expresses glucagon.
  • the promoter includes at most 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 nucleic acid substitutions in SEQ ID NO: 1, and is 348 nucleotides in length, and the promoter functions, such that a heterologous nucleic acid operably linked to the promoter can be expressed when transferred into a host cell.
  • the promoter is 348 nucleotides in length.
  • the promoter is 350 nucleotides in length. In more aspects, the promoter is 355 nucleotides in length. In further aspects, the promoter is 360 nucleotides in length. Other lengths are also of use, such as 349, 351, 352, 353, 354, 356, 357, 358 and 359 nucleotides.
  • the promoter can be, for example, 348 to 360 nucleotides in length, such as 348 to 355 nucleotides in length or 348 to 350 nucleotides in length.
  • the variant of the promoter includes CCAGGCCTC. In other aspects, the variant of the promoter includes CCAGGCCTC. In further aspects, the variant of the promoter includes AGTCAGCTGAC (SEQ ID NO: 12). In more aspects, the variant of the promoter includes GGGGGCGGG. In yet other aspects, the variant of the promoter includes CCCCGCCC.
  • the variant of the promoter includes two of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In more aspects, the variant of the promoter includes three of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In other aspects, the variant of the promoter includes four of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In more aspects, the variant of the promoter includes all of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC.
  • the promoters is at least 95%, 96%, 97%, 98%, 99% identical to SEQ ID NO: 1 and includes one, two, three, four, or all five of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In more aspects, the promoters is at least 95%, 96%, 97%, 98%, 99% identical to SEQ ID NO: 1, one, two, three, four, or all five of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC, and is 360, 355, 350, or 348 nucleotides in length.
  • the promoter includes at most 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 nucleic acid substitutions in SEQ ID NO: 1, wherein the nucleic acid substitutions are not in one, two, three, four, or all five of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In some examples, the promoter includes at most 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 nucleic acid substitutions in SEQ ID NO: 1 wherein the nucleic acid substitutions are not in CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC.
  • the promoter can include the nucleic acid sequence set forth as SEQ ID NO: 1 and an additional 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides of the full-length human MafB promoter.
  • another nucleic acid sequence can be linked to the promoter, such as an enhancer.
  • the promoter can be linked to a Kozak sequence.
  • the promoter can be linked to an intron between a promoter and a Kozak sequence, wherein the intron works as an enhancer.
  • disclosed promoters can be operably linked to a heterologous nucleic acid molecule, such as a nucleic acid encoding a heterologous protein.
  • the heterologous protein is PDX1 or MafA.
  • the heterologous nucleic acid encodes PDX1 and MafA.
  • the heterologous nucleic acid encodes PDX1 and/or MafA and does not encode Ngn3.
  • the nucleic acid molecule also encodes MafB.
  • the heterologous nucleic acid can encode a selectable marker, which includes, but are not limited to, a protein whose expression can be readily detected such as a fluorescent or luminescent protein or an enzyme that acts on a substrate to produce a colored, fluorescent, or luminescent substance ("detectable markers").
  • a selectable marker which includes, but are not limited to, a protein whose expression can be readily detected such as a fluorescent or luminescent protein or an enzyme that acts on a substrate to produce a colored, fluorescent, or luminescent substance ("detectable markers").
  • a nucleic acid molecule that encodes drug resistance or provides a function that can be used to purify cells.
  • Selectable markers include neomycin resistance gene (neo), puromycin resistance gene (puro), guanine phosphoribosyl transferase (gpt), dihydrofolate reductase (DHFR), adenosine deaminase (ada), puromycin-N-acetyltransferase (PAC), hygromycin resistance gene (hyg), multidrug resistance gene (mdr), thymidine kinase (TK), hypoxanthine- guanine phosphoribosyltransferase (HPRT), and hisD gene.
  • neomycin resistance gene neo
  • puro puro
  • DHFR dihydrofolate reductase
  • ada puromycin-N-acetyltransferase
  • PAC puromycin-N-acetyltransferase
  • Detectable markers include green fluorescent protein (GFP) blue, sapphire, yellow, red, orange, and cyan fluorescent proteins and variants of any of these.
  • Luminescent proteins such as luciferase (e.g., firefly or Renilla luciferase) are also selectable makers.
  • the promoter functions such that PDX1 and/or MafA transcripts are produced. In more aspects, the promoter functions such that MafB is also produced.
  • the promoter is operably linked to a nucleic acid molecule encoding PDX1 and a nucleic acid molecule encoding MafA, but is not operably linked to a nucleic acid molecule encoding Ngn3.
  • the promoter is operably linked to a nucleic acid encoding PDX1 and a nucleic acid encoding MafA, and a nucleic acid molecule encoding MafB, but is not operably linked to a nucleic acid encoding Ngn3.
  • U.S. Published Patent Application No. 2011/0280842 incorporated by reference herein, provides PDX1, MafA and Ngn3 amino acid and nucleic acid sequences.
  • the promoter is operably linked to a heterologous nucleic acid molecule encoding MafA, such as the amino acid sequence set forth as: MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEPPEAERFCHRLPPGSLSSTPLSTPCSSVPSSPSF CAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALEDLYWMSGYQHHLNPEALN LTPEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGPGFAGGGGADDMGAGHHHGAHHAAHHH HAAHHHHHHHHGGAGHGGGAGHHVRLEERFSDDQLVSMSVRELNRQLRGFSKEEVIRLKQKR RTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEVGRLAKERDLYKEKYEKLAGRGGPG SAGGAGFPREPSPPQAGPGGAKGTADFFL
  • MafA is a beta cell specific and glucose regulated transcription factor for insulin gene expression.
  • a disclosed promoter can be operably linked to a heterologous nucleic acid molecule encoding a MafA protein that has an amino acid sequence at least 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 2 or SEQ ID NO: 3, wherein the protein functions as a transcription factor.
  • the heterologous nucleic acid molecule can encode a MafA protein that includes at most 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 conservative amino acid substitutions in SEQ ID NO: 2 or SEQ ID NO: 3, wherein the protein functions as a transcription factor.
  • the disclosed promoter can be operably linked to a nucleic acid molecule that encodes a protein that has the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 3.
  • a disclosed promoter can be operably linked to a heterologous nucleic acid molecule encoding a human PDX1 amino acid sequence including the amino acid sequence set forth as: MNGEEQYYAATQLYKDPCAFQRGPAPEFSASPPACLYMGRQPPPPPPHPFPGALGALEQGSPPDISP YEVPPLADDPAVAHLHHHLPAQLALPHPPAGPFPEGAEPGVLEEPNRVQLPFPWMKSTKAHAWKG QWAGGAYAAEPEENKRTRTAYTRAQLLELEKEFLFNKYISRPRRVELAVMLNLTERHIKIWFQNRR MKWKKEEDKKRGGGTAVGGGGVAEPEQDCAVTSGEELLALPPPPPPGGAVPPAAPVAAREGRLPP GLSASPQPSSVAPRRPQEPR
  • PDX1 is a transcriptional activator of several genes, including insulin, somatostatin, glucokinase, islet amyloid polypeptide, and glucose transporter type 2.
  • the heterologous nucleic acid molecule can encode a PDX1 protein that has an amino acid sequence at least 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 4 or SEQ ID NO: 5, wherein the protein functions as a transcription factor.
  • the heterologous nucleic acid molecule can encode a PDX1 protein that includes at most 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 conservative amino acid substitutions in SEQ ID NO: 4 or SEQ ID NO: 5, wherein the protein functions as a transcription factor.
  • the disclosed promoter can be operably linked to a nucleic acid molecule that encodes a protein that has the amino acid sequence of SEQ ID NO: 4 or SEQ ID NO: 5.
  • the heterologous nucleic acid molecule can encode both PDX1 and MafA.
  • the heterologous nucleic acid molecules encoding PDX1 and MafA are separated by a connector.
  • the connector is 2A.
  • the nucleic acid sequence of the 2A connector is shown below: CGCGCCAAGCGCGGCTCCGGCGCCACCAACTTCTCCCTGCTGAAGCAG (SEQ ID NO: 6).
  • Nucleic acid sequences of additional exemplary connectors are: CGCGCCAAGCGCGGCTCCGGCCAGTGCACCAACTACGCCCTGCTGAAGCTGGCCGGCGACGTG GAGTCCAACCCCGGCCCC (SEQ ID NO: 7); and CGCGCCAAGCGCGGCTCCGGCGAGGGCAGAGGAAGTCTTCTAACATGCGGTGACGTGGAGGA GAATCCCGGCCCT (SEQ ID NO: 8).
  • Suitable connectors also include a nucleic acid sequence with at most 1, 2, 3, 4, or 5 substitutions in one of SEQ ID NO: 6-8.
  • Suitable connectors arc, for example, 40 to 90 nucleotides in length, such as 45 to 85 nucleotides in length, such as 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 nucleotides in length.
  • These connectors can be included in a heterologous nucleic acid molecule between a nucleic acid molecule encoding MafA and a nucleic acid molecule encoding PDX1.
  • the nucleic acid molecule encoding MafA can be 5' to the nucleic acid molecule encoding PDX1.
  • the nucleic acid molecule encoding MafA can be 3’ to the nucleic acid molecule encoding PDX1.
  • a disclosed promoter further can be operably linked to a heterologous nucleic acid molecule encoding a MafB amino acid sequence including the amino acid sequence set forth as: MAAELSMGPELPTSPLAMEYVNDFDLLKFDVKKEPLGRAERPGRPCTRLQPVGSVSSTPLSTPCSSVPS SPS FSPTEQKTHLEDLYWMASNYQQMNPEALNLTPEDAVEAL IGSHPVPQPLQSFDSFRGAHHHHHHPHPHHA YPGAGVAHDELGPHAHPHHHHHHQASPPP SSAASPAQQLPTSHPGPGPHATASATAAGGNGSVEDRFSDDQL VSMSVRELNRHLRGFTKDEVIRLKHKRRTLKNRGYAQSCRYKRVQQKHHLENEKTQL IQQVEQLKQEVSRLA RERDAYKVKCEKLANSGFREAGSTSDSP S SPEFFL
  • the heterologous nucleic acid molecule encodes PDX1 and MafA, and optionally MafB, but does not encode Ngn3, for example, the Ngn3 protein of GENBANK® Accession No: NM 009719 (mouse), February 15, 2015 and GENBANK® Accession No: NP 033849.3 (Human), February 15, 2015.
  • Ngn3 protein An exemplary Ngn3 protein is shown below: MAPHPLDALT1QVSPETQQPFPGASDHEVLSSNSTPPSPTLIPRDCSEAEVGDCRGTSRKLRARRGGR NRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTKIETLRFAHNYIW ALTQTLR IADHSFYGPE PPVPCGELGS PGGGSNGDWG SIYSPVSQAGNLSPTASLEEFPGLQVPSSP SYLLPGALVF SDFL (SEQ ID NO: 11)
  • the heterologous nucleic acid molecule does not encode a protein that has an amino acid sequence at least 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 11, wherein the protein functions as a transcription factor.
  • the heterologous nucleic acid molecule does not encode a protein that includes at most 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 conservative amino acid substitutions in SEQ ID NO: 11, wherein the protein functions as a transcription factor.
  • the heterologous nucleic acid molecule encodes MafA and PDX1, and encodes Ngn3, or just encodes Ngn3.
  • the heterologous nucleic acid molecule encodes a protein that has an amino acid sequence at least 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 11, wherein the protein functions as a transcription factor.
  • the heterologous nucleic acid molecule can encode a protein that includes at most 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 conservative amino acid substitutions in SEQ ID NO: 11, wherein the protein functions as a transcription factor.
  • the heterologous nucleic acid molecule can encode a protein with the amino acid sequence of SEQ ID NO: 11.
  • a connector can be included between nucleic acid molecules encoding proteins.
  • the connector can be a 2A connection.
  • the vector can include a connector, such as T2A or P2A.
  • Exemplary nucleic acid sequences for these connectors are provided as SEQ ID NO: 16 and SEQ ID NO: 18, respectively.
  • polyadenylation signal may be of use to include a polyadenylation signal to effect proper termination and polyadenylation of the gene transcript.
  • exemplary polyadenylation signals have been isolated from beta globin, bovine growth hormone, SV40, and the herpes simplex virus thymidine kinase genes.
  • An exemplary nucleic acid sequence for a poly adenylation signal is provided as SEQ ID NO: 21.
  • a vector including the promoter and the heterologous nucleic acid.
  • other transcription control sequences such as one or more enhancer elements, which are binding recognition sites for one or more transcription factors that increase transcription above that observed for the promoter alone, can be included in the vector.
  • Procedures for producing vectors and cloning can be found, for example, in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 4 th ed, Cold Spring Harbor, New York, 2012) and Ausubel et al. (In Current Protocols in Molecular Biology, John Wiley & Sons, New York, through supplement 104, 2013.
  • Additional regulatory elements can be included, such as, but not limited to, enhancer elements.
  • the additional regulatory element can be the WPRE, see for example, SEQ ID NO: 20.
  • Vectors are disclosed herein that include one or more of SEQ ID NOs: 13-20.
  • Vectors include, but are not limited to, Tina-055 and Tina-059, as shown in TIGS. 7 and 8, respectively.
  • viral vectors are known in the art, including polyoma; SV40 (Madzak et al., 1992, J. Gen. Virol., 73:15331536); adenovirus (Berkner, 1992, Cur. Top. Microbiol. Immunol., 158:39-6; Hopkins et al., 1988, Bio Techniques, 6:616-629; Gorziglia et al., 1992, J. Virol., 66:4407-4412; Quantin et al., 1992, Proc. Nad. Acad. Sci.
  • the vector can be a viral vector.
  • Suitable vectors include retrovirus vectors, orthopox vectors, avipox vectors, fowlpox vectors, capripox vectors, suipox vectors, adenoviral vectors, herpes virus vectors, alpha virus vectors, baculovirus vectors, Sindbis virus vectors, vaccinia virus vectors, lentivirus vectors and poliovirus vectors.
  • Specific exemplary vectors are poxvirus vectors, such as vaccinia virus, fowlpox virus and a highly attenuated vaccinia virus (MVA), adenovirus, baculovirus, yeast, and the like.
  • MVA highly attenuated vaccinia virus
  • the vector can be a retroviral vector, a lentiviral vector, an adenovirus vector.
  • Adeno-associated virus vectors AAV are disclosed in additional detail below, and are of use in the disclosed methods.
  • the AAV vector can be AAV6.
  • the vector is an attenuated adenovirus vector, such as the vector described by Stratford- Perricaudet et aL (J. Clin. Invest., 90:626-630 1992; La Salle et al., Science 259:988-990, 1993); and a defective adeno-associated virus vector (Samulski et al., J. Virol., 61:3096-3101, 1987; Samulski et al., J. Virol., 63:3822-3828, 1989; Lebkowski et al., Mol. Cell. Biol., 8:3988-3996, 1988).
  • the vector includes a disclosed promoter operably linked to a nucleic acid encoding PDX1 and MafA, and optionally MafB. In more aspects, the vector does not include a nucleic acid encoding Ngn3.
  • the vector can be, for example, a lentivirus vector, an adenovirus vector or an adeno-associated virus (AAV) vector.
  • AAV vector is an AAV6 vector.
  • the AAV vector is an AAVDJ vector.
  • the AAV vector is an AAKP1 vector.
  • a vector of use includes a gene encoding a selectable marker, which includes, but are not limited to, a protein whose expression can be readily detected such as a fluorescent or luminescent protein or an enzyme that acts on a substrate to produce a colored, fluorescent, or luminescent substance ("detectable markers").
  • a selectable marker which includes, but are not limited to, a protein whose expression can be readily detected such as a fluorescent or luminescent protein or an enzyme that acts on a substrate to produce a colored, fluorescent, or luminescent substance ("detectable markers").
  • detectable markers include, but are not limited to, a protein whose expression can be readily detected such as a fluorescent or luminescent protein or an enzyme that acts on a substrate to produce a colored, fluorescent, or luminescent substance.
  • genes of use such as genes that encode drug resistance or provide a function that can be used to purify cells.
  • Selectable markers include neomycin resistance gene (neo), puromycin resistance gene (puro), guanine phosphoribosyl transferase (gpt), dihydrofolate reductase (DHFR), adenosine deaminase (ada), puromycin-N-acetyltransferase (PAC), hygromycin resistance gene (hyg), multidrug resistance gene (mdr), thymidine kinase (TK), hypoxanthine-guanine phosphoribosyltransferase (HPRT), and hisD gene.
  • neomycin resistance gene neo
  • puro puro
  • DHFR dihydrofolate reductase
  • ada puromycin-N-acetyltransferase
  • PAC puromycin-N-acetyltransferase
  • Detectable markers include green fluorescent protein (GFP) blue, sapphire, yellow, red, orange, and cyan fluorescent proteins and variants of any of these.
  • Luminescent proteins such as luciferase (e.g., firefly or Renilla luciferase) are also selectable makers.
  • the disclosed MafB promoter operably linked to a heterologous nucleic acid molecule, can be included in adenoviral vectors and/or adeno-associated viral vectors.
  • AAV belongs to the family Parvoviridae and the genus Dependovirus.
  • AAV is a small, non-enveloped virus that packages a linear, single- stranded DNA genome. Both sense and antisense strands of AAV DNA are packaged into AAV capsids with equal frequency.
  • the AAV includes a disclosed promoter operably linked to a nucleic acid encoding PDX1 and MafA, but does not include a nucleic acid encoding Ngn3.
  • recombinant vectors such as recombinant adenovirus vectors and recombinant adeno- associated virus vectors comprising a nucleic acid molecule disclosed herein.
  • the AAV is AAV6.
  • the AAV is A A VKP1.
  • the AAV is AAVDJ.
  • the AAV serotype can be any other suitable AAV serotype, such as AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV9, AAV 10, AAV11 or AAV12, or a hybrid of two or more AAV serotypes (such as, but not limited to AAV2/1, AAN n, AAV2/8 or AAV2/9).
  • the AAV vector can transduce one or more pancreatic cell types, see Published U.S. Patent Application No. US-2022-0354969-A1.
  • the AAV vector can be AAV9, AAVDJ. AAVKP1 , AAV2.7m8, AAV 10, AAVshlO. AAV1, AAV7, AAV218, AAVPHPeB, AAVhu3?, AAVAnc80, AAVrhS, AAVLK03, AAVKP3, AAV2G9. or AAVPHPB.
  • the AAV genome is characterized by two inverted terminal repeats (ITRs) that flank two open reading frames (ORFs).
  • ITRs inverted terminal repeats
  • ORFs open reading frames
  • the first 125 nucleotides of the ITR are a palindrome, which folds upon itself to maximize base pairing and forms a T-shaped hairpin structure.
  • the other 20 bases of the ITR called the D sequence, remain unpaired.
  • the ITRs are cis-acting sequences important for AAV DNA replication; the ITR is the origin of replication and serves as a primer for second- strand synthesis by DNA polymerase.
  • the double-stranded DNA formed during this synthesis which is called replicating-form monomer, is used for a second round of self -priming replication and forms a replicating-form dimer.
  • These double- stranded intermediates are processed via a strand displacement mechanism, resulting in single-stranded DNA used for packaging and double-stranded DNA used for transcription.
  • Located within the ITR are the Rep binding elements and a terminal resolution site (TRS). These features are used by the viral regulatory protein Rep during AAV replication to process the doublestranded intermediates.
  • the ITR is also essential for AAV genome packaging, transcription, negative regulation under non-permissive conditions, and site-specific integration (Daya and Berns, Clin Microbiol Rev 21(4):583-593, 2008). In some aspects, these elements are included in the AAV vector.
  • the left ORF of AAV contains the Rep gene, which encodes four proteins - Rep78, Rep 68, Rep52 and Rep40.
  • the right ORF contains the Cap gene, which produces three viral capsid proteins (VP1, VP2 and VP3).
  • the AAV capsid contains 60 viral capsid proteins arranged into an icosahedral symmetry. VP1, VP2 and VP3 are present in a 1: 1:10 molar ratio (Daya and Berns, Clin Microbiol Rev 21(4):583-593, 2008). In some aspects, these elements are included in the AAV vector.
  • AAV vectors can be used for gene therapy.
  • Exemplary AAV of use are AAV2, AAV5, AAV6, AAV8 and AAV9.
  • the AAV is an AAV6 vector.
  • the AAV is AAVKP1.
  • the AAV is AAVDJ.
  • the AAV serotype can be any other suitable AAV serotype, such as AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV9, AAV10, AAV11 or AAV12, or a hybrid of two or more AAV serotypes (such as, but not limited to AAV2/1, PJWin, AAV2/8 or AAV2/9).
  • the AAV vector can transduce one or more pancreatic cell types, see Published U.S. Patent Application No. US-2022-0354969-A1.
  • the AAV vector can be AAV9, AAVDJ, AAVK.P1, AAV2.7m8, AAV 10, AAVshlO.
  • Adenovirus, AAV2 and AAV8 are capable of transducing cells in the pancreas.
  • any of an AAV2 or AAV8 vector can be used in the methods disclosed herein.
  • AAV6 and AAV9 vectors are also of use.
  • the AAV vector is an AAV6 vector.
  • AAV infects humans and some other primate species, it is not known to cause disease and elicits a very mild immune response.
  • Gene therapy vectors that utilize AAV can infect both dividing and quiescent cells and persist in an extrachromosomal state without integrating into the genome of the host cell. Because of the advantageous features of AAV, the present disclosure contemplates the use of an AAV vector including a disclosed promoter for the methods disclosed herein.
  • AAV possesses several additional desirable features for a gene therapy vector, including the ability to bind and enter target cells, enter the nucleus, the ability to be expressed in the nucleus for a prolonged period of time, and low toxicity.
  • AAV can be used to transfect cells, and suitable vector are known in the art, see for example, U.S. Published Patent Application No. 2014/0037585, incorporated herein by reference.
  • Methods for producing AAV vectors suitable for gene therapy are well known in the art (see, for example, U.S. Published Patent Application Nos. 2012/0100606; 2012/0135515; 2011/0229971; and 2013/0072548; and Ghosh et al., Gene Ther 13(4):321-329, 2006), and can be utilized with the methods disclosed herein.
  • AAV8 vectors are disclosed, for example, in U.S. Patent No. 8,692,332, which is incorporated by reference herein.
  • An exemplary AAV8 nucleic acid sequence is shown in Fig. 1 and SEQ ID NO: 1 of U.S. Patent No. 8,692,332. It is disclosed that AAV nucleic acid sequence can be greater than about 90%, 95%, 98% or 99% identical to this nucleic acid sequence.
  • the location and sequence of the capsid, rep 68/78, rep 40/52, VP1, VP2 and VP3 are disclosed in this U.S. Patent No. 8,692,332.
  • the location and hypervariable regions of AAV8 are also provided.
  • vectors of use in the methods disclosed herein can contain nucleic acid sequences encoding an intact AAV capsid which may be from a single AAV serotype (e.g., AAV2, AAV, 6, AAV8 or AAV9, such as AAV6).
  • AAV2 AAV2, AAV, 6, AAV8 or AAV9, such as AAV6
  • vectors of use also can be recombinant, and thus can contain sequences encoding artificial capsids which contain one or more fragments of the AAV6 capsid fused to heterologous AAV or non-AAV capsid proteins (or fragments thereof).
  • an AAV vector may have a capsid protein comprising one or more of the AAV8 capsid regions selected from the VP2 and/or VP3, or from VP1, or fragments thereof selected from amino acids 1 to 184, amino acids 199 to 259; amino acids 274 to 446; amino acids 603 to 659; amino acids 670 to 706; amino acids 724 to 738 of the AAV8 capsid, see SEQ ID NO: 2 of U.S. Patent No. 8,692,332.
  • the AAV may contain one or more of the AAV serotype 8 capsid protein hypervariable regions, for example aa 185- 198; aa 260-273; aa447-477; aa495-602; aa660-669; and aa707-723 of the AAV8 capsid set forth in SEQ ID NO: 2 of U.S. Patent No. 8,692,332.
  • an AAV is generated having an AAV serotype 6 capsid.
  • a host cell which can be cultured that contains a nucleic acid sequence encoding an adeno-associated virus (AAV) serotype 6 capsid protein, or fragment thereof, as defined herein; a functional rep gene; a minigene composed of, at a minimum, AAV inverted terminal repeats (ITRs) and a transgene, such as a transgene encoding PDX1 and MafA; and sufficient helper functions to permit packaging in the AAV6 capsid protein.
  • AAV adeno-associated virus
  • ITRs AAV inverted terminal repeats
  • transgene such as a transgene encoding PDX1 and MafA
  • sufficient helper functions to permit packaging in the AAV6 capsid protein.
  • the components required to be cultured in the host cell to package an AAV minigene in an AAV capsid may be provided to the host cell in trans.
  • any one or more of the required components may be provided by a stable host cell which has been engineered to contain one or more of the required components using methods known to those of skill in the art.
  • a stable host cell will contain the required component(s) under the control of an inducible promoter.
  • the required component(s) can be under the control of a constitutive promoter. Examples of suitable inducible and constitutive promoters are provided below. Similar methods can be used to generate an AAV2, AAV8 or AAV9 vector and/or virions.
  • a selected stable host cell may contain selected component(s) under the control of a constitutive promoter and other selected component(s) under the control of one or more inducible promoters.
  • a stable host cell may be generated which is derived from 293 cells (which contain El helper functions under the control of a constitutive promoter), but which contains the rep and/or cap proteins under the control of inducible promoters. Still other stable host cells may be generated by one of skill in the art.
  • the minigene, rep sequences, cap sequences, and helper functions required for producing an AAV can be delivered to the packaging host cell in the form of any genetic element which transfer the sequences carried thereon.
  • the selected genetic element may be delivered by any suitable method, including those described herein.
  • the methods used to construct vectors are known to those with skill in nucleic acid manipulation and include genetic engineering, recombinant engineering, and synthetic techniques. See, e.g., Sambrook et al. (Molecular Cloning: A Laboratory Manual, 4 th ed, Cold Spring Harbor, New York, 2012. Similarly, methods of generating AAV virions are known, and the selection of a suitable method is not a limitation on the present invention. See, e.g., K. Fisher et al, J.
  • selected AAV components can be readily isolated using techniques available to those of skill in the art from an AAV serotype, including AAV8.
  • AAV may be isolated or obtained from academic, commercial, or public sources (e.g., the American Type Culture Collection, Manassas, Va.).
  • the AAV sequences may be obtained through synthetic or other suitable means by reference to published sequences such as are available in the literature or in databases such as, e.g., GENBANK®.
  • the adenovirus and AAV vectors disclosed herein include a disclosed promoter operably linked to a nucleic acid encoding PDX1 and MafA. In some aspects, the adenovirus and AAV vectors do not include a nucleic acid encoding Ngn3.
  • the promoter consists essentially of, or consists of, SEQ ID NO: 1.
  • the vector includes a heterologous nucleic acid molecule encoding PDX1 and/or MafA, operably linked to this promoter.
  • the vector includes a nucleic acid molecule encoding MafB, operably linked to this promoter.
  • the vector includes a heterologous nucleotide acid molecule encoding PDX1 and/or MafA, and optionally MafB, operably linked to the promoter, but does not encode Ngn3.
  • the promoter functions such that both PDX1 and/or MafA transcripts are produced. In more aspects, the promoter functions such that both PDX1 and/or MafA transcripts, and MafB transcripts are produced.
  • the promoter functions such that all of PDX1, MafA and MafB transcripts are produced.
  • the promoter is operably linked to a nucleic acid encoding PDX1 and a nucleic acid encoding MafA, but is not operably linked to a nucleic acid encoding Ngn3.
  • the promoter is operably linked to a nucleic acid encoding PDX1 and a nucleic acid encoding MafA, and a nucleic acid molecule encoding MafB, but is not operably linked to a nucleic acid encoding Ngn3.
  • a nucleic acid molecule encoding Ngn3 can be absent from the vector, and may not be administered with the vector.
  • a vector including a disclosed MafB promoter and the heterologous nucleic acid such as an AAV vector, for example and AAV6 vector, does not include a nucleic acid encoding Ngn3.
  • host cells can be produced that are transformed with these recombinant nucleic acid molecules.
  • These host cells can be mammalian host cells, such as non-human primate, mouse, and human host cells.
  • the host cells can be alpha cells. Methods are disclosed for producing a protein in a host cell, that include transforming the host cell with an effective amount of the vector, thereby producing the protein in the host cell.
  • the method can include isolating the protein.
  • the heterologous protein can be isolated and purified by standard methods including, but not limited to, chromatography (e.g., ion exchange, affinity, size exclusion, and hydroxyapatite chromatography), gel filtration, centrifugation, or differential solubility, ethanol precipitation, immunoaffinity purification, or by any other available technique for the purification of proteins (See, e.g., Bose K (Ed.), Textbook on Cloning, Expression and Purification of Recombinant Proteins, Sprinter Press, Singapore, 2022; Simpson et al. (Eds.), Basic methods in Protein Purification and Analysis: A Laboratory Manual, New York: Cold Spring Harbor Laboratory Press, 2009, Higgins, S. J.
  • the protein can be isolated by binding it to an affinity column comprising a specific binding agent, such as antibodies that were raised against that protein and were affixed to a stationary support.
  • affinity tags such as an influenza coat sequence, poly-histidine, or glutathionc-S-transfcrasc can be attached to the protein by standard recombinant techniques to allow for easy purification by passage over the appropriate affinity column.
  • Protease inhibitors such as phenyl methyl sulfonyl fluoride (PMSF), leupeptin, pepstatin or aprotinin can be added at any or all stages in order to reduce or eliminate degradation of the polypeptide or protein during the purification process.
  • Protease inhibitors can be used when cells must be lysed in order to isolate and purify the expressed polypeptide or protein.
  • One of ordinary skill in the art will appreciate that the exact purification technique will vary depending on the character of the protein to be purified, the type of the cells from which the polypeptide or protein is expressed, and the composition of the medium in which the cells were grown.
  • the disclosed MafB promoter provides a high level of expression in alpha cells, providing increased transdifferentiation to beta cells.
  • PDX1 and MafA expression can be increased in alpha cells, and optionally MafB, providing increased transdifferentiation into beta cells.
  • PDX1 and MafA expression are increased in alpha cells.
  • PDX1, MafA, and MafB expression can be increased in alpha cells.
  • the methods include administering to the subject a vector, such as an adenovirus vector or an AAV vector, encoding heterologous PDX1 and MafA, and optionally MafB.
  • a vector such as an adenovirus vector or an AAV vector
  • the vector does not include a nucleic acid encoding Ngn3.
  • the subject is not administered any other nucleic acid encoding Ngn3. Exemplary methods are disclosed, for example, in U.S. Patent No. 10,071,172, incorporated herein by reference.
  • a vector such as an adenovirus or an AAV vector can be formulated into a pharmaceutical composition and will generally be administered locally or systemically.
  • the vector is administered directly to the pancreas.
  • the subject has diabetes, such as Type 1 diabetes.
  • methods for producing pancreatic beta cells from pancreatic alpha cells in a subject. These methods include administering to the subject a vector comprising a disclosed MafB promoter operably linked to a nucleic acid molecule encoding PDX1 and MafA, and optionally MafB.
  • the vector does not encode Ngn3 and the subject is not administered any other nucleic acid encoding Ngn3.
  • the vector is administered intraductally into a pancreatic duct of the subject.
  • the subject has diabetes or pre-diabetes.
  • methods are provided for treating diabetes Type 1 or pre-diabetes in a subject.
  • the subject can be any mammalian subject, including human and veterinary subjects.
  • the subject can be a child or an adult.
  • the method can include selecting a subject of interest, such as a subject with diabetes.
  • the subject can also be administered insulin.
  • the method can include measuring beta cell number.
  • a subject with diabetes may be clinically diagnosed by a fasting plasma glucose (FPG) concentration of greater than or equal to 7.0 millimole per liter (mmol/L) (126 milligram per deciliter (mg/dL)), or a plasma glucose concentration of greater than or equal to 11.1 mmol/L (200 mg/dL) at about two hours after an oral glucose tolerance test (OGTT) with a 75 gram (g) load, or in a patient with classic symptoms of hyperglycemia or hyperglycemic crisis, a random plasma glucose concentration of greater than or equal to 11.1 mmol/L (200 mg/dL), or HbAlc levels of greater than or equal to 6.5%.
  • FPG fasting plasma glucose
  • a subject with pre-diabetes may be diagnosed by impaired glucose tolerance (IGT).
  • IGT impaired glucose tolerance
  • An OGTT two-hour plasma glucose of greater than or equal to 140 mg/dL and less than 200 mg/dL (7.8-11.0 mM), or a fasting plasma glucose (FPG) concentration of greater than or equal to 100 mg/dE and less than 125 mg/dE (5.6-6.9 mmol/L), or HbAlc levels of greater than or equal to 5.7% and less than 6.4% (5.7-6.4%) is considered to be IGT, and indicates that a subject has pre-diabetes. Additional information can be found in Standards of Medical Care in Diabetes — 2010 (American Diabetes Association, Diabetes Care 33:S 11 -61, 2010, incorporated herein by reference).
  • the disclosed methods produce pancreatic beta cells in a subject. Generally, these cells produce insulin.
  • the subject is a subject with Type 1 diabetes and the pancreatic beta cells produced by the disclosed methods are not recognized by the immune system of the subject.
  • T cell and/or B cells do not produce an immune response to the pancreatic beta cells produced by the disclosed methods.
  • the subject does not mount an autoimmune response to the pancreatic beta cells produced by the disclosed methods.
  • the subject does not have immune destruction of the pancreatic beta cells, and does not exhibit an increased lymphocyte invasion of the islets.
  • the disclosed method transdifferentiate alpha cells into beta cells.
  • Appropriate doses of a disclosed vector depend on the subject being treated (e.g., human or nonhuman primate or other mammal), age and general condition of the subject to be treated, the severity of the condition being treated, the mode of administration of the AAV vector/virion, among other factors.
  • An appropriate effective amount can be readily determined by one of skill in the art. Thus, a "therapeutically effective amount" will fall in a relatively broad range that can be determined through clinical trials.
  • the method can include measuring an outcome, such as insulin production, improvement in a fasting plasma glucose tolerance test, or pancreatic beta cell number.
  • the method can include administering other therapeutic agents, such as insulin.
  • the method can also include having the subject make lifestyle modifications.
  • a therapeutically effective dose can be on the order of from about 10 5 to 10 16 of the AAV virions, such as 10 8 to 10 14 AAV virions, such as AAV6 virions.
  • the dose depends on the efficiency of transduction, the disclosed promoter that is selected, the stability of the message and the protein encoded thereby, and clinical factors. Effective dosages can be readily established by one of ordinary skill in the art through routine trials establishing dose response curves.
  • Dosage treatment may be a single dose schedule or a multiple dose schedule to ultimately deliver the amount specified above.
  • the subject may be administered as many doses as appropriate.
  • the subject may be given, e.g., 10 5 to 10 16 AAV virions in a single dose, or two, four, five, six or more doses that collectively result in delivery of, e.g., 10 5 to 10 16 AAV virions.
  • One of skill in the art can readily determine an appropriate number of doses to administer.
  • the AAV is administered at a dose of about 1 x 10” to about 1 x 10 14 viral particles (vp)/kg. In some examples, the AAV is administered at a dose of about 1 x 10 12 to about 8 x 10 13 vp/kg. In other examples, the AAV is administered at a dose of about 1 x 10 13 to about 6 x 10 13 vp/kg.
  • the AAV is administered at a dose of at least about 1 x 10 11 , at least about 5 x 10 11 , at least about 1 x 10 12 , at least about 5 x 10 12 , at least about 1 x 10 13 , at least about 5 x 10 13 , or at least about 1 x 10 14 vp/kg.
  • the AAV is administered at a dose of no more than about 5 x 10 11 , no more than about 1 x 10 12 , no more than about 5 x 10 12 , no more than about 1 x 10 13 , no more than about 5 x 10 13 , or no more than about 1 x 10 14 vp/kg.
  • the AAV is administered at a dose of about 1 x 1012 vp/kg.
  • the AAV can be administered in a single dose, or in multiple doses (such as 2, 3, 4, 5, 6, 7, 8, 9 or 10 doses) as needed for the desired therapeutic results, such as the production of 0 cells and/or treatment of Type 1 diabetes.
  • the AAV can be an AAV6.
  • compositions include sufficient genetic material to produce a therapeutically effective amount of PDX1 and MafA.
  • AAV virions will be present in the subject compositions in an amount sufficient to provide a therapeutic effect, such as the production of pancreatic beta cells and/or the treatment of diabetes when given in one or more doses.
  • AAV virions can be provided as lyophilized preparations and diluted in a stabilizing compositions for immediate or future use. Alternatively, the AAV virions can be provided immediately after production and stored for future use.
  • the pharmaceutical compositions can contain the vector, such as the AAV vector, and/or virions, and a pharmaceutically acceptable excipient.
  • excipients include any pharmaceutical agent that does not itself induce the production of antibodies harmful to the individual receiving the composition, and which may be administered without undue toxicity.
  • Pharmaceutically acceptable excipients include, but are not limited to, liquids such as water, saline, glycerol and ethanol.
  • the excipient can be pluronic acid (PF-68, also called Polaxamer 188).
  • Pharmaceutically acceptable salts can be included therein, for example, mineral acid salts such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of organic acids such as acetates, propionates, malonates, benzoates, and the like. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles.
  • mineral acid salts such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like
  • organic acids such as acetates, propionates, malonates, benzoates, and the like
  • auxiliary substances such as wetting or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles.
  • the excipients confer a protective effect on the AAV virion such that loss of AAV virions, as well as transduceability resulting from formulation procedures, packaging, storage, transport, and the like, is minimized.
  • These excipient compositions are therefore considered “virion-stabilizing" in the sense that they provide higher AAV virion titers and higher transduceability levels than their non-protected counterparts, as measured using standard assays, see, for example, Published U.S. Application No. 2012/0219528, incorporated herein by reference.
  • These Compositions therefore demonstrate "enhanced transduccability levels" as compared to compositions lacking the particular excipients described herein, and are therefore more stable than their non-protected counterparts.
  • Exemplary excipients that can used to protect the AAV virion from activity degradative conditions include, but are not limited to, detergents, proteins, e.g., ovalbumin and bovine serum albumin, amino acids, e.g., glycine, polyhydric and dihydric alcohols, such as but not limited to polyethylene glycols (PEG) of varying molecular weights, such as PEG-200, PEG-400, PEG-600, PEG-1000, PEG-1450, PEG-3350, PEG- 6000, PEG-8000 and any molecular weights in between these values, with molecular weights of 1500 to 6000 preferred, propylene glycols (PG), sugar alcohols, such as a carbohydrate, preferably, sorbitol.
  • PEG polyethylene glycols
  • PG propylene glycols
  • sugar alcohols such as a carbohydrate, preferably, sorbitol.
  • the detergent when present, can be an anionic, a cationic, a zwitterionic or a nonionic detergent.
  • An exemplary detergent is a nonionic detergent.
  • One suitable type of nonionic detergent is a sorbitan ester, e.g., polyoxyethylenesorbitan monolaurate (TWEENO-20) polyoxyethylenesorbitan monopalmitate (TWEEN®- 40), polyoxyethylenesorbitan monostearate (TWEEN®-60), polyoxyethylenesorbitan tristearate (TWEEN®- 65), polyoxyethylenesorbitan monooleate (TWEEN®-80), polyoxyethylenesorbitan trioleate (TWEEN®- 85), such as TWEEN®-20 and/or TWEEN®-80.
  • TWEENO-20 polyoxyethylenesorbitan monolaurate
  • TWEEN®- 40 polyoxyethylenesorbitan monopalmitate
  • TWEEN®-60 polyoxyethylenesorbitan monostearate
  • TWEEN®- 65 poly
  • excipients are commercially available from a number of vendors, such as Sigma, St. Louis, Mo.
  • the amount of the various excipients present in any of the disclosed compositions varies and is readily determined by one of skill in the art.
  • a protein excipient such as BSA, if present, will can be present at a concentration of between 1.0 weight (wt.) % to about 20 wt. %, preferably 10 wt. %.
  • an amino acid such as glycine
  • it can be present at a concentration of about 1 wt. % to about 5 wt. %.
  • a carbohydrate, such as sorbitol, if present, can be present at a concentration of about 0.1 wt % to about 10 wt. %, such as between about 0.5 wt. % to about 15 wt. %, or about 1 wt. % to about 5 wt. %.
  • polyethylene glycol it can generally be present on the order of about 2 wt. % to about 40 wt. %, such as about 10 wt. % top about 25 wt. %.
  • propylene glycol is used in the subject formulations, it will typically be present at a concentration of about 2 wt. % to about 60 wt. %, such as about 5 wt.
  • an aqueous virion-stabilizing formulation comprises a carbohydrate, such as sorbitol, at a concentration of between 0.1 wt. % to about 10 wt. %, such as between about 1 wt. % to about 5 wt.
  • Virions are generally present in the composition in an amount sufficient to provide a therapeutic effect when given in one or more doses, as defined above.
  • the pharmaceutical compositions can include a contrast dye, for administration in addition to the viral vector, such an adenoviral vector, including a disclosed MafB promoter operably linked to a heterologous nucleic acid molecule encoding PDX1 and MafA, and optionally MafB.
  • the contrast dye can be a low-osmolar low-viscosity non-ionic dye, a low-viscosity high-osmolar dye, or a dissociable high- viscosity dye.
  • the dye is lopromid, loglicinate, or loxaglinate.
  • a pharmaceutical composition including a) an adcno-associatcd virus vector, such as AAV6, comprising a disclosed MafB promoter operably linked to a nucleic acid molecule encoding PDX1 and a nucleic acid encoding MafA, and optionally MafB, and further optionally wherein the vector does not encode Ngn3; b) a buffer; and c) a contrast dye for endoscopic retrograde cholangiopancreatography.
  • the pharmaceutical composition does not include a nucleic acid encoding Ngn3.
  • Any of the AAV vectors disclosed herein, including any of the disclosed promoters, can be included in this composition.
  • the AAV vector can be encapsulated in a virion.
  • the AAV vector can be an AAV6 vector.
  • the composition can be formulated for administration to the pancreatic duct.
  • compositions including a viral vector can be delivered to humans or other animals by any means, including orally, intravenously, intramuscularly, intraperitoneally, intranasally, intradermally, intrathecally, subcutaneously, via inhalation or via suppository.
  • the composition is administered into the pancreatic duct of a subject in vivo.
  • ERCP Endoscopic Retrograde Cholangiopancreatography
  • ERCP is an endoscopic technique that involves the placement of a side- viewing instrument (generally either an endoscope or duodenoscope) within the descending duodenum. The procedure eliminates the need for invasive surgical procedures for administration to the pancreatic duct.
  • the patient will generally lie on their side on an examining table.
  • the patient will then be given medication to help numb the back of the patient's throat, and a sedative to help the patient relax during the examination.
  • the patient then swallows the endoscope.
  • the thin, flexible endoscope is passed carefully through the alimentary canal of the patient.
  • the physician guides the endoscope through the patient's esophagus, stomach, and the first part of the small intestine known as the duodenum. Because of the endoscope's relatively small diameter, most patients can tolerate the unusualness of having the endoscope advanced through the opening of their mouth.
  • This location is called the papilla of Vater, or also commonly referred to as the ampulla of Vater.
  • the papilla of Vater is a small mound of tissue looking and acting similarly to a nipple.
  • the papilla of Vater emits a substance known as bile into the small intestine, as well as pancreatic secretions that contain digestive enzymes.
  • Bile is a combination of chemicals made in the liver and is necessary in the act of digestion. Bile is stored and concentrated in the gallbladder between meals.
  • the gallbladder squeezes the bile through the common bile duct and subsequently through the papilla of Vater.
  • the pancreas secretes enzymes in response to a meal, and the enzymes help digest carbohydrates, fats, and proteins.
  • the patient will be instructed (or manually maneuvered) to lie flat on their stomach once the endoscope reaches the papilla of Vater.
  • the distal end of the endoscope is positioned proximate the papilla of Vater.
  • a catheter is then advanced through the endoscope until the distal tip of the catheter emerges from the opening at the endoscope's distal end.
  • the distal end of the catheter is guided through the endoscope's orifice to the papilla of Vater (located between the sphincter of Oddi) leading to the common bile duct and the pancreatic duct.
  • the pancreatic duct proper can be entered.
  • ERCP catheters can be constructed from Teflon, polyurethane and polyaminde. ERCP catheters also can also be constructed from resin comprised of nylon and PEBA (see U.S. Patent No. 5,843,028), and can be construed for use by a single operator (see U.S. Patent No. 7,179,252). At times, a spring wire guide may be placed in the lumen of the catheter to assist in cannulation of the ducts. A stylet, used to stiffen the catheter, must first be removed prior to spring wire guide insertion.
  • a dual or multi-lumen ERCP catheter in which one lumen could be utilized to accommodate the spring wire guide or diagnostic or therapeutic device, and in which a second lumen could be utilized for contrast media and/or dye infusion and or for administration of a pharmaceutical composition including a viral vector, such an adenoviral vector, that includes a disclosed promoter, such as operably linked to a heterologous nucleic acid molecule encoding PDX1 and MafA.
  • a contrast dye is administered to the subject.
  • a contrast dye can be included in a pharmaceutical composition.
  • the contrast dye can be a low- osmolar low-viscosity non-ionic dye, a low-viscosity high-osmolar dye, or a dissociable high- viscosity dye.
  • the dye is lopromid, loglicinate, or loxaglinate.
  • Endoscopes have been designed for the delivery of more than one liquid solution, such as a first liquid composition including a viral vector, such an adenoviral vector, and a second liquid composition including dye, see U. S. Patent No. 7,597,662, which is incorporated herein by reference.
  • the pharmaceutical composition including a viral vector, such an adenoviral vector, and the dye can be delivered in the same or separate liquid compositions.
  • Methods and devices for using biliary catheters for accessing the biliary tree for ERCP procedures are disclosed in U.S. Patent No. 5,843,028, U.S. Patent No. 5,397,302 U.S. Pat. No. 5,320,602, which are incorporated by reference herein.
  • the vector is administered using a viral infusion technique into a pancreatic duct. Suitable methods are disclosed, for example, in Guo et al. Laboratory Invest. 93: 1241-1253, 2013, incorporated by reference herein.
  • the subject can be administered an effective amount of another agent, such as an immunosuppressive agent.
  • the immunosuppressive agent can be a steroid.
  • the immunosuppressive agent can be a non-steroidal anti-inflammatory agent.
  • the immunosuppressive agent can be an antibody.
  • the immunosuppressive agent can be a calcineurin inhibitor.
  • the immunosuppressive agent can be a mammalian target inhibitor (mTOR inhibiter).
  • the immunosuppressive agent can be a glucocorticoid.
  • the immunosuppressive agent can be an alkylating agent.
  • Immunosuppressive agents are disclosed, for example, in Hussain and Khan, Encyclopedia of Infection and Immunity 2022: 726-740, doi: 10.1016/B978-0-12- 818731-9.00068-9, April 8, 2022. Combinations of immunosuppressive agents are also of use.
  • Suitable immunosuppressive agents include, but are not limited to, mycophenolate mofetil (MMF), Rituximab, anti-Thymocyte Globulin (ATG), rapamycin. Methylprednisolone, and prednisolone. Combinations of these agents arc also of use.
  • the subject is administered Rituximab.
  • the subject is administered MMF.
  • the subject is administered Rituximab and MMF.
  • the immunosuppressive agent can be cortisol, cyclosporine-A, sirolimus, everolimus, mizoribine, lefunomide, azathioprine, methotrexate, or tacrolimus.
  • a gene therapy construct was engineered in which a shortened form of the MafB promoter drives expression of PDX1 and MafA, with or without additionally driving MafB production.
  • This construct was able to convert human and mouse alpha cell lines into insulin-producing cells.
  • the advantage over using the glucagon promoter is that when alpha cells are being converted into beta cells, the glucagon promoter will shut off, which could prevent full conversion.
  • MafB since in human and non-human primates, MafB is expressed in both alpha and beta cells, the MafB promoter does not turn off during conversion of an alpha cell into a beta cell. It was found that the conversion of human alpha cell lines into insulin-producing cells was much more durable than using either the glucagon promoter or the CMV promoter. It was also found that MafB driving PDX1 and MafA packaged in an AAV and infused into the pancreatic duct could induce formation of insulin-producing cells in non-human primates.
  • a human MafB promoter was selected as a 348 bp sequence upstream of the human MafB open reading frame.
  • the AAV vector, AAV-hMafBp-hPM, (Tina-059, see FIG. 2) contains human PDX1 and MafA open reading frames connected by a P2A linker under control of this human MafB promoter sequence plus a chimeric intron.
  • AAV serotype 6, DJ from Cell Biolabs
  • KPI Tinford University
  • the recombinant AAV vectors used for in vivo and in vitro infection were generated by transfection of human embryonic kidney 293T cells using polyethylenimine (PEI) max at 3:1 ratio to total DNA amount.
  • PEI polyethylenimine
  • rAAV vectors were harvested 3-4 days after transfection.
  • rAAV vectors were released from cells by three frcczc-thaw cycles, and non-cncapsidatcd DNA was removed by digestion with Bcnzonasc.
  • rAAV vectors were first purified by chloroform extraction and PEG-aqueous partition. Then rAAV vectors were further purified by 2 cycles of iodixanol ultracentrifugation.
  • rAAV were collected and dialyzed in PBS-0.001%PF- 68 buffer, then they were concentrated with a protein concentrator 50,000 MWCO tube and sterile-filtered. The viral titer was determined with gene of interest specific Taqman probe set.
  • Cell culture conditions Human Alpha Cells (Celprogen, Cat#35002-05) were cultured in human alpha cell complete media with serum (Celprogen, Cat#M35002-05S). The cell culture expansion extracellular matrix (Celprogen, Cat#E35002-05) were used for all human alpha cell culture experiments.
  • Virus infusion solution for non-human primates: Virus infusion solution was calculated with 1.7 mL/kg NHP body weight. The rAAV (1-5E+13 vg) was diluted with sterile saline-0.001% PF-68 and 2-4 mL indigo carmine solution (10 mg/mL). The virus solution was sterile-filtered before use.
  • Example 2 Virus infusion solution was calculated with 1.7 mL/kg NHP body weight. The rAAV (1-5E+13 vg) was diluted with sterile saline-0.001% PF-68 and 2-4 mL indigo carmine solution (10 mg/mL). The virus solution was sterile-filtered before use.
  • Example 2 Example 2
  • the MafB regulatory elements was investigated for driving the expression of PDX1 and MafA.
  • immature 0 cells express MafB, but then the expression ends when the cells mature, whereas both immature and mature a cells express MafB.
  • MafB was expressed equally in both mature a cells and mature 0 cells. This expression in both types of cells can be a distinct advantage over glucagon regulatory elements when driving PDX1 and MafA because the expression construct will not be shut off when transitioning into 0 cells or 0-like cells.
  • FIG. 1 a plasmid construct (FIG. 1) was made in which the MafB promoter, along with an intron sequence, which helps increase expression, drove luciferase and GFP to demonstrate activity of the promoter.
  • a human alpha cell line CELPROGEN®
  • CELPROGEN® human alpha cell line
  • luciferase graph
  • the CMV promoter is adequate to drive conversion of alpha cells into beta cells.
  • FIG. 2 A similar construct was created, with different combinations of PDX1, MafA, and MafB coding regions (FIG. 2).
  • the plasmids were designed to allow a direct comparison of the MafB promoter with the CMV promoter in transforming human alpha cells into insulin-secreting cells.
  • the MafB promoter induced the production and secretion of insulin in human alpha cells more strongly, when compared with the CMV promoter (Tina-036-1, CMV, compared with Tina-055, MafB, and then Tina-007, CMV, compared with Tina-059, MafB).
  • one of the plasmids (Tina-055) was converted into an AAV virus and surgery was performed to infuse the virus into the pancreatic duct of a diabetic non-human primate.
  • FIG. 4 the blood glucose trends and insulin requirement trends.
  • the glucose tolerance curves showed a dramatic improvement at two months, with a near normal fasting blood glucose (time zero) and a much more rapid decline in glucose levels after glucose challenge.
  • the animals did not receive insulin for 24 hours prior to the glucose tolerance test, so there is essentially no residual exogenous insulin circulating.
  • insulin levels were measured in the blood 30 minutes after glucose challenge. Consistent with the glucose tolerance test shown in FIG. 5, the circulating (endogenous) levels were substantially higher after 1 month compared with the presurgical diabetic state.
  • FIG. 1 shows that the MafB promoter drives luciferase in alpha cells equivalent to the CMV promoter.
  • FIG. 2 provides a schematic diagram of MafB promoter constructs that were generated, Tina-055 and Tina-059, driving PDX1 and MafA, plus or minus driving MafB.
  • FIG. 3 provides evidence that human alpha cells transfected with Tina-055 make more insulin than cells transfected with a similar construct that utilizes the CMV promoter.
  • FIG. 4 provide results from a monkey that received an AAV with the MafB promoter driving pdxl and mafA infusion (surgery) into the pancreas. The insulin requirement and blood glucose improved.
  • FIG. 5 provides evidenced that the fasting glucose and glucose tolerance improved in this monkey, to such an extent that the fasting glucose was normal.
  • FIG. 6 shows the circulating monkey insulin levels during a glucose tolerance test, with time zero being fasting, for this animal.
  • pTina-055 and pTina-059 are shown in FIG. 7 and FIG. 8, respectively.
  • pTina-055 encodes human MafA without a STOP codon, includes the connector T2A, and encodes human mafB sequence.
  • pTina-059 encodes human MafA with a STOP codon, but does not include the connector T2A, and does not encode a human MafB sequence.
  • the following nucleic acid sequences were utilized in pTina- 055.
  • human MAFB promoter GGACCTGCAGGGTACGGGGGTGGGCCCAGGCTGGCCGTCCCACTCCGCTCCAGCTCCGCCA
  • GGATCCAGCACAGCTGCACCGCCGAGCTGCGAGCGGCTGCGAGCGAGAGAGCGTAAGAGCAA GAGAGCTAGAGAGCGAGCAACGGGCACTCGCCCCA SEQ ID NO: 1
  • GCCACCATGA (SEQ ID NO: 14) human PDX1 (no STOP codon) ATGAACGGCGAGGAGCAGTACTACGCGGCCACGCAGCTTTACAAGGACCCATGCGCGTTCCAG CGAGGCCCGGCGCCGGAGTTCAGCGCCAGCCCCTGCGTGCCTGTACATGGGCCGCCAGCCC CCGCCGCCGCACCCGTTCCCTGGCGCCCTGGGCGCGCTGGAGCAGGGCAGCCCCCCG GACATCTCCCCGTACGAGGTGCCCCCTCGCCGACGACCCCGCGGTGGCGCACCTTCACCACCACCACC
  • FIGS. 9 and 10 provide glucose tolerance test (GTT) results from two monkeys that received a pancreatic duct infusion of AAV6 carrying Tina-059.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Biomedical Technology (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Diabetes (AREA)
  • Biophysics (AREA)
  • Medicinal Chemistry (AREA)
  • Public Health (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Veterinary Medicine (AREA)
  • Animal Behavior & Ethology (AREA)
  • Cell Biology (AREA)
  • Endocrinology (AREA)
  • Virology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Oncology (AREA)
  • Emergency Medicine (AREA)
  • Epidemiology (AREA)
  • Hematology (AREA)
  • Obesity (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • General Chemical & Material Sciences (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)

Abstract

MafB promoters are disclosed herein. In addition, disclosed are recombinant nucleic acid molecules that include a MafB promoter operably linked to a nucleic acid molecule encoding at least one heterologous protein. Vectors including these recombinant nucleic acid molecules, and host cells transformed with these vectors, are also disclosed. Methods of use for these promoters, recombinant nucleic acid molecules, vectors and host cells are disclosed.

Description

MAFB PROMOTER FOR GENE THERAPY FOR DIABETES
CROSS REFERENCE TO RELATED APPLICATIONS
This claims the benefit of U.S. Provisional Application No. 63/486,202, filed February 21, 2023, which is incorporated by reference herein in its entirety.
SEQUENCE LISTING
The Sequence Listing is submitted as an XML file in the form of the file named “Sequences” (25,742 bytes), which was created on February 2, 2024, which is incorporated by reference herein.
FIELD OF THE DISCLOSURE
This relates to the field of alpha cells, specifically to Musculoaponeurotic fibrosarcoma oncogene homolog (Maf)B promoters that can be used to express a heterologous protein in alpha cells.
BACKGROUND
A mammalian pancreas is composed of two subclasses of tissue: the exocrine cells of the acinar tissue and the endocrine cells of the islets of Langerhans. The exocrine cells produce digestive enzymes that are secreted through the pancreatic duct to the intestine. The islet cells produce polypeptide hormones that are involved in carbohydrate metabolism. The islands of endocrine tissue that exist within the adult mammalian pancreas are termed the islets of Langerhans. Adult mammalian islets are composed of five major cell types, the alpha (a), beta (3), delta (8), pancreatic polypeptide (PP), and ghrelin (s) cells. These cells are distinguished by their production of glucagon, insulin, somatostatin, pancreatic polypeptide, and ghrelin, respectively.
Adeno-associated virus encoding PDX1 and MafA can be infused through the pancreatic duct, such as by using endoscopic retrograde cholangiopancreatography (ERCP), to transdifferentiate a cells into 0 cells (see PCT Publication No. WO 2015/164218, incorporated herein by reference). However, there remains a need for promoters that can be used to achieve high levels of these proteins, and/or other proteins, in alpha cells.
SUMMARY OF THE DISCLOSURE
MafB promoters are disclosed herein, wherein the MafB promoter consists essentially of, or consists of a nucleotide sequence at least 95% identical to SEQ ID NO: 1, that functions as a promoter, or the nucleotide sequence of SEQ ID NO: 1.
In further aspects, recombinant nucleic acid molecules are disclosed that include a MafB promoter operably linked to a heterologous nucleic acid molecule encoding a protein. Also disclosed are vectors include these recombinant nucleic acid molecules, and host cells transformed with these vectors. In more aspects, methods of using these vectors and host cells are disclosed. Methods for treating diabetes in a subject are also provided.
The foregoing and other features and advantages of the invention will become more apparent from the following detailed description of several aspects which proceeds with reference to the accompanying figures.
BRIEF DESCRIPTION OF THE FIGURES
FIG. 1 provides a schematic diagram of a construct including the human MafB promoter, and evidence of its use to target human alpha cells. Results are provided from three days after transfection of human alpha cells in the lower panels. Expression is compared to a similar construct that includes the cytomegalovirus (CMV) promoter instead of the MafB promoter.
FIG. 2 provides two schematic diagrams of constructs that include the human MafB promoter, that can be used to target human alpha cells. Two schematic diagrams of control constructs, that include the CMV promoter, are also provided.
FIG. 3 is a bar graph showing the results of an Insulin ELISA. Results are from three days after transfection of human alpha cells with constructs shown in FIG. 2.
FIG. 4 provides graphs showing the daily blood glucose (top) and daily insulin requirement (bottom) from a monkey administered Tina-059, shown in FIG. 2.
FIG. 5 is a graph of the results from a glucose tolerance test in a monkey following treatment with Tina-059.
FIG. 6 is a graph of the insulin levels in the same monkey as Fig. 5 30 minutes after glucose administration following treatment with Tina-059.
FIG. 7 is a schematic diagram of pTina-055. This vector includes a human MafB promoter, a chimeric intron, a Kozak sequence, a nucleic acid molecule encoding human PDX1 (without a STOP codon), P2A, WPRE, and a poly(A) sequence. This vector also includes a nucleic acid molecule encoding human MafA (without a STOP codon) T2A and a nucleic acid molecule encoding human MafB.
FIG. 8 is a schematic diagram of pTina-059. This vector includes a human MafB promoter, a chimeric intron, a Kozak sequence, a nuclei acid encoding human PDX1 (no STOP CODON), P2A, WPRE, and a poly(A) sequence. This vector also includes a nucleic acid molecule encoding human MafA (with a STOP codon) but does not include T2A. It does not include a nucleic acid molecule encoding human MafB.
FIG. 9 is a graph of a glucose tolerance tests (GTT) for a first monkey (M263) that received a pancreatic ductal infusion of AAV6 carrying pTina-059. For the GTT the monkey was given an intravenous (IV) bolus of glucose and then blood samples were measured at serial time points to follow the response of the beta cells. This monkey had a normal GTT at one month. M263 received mycophenolate mofetil (MMF) for immunosuppression. FIG. 10 is a graph of a GTT for a second monkey (Ml 88) that received a pancreatic ductal infusion of AAV6 carrying Tina-059. For the GTT the monkey was given an intravenous (IV) bolus of glucose and then blood samples were measured at serial time points to follow the response of the beta cells. This monkey did not achieve a normal GTT, but continued to improve over time. M188 received MMF and Rituximab for immunosuppression.
SEQUENCES
The nucleic and amino acid sequences listed are shown using standard letter abbreviations for nucleotide bases, and three letter code for amino acids, as defined in 37 C.F.R. 1.822. Only one strand of each nucleic acid sequence is shown, but the complementary strand is understood as included by any reference to the displayed strand.
SEQ ID NO: 1 is a nucleic acid sequence of an exemplary MafB promoter.
SEQ ID NO: 2 is an amino acid sequence of and exemplary human MafA.
SEQ ID NO: 3 is an amino acid sequence of an exemplary mouse MafA.
SEQ ID NO: 4 is an amino acid sequence of an exemplary human PDX1.
SEQ ID NO: 5 is an amino acid sequence of an exemplary mouse PDX1.
SEQ ID NOs: 6-8 are the nucleic acid sequences of exemplary connectors.
SEQ ID NO: 9 is an amino acid sequence of an exemplary human MafB.
SEQ ID NO: 10 is an amino acid sequence of am exemplary mouse MafB.
SEQ ID NO: 11 is an amino acid sequence of an exemplary Ngn3.
SEQ ID NO: 12 is the nucleic acid sequence a portion of an exemplary MafB promoter.
SEQ ID NO: 13 is a nucleic acid sequence of a chimeric intron.
SEQ ID NO: 14 is a nucleic acid sequence of a Kozak sequence.
SEQ ID NO: 15 is a nucleic acid sequence encoding human PDX1, without a STOP codon.
SEQ ID NO: 16 is a nucleic acid sequence of a P2A connector.
SEQ ID NO: 17 is a nucleic acid sequence encoding human MafA, without a STOP codon.
SEQ ID NO: 18 is a nucleic acid sequence of a T2A connector.
SEQ ID NO: 19 is a nucleic acid sequence of human MafA, with a STOP codon.
SEQ ID NO: 20 is a nucleic acid sequence of a Woodchuck hepatitis virus Posttranscriptional
Regulatory Element (WPRE).
SEQ ID NO: 21 is a nucleic acid sequence encoding a poly A tail.
DETAILED DESCRIPTION OF SEVERAL ASPECTS
MafB is a transcription factor that is expressed in alpha cells in mice, but not in mouse beta cells. However, in humans and non-human primates it is expressed in both alpha cells and beta cells. Gene therapy constructs were engineered in which a shortened form of the MafB promoter drives expression of PDX1 and MafA, with or without additionally driving MafB production. This construct is able to convert human and mouse alpha cell lines into insulin-producing cells.
Disclosed herein are MafB promoters that can be operably linked to a heterologous nucleic acid molecule encoding a protein. Recombinant nucleic acid molecules including such a MafB promoter operably linked to the heterologous nucleic acid molecule encoding a protein provide a high level of expression of the protein in alpha cells. Vectors, such as adeno-associated virus vectors including a disclosed MafB promoter operably linked to a heterologous nucleic acid molecule encoding PDX1 and MafA, and optionally MafB itself, can be infused through the pancreatic duct, such as by using endoscopic retrograde cholangiopancreatography (ERCP), to transdifferentiate a cells into functional 0 cells.
PCT Publication No. WO 2015/164218, incorporated herein by reference, discloses that adeno- associated virus encoding PDX1 and MafA can be infused through the pancreatic duct, such as by using endoscopic retrograde cholangiopancreatography (ERCP), to reprogram a cells into 0 cells, which secrete insulin. The new beta cells are immunologically unrecognized for an extended period, resulting in persistent euglycemia without further interventions. In addition, the immune system in treated subjects was reverted to a naive state in which the immune cells were not actively being exposed to beta-cell autoantigens. The presently disclosed promoters can be used to achieve high levels of expression of MafA and PDX1 and can be used to increase the efficiency of these methods.
Terms
Unless otherwise noted, technical terms are used according to conventional usage. Definitions of many common terms in molecular biology may be found in Krebs et al. (eds.), Lewin ’s genes XII, published by Jones & Bartlett Learning, 2017. As used herein, the singular forms “a,” “an,” and “the,” refer to both the singular as well as plural, unless the context indicates otherwise. For example, the term “a beta cell” includes single or plural beta cells and can be considered equivalent to the phrase “at least one beta cell.” As used herein, the term “comprises” means “includes.” It is further to be understood that any and all base sizes or amino acid sizes, and all molecular weight or molecular mass values, given for nucleic acids or polypeptides are approximate, and are provided for descriptive purposes, unless otherwise indicated. Although many methods and materials similar or equivalent to those described herein can be used, particular suitable methods and materials are described herein. In case of conflict, the present specification, including explanations of terms, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting. To facilitate review of the various aspects, the following explanations of terms are provided:
About: Unless context indicated otherwise, “about” refers to plus or minus 5% of a reference value. For example, “about” 100 refers to 95 to 105.
Alpha (a) cells: Mature glucagon producing endocrine cells. In vivo, these cells are found in the pancreatic islets of Langerhans. Beta (P) cells: Mature insulin producing endocrine cells. In vivo, these cells are found in the pancreatic islets of Langerhans,
Delta (8) cells: Mature somatostatin producing endocrine cells. In vivo, these cells are found in the pancreatic islets of Langerhans.
PP cells: Mature pancreatic polypeptide (PP) producing endocrine cells. In vivo, these cells are found in the pancreatic islets of Langerhans.
Adeno-associated virus (AAV): A small, replication-defective, non-enveloped virus that infects humans and some other primate species. AAV is not known to cause disease and elicits a very mild immune response. Gene therapy vectors that utilize AAV can infect both dividing and quiescent cells and can persist in an extrachromosomal state without integrating into the genome of the host cell. These features make AAV an attractive viral vector for gene therapy. There are currently 14 recognized serotypes of AAV (AAV1-13 and AAV-DJ).
Administration: To provide or give a subject an agent by any effective route. Exemplary routes of administration include, but are not limited to, oral, injection (such as subcutaneous, intramuscular, intradermal, intraperitoneal, intravenous, and intratumoral), sublingual, rectal, transdermal, intranasal, vaginal and inhalation routes. In some aspects, administration is to a pancreatic duct.
Agent: Any polypeptide, compound, small molecule, organic compound, salt, polynucleotide, vector, or other molecule of interest. Agent can include a therapeutic agent, a diagnostic agent or a pharmaceutical agent. A therapeutic agent is a substance that demonstrates some therapeutic effect by restoring or maintaining health, such as by alleviating the symptoms associated with a disease or physiological disorder, or delaying (including preventing) progression or onset of a disease, such as T1D. A therapeutic agent can include a disclosed MafB promoter operably linked to a nucleic acid molecule encoding a polypeptide.
Amino acid substitution: The replacement of one amino acid in a polypeptide with a different amino acid.
Animal: Living multi-cellular vertebrate organisms, a category that includes, for example, mammals and birds. The term mammal includes both human and non-human mammals. Similarly, the term “subject” includes both human and veterinary subjects.
Anti-diabetic lifestyle modifications: Changes to lifestyle, habits, and practices intended to alleviate the symptoms of diabetes or pre-diabetes. Obesity and sedentary lifestyle may both independently increase the risk of a subject developing type 2 diabetes, so anti-diabetic lifestyle modifications include those changes that will lead to a reduction in a subject’s body mass index (BMI), increase physical activity, or both. Specific, non-limiting examples include the lifestyle interventions described in Diabetes Care, 22(4):623-34 at pages 626-27, herein incorporated by reference.
Conservative Substitutions: Modifications of a polypeptide that involve the substitution of one or more amino acids for amino acids having similar biochemical properties that do not result in change or loss of a biological or biochemical function of the polypeptide are designated “conservative” substitutions. These conservative substitutions are likely to have minimal impact on the activity of the resultant protein. Table 1 shows amino acids that can be substituted for an original amino acid in a protein, and which are regarded as conservative substitutions.
TABLE
Original Residue Conservative Substitutions
Ala ser
Arg lys
Asn gin; his
Asp glu
Cys ser
Gin asn
Glu asp
Gly pro
His asn; gin
He leu; val
Leu ile; val
Lys arg; gin; glu
Met leu; ile
Phe met; leu; tyr
Ser thr
Thr ser
Trp tyr
Tyr trp; phe
Val ile; leu
One or more conservative changes, or up to ten conservative changes (such as two substituted amino acids, three substituted amino acids, four substituted amino acids, or five substituted amino acids, etc.) can be made in the polypeptide without changing a biochemical function of the protein, such as PDX1 or MafA.
Consists essentially of: With regarding to nucleic acid molecules, a term that indicates that additional nucleotides are not included in the part of the molecule, but that other agents (such as labels, altered forms of a nucleotide, or chemical compounds) can be included. “Consists of’ indicates that the exact nucleotide sequence is present in that part of a molecule.
Degenerate variant: In the context of the present disclosure, a “degenerate variant” refers to a polynucleotide encoding a polypeptide (such as PDX1 or MafA) that includes a sequence that is degenerate as a result of the genetic code. There are 20 natural amino acids, most of which are specified by more than one codon. Therefore, all degenerate nucleotide sequences encoding a peptide are included as long as the amino acid sequence of the peptide encoded by the nucleotide sequence is unchanged. Diabetes mellitus: A group of metabolic diseases in which a subject has high blood sugar, either because the pancreas does not produce enough insulin, or because cells do not respond to the insulin that is produced. Type 1 diabetes results from the body's failure to produce insulin. This form has also been called ' insulin-dependent diabetes mellitus" (IDDM) or "juvenile diabetes". Type 1 diabetes mellitus is characterized by loss of the insulin-producing P cells, leading to insulin deficiency. This type can be further classified as immune-mediated or idiopathic. Type 2 diabetes results from insulin resistance, a condition in which cells fail to use insulin properly, sometimes combined with an absolute insulin deficiency. This form is also called “non insulin-dependent diabetes mellitus” (NIDDM) or "adult-onset diabetes." The defective responsiveness of body tissues to insulin is believed to involve the insulin receptor. Diabetes mellitus is characterized by recurrent or persistent hyperglycemia, and is diagnosed by demonstrating any one of: a. Fasting plasma glucose level > 7.0 mmol/1 (126 mg/dl); b. Plasma glucose > 11.1 mmol/1 (200 mg/dL) two hours after a 75 g oral glucose load as in a glucose tolerance test; c. Symptoms of hyperglycemia and casual plasma glucose > 11.1 mmol/1 (200 mg/dl); d. Glycated hemoglobin (Hb A1C) > 6.5%
Differentiation: The process whereby a first cell acquires specialized structural and/or functional features characteristic of a certain type of mature cells. Similarly, “differentiate” refers to this process. Typically, during differentiation, cellular structure alters and tissue-specific proteins appear. The term "differentiated pancreatic endocrine cell" refers to cells expressing a protein characteristic of the specific pancreatic endocrine cell type. A differentiated pancreatic endocrine cell includes an a cell, a 0 cell, a 8 cell, and a PP cell, which express glucagon, insulin, somatostatin, and pancreatic polypeptide, respectively.
Endocrine: Tissue which secretes regulatory hormones directly into the bloodstream without the need for an associated duct system.
Enhancer: A nucleic acid sequence that increases the rate of transcription by increasing the activity of a promoter.
Expand: A process by which the number or amount of cells is increased due to cell division. Similarly, the terms “expansion” or “expanded” refers to this process. The terms "proliferate," "proliferation" or "proliferated" may be used interchangeably with the words "expand," "expansion," or "expanded."
Expressed: Translation of a nucleic acid into a protein. Proteins may be expressed and remain intracellular, become a component of the cell surface membrane, or be secreted into the extracellular matrix or medium.
Exocrine: Secretory tissue which distributes its products, such as enzymes, via an associated duct network. The exocrine pancreas is the part of the pancreas that secretes enzymes required for digestion. The exocrine cells of the pancreas include the centroacinar cells and basophilic cells, which produce secretin and cholecystokinin.
Expression Control Sequences: Nucleic acid sequences that regulate the expression of a heterologous nucleic acid sequence to which it is operatively linked. Expression control sequences are operatively linked to a nucleic acid sequence when the expression control sequences control and regulate the transcription and, as appropriate, translation of the nucleic acid sequence. Thus, expression control sequences can include appropriate promoters, enhancers, transcription terminators, a start codon (i.e., ATG) in front of a protein-encoding gene, splicing signal for introns, maintenance of the correct reading frame of that gene to permit proper translation of mRNA, and stop codons. The term “control sequences” is intended to include, at a minimum, components whose presence can influence expression, and can also include additional components whose presence is advantageous, for example, leader sequences and fusion partner sequences. Expression control sequences can include a promoter.
Expression vector: A vector comprising a recombinant polynucleotide comprising expression control sequences operatively linked to a nucleotide sequence to be expressed. An expression vector comprises sufficient cis- acting elements for expression; other elements for expression can be supplied by the host cell or in an in vitro expression system. Non-limiting examples of expression vectors include cosmids, plasmids (e.g., naked or contained in liposomes) and viruses (e.g., lentiviruses, retroviruses, adenoviruses, and adeno-associated viruses) that incorporate the recombinant polynucleotide.
A polynucleotide can be inserted into an expression vector that contains a promoter sequence which facilitates the efficient transcription of the inserted genetic sequence of the host. The expression vector typically contains an origin of replication, a promoter, as well as specific nucleic acid sequences that allow phenotypic selection of the transformed cells.
Glucagon: A pancreatic enzyme produced by the pancreatic a cells in vivo. Exemplary glucagon amino acid sequences are shown in GENBANK® accession Nos: NP_002045.1 (pro-protein) (human); NP 032126.1 (mouse), both incorporated by reference. The term “glucagon” also encompasses species variants, homologues, allelic forms, mutant forms, and equivalents thereof, including conservative substitutions, additions, deletions therein not adversely affecting the function, such as binding to the glucagon receptor. Glucagon is encoded by nucleic acid corresponding to GENBANK® Accession No: NM_002054.5 (human); NM_008100.4 (mouse), both incorporated by reference as available on January 4, 2022. The glucagon protein encoded by the glucagon gene, which includes a glucagon promoter in vivo. Glucagon is expressed as a preproprotein that is cleaved into four distinct mature peptides, one of which is glucagon. Glucagon is a pancreatic hormone that counteracts the glucose-lowering action of insulin by stimulating glycogenolysis and gluconeogenesis. Glucagon is a ligand for a specific G-protein linked receptor whose signaling pathway controls cell proliferation.
Heterologous: A heterologous sequence is a sequence that is not normally (in the wild-type sequence) found adjacent to a second sequence. In one aspect, the sequence is from a different genetic source, such as a virus or organism, than the second sequence. A protein that is heterologous to a MafB promoter is any protein other than MafB (it is not MafB).
Host cells: Cells in which a vector can be propagated and its DNA expressed. The cell may be prokaryotic or eukaryotic. The term also includes any progeny of the subject host cell. It is understood that all progeny may not be identical to the parental cell since there may be mutations that occur during replication. However, such progeny are included when the term “host cell” is used.
Insulin: A protein hormone involved in the regulation of blood sugar levels that is produced by pancreatic beta cells. In vivo, insulin is produced as a precursor proinsulin, consisting of the B and A chains of insulin linked together via a connecting C-peptide. Insulin itself includes only the B and A chains. Exemplary nucleic acid sequences encoding insulin are provided in GENBANK® Accession No: NM_000207.3 (human) and NM_008386.4 (mouse), as available on December 28, 2021, and are incorporated by reference herein. The term insulin also encompasses species variants, homologues, allelic forms, mutant forms, and equivalents thereof, including conservative substitutions, additions, deletions therein not adversely affecting the structure of function.
Islets of Langerhans: Small discrete clusters of pancreatic endocrine tissue. In vivo, in an adult mammal, the islets of Langerhans are found in the pancreas as discrete clusters (islands) of pancreatic endocrine tissue surrounded by the pancreatic exocrine (or acinar) tissue. In vivo, the islets of Langerhans consist of the a cells, cells, 6 cells, PP cells, and e cells. Histologically, in rodents, the islets of Langerhans consist of a central core of 0 cells surrounded by an outer layer of a cells, 8 cells, and PP cells. The structure of human islets of Langerhans is different and distinct from rodents. The islets of Langerhans are sometimes referred to herein as “islets.”
Isolated: An “isolated” biological component (such as a nucleic acid, peptide or protein) has been substantially separated, produced apart from, or purified away from other biological components in the cell of the organism in which the component naturally occurs, i.e., other chromosomal and extrachromosomal DNA and RNA, and proteins. Nucleic acids, peptides and proteins which have been “isolated” thus include nucleic acids and proteins purified by standard purification methods. The term also embraces nucleic acids, peptides and proteins prepared by recombinant expression in a host cell as well as chemically synthesized nucleic acids. An isolated cell type has been substantially separated from other cell types, such as a different cell type that occurs in an organ. A purified cell or component can be at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% pure.
Label: A detectable compound or composition that is conjugated directly or indirectly to another molecule, such as an antibody or a protein, to facilitate detection of that molecule. Specific, non-limiting examples of labels include fluorescent tags, enzymatic linkages, and radioactive isotopes.
Musculoaponeurotic fibrosarcoma oncogene homolog A (MafA): MafA is a transcription factor that binds RIPE3b, a conserved enhancer element that regulates pancreatic beta cell-specific expression of the insulin gene (INS; MIM 176730) (Olbrot et al., 2002). MafA is referred in the art as aliases; v-maf musculoaponeurotic fibrosarcoma oncogene homolog A (avian), hMafA; RIPE3bl; MafA. Exemplary MafA proteins are the MafA protein of GENBANK® Accession No: NM_194350 (mouse) (SEQ ID NOs:3 and 32 of U.S. Published Patent Application No. 2011/0280842) or NP_963883.2 (Human)(SEQ ID NOs: 33 and 32 of U.S. Published Patent Application No. 2011/0280842); GenelD No: 389692, which are all incorporated by reference. The term MafA also encompasses species variants, homologues, allelic forms. mutant forms, and equivalents thereof, including conservative substitutions, additions, deletions that do not adversely affecting the structure of function. The term "MafA", or "MafA" protein” as used herein refers to a polypeptide having a naturally occurring amino acid sequence of a MafA" protein or a fragment, variant, or derivative thereof retains the ability of the naturally occurring protein to bind to DNA and activate gene transcription of Glut2 and pyruvate carboxylase, and other genes such as Glut2, PDX-1, Nkx6.1, GLP-1 receptor, prohormone convertase-1/3 as disclosed in Wang et al., Diabetologia. 2007 February; 50(2): 348- 358, which is incorporated herein by reference. Exemplary MafA nucleic acids are GENBANK® Accession No: NM_201589 (human) (SEQ ID NO:36 of U.S. Published Patent Application No. 2011/0280842) and GENBANK® Accession No: NM_194350 (mouse) (SEQ ID NO: 39 of U.S. Published Patent Application No. 2011/0280842), which are all incorporated by reference. In addition to naturally-occurring allelic variants of the MafA sequences that may exist in the population, it will be appreciated that, as is the case for virtually all proteins, a variety of changes can be introduced into the sequences of SEQ ID NO: 3 of U.S. Published Patent Application No. 2011/0280842 or SEQ ID NO: 33 of U.S. Published Patent Application No. 2011/0280842 (referred to as "wild type" sequences) without substantially altering the functional (biological) activity of the polypeptides. Such variants are included within the scope of the terms "MafA", "MafA protein", etc. U.S. Published Patent Application No. 2011/0280842 and all of the GENBANK entries are incorporated herein by reference.
Musculoaponeurotic fibrosarcoma oncogene homolog B (MafB): A basic leucine zipper (bZIP) transcription factor that plays an important role in the regulation of lineage-specific hematopoiesis. MafB is an encoded nuclear protein that represses ETS1 -mediated transcription of erythroid-specific genes in myeloid cells. MafB is also a negative regulator of cellular induction of type I interferon (IFN) and other pro-inflammatory cytokines, and thus is believed to protect against the development of auto-inflammatory disorders in multiple mammalian tissues. MafB belongs to the large Maf family of transcription factors, which consists of MafB, c-Maf, MafA and NrL. In adult pancreatic islets in mice, MafB expression is generally restricted to alpha cells. However, in humans and non-human primates, MafB is expressed in beta cells (see Cyphert et al., Diabetes 2019;68(2):337-348, released on-line November 13, 2018). Exemplary nucleic acid molecules encoding human MafB, and the encoded human MafB protein sequence, are provided in GENBANK® Accession No. AF134157.1, August 31, 1999, incorporated herein by reference. Exemplary nucleic acid molecules encoding murine MafB, and the encoded murine MafB protein sequence, are provided in GENBANK® Accession No: NM_010658.3, December 21, 2022, incorporated herein by reference. The MafB gene is disclosed at Gene ID No. 9935, updated January 22, 2023, incorporated herein by reference, as available from the National Library of Medicine, available at the NCBI website.
Mammal: This term includes both human and non-human mammals. Similarly, the term “subject” includes both human and veterinary subjects.
Neurogenin (Ngn)-3: Neurogenin-3 (also known as NEUROG3) is expressed in endocrine progenitor cells and is required for endocrine cell development in the pancreas and intestine. It belongs to a family of basic helix-loop-helix transcription factors involved in the determination of neural precursor cells in the neuroectoderm. Ngn3 is referred in the art as aliases; Neurogenin 3; Atoh5; Math4B; bHLHa7; NEUR0G3. Exemplary Ngn3 proteins are provided in GENBANK® Accession No: NM 009719 (mouse) and SEQ ID NO:2 of U.S. Published Patent Application No. 2011/0280842, both incorporated by reference herein or GENBANK® Accession No: NP 033849.3 (Human) and SEQ ID NO: 32 of U.S. Published Patent Application No. 2011/0280842, both incorporated by reference herein; GenelD No: 50674. The term Ngn3 also encompasses species variants, homologues, allelic forms, mutant forms, and equivalents thereof, including conservative substitutions, additions, deletions therein not adversely affecting the structure of function. Human Ngn3 is encoded by nucleic acid corresponding to GENBANK® Accession No: NM 020999 (human), SEQ ID NO:35 of U.S. Published Patent Application No. 2011/0280842 or NM 009719 (mouse), SEQ ID NO: 38 of U.S. Published Patent Application No. 2011/0280842. U.S. Published Patent Application No. 2011/0280842 and these GENBANK® Accession Nos. are incorporated by reference herein. The term "Ngn3", or "Ngn3 protein" as used herein refers to a polypeptide having a naturally occurring amino acid sequence of a Ngn3 protein or a fragment, variant, or derivative thereof that retains the ability of the naturally occurring protein to bind to DNA and activate gene transcription of NeuroD, Delta-like l(Dlll), HeyL, insulinoma-assiciated-1 (IA1), Nk2.2, Notch, HesS, Isll, Somatostatin receptor 2 (Sstr2) and other genes as disclosed in Serafimidis et al., Stem cells; 2008; 26; 3-16, which is incorporated herein in its entirety by reference. In addition to naturally-occurring allelic variants of the Ngn3 sequences that may exist in the population, it will be appreciated that, as is the case for virtually all proteins, a variety of changes can be introduced into a wild-type sequence (listed above in GENBANK® enteries) without substantially altering the functional (biological) activity of the polypeptides. Such variants are included within the scope of the terms "Ngn3 ", "Ngn3 protein", etc.
Nucleic acid: A polymer composed of nucleotide units (ribonucleotides, deoxyribonucleotides, related naturally occurring structural variants, and synthetic non-naturally occurring analogs thereof) linked via phosphodiester bonds, related naturally occurring structural variants, and synthetic non-naturally occurring analogs thereof. Thus, the term includes nucleotide polymers in which the nucleotides and the linkages between them include non-naturally occurring synthetic analogs, such as, for example and without limitation, phosphorothioates, phosphoramidates, methyl phosphonates, chiral-methyl phosphonates, 2-O- methyl ribonucleotides, peptide-nucleic acids (PNAs), and the like. Such polynucleotides can be synthesized, for example, using an automated DNA synthesizer. The term “oligonucleotide” typically refers to short polynucleotides, generally no greater than about 50 nucleotides. It will be understood that when a nucleotide sequence is represented by a DNA sequence (i.e., A, T, G, C), this also includes an RNA sequence (i.e., A, U, G, C) in which “U” replaces “T.”
Conventional notation is used herein to describe nucleotide sequences: the left-hand end of a singlestranded nucleotide sequence is the 5 '-end; the left-hand direction of a double- stranded nucleotide sequence is referred to as the 5’-direction. The direction of 5' to 3' addition of nucleotides to nascent RNA transcripts is referred to as the transcription direction. The DNA strand having the same sequence as an mRNA is referred to as the “coding strand;” sequences on the DNA strand having the same sequence as an mRNA transcribed from that DNA and which are located 5' to the 5'-end of the RNA transcript are referred to as “upstream sequences;” sequences on the DNA strand having the same sequence as the RNA and which are 3' to the 3' end of the coding RNA transcript are referred to as “downstream sequences.”
“cDNA” refers to a DNA that is complementary or identical to an mRNA, in either single stranded or double stranded form.
“Encoding” refers to the inherent property of specific sequences of nucleotides in a polynucleotide, such as a gene, a cDNA, or an mRNA, to serve as templates for synthesis of other polymers and macromolecules in biological processes having either a defined sequence of nucleotides (i.e., rRNA, tRNA and mRNA) or a defined sequence of amino acids and the biological properties resulting therefrom. Thus, a gene encodes a protein if transcription and translation of mRNA produced by that gene produces the protein in a cell or other biological system. Both the coding strand, the nucleotide sequence of which is identical to the mRNA sequence and is usually provided in sequence listings, and non-coding strand, used as the template for transcription, of a gene or cDNA can be referred to as encoding the protein or other product of that gene or cDNA. Unless otherwise specified, a "nucleotide sequence encoding an amino acid sequence" includes all nucleotide sequences that are degenerate versions of each other and that encode the same amino acid sequence. Nucleotide sequences that encode proteins and RNA may include introns.
A first sequence is an “antisense” with respect to a second sequence if a polynucleotide whose sequence is the first sequence specifically hybridizes with a polynucleotide whose sequence is the second sequence.
Terms used to describe sequence relationships between two or more nucleotide sequences or amino acid sequences include “reference sequence,” “selected from,” “comparison window,” “identical,” “percentage of sequence identity,” “substantially identical,” “complementary,” and “substantially complementary.”
For sequence comparison of nucleic acid sequences, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters are used. Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison can be conducted, for example, by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482, 1981, by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443, 1970, by the search for similarity method of Pearson & Lipman, Proc. Nat'l. Acad. Sci. USA 85:2444, 1988, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see for example, Current Protocols in Molecular Biology (Ausubel et al., eds 1995 supplement)).
One example of a useful algorithm is PILEUP. PILEUP uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol. Evol. 35:351-360, 1987. The method used is similar to the method described by Higgins & Sharp, CABIOS 5:151-153, 1989. Using PILEUP, a reference sequence is compared to other test sequences to determine the percent sequence identity relationship using the following parameters: default gap weight (3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can be obtained from the GCG sequence analysis software package, such as version 7.0 (Devereaux el al., Nuc. Acids Res. 12:387-395, 1984.
Another example of algorithms that are suitable for determining percent sequence identity and sequence similarity are the BLAST and the BLAST 2.0 algorithm, which are described in Altschul et al., J. Mol. Biol. 215:403-410, 1990 and Altschul et al., Nucleic Acids Res. 25:3389-3402, 1977. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). The BLASTN program (for nucleotide sequences) uses as defaults a word length (W) of 11, alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. The BLASTP program (for amino acid sequences) uses as defaults a word length (W) of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915, 1989).
Operably linked: A first nucleic acid sequence is operably linked with a second nucleic acid sequence when the first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the promoter affects the transcription or expression of the coding sequence. Generally, operably linked DNA sequences are contiguous and, where necessary to join two protein coding regions, in the same reading frame.
ORF (open reading frame): A series of nucleotide triplets (codons) coding for amino acids without any termination codons. These sequences are usually translatable into a peptide.
Pancreatic endocrine cell: An endocrine cell of pancreatic origin that produces one or more pancreatic hormone, such as insulin, glucagon, somatostatin, or pancreatic polypeptide. Subsets of pancreatic endocrine cells include the a (glucagon producing), (insulin producing) 3 (somatostatin producing) or PP (pancreatic polypeptide producing) cells. Additional subsets produce more than one pancreatic hormone, such as, but not limited to, a cell that produces both insulin and glucagon, or a cell that produces insulin, glucagon, and somatostatin, or a cell that produces insulin and somatostatin.
Pancreas duodenal homeobox protein (PDX)l: PDX1 protein is a transcriptional activator of several genes, including insulin, somatostatin, glucokinase, islet amyloid polypeptide, and glucose transporter type 2 (GLUT2). PDX1 is a nuclear protein is involved in the early development of the pancreas and plays a major role in glucose-dependent regulation of insulin gene expression. Defects in the gene encoding the PDX1 protein are a cause of pancreatic agenesis, which can lead to early-onset insulindependent diabetes mellitus (NIDDM), as well as maturity onset diabetes of the young type 4 (M0DY4). PDX1 is referred in the art as aliases; pancreatic and duodenal homeobox 1, IDX-1, STF-1, PDX-1, M0DY4, Ipfl. Exemplary PDX1 proteins are shown in GENBANK® Accession No. NM 008814 (mouse) (SEQ ID NO:1 of U.S. Published Patent Application No. 2011/0280842) or GENBANK® Accession No. NP 000200.1 (Human)(SEQ ID NO: 31 of U.S. Published Patent Application No. 2011/0280842), or Gene ID: 3651, which are all incorporated herein by reference. The term PDX1 also encompasses species variants, homologues, allelic forms, mutant forms, and equivalents thereof, including conservative substitutions, additions, deletions therein not adversely affecting the structure of function. Exemplary nucleic acid sequences are shown in GENBANK® Accession No NM 000209 (human) (SEQ ID NO:34 of U.S. Published Patent Application No. 2011/0280842) or GENBANK® Accession No NM 008814 (mouse)(SEQ ID NO: 37 of U.S. Published Patent Application No. 2011/0280842), which are all incorporated by reference. The term "PDX1", or "PDX1 protein" as used herein refers to a polypeptide having a naturally occurring amino acid sequence of a PDX1 protein or a fragment, variant, or derivative thereof that at least in part retains the ability of the naturally occurring protein to bind to DNA and activate gene transcription of insulin, somatostatin, glucokinase, islet amyloid polypeptide, and glucose transporter type 2 (GLUT2). In addition to naturally-occurring allelic variants of the PDX1 sequences that may exist in the population, it will be appreciated that, as is the case for virtually all proteins, a variety of changes can be introduced into a wild type sequence (see the listed GENBANK® enteries) without substantially altering the functional (biological) activity of the polypeptides. Such variants are included within the scope of the terms "PDX1", "PDX1 protein", etc. The listed GENBANK® Accession Nos. and U.S. Published Patent Application No. 2011/0280842 are incorporated by reference herein.
Pharmaceutically acceptable carriers: The pharmaceutically acceptable carriers useful in this invention are conventional. Remington: The Science and Practice of Pharmacy, 22nd ed., London, UK: Pharmaceutical Press, 2013, describes compositions and formulations suitable for pharmaceutical delivery of the fusion proteins herein disclosed.
In general, the nature of the carrier will depend on the particular mode of administration being employed. For instance, parenteral formulations usually comprise injectable fluids that include pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (e.g., powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to biologically-neutral carriers, pharmaceutical compositions to be administered can contain minor amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate.
Pharmaceutical agent: A chemical compound or a composition capable of inducing a desired therapeutic or prophylactic effect when properly administered to a subject or a cell. “Incubating” includes a sufficient amount of time for a drug to interact with a cell. “Contacting” includes incubating a drug in solid or in liquid form with a cell.
Pre-diabetes: A state in which some, but not all, of the criteria for diabetes are met. For example, a subject can have impaired fasting glycemia or impaired fasting glucose (IFG). Subjects with fasting glucose levels from 110 to 125 mg/dl (6.1 to 6.9 mmol/1) are considered to have impaired fasting glucose. Subjects with plasma glucose at or above 140 mg/dL (7.8 mmol/1), but not over 200 mg/dl (11.1 mmol/1), two hours after a 75 g oral glucose load are considered to have impaired glucose tolerance.
Predisposition for diabetes: A subject that is at high risk for developing diabetes. A number of risk factors are known to those of skill in the art and include: genetic factors (e.g., carrying alleles that result in a higher occurrence of diabetes than in the average population or having parents or siblings with diabetes); overweight (e.g., body mass index (BMI) greater or equal to 25 kg/m.sup.2); habitual physical inactivity, race/ethnicity (e.g., African-American, Hispanic-American, Native Americans, Asian-Americans, Pacific Islanders); previously identified impaired fasting glucose or impaired glucose tolerance, hypertension (e.g., greater or equal to 140/90 mmHg in adults); HDL cholesterol greater or equal to 35 mg/dl; triglyceride levels greater or equal to 250 mg/dl; a history of gestational diabetes or delivery of a baby over nine pounds; and/or polycystic ovary syndrome. See, e.g., "Report of the Expert Committee on the Diagnosis and Classification of Diabetes Mellitus" and "Screening for Diabetes" Diabetes Care 25(1): S5-S24 (2002).
Polypeptide: A polymer in which the monomers are amino acid residues that are joined together through amide bonds. When the amino acids are alpha-amino acids, either the L-optical isomer or the D- optical isomer can be used, the L-isomers being preferred. The terms "polypeptide" or “protein” as used herein is intended to encompass any amino acid sequence and include modified sequences such as glycoproteins. The term “polypeptide” is specifically intended to cover naturally occurring proteins, as well as those that are recombinantly or synthetically produced.
The term “polypeptide fragment” refers to a portion of a polypeptide which exhibits at least one useful epitope. The term “functional fragments of a polypeptide” refers to all fragments of a polypeptide that retain an activity of the polypeptide. Biologically functional fragments, for example, can vary in size from a polypeptide fragment as small as an epitope capable of binding an antibody molecule to a large polypeptide capable of participating in the characteristic induction or programming of phenotypic changes within a cell. An “epitope” is a region of a polypeptide capable of binding an immunoglobulin generated in response to contact with an antigen. Thus, smaller peptides containing the biological activity of insulin, or conservative variants of the insulin, are thus included as being of use.
The term "soluble" refers to a form of a polypeptide that is not inserted into a cell membrane.
The term “substantially purified polypeptide” as used herein refers to a polypeptide which is substantially free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated. In one aspect, the polypeptide is at least 50%, for example at least 80% free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated. In another aspect, the polypeptide is at least 90% free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated. In yet another aspect, the polypeptide is at least 95% free of other proteins, lipids, carbohydrates or other materials with which it is naturally associated.
Conservative substitutions replace one amino acid with another amino acid that is similar in size, hydrophobicity, etc. Variations in the cDNA sequence that result in amino acid changes, whether conservative or not, should be minimized in order to preserve the functional and immunologic identity of the encoded protein. The immunologic identity of the protein may be assessed by determining if it is recognized by an antibody; a variant that is recognized by such an antibody is immunologically conserved. Any cDNA sequence variant will preferably introduce no more than twenty, and preferably fewer than ten amino acid substitutions into the encoded polypeptide. Variant amino acid sequences may, for example, be 80, 90 or even 95% or 98% identical to the native amino acid sequence.
Polynucleotide: A nucleic acid sequence (such as a linear sequence) of any length. Therefore, a polynucleotide includes oligonucleotides, and also gene sequences found in chromosomes. An “oligonucleotide” is a plurality of joined nucleotides joined by native phosphodiester bonds. An oligonucleotide is a polynucleotide of between 6 and 300 nucleotides in length. An oligonucleotide analog refers to moieties that function similarly to oligonucleotides but have non-naturally occurring portions. For example, oligonucleotide analogs can contain non-naturally occurring portions, such as altered sugar moieties or inter-sugar linkages, such as a phosphorothioate oligodeoxynucleotide. Functional analogs of naturally occurring polynucleotides can bind to RNA or DNA, and include peptide nucleic acid (PNA) molecules.
Preventing, treating or ameliorating a disease: “Preventing” a disease (such as Type 1 diabetes) refers to inhibiting the full development of a disease in a subject with a pre-disposition to develop the disease. “Treating” refers to a therapeutic intervention that ameliorates a sign or symptom of a disease or pathological condition after it has begun to develop. “Ameliorating” refers to the reduction in the number or severity of signs or symptoms of a disease.
Promoter: A promoter is an array of nucleic acid control sequences which direct transcription of a nucleic acid. A promoter includes necessary nucleic acid sequences near the start site of transcription, such as, in the case of a polymerase II type promoter, a TATA element. A promoter can optionally include distal enhancer or repressor elements which can be located as much as several thousand base pairs from the start site of transcription. Also included arc those promoter elements which arc sufficient to render promoterdependent gene expression controllable for cell-type specific, tissue-specific, or inducible by external signals or agents; such elements may be located in the 5' or 3' regions of the gene. Both constitutive and inducible promoters are included (see for example, Bitter et al., Methods in Enzymology 153:516-544, 1987).
Purified: The term “purified” does not require absolute purity; rather, it is intended as a relative term. Thus, for example, a purified peptide, protein, virus, or other active compound is one that is isolated in whole or in part from naturally associated proteins and other contaminants. In certain aspects, the term “substantially purified” refers to a peptide, protein, virus or other active compound that has been isolated from a cell, cell culture medium, or other crude preparation and subjected to fractionation to remove various components of the initial preparation, such as proteins, cellular debris, and other components.
Recombinant: A recombinant nucleic acid is one that has a sequence that is not naturally occurring or has a sequence that is made by an artificial combination of two otherwise separated segments of sequence. This artificial combination is often accomplished by chemical synthesis or, more commonly, by the artificial manipulation of isolated segments of nucleic acids, such as by genetic engineering techniques. A recombinant nucleic acid can include a nucleic acid that has a non-coding function (such as a promoter. origin of replication, ribosome-binding site, etc.). A recombinant protein is one encoded for by a recombinant nucleic acid molecule. In addition, a recombinant virus is a virus comprising sequence (such as genomic sequence) that is non-naturally occurring or made by artificial combination of at least two sequences of different origin. The term “recombinant” also includes nucleic acids, proteins and viruses that have been altered solely by addition, substitution, or deletion of a portion of a natural nucleic acid molecule, protein or virus. As used herein, “recombinant AAV” refers to an AAV particle in which a recombinant nucleic acid molecule (such as a recombinant nucleic acid molecule comprising a MafB promoter and encoding PDX1 and MafA) has been packaged. A host cell that comprises the recombinant nucleic acid is referred to as a “recombinant host cell.”
Selectable Marker: A gene, RNA, or protein that when expressed, confers upon cells a selectable phenotype, such as resistance to a cytotoxic or cytostatic agent (e.g., antibiotic resistance), nutritional prototrophy, or expression of a particular protein that can be used as a basis to distinguish cells that express the protein from cells that do not. Proteins whose expression can be readily detected such as a fluorescent or luminescent protein or an enzyme that acts on a substrate to produce a colored, fluorescent, or luminescent substance ("detectable markers") constitute a subset of selectable markers. The presence of a selectable marker linked to expression control elements native to a gene that is normally expressed selectively or exclusively in pluripotent cells makes it possible to identify and select specific cells of interest. A variety of selectable marker genes can be used, such as neomycin resistance gene (neo), puromycin resistance gene (puro), guanine phosphoribosyl transferase (gpt), dihydrofolate reductase (DHFR), adenosine deaminase (ada), puromycin-N-acetyltransf erase (PAC), hygromycin resistance gene (hyg), multidrug resistance gene (mdr), thymidine kinase (TK), hypoxanthine-guanine phosphoribosyltransferase (HPRT), and hisD gene. Detectable markers include green fluorescent protein (GFP) blue, sapphire, yellow, red, orange, and cyan fluorescent proteins and variants of any of these. Luminescent proteins such as luciferase (e.g., firefly or Renilla luciferase) are also of use. The term "selectable marker” as used herein can refer to a gene or to an expression product of the gene, e.g., an encoded protein.
Sequence identity of amino acid sequences: The similarity between amino acid sequences is expressed in terms of the similarity between the sequences, otherwise referred to as sequence identity. Sequence identity is frequently measured in terms of percentage identity (or similarity or homology); the higher the percentage, the more similar the two sequences are. Homologs or variants of a polypeptide will possess a relatively high degree of sequence identity when aligned using standard methods.
Methods of alignment of sequences for comparison are well known in the art. Various programs and alignment algorithms are described in: Smith and Waterman, Adv. Appl. Math. 2:482, 1981; Needleman and Wunsch, J. Mol. Biol. 48:443, 1970; Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85:2444, 1988; Higgins and Sharp, Gene 73:237, 1988; Higgins and Sharp, CABIOS 5: 151, 1989; Corpet et al.. Nucleic Acids Research 16: 10881, 1988; and Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85:2444, 1988. Altschul et al., Nature Genet. 6: 119, 1994, presents a detailed consideration of sequence alignment methods and homology calculations. The NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et al., J. Mol. Biol. 215:403, 1990) is available from several sources, including the National Center for Biotechnology Information (NCBI, Bethesda, MD) and on the internet, for use in connection with the sequence analysis programs blastp, blastn, blastx, tblastn and tblastx. A description of how to determine sequence identity using this program is available on the NCBI website on the internet.
Homologs and variants of proteins, such as MafA or PDX1 are typically characterized by possession of at least about 75%, for example at least about 80%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity counted over the full length alignment with the amino acid sequence of the antibody using the NCBI Blast 2.0, gapped blastp set to default parameters. For comparisons of amino acid sequences of greater than about 30 amino acids, the Blast 2 sequences function is employed using the default BLOSUM62 matrix set to default parameters, (gap existence cost of 11, and a per residue gap cost of 1). When aligning short peptides (fewer than around 30 amino acids), the alignment should be performed using the Blast 2 sequences function, employing the PAM30 matrix set to default parameters (open gap 9, extension gap 1 penalties). Proteins with even greater similarity to the reference sequences will show increasing percentage identities when assessed by this method, such as at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity. When less than the entire sequence is being compared for sequence identity, homologs and variants will typically possess at least 80% sequence identity over short windows of 10-20 amino acids, and may possess sequence identities of at least 85% or at least 90% or 95% depending on their similarity to the reference sequence. Methods for determining sequence identity over such short windows are available at the NCBI website on the internet. One of skill in the art will appreciate that these sequence identity ranges are provided for guidance only; it is entirely possible that strongly significant homologs could be obtained that fall outside of the ranges provided.
Specific binding agent: An agent that binds substantially only to a defined target. Thus, a 0 cell specific binding agent is an agent that binds substantially to a 0 cell, and a pancreatic endocrine cell specific binding agent is an agent that binds substantially only to pancreatic endocrine cells or a subset thereof (and not to pancreatic exocrine cells). Similarly, a pancreatic exocrine cell specific binding agent is an agent that binds substantially to exocrine cells. In one aspect, the specific binding agent is a monoclonal or polyclonal antibody that specifically binds a type of pancreatic cell.
The term "specifically binds" refers, with respect to a cell, such as a pancreatic endocrine cell, to the preferential association of an antibody or other ligand, in whole or part, with a cell or tissue bearing that antigen and not to cells or tissues lacking that antigen. It is, of course, recognized that a certain degree of non-specific interaction may occur between a molecule and a non-target cell or tissue. Nevertheless, specific binding may be distinguished as mediated through specific recognition of the antigen. Although selectively reactive antibodies bind antigen, they may do so with low affinity. On the other hand, specific binding results in a much stronger association between the antibody (or other ligand) and cells bearing the antigen than between the bound antibody (or other ligand) and cells lacking the antigen. Specific binding typically results in greater than 2-fold, such as greater than 5-fold, greater than 10-fold, or greater than 100- fold increase in amount of bound antibody or other ligand (per unit time) to a cell or tissue expressing the target epitope as compared to a cell or tissue lacking this epitope. Specific binding to a protein under such conditions requires an antibody that is selected for its specificity for a particular protein. A variety of immunoassay formats are appropriate for selecting antibodies or other ligands specifically immunoreactive with a particular protein. For example, solid-phase ELISA immunoassays are routinely used to select monoclonal antibodies specifically immunoreactive with a protein. See Harlow & Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York (1988), for a description of immunoassay formats and conditions that can be used to determine specific immunoreactivity.
Subject: Any mammal, such as humans, non-human primates, pigs, sheep, cows, rodents and the like which is to be the recipient of the particular treatment. In two non-limiting examples, a subject is a human subject or a murine subject.
Therapeutic agent: Used in a generic sense, it includes treating agents, prophylactic agents, and replacement agents. A therapeutic agent can be a nucleic acid molecule encoding MafA and PDX-1, or a vector encoding these factors.
Therapeutically effective amount or Effective amount: A quantity of a specified pharmaceutical or therapeutic agent (e.g., a recombinant AAV) sufficient to achieve a desired effect in a subject, or in a cell, being treated with the agent, such as increasing insulin production. The effective amount of the agent will be dependent on several factors, including, but not limited to the subject or cells being treated, and the manner of administration of the therapeutic composition.
Transdifferentiate: A process wherein a differentiated cell of one type, such as an alpha cell, differentiates into a differentiated cell of another type, such as a beta cells.
Transduced and Transformed: A virus or vector “transduces” a cell when it transfers nucleic acid into the cell. A cell is “transformed” or “transfected” by a nucleic acid transduced into the cell when the DNA becomes stably replicated by the cell, either by incorporation of the nucleic acid into the cellular genome, or by episomal replication.
Numerous methods of transfection are known to those skilled in the art, such as: chemical methods (e.g., calcium-phosphate transfection), physical methods (e.g., electroporation, microinjection, particle bombardment), fusion (e.g., liposomes), receptor-mediated endocytosis (e.g., DNA-protein complexes, viral envelope/capsid-DNA complexes) and by biological infection by viruses such as recombinant viruses. Methods for the introduction of genes into the pancreatic endocrine cells are known (e.g. see U.S. Patent No. 6,110,743). These methods can be used to transduce a pancreatic endocrine cell produced by the methods described herein, or an artificial islet produced by the methods described herein.
Genetic modification of the target cell is an indicium of successful transfection. "Genetically modified cells" refers to cells whose genotypes have been altered as a result of cellular uptakes of exogenous nucleotide sequence by transfection. A reference to a transfected cell or a genetically modified cell includes both the particular cell into which a vector or polynucleotide is introduced and progeny of that cell.
Transgene: An exogenous gene supplied by a vector, such as AAV. Vector: A nucleic acid molecule as introduced into a host cell, thereby producing a transformed host cell. A vector may include nucleic acid sequences that permit it to replicate in the host cell, such as an origin of replication. A vector may also include one or more therapeutic genes and/or selectable marker genes and other genetic elements known in the art. A vector can transduce, transform or infect a cell, thereby causing the cell to express nucleic acids and/or proteins other than those native to the cell. A vector optionally includes materials to aid in achieving entry of the nucleic acid into the cell, such as a viral particle, liposome, protein coating or the like. In some aspects herein, the vector is an AAV vector.
Although many methods and materials similar or equivalent to those described herein can be used, particular suitable methods and materials are described below. In case of conflict, the present specification, including explanations of terms, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.
Overview of Several Aspects
MafB promoters are disclosed herein that can be linked to a nucleic acid molecule encoding a heterologous protein, and used to treat diabetes in a subject.
In some aspects, disclosed is a recombinant nucleic acid molecule comprising a MafB promoter operably linked to a nucleic acid molecule encoding a heterologous protein, wherein the MafB promoter consists essentially of: a) a nucleotide sequence at least 95% identical to SEQ ID NO: 1, that functions as a promoter; or b) the nucleotide sequence of SEQ ID NO: 1. In further aspects, the MafB promoter consists of a) the nucleotide sequence at least 95% identical to SEQ ID NO: 1, that functions as a promoter; or b) the nucleotide sequence of SEQ ID NO: 1.
Vectors arc also disclosed that include the comprising the recombinant nucleic acid molecule. In some aspects, the vector is a viral vector. The viral vector can be a lentivirus vector, an adenovirus vector or an adeno-associated virus (AAV) vector, such as, but not limited to, an AAV6 vector.
In some aspects, the MafB promoter is operably linked to a nucleic acid molecule encoding MafA and/or a nucleic acid molecule encoding PDX1. The MafB promoter can also be operably linked to a nucleic acid molecule encoding MafB. In specific non-limiting examples, the recombinant nucleic acid molecule includes both the nucleic acid molecule encoding MafA and the nucleic acid molecule encoding PDX1, and includes a connector between the nucleic acid molecule encoding MafA and the nucleic acid molecule encoding PDX1. In a specific non-limiting example, the connector is a 2 A connector.
In more aspects, provided are host cells transformed with the disclosed recombinant nucleic acid molecule or the disclosed vectors. The host cell can be a mammalian host cell, such as a human host cell or a non-human primate host cell. The host cell can be an alpha cell, such as a human alpha cell.
In further aspects, methods are disclosed for producing a heterologous protein in a host cell that include transforming a host cell with an effective amount of a disclosed vector, thereby producing the heterologous protein in the host cell. The host cell can be a mammalian host cell, such as a human host cell or a non-human primate host cell.
In some aspects, compositions are provided that include an effective amount of a disclosed recombinant nucleic acid molecule or the vector and a pharmaceutically acceptable carrier. These compositions can also include a contrast dye for endoscopic retrograde cholangiopancreatography. The contrast dye can be a low-osmolar low-viscosity non-ionic dye, a low-viscosity high-osmolar dye, or a dissociable high- viscosity dye. The contrast dye can be lopromid, loglicinate, or loxaglinate.
Methods are also disclosed for producing pancreatic beta cells in a subject, that include administering to the subject a composition comprising an effective amount of a disclosed vector, wherein the MafB promoter is operably linked to the nucleic acid molecule encoding PDX1 and a nucleic acid molecule encoding MafA, wherein the vector does not encode Neurogenin 3 (Ngn3), wherein the subject is not administered any other nucleic acid encoding Ngn3, wherein the vector is administered intraductally into a pancreatic duct of the subject. These methods transduce alpha cells to transdifferentiate into pancreatic beta cells in the subject. In some aspects, the MafB promoter is further operably linked to a nucleic acid molecule encoding MafB.
In more aspects, the method includes the use of endoscopic retrograde cholangiopancreatography (ERCP). In further aspects, the subject is not administered an immunosuppressive agent. In other aspects, the subject is administered an immunosuppressive agent. In more aspects, the subject is a human. In some aspects, the subject has Type 1 diabetes. In yet other aspects, the composition further includes a contrast dye for endoscopic retrograde cholangiopancreatography. The contrast dye can be a low-osmolar low-viscosity non-ionic dye, a low-viscosity high-osmolar dye, or a dissociable high-viscosity dye. The contrast dye can be lopromid, loglicinate, or loxaglinate.
Compositions are provided that include a disclosed vector, wherein the MafB promoter is operably linked to the nucleic acid molecule encoding PDX1 and the nucleic acid molecule encoding MafaA. These compositions are of use for treating diabetes in a subject.
Promoters, Recombinant Nucleic Acid Molecules, and Vectors
Disclosed herein are MafB promoters that include a portion of the full-length MafB promoter.
An exemplary human MafB promoter (348 nucleotides) consists of, or consists essentially of: GGACCTGCAGGGTACGGGGGTGGGCCCAGGCTGGCCGTCCCACTCCGCTCCAGCGCTCCGCCAGGCCTCCGC CGCTCCGGGGCCGCTCGGGCCGCCAGTCAGCTGACGCGGGGGGCGGGGGAGCTGTCAGGCGCGCCCCGCCCT GCGCCGCTGGGCCGCGGAGGCCGTGCAGCTATTGGCCCGCGCACCGGGCCGCCCGGGCCCCCGCACCCCAGT GACATCAGGAGGCGATAAAAGGCTGCGGCGCCGCCGGATCCAGCACAGCTGCACCGCCGAGCTGCGAGCGGC TGCGAGCGAGAGAGCGTAAGAGCAAGAGAGCTAGAGAGCGAGCAACGGGCACTCGCCCCA ( SEQ ID NO : 1 )
One of skill in the art will readily appreciate that variants of this promoter is of use, such as promoters at least 95%, 96%, 97%, 98%, 99% identical to SEQ ID NO: 1 provided the promoter functions, such that a heterologous nucleic acid operably linked to the promoter is expressed when transferred into a host cell, such as an alpha cell. In additional aspects, the promoter can include at most 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 nucleic acid substitutions in SEQ ID NO: 1, provided the promoter functions, such that a heterologous nucleic acid operably linked to the promoter can be expressed when transferred into a host cell.
In some aspects, the variants of the promoter of SEQ ID NO: 1 are about 360 to about 340 nucleotides in length, such as 348 nucleotides in length. In some non-limiting examples, the promoter is at least 95%, 96%, 97%, 98%, 99% identical to SEQ ID NO: 1 and 360, 355, 350, or 348 nucleotides in length, and the promoter functions, such that a heterologous nucleic acid operably linked to the promoter is expressed when transferred into a host cell that expresses glucagon. In some non-limiting examples, the promoter includes a nucleotide sequence identical to SEQ ID NO: 1 and is 360, 355, 350, or 348 nucleotides in length, and the promoter functions, such that a heterologous nucleic acid operably linked to the promoter is expressed when transferred into a host cell that expresses glucagon. In additional aspects, the promoter includes at most 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 nucleic acid substitutions in SEQ ID NO: 1, and is 348 nucleotides in length, and the promoter functions, such that a heterologous nucleic acid operably linked to the promoter can be expressed when transferred into a host cell. In aspects, the promoter is 348 nucleotides in length. In other aspects, the promoter is 350 nucleotides in length. In more aspects, the promoter is 355 nucleotides in length. In further aspects, the promoter is 360 nucleotides in length. Other lengths are also of use, such as 349, 351, 352, 353, 354, 356, 357, 358 and 359 nucleotides. The promoter can be, for example, 348 to 360 nucleotides in length, such as 348 to 355 nucleotides in length or 348 to 350 nucleotides in length.
In some aspects, the variant of the promoter includes CCAGGCCTC. In other aspects, the variant of the promoter includes CCAGGCCTC. In further aspects, the variant of the promoter includes AGTCAGCTGAC (SEQ ID NO: 12). In more aspects, the variant of the promoter includes GGGGGCGGG. In yet other aspects, the variant of the promoter includes CCCCGCCC.
In some aspects, the variant of the promoter includes two of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In more aspects, the variant of the promoter includes three of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In other aspects, the variant of the promoter includes four of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In more aspects, the variant of the promoter includes all of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In some aspects, the promoters is at least 95%, 96%, 97%, 98%, 99% identical to SEQ ID NO: 1 and includes one, two, three, four, or all five of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In more aspects, the promoters is at least 95%, 96%, 97%, 98%, 99% identical to SEQ ID NO: 1, one, two, three, four, or all five of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC, and is 360, 355, 350, or 348 nucleotides in length.
In additional aspects, the promoter includes at most 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 nucleic acid substitutions in SEQ ID NO: 1, wherein the nucleic acid substitutions are not in one, two, three, four, or all five of CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC. In some examples, the promoter includes at most 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1 nucleic acid substitutions in SEQ ID NO: 1 wherein the nucleic acid substitutions are not in CCAGGCCTC, CCAGGCCTC, SEQ ID NO: 12, GGGGGCGGG, and CCCCGCCC.
Additional nucleotides can be added, provided the promoter functions, such that a heterologous nucleic acid operably linked to the promoter is expressed when transferred into a host cell. The promoter can include the nucleic acid sequence set forth as SEQ ID NO: 1 and an additional 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides of the full-length human MafB promoter.
In some aspects, another nucleic acid sequence can be linked to the promoter, such as an enhancer. The promoter can be linked to a Kozak sequence. The promoter can be linked to an intron between a promoter and a Kozak sequence, wherein the intron works as an enhancer.
In other aspects, disclosed promoters can be operably linked to a heterologous nucleic acid molecule, such as a nucleic acid encoding a heterologous protein. In some aspects, the heterologous protein is PDX1 or MafA. In further aspects, the heterologous nucleic acid encodes PDX1 and MafA. In yet other aspects, the heterologous nucleic acid encodes PDX1 and/or MafA and does not encode Ngn3. In more aspects, the nucleic acid molecule also encodes MafB.
The heterologous nucleic acid can encode a selectable marker, which includes, but are not limited to, a protein whose expression can be readily detected such as a fluorescent or luminescent protein or an enzyme that acts on a substrate to produce a colored, fluorescent, or luminescent substance ("detectable markers"). There are other nucleic acid molecules of use, such as a nucleic acid molecule that encodes drug resistance or provides a function that can be used to purify cells. Selectable markers include neomycin resistance gene (neo), puromycin resistance gene (puro), guanine phosphoribosyl transferase (gpt), dihydrofolate reductase (DHFR), adenosine deaminase (ada), puromycin-N-acetyltransferase (PAC), hygromycin resistance gene (hyg), multidrug resistance gene (mdr), thymidine kinase (TK), hypoxanthine- guanine phosphoribosyltransferase (HPRT), and hisD gene. Detectable markers include green fluorescent protein (GFP) blue, sapphire, yellow, red, orange, and cyan fluorescent proteins and variants of any of these. Luminescent proteins such as luciferase (e.g., firefly or Renilla luciferase) are also selectable makers.
In some aspects, the promoter functions such that PDX1 and/or MafA transcripts are produced. In more aspects, the promoter functions such that MafB is also produced.
In specific non-limiting examples, the promoter is operably linked to a nucleic acid molecule encoding PDX1 and a nucleic acid molecule encoding MafA, but is not operably linked to a nucleic acid molecule encoding Ngn3. In specific non-limiting examples, the promoter is operably linked to a nucleic acid encoding PDX1 and a nucleic acid encoding MafA, and a nucleic acid molecule encoding MafB, but is not operably linked to a nucleic acid encoding Ngn3. U.S. Published Patent Application No. 2011/0280842, incorporated by reference herein, provides PDX1, MafA and Ngn3 amino acid and nucleic acid sequences.
In some aspects, the promoter is operably linked to a heterologous nucleic acid molecule encoding MafA, such as the amino acid sequence set forth as: MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEPPEAERFCHRLPPGSLSSTPLSTPCSSVPSSPSF CAPSPGTGGGGGAGGGGGSSQAGGAPGPPSGGPGAVGGTSGKPALEDLYWMSGYQHHLNPEALN LTPEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGPGFAGGGGADDMGAGHHHGAHHAAHHH HAAHHHHHHHHHHGGAGHGGGAGHHVRLEERFSDDQLVSMSVRELNRQLRGFSKEEVIRLKQKR RTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEVGRLAKERDLYKEKYEKLAGRGGPG SAGGAGFPREPSPPQAGPGGAKGTADFFL
(human MafA, SEQ ID NO: 2, GENBANK® Accession No. NP_963883.2, May 10, 2014, incorporated herein by reference), or
MAAELAMGAELPSSPLAIEYVNDFDLMKFEVKKEPPEAERFCHRLPPGSLSSTPLSTPCSSVPSSPSF CAPSPGTGGGAGGGGSAAQAGGAPGPPSGGPGTVGGASGKAVLEDLYWMSGYQHHLNPEALNLT PEDAVEALIGSGHHGAHHGAHHPAAAAAYEAFRGQSFAGGGGADDMGAGHHHGAHHTAHHHHS AHHHHHHHHHHGGSGHHGGGAGHGGGGAGHHVRLEERFSDDQLVSMSVRELNRQLRGFSKEEVI RLKQKRRTLKNRGYAQSCRFKRVQQRHILESEKCQLQSQVEQLKLEVGRLAKERDLYKEKYEKLA GRGGPGGAGGAGFPREPSPAQAGPGAAKGAPDFFL
(mouse MafA, SEQ ID NO: 3, GENBANK® Accession No. NP_919331, April 26, 2014, incorporated herein by reference. MafA is a beta cell specific and glucose regulated transcription factor for insulin gene expression.
A disclosed promoter can be operably linked to a heterologous nucleic acid molecule encoding a MafA protein that has an amino acid sequence at least 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 2 or SEQ ID NO: 3, wherein the protein functions as a transcription factor. The heterologous nucleic acid molecule can encode a MafA protein that includes at most 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 conservative amino acid substitutions in SEQ ID NO: 2 or SEQ ID NO: 3, wherein the protein functions as a transcription factor. The disclosed promoter can be operably linked to a nucleic acid molecule that encodes a protein that has the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 3.
In some aspects, a disclosed promoter can be operably linked to a heterologous nucleic acid molecule encoding a human PDX1 amino acid sequence including the amino acid sequence set forth as: MNGEEQYYAATQLYKDPCAFQRGPAPEFSASPPACLYMGRQPPPPPPHPFPGALGALEQGSPPDISP YEVPPLADDPAVAHLHHHLPAQLALPHPPAGPFPEGAEPGVLEEPNRVQLPFPWMKSTKAHAWKG QWAGGAYAAEPEENKRTRTAYTRAQLLELEKEFLFNKYISRPRRVELAVMLNLTERHIKIWFQNRR MKWKKEEDKKRGGGTAVGGGGVAEPEQDCAVTSGEELLALPPPPPPGGAVPPAAPVAAREGRLPP GLSASPQPSSVAPRRPQEPR
(human PDX1, SEQ ID NO: 4, GENBANK Accession No. NP_000200.1, March 15, 2015, incorporated herein by reference), or
MNSEEQYYAATQLYKDPCAFQRGPVPEFSANPPACLYMGRQPPPPPPPQFTSSLGSLEQGSPPDISPY EVPPLASDDPAGAHLHHHLPAQLGLAHPPPGPFPNGTEPGGLEEPNRVQLPFPWMKSTKAHAWKG QWAGGAYTAEPEENKRTRTAYTRAQLLELEKEFLFNKYISRPRRVELAVMLNLTERHIKIWFQNRR MKWKKEEDKKRSSGTPSGGGGGEEPEQDCAVTSGEELLAVPPLPPPGGAVPPGVPAAVREGLLPSG LSVSPQPSSIAPLRPQEPR
(mouse PDX1, SEQ ID NO: 5, GENBANK Accession No: NM_008814.3, February 15, 2015, incorporated herein by reference). PDX1 is a transcriptional activator of several genes, including insulin, somatostatin, glucokinase, islet amyloid polypeptide, and glucose transporter type 2.
The heterologous nucleic acid molecule can encode a PDX1 protein that has an amino acid sequence at least 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 4 or SEQ ID NO: 5, wherein the protein functions as a transcription factor. The heterologous nucleic acid molecule can encode a PDX1 protein that includes at most 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 conservative amino acid substitutions in SEQ ID NO: 4 or SEQ ID NO: 5, wherein the protein functions as a transcription factor. The disclosed promoter can be operably linked to a nucleic acid molecule that encodes a protein that has the amino acid sequence of SEQ ID NO: 4 or SEQ ID NO: 5.
The heterologous nucleic acid molecule can encode both PDX1 and MafA. In some aspects, the heterologous nucleic acid molecules encoding PDX1 and MafA are separated by a connector. In specific non-limiting examples, the connector is 2A. The nucleic acid sequence of the 2A connector is shown below: CGCGCCAAGCGCGGCTCCGGCGCCACCAACTTCTCCCTGCTGAAGCAG (SEQ ID NO: 6). Nucleic acid sequences of additional exemplary connectors are: CGCGCCAAGCGCGGCTCCGGCCAGTGCACCAACTACGCCCTGCTGAAGCTGGCCGGCGACGTG GAGTCCAACCCCGGCCCC (SEQ ID NO: 7); and CGCGCCAAGCGCGGCTCCGGCGAGGGCAGAGGAAGTCTTCTAACATGCGGTGACGTGGAGGA GAATCCCGGCCCT (SEQ ID NO: 8). Suitable connectors also include a nucleic acid sequence with at most 1, 2, 3, 4, or 5 substitutions in one of SEQ ID NO: 6-8. Suitable connectors arc, for example, 40 to 90 nucleotides in length, such as 45 to 85 nucleotides in length, such as 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 nucleotides in length. These connectors can be included in a heterologous nucleic acid molecule between a nucleic acid molecule encoding MafA and a nucleic acid molecule encoding PDX1. The nucleic acid molecule encoding MafA can be 5' to the nucleic acid molecule encoding PDX1. The nucleic acid molecule encoding MafA can be 3’ to the nucleic acid molecule encoding PDX1.
In some aspects, a disclosed promoter further can be operably linked to a heterologous nucleic acid molecule encoding a MafB amino acid sequence including the amino acid sequence set forth as: MAAELSMGPELPTSPLAMEYVNDFDLLKFDVKKEPLGRAERPGRPCTRLQPVGSVSSTPLSTPCSSVPS SPS FSPTEQKTHLEDLYWMASNYQQMNPEALNLTPEDAVEAL IGSHPVPQPLQSFDSFRGAHHHHHHHHPHPHHA YPGAGVAHDELGPHAHPHHHHHHQASPPP SSAASPAQQLPTSHPGPGPHATASATAAGGNGSVEDRFSDDQL VSMSVRELNRHLRGFTKDEVIRLKHKRRTLKNRGYAQSCRYKRVQQKHHLENEKTQL IQQVEQLKQEVSRLA RERDAYKVKCEKLANSGFREAGSTSDSP S SPEFFL
(human MafB, SEQ ID NO: 9, GENBANK® Accession No. AF134157.1, August 31, 1999, incorporated herein by reference), or MAAELSMGQELPTSPLAMEYVNDFDLLKFDVKKEPLGRAERPGRPCTRLQPAGSVSSTPLSTPCS SVPS SPS FSPTEPKTHLEDLYWMASNYQQMNPEALNLTPEDAVEAL IGSHPVPQPLQSFDGFRSAHHHHHHHHPHPHHG YPGAGVTHDDLGQHAHPHHHHHHQASPPP SSAASPAQQLPTSHPGPGPHATAAATAAGGNGSVEDRFSDDQL VSMSVRELNRHLRGFTKDEVIRLKQKRRTLKNRGYAQSCRYKRVQQKHHLENEKTQL IQQVEQLKQEVSRLA RERDAYKVKCEKLANSGFREAGSTSDSP S SPEFFL
(mouse MafB, SEQ ID NO: 10, GENBANK® Accession No: NM_010658.3, December 21, 2022, incorporated herein by reference).
In some aspects, the heterologous nucleic acid molecule encodes PDX1 and MafA, and optionally MafB, but does not encode Ngn3, for example, the Ngn3 protein of GENBANK® Accession No: NM 009719 (mouse), February 15, 2015 and GENBANK® Accession No: NP 033849.3 (Human), February 15, 2015. An exemplary Ngn3 protein is shown below: MAPHPLDALT1QVSPETQQPFPGASDHEVLSSNSTPPSPTLIPRDCSEAEVGDCRGTSRKLRARRGGR NRPKSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTKIETLRFAHNYIW ALTQTLR IADHSFYGPE PPVPCGELGS PGGGSNGDWG SIYSPVSQAGNLSPTASLEEFPGLQVPSSP SYLLPGALVF SDFL (SEQ ID NO: 11)
Thus, in some aspects, the heterologous nucleic acid molecule does not encode a protein that has an amino acid sequence at least 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 11, wherein the protein functions as a transcription factor. The heterologous nucleic acid molecule does not encode a protein that includes at most 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 conservative amino acid substitutions in SEQ ID NO: 11, wherein the protein functions as a transcription factor.
However, in some aspects, the heterologous nucleic acid molecule encodes MafA and PDX1, and encodes Ngn3, or just encodes Ngn3. Thus, in some aspects, the heterologous nucleic acid molecule encodes a protein that has an amino acid sequence at least 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 11, wherein the protein functions as a transcription factor. The heterologous nucleic acid molecule can encode a protein that includes at most 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 conservative amino acid substitutions in SEQ ID NO: 11, wherein the protein functions as a transcription factor. The heterologous nucleic acid molecule can encode a protein with the amino acid sequence of SEQ ID NO: 11.
A connector can be included between nucleic acid molecules encoding proteins. The connector can be a 2A connection. In aspects, the vector can include a connector, such as T2A or P2A. Exemplary nucleic acid sequences for these connectors are provided as SEQ ID NO: 16 and SEQ ID NO: 18, respectively.
It may be of use to include a polyadenylation signal to effect proper termination and polyadenylation of the gene transcript. Exemplary polyadenylation signals have been isolated from beta globin, bovine growth hormone, SV40, and the herpes simplex virus thymidine kinase genes. An exemplary nucleic acid sequence for a poly adenylation signal is provided as SEQ ID NO: 21.
Also disclosed arc a vector including the promoter and the heterologous nucleic acid. Optionally, other transcription control sequences, such as one or more enhancer elements, which are binding recognition sites for one or more transcription factors that increase transcription above that observed for the promoter alone, can be included in the vector. Procedures for producing vectors and cloning can be found, for example, in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 4th ed, Cold Spring Harbor, New York, 2012) and Ausubel et al. (In Current Protocols in Molecular Biology, John Wiley & Sons, New York, through supplement 104, 2013.
Additional regulatory elements can be included, such as, but not limited to, enhancer elements. The additional regulatory element can be the WPRE, see for example, SEQ ID NO: 20.
Vectors are disclosed herein that include one or more of SEQ ID NOs: 13-20. Vectors include, but are not limited to, Tina-055 and Tina-059, as shown in TIGS. 7 and 8, respectively.
Numerous viral vectors are known in the art, including polyoma; SV40 (Madzak et al., 1992, J. Gen. Virol., 73:15331536); adenovirus (Berkner, 1992, Cur. Top. Microbiol. Immunol., 158:39-6; Berliner et al., 1988, Bio Techniques, 6:616-629; Gorziglia et al., 1992, J. Virol., 66:4407-4412; Quantin et al., 1992, Proc. Nad. Acad. Sci. USA, 89:2581-2584; Rosenfeld et al., 1992, Cell, 68:143-155; Wilkinson et al., 1992, Nucl. Acids Res., 20:2233-2239; Stratford-Perricaudet et al., 1990, Hum. Gene Then, 1:241-256); vaccinia virus (Mackett et al., 1992, Biotechnology, 24:495-499); adeno-associated virus (Muzyczka, 1992, Curr. Top. Microbiol. Immunol., 158:91-123; On et al., 1990, Gene, 89:279-282); herpes viruses, including HSV and EBV (Margolskee, 1992, Curr. Top. Microbiol. Immunol., 158:67-90; Johnson et al., 1992, J. Virol., 66:29522965; Fink et al., 1992, Hum. Gene Ther. 3:11-19; Breakfield et al., 1987, Mol. Neurobiol., 1:337- 371; Fresse et al., 1990, Biochem. Pharmacol., 40:2189-2199); Sindbis viruses (H. Herweijer et al., 1995, Human Gene Therapy 6:1161-1167; U.S. Pat. Nos. 5,091,309 and 5,2217,879); alphaviruses (S. Schlesinger, 1993, Trends Biotechnol. 11:18-22; I. Frolov et al., 1996, Proc. Natl. Acad. Sci. USA 93:11371-11377); and retroviruses of avian (Brandyopadhyay et al., 1984, Mol. Cell Biol., 4:749-754; Pctropouplos et al., 1992, J. Virol., 66:3391-3397), murine (Miller, 1992, Curr. Top. Microbiol. Immunol., 158:1-24; Miller et al., 1985, Mol. Cell Biol., 5:431-437; Sorge et al., 1984, Mol. Cell Biol., 4:1730-1737; Mann et al., 1985, J. Virol., 54:401-407), and human origin (Page et al., 1990, J. Virol., 64:5370-5276; Buchschalcher et al., 1992, J. Virol., 66:2731-2739).
The vector can be a viral vector. Suitable vectors include retrovirus vectors, orthopox vectors, avipox vectors, fowlpox vectors, capripox vectors, suipox vectors, adenoviral vectors, herpes virus vectors, alpha virus vectors, baculovirus vectors, Sindbis virus vectors, vaccinia virus vectors, lentivirus vectors and poliovirus vectors. Specific exemplary vectors are poxvirus vectors, such as vaccinia virus, fowlpox virus and a highly attenuated vaccinia virus (MVA), adenovirus, baculovirus, yeast, and the like. The vector can be a retroviral vector, a lentiviral vector, an adenovirus vector. Adeno-associated virus vectors (AAV) are disclosed in additional detail below, and are of use in the disclosed methods. The AAV vector can be AAV6.
Defective viruses, that entirely or almost entirely lack viral genes, can be used. Use of defective viral vectors allows for administration to specific cells without concern that the vector can infect other cells. In some examples, the vector is an attenuated adenovirus vector, such as the vector described by Stratford- Perricaudet et aL (J. Clin. Invest., 90:626-630 1992; La Salle et al., Science 259:988-990, 1993); and a defective adeno-associated virus vector (Samulski et al., J. Virol., 61:3096-3101, 1987; Samulski et al., J. Virol., 63:3822-3828, 1989; Lebkowski et al., Mol. Cell. Biol., 8:3988-3996, 1988).
In some aspects, the vector includes a disclosed promoter operably linked to a nucleic acid encoding PDX1 and MafA, and optionally MafB. In more aspects, the vector does not include a nucleic acid encoding Ngn3. The vector can be, for example, a lentivirus vector, an adenovirus vector or an adeno-associated virus (AAV) vector. In some non-limiting examples, the AAV vector is an AAV6 vector. In some non-limiting examples, the AAV vector is an AAVDJ vector. In some non-limiting examples, the AAV vector is an AAKP1 vector.
In some aspects, a vector of use includes a gene encoding a selectable marker, which includes, but are not limited to, a protein whose expression can be readily detected such as a fluorescent or luminescent protein or an enzyme that acts on a substrate to produce a colored, fluorescent, or luminescent substance ("detectable markers"). There are other genes of use, such as genes that encode drug resistance or provide a function that can be used to purify cells. Selectable markers include neomycin resistance gene (neo), puromycin resistance gene (puro), guanine phosphoribosyl transferase (gpt), dihydrofolate reductase (DHFR), adenosine deaminase (ada), puromycin-N-acetyltransferase (PAC), hygromycin resistance gene (hyg), multidrug resistance gene (mdr), thymidine kinase (TK), hypoxanthine-guanine phosphoribosyltransferase (HPRT), and hisD gene. Detectable markers include green fluorescent protein (GFP) blue, sapphire, yellow, red, orange, and cyan fluorescent proteins and variants of any of these. Luminescent proteins such as luciferase (e.g., firefly or Renilla luciferase) are also selectable makers.
The disclosed MafB promoter, operably linked to a heterologous nucleic acid molecule, can be included in adenoviral vectors and/or adeno-associated viral vectors. AAV belongs to the family Parvoviridae and the genus Dependovirus. AAV is a small, non-enveloped virus that packages a linear, single- stranded DNA genome. Both sense and antisense strands of AAV DNA are packaged into AAV capsids with equal frequency. In some aspects the AAV includes a disclosed promoter operably linked to a nucleic acid encoding PDX1 and MafA, but does not include a nucleic acid encoding Ngn3. Further provided are recombinant vectors, such as recombinant adenovirus vectors and recombinant adeno- associated virus vectors comprising a nucleic acid molecule disclosed herein. In some aspects, the AAV is AAV6. In other aspects, the AAV is A A VKP1. In more aspects, the AAV is AAVDJ. However, the AAV serotype can be any other suitable AAV serotype, such as AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV9, AAV 10, AAV11 or AAV12, or a hybrid of two or more AAV serotypes (such as, but not limited to AAV2/1, AAN n, AAV2/8 or AAV2/9). In more aspects, the AAV vector can transduce one or more pancreatic cell types, see Published U.S. Patent Application No. US-2022-0354969-A1. The AAV vector can be AAV9, AAVDJ. AAVKP1 , AAV2.7m8, AAV 10, AAVshlO. AAV1, AAV7, AAV218, AAVPHPeB, AAVhu3?, AAVAnc80, AAVrhS, AAVLK03, AAVKP3, AAV2G9. or AAVPHPB.
The AAV genome is characterized by two inverted terminal repeats (ITRs) that flank two open reading frames (ORFs). In the AAV2 genome, for example, the first 125 nucleotides of the ITR are a palindrome, which folds upon itself to maximize base pairing and forms a T-shaped hairpin structure. The other 20 bases of the ITR, called the D sequence, remain unpaired. The ITRs are cis-acting sequences important for AAV DNA replication; the ITR is the origin of replication and serves as a primer for second- strand synthesis by DNA polymerase. The double-stranded DNA formed during this synthesis, which is called replicating-form monomer, is used for a second round of self -priming replication and forms a replicating-form dimer. These double- stranded intermediates are processed via a strand displacement mechanism, resulting in single-stranded DNA used for packaging and double-stranded DNA used for transcription. Located within the ITR are the Rep binding elements and a terminal resolution site (TRS). These features are used by the viral regulatory protein Rep during AAV replication to process the doublestranded intermediates. In addition to their role in AAV replication, the ITR is also essential for AAV genome packaging, transcription, negative regulation under non-permissive conditions, and site-specific integration (Daya and Berns, Clin Microbiol Rev 21(4):583-593, 2008). In some aspects, these elements are included in the AAV vector.
The left ORF of AAV contains the Rep gene, which encodes four proteins - Rep78, Rep 68, Rep52 and Rep40. The right ORF contains the Cap gene, which produces three viral capsid proteins (VP1, VP2 and VP3). The AAV capsid contains 60 viral capsid proteins arranged into an icosahedral symmetry. VP1, VP2 and VP3 are present in a 1: 1:10 molar ratio (Daya and Berns, Clin Microbiol Rev 21(4):583-593, 2008). In some aspects, these elements are included in the AAV vector.
AAV vectors can be used for gene therapy. Exemplary AAV of use are AAV2, AAV5, AAV6, AAV8 and AAV9. In some aspects, the AAV is an AAV6 vector. In other aspects, the AAV is AAVKP1. In more aspects, the AAV is AAVDJ. However, the AAV serotype can be any other suitable AAV serotype, such as AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV9, AAV10, AAV11 or AAV12, or a hybrid of two or more AAV serotypes (such as, but not limited to AAV2/1, PJWin, AAV2/8 or AAV2/9). In more aspects, the AAV vector can transduce one or more pancreatic cell types, see Published U.S. Patent Application No. US-2022-0354969-A1. The AAV vector can be AAV9, AAVDJ, AAVK.P1, AAV2.7m8, AAV 10, AAVshlO. AAV1, AAV7, AAV218, AAVPHPeB, AAVhu37, A VAnc80, AAVrhS, AAVLK03, AAVKP3, AAV2G9, or AAVPHPB.
Adenovirus, AAV2 and AAV8 are capable of transducing cells in the pancreas. Thus, any of an AAV2 or AAV8 vector can be used in the methods disclosed herein. However, AAV6 and AAV9 vectors are also of use. In one non-limiting example, the AAV vector is an AAV6 vector.
Although AAV infects humans and some other primate species, it is not known to cause disease and elicits a very mild immune response. Gene therapy vectors that utilize AAV can infect both dividing and quiescent cells and persist in an extrachromosomal state without integrating into the genome of the host cell. Because of the advantageous features of AAV, the present disclosure contemplates the use of an AAV vector including a disclosed promoter for the methods disclosed herein.
AAV possesses several additional desirable features for a gene therapy vector, including the ability to bind and enter target cells, enter the nucleus, the ability to be expressed in the nucleus for a prolonged period of time, and low toxicity. AAV can be used to transfect cells, and suitable vector are known in the art, see for example, U.S. Published Patent Application No. 2014/0037585, incorporated herein by reference. Methods for producing AAV vectors suitable for gene therapy are well known in the art (see, for example, U.S. Published Patent Application Nos. 2012/0100606; 2012/0135515; 2011/0229971; and 2013/0072548; and Ghosh et al., Gene Ther 13(4):321-329, 2006), and can be utilized with the methods disclosed herein.
AAV8 vectors are disclosed, for example, in U.S. Patent No. 8,692,332, which is incorporated by reference herein. An exemplary AAV8 nucleic acid sequence is shown in Fig. 1 and SEQ ID NO: 1 of U.S. Patent No. 8,692,332. It is disclosed that AAV nucleic acid sequence can be greater than about 90%, 95%, 98% or 99% identical to this nucleic acid sequence. The location and sequence of the capsid, rep 68/78, rep 40/52, VP1, VP2 and VP3 are disclosed in this U.S. Patent No. 8,692,332. The location and hypervariable regions of AAV8 are also provided.
The vectors of use in the methods disclosed herein can contain nucleic acid sequences encoding an intact AAV capsid which may be from a single AAV serotype (e.g., AAV2, AAV, 6, AAV8 or AAV9, such as AAV6). As disclosed in U.S. Patent No. 8,692,332, vectors of use also can be recombinant, and thus can contain sequences encoding artificial capsids which contain one or more fragments of the AAV6 capsid fused to heterologous AAV or non-AAV capsid proteins (or fragments thereof). These artificial capsid proteins are selected from non-contiguous portions of the AAV2, AAV8 or AAV9 capsid or from capsids of other AAV serotypes. For example, an AAV vector may have a capsid protein comprising one or more of the AAV8 capsid regions selected from the VP2 and/or VP3, or from VP1, or fragments thereof selected from amino acids 1 to 184, amino acids 199 to 259; amino acids 274 to 446; amino acids 603 to 659; amino acids 670 to 706; amino acids 724 to 738 of the AAV8 capsid, see SEQ ID NO: 2 of U.S. Patent No. 8,692,332. In another example, it may be desirable to alter the start codon of the VP3 protein to GTG. Alternatively, the AAV may contain one or more of the AAV serotype 8 capsid protein hypervariable regions, for example aa 185- 198; aa 260-273; aa447-477; aa495-602; aa660-669; and aa707-723 of the AAV8 capsid set forth in SEQ ID NO: 2 of U.S. Patent No. 8,692,332.
In some aspects, an AAV is generated having an AAV serotype 6 capsid. To produce the vector, a host cell which can be cultured that contains a nucleic acid sequence encoding an adeno-associated virus (AAV) serotype 6 capsid protein, or fragment thereof, as defined herein; a functional rep gene; a minigene composed of, at a minimum, AAV inverted terminal repeats (ITRs) and a transgene, such as a transgene encoding PDX1 and MafA; and sufficient helper functions to permit packaging in the AAV6 capsid protein. The components required to be cultured in the host cell to package an AAV minigene in an AAV capsid may be provided to the host cell in trans. Alternatively, any one or more of the required components (e.g., minigene, rep sequences, cap sequences, and/or helper functions) may be provided by a stable host cell which has been engineered to contain one or more of the required components using methods known to those of skill in the art. In some aspects, a stable host cell will contain the required component(s) under the control of an inducible promoter. However, the required component(s) can be under the control of a constitutive promoter. Examples of suitable inducible and constitutive promoters are provided below. Similar methods can be used to generate an AAV2, AAV8 or AAV9 vector and/or virions.
In still another alternative, a selected stable host cell may contain selected component(s) under the control of a constitutive promoter and other selected component(s) under the control of one or more inducible promoters. For example, a stable host cell may be generated which is derived from 293 cells (which contain El helper functions under the control of a constitutive promoter), but which contains the rep and/or cap proteins under the control of inducible promoters. Still other stable host cells may be generated by one of skill in the art.
The minigene, rep sequences, cap sequences, and helper functions required for producing an AAV can be delivered to the packaging host cell in the form of any genetic element which transfer the sequences carried thereon. The selected genetic element may be delivered by any suitable method, including those described herein. The methods used to construct vectors are known to those with skill in nucleic acid manipulation and include genetic engineering, recombinant engineering, and synthetic techniques. See, e.g., Sambrook et al. (Molecular Cloning: A Laboratory Manual, 4th ed, Cold Spring Harbor, New York, 2012. Similarly, methods of generating AAV virions are known, and the selection of a suitable method is not a limitation on the present invention. See, e.g., K. Fisher et al, J. Virol., 70:520-532 (1993) and U.S. Pat. No. 5,478,745. In some aspects, selected AAV components can be readily isolated using techniques available to those of skill in the art from an AAV serotype, including AAV8. Such AAV may be isolated or obtained from academic, commercial, or public sources (e.g., the American Type Culture Collection, Manassas, Va.). Alternatively, the AAV sequences may be obtained through synthetic or other suitable means by reference to published sequences such as are available in the literature or in databases such as, e.g., GENBANK®.
In some aspects, the adenovirus and AAV vectors disclosed herein include a disclosed promoter operably linked to a nucleic acid encoding PDX1 and MafA. In some aspects, the adenovirus and AAV vectors do not include a nucleic acid encoding Ngn3.
In some aspects, the promoter consists essentially of, or consists of, SEQ ID NO: 1. Thus, in specific examples, the vector includes a heterologous nucleic acid molecule encoding PDX1 and/or MafA, operably linked to this promoter. In other specific examples, the vector includes a nucleic acid molecule encoding MafB, operably linked to this promoter. In additional examples, the vector includes a heterologous nucleotide acid molecule encoding PDX1 and/or MafA, and optionally MafB, operably linked to the promoter, but does not encode Ngn3. In some aspects, the promoter functions such that both PDX1 and/or MafA transcripts are produced. In more aspects, the promoter functions such that both PDX1 and/or MafA transcripts, and MafB transcripts are produced.
In specific non-limiting examples, the promoter functions such that all of PDX1, MafA and MafB transcripts are produced. In more specific non-limiting examples, the promoter is operably linked to a nucleic acid encoding PDX1 and a nucleic acid encoding MafA, but is not operably linked to a nucleic acid encoding Ngn3. In specific non-limiting examples, the promoter is operably linked to a nucleic acid encoding PDX1 and a nucleic acid encoding MafA, and a nucleic acid molecule encoding MafB, but is not operably linked to a nucleic acid encoding Ngn3. A nucleic acid molecule encoding Ngn3 can be absent from the vector, and may not be administered with the vector. In other aspects, a vector including a disclosed MafB promoter and the heterologous nucleic acid, such as an AAV vector, for example and AAV6 vector, does not include a nucleic acid encoding Ngn3.
In some aspects, host cells can be produced that are transformed with these recombinant nucleic acid molecules. These host cells can be mammalian host cells, such as non-human primate, mouse, and human host cells. The host cells can be alpha cells. Methods are disclosed for producing a protein in a host cell, that include transforming the host cell with an effective amount of the vector, thereby producing the protein in the host cell.
The method can include isolating the protein. The heterologous protein can be isolated and purified by standard methods including, but not limited to, chromatography (e.g., ion exchange, affinity, size exclusion, and hydroxyapatite chromatography), gel filtration, centrifugation, or differential solubility, ethanol precipitation, immunoaffinity purification, or by any other available technique for the purification of proteins (See, e.g., Bose K (Ed.), Textbook on Cloning, Expression and Purification of Recombinant Proteins, Sprinter Press, Singapore, 2022; Simpson et al. (Eds.), Basic methods in Protein Purification and Analysis: A Laboratory Manual, New York: Cold Spring Harbor Laboratory Press, 2009, Higgins, S. J. and Hames, B. D. (eds.). Protein Expression: A Practical Approach, Oxford Univ Press, 1999; and Deutscher, M. P., and Rugess, R.R. (eds.), Guide to Protein Purification: Methods in Enzymology (Methods in Enzymology Series, Vol 182), Academic Press, 2009, all incorporated herein by reference).
For immunoaffinity chromatography, the protein can be isolated by binding it to an affinity column comprising a specific binding agent, such as antibodies that were raised against that protein and were affixed to a stationary support. Alternatively, affinity tags such as an influenza coat sequence, poly-histidine, or glutathionc-S-transfcrasc can be attached to the protein by standard recombinant techniques to allow for easy purification by passage over the appropriate affinity column. Protease inhibitors such as phenyl methyl sulfonyl fluoride (PMSF), leupeptin, pepstatin or aprotinin can be added at any or all stages in order to reduce or eliminate degradation of the polypeptide or protein during the purification process. Protease inhibitors can be used when cells must be lysed in order to isolate and purify the expressed polypeptide or protein. One of ordinary skill in the art will appreciate that the exact purification technique will vary depending on the character of the protein to be purified, the type of the cells from which the polypeptide or protein is expressed, and the composition of the medium in which the cells were grown.
Pharmaceutical Compositions and Administration to the Pancreatic Duct
Methods are provided for producing pancreatic beta cells in a subject. The disclosed MafB promoter provides a high level of expression in alpha cells, providing increased transdifferentiation to beta cells. In some aspects, using the MafB promoter, PDX1 and MafA expression can be increased in alpha cells, and optionally MafB, providing increased transdifferentiation into beta cells. In a non-limiting example, as a result of using a vector including the MafB promoter, PDX1 and MafA expression are increased in alpha cells. In another non-limiting example, as a result of using a vector including the MafB promoter, PDX1, MafA, and MafB expression can be increased in alpha cells.
In some aspects, the methods include administering to the subject a vector, such as an adenovirus vector or an AAV vector, encoding heterologous PDX1 and MafA, and optionally MafB. In some aspects, the vector does not include a nucleic acid encoding Ngn3. In further aspects, the subject is not administered any other nucleic acid encoding Ngn3. Exemplary methods are disclosed, for example, in U.S. Patent No. 10,071,172, incorporated herein by reference.
For in vivo delivery, a vector, such as an adenovirus or an AAV vector can be formulated into a pharmaceutical composition and will generally be administered locally or systemically. In some aspects, the vector is administered directly to the pancreas. In other aspects, intraductally into a pancreatic duct of the subject. In other aspects, the subject has diabetes, such as Type 1 diabetes.
In some aspects, methods are provided for producing pancreatic beta cells from pancreatic alpha cells in a subject. These methods include administering to the subject a vector comprising a disclosed MafB promoter operably linked to a nucleic acid molecule encoding PDX1 and MafA, and optionally MafB. In some aspects, the vector does not encode Ngn3 and the subject is not administered any other nucleic acid encoding Ngn3. In more aspects, the vector is administered intraductally into a pancreatic duct of the subject. In further aspects, the subject has diabetes or pre-diabetes.
In additional aspects, methods are provided for treating diabetes Type 1 or pre-diabetes in a subject. The subject can be any mammalian subject, including human and veterinary subjects. The subject can be a child or an adult. The method can include selecting a subject of interest, such as a subject with diabetes. The subject can also be administered insulin. The method can include measuring beta cell number.
In some examples, a subject with diabetes may be clinically diagnosed by a fasting plasma glucose (FPG) concentration of greater than or equal to 7.0 millimole per liter (mmol/L) (126 milligram per deciliter (mg/dL)), or a plasma glucose concentration of greater than or equal to 11.1 mmol/L (200 mg/dL) at about two hours after an oral glucose tolerance test (OGTT) with a 75 gram (g) load, or in a patient with classic symptoms of hyperglycemia or hyperglycemic crisis, a random plasma glucose concentration of greater than or equal to 11.1 mmol/L (200 mg/dL), or HbAlc levels of greater than or equal to 6.5%. In other examples, a subject with pre-diabetes may be diagnosed by impaired glucose tolerance (IGT). An OGTT two-hour plasma glucose of greater than or equal to 140 mg/dL and less than 200 mg/dL (7.8-11.0 mM), or a fasting plasma glucose (FPG) concentration of greater than or equal to 100 mg/dE and less than 125 mg/dE (5.6-6.9 mmol/L), or HbAlc levels of greater than or equal to 5.7% and less than 6.4% (5.7-6.4%) is considered to be IGT, and indicates that a subject has pre-diabetes. Additional information can be found in Standards of Medical Care in Diabetes — 2010 (American Diabetes Association, Diabetes Care 33:S 11 -61, 2010, incorporated herein by reference).
The disclosed methods produce pancreatic beta cells in a subject. Generally, these cells produce insulin. In some aspects, the subject is a subject with Type 1 diabetes and the pancreatic beta cells produced by the disclosed methods are not recognized by the immune system of the subject. In some aspects, T cell and/or B cells do not produce an immune response to the pancreatic beta cells produced by the disclosed methods. Thus, in some aspects, the subject does not mount an autoimmune response to the pancreatic beta cells produced by the disclosed methods. In specific non-limiting examples, the subject does not have immune destruction of the pancreatic beta cells, and does not exhibit an increased lymphocyte invasion of the islets. In some aspects, the disclosed method transdifferentiate alpha cells into beta cells.
Appropriate doses of a disclosed vector depend on the subject being treated (e.g., human or nonhuman primate or other mammal), age and general condition of the subject to be treated, the severity of the condition being treated, the mode of administration of the AAV vector/virion, among other factors. An appropriate effective amount can be readily determined by one of skill in the art. Thus, a "therapeutically effective amount" will fall in a relatively broad range that can be determined through clinical trials. The method can include measuring an outcome, such as insulin production, improvement in a fasting plasma glucose tolerance test, or pancreatic beta cell number. The method can include administering other therapeutic agents, such as insulin. The method can also include having the subject make lifestyle modifications.
For example, for in vivo injection, a therapeutically effective dose can be on the order of from about 105 to 1016 of the AAV virions, such as 108 to 1014 AAV virions, such as AAV6 virions. The dose, of course, depends on the efficiency of transduction, the disclosed promoter that is selected, the stability of the message and the protein encoded thereby, and clinical factors. Effective dosages can be readily established by one of ordinary skill in the art through routine trials establishing dose response curves.
Dosage treatment may be a single dose schedule or a multiple dose schedule to ultimately deliver the amount specified above. Moreover, the subject may be administered as many doses as appropriate. Thus, the subject may be given, e.g., 10 5 to 1016 AAV virions in a single dose, or two, four, five, six or more doses that collectively result in delivery of, e.g., 105 to 1016 AAV virions. One of skill in the art can readily determine an appropriate number of doses to administer.
In some aspects, the AAV is administered at a dose of about 1 x 10” to about 1 x 1014 viral particles (vp)/kg. In some examples, the AAV is administered at a dose of about 1 x 1012 to about 8 x 1013 vp/kg. In other examples, the AAV is administered at a dose of about 1 x 1013 to about 6 x 1013 vp/kg. In specific non-limiting examples, the AAV is administered at a dose of at least about 1 x 1011, at least about 5 x 1011 , at least about 1 x 1012, at least about 5 x 1012, at least about 1 x 1013, at least about 5 x 1013, or at least about 1 x 1014 vp/kg. In other non-limiting examples, the AAV is administered at a dose of no more than about 5 x 1011, no more than about 1 x 1012, no more than about 5 x 1012, no more than about 1 x 1013, no more than about 5 x 1013, or no more than about 1 x 1014 vp/kg. In one non-limiting example, the AAV is administered at a dose of about 1 x 1012 vp/kg. The AAV can be administered in a single dose, or in multiple doses (such as 2, 3, 4, 5, 6, 7, 8, 9 or 10 doses) as needed for the desired therapeutic results, such as the production of 0 cells and/or treatment of Type 1 diabetes. The AAV can be an AAV6.
Pharmaceutical compositions include sufficient genetic material to produce a therapeutically effective amount of PDX1 and MafA. In some aspects, AAV virions will be present in the subject compositions in an amount sufficient to provide a therapeutic effect, such as the production of pancreatic beta cells and/or the treatment of diabetes when given in one or more doses.
AAV virions can be provided as lyophilized preparations and diluted in a stabilizing compositions for immediate or future use. Alternatively, the AAV virions can be provided immediately after production and stored for future use.
The pharmaceutical compositions can contain the vector, such as the AAV vector, and/or virions, and a pharmaceutically acceptable excipient. Such excipients include any pharmaceutical agent that does not itself induce the production of antibodies harmful to the individual receiving the composition, and which may be administered without undue toxicity. Pharmaceutically acceptable excipients include, but are not limited to, liquids such as water, saline, glycerol and ethanol. The excipient can be pluronic acid (PF-68, also called Polaxamer 188). Pharmaceutically acceptable salts can be included therein, for example, mineral acid salts such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of organic acids such as acetates, propionates, malonates, benzoates, and the like. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles. A thorough discussion of pharmaceutically acceptable excipients is available in Remington: The Science and Practice of Pharmacy, 22nd ed., London, UK: Pharmaceutical Press, 2013.
In some aspects, the excipients confer a protective effect on the AAV virion such that loss of AAV virions, as well as transduceability resulting from formulation procedures, packaging, storage, transport, and the like, is minimized. These excipient compositions are therefore considered "virion-stabilizing" in the sense that they provide higher AAV virion titers and higher transduceability levels than their non-protected counterparts, as measured using standard assays, see, for example, Published U.S. Application No. 2012/0219528, incorporated herein by reference. These Compositions therefore demonstrate "enhanced transduccability levels" as compared to compositions lacking the particular excipients described herein, and are therefore more stable than their non-protected counterparts.
Exemplary excipients that can used to protect the AAV virion from activity degradative conditions include, but are not limited to, detergents, proteins, e.g., ovalbumin and bovine serum albumin, amino acids, e.g., glycine, polyhydric and dihydric alcohols, such as but not limited to polyethylene glycols (PEG) of varying molecular weights, such as PEG-200, PEG-400, PEG-600, PEG-1000, PEG-1450, PEG-3350, PEG- 6000, PEG-8000 and any molecular weights in between these values, with molecular weights of 1500 to 6000 preferred, propylene glycols (PG), sugar alcohols, such as a carbohydrate, preferably, sorbitol. The detergent, when present, can be an anionic, a cationic, a zwitterionic or a nonionic detergent. An exemplary detergent is a nonionic detergent. One suitable type of nonionic detergent is a sorbitan ester, e.g., polyoxyethylenesorbitan monolaurate (TWEENO-20) polyoxyethylenesorbitan monopalmitate (TWEEN®- 40), polyoxyethylenesorbitan monostearate (TWEEN®-60), polyoxyethylenesorbitan tristearate (TWEEN®- 65), polyoxyethylenesorbitan monooleate (TWEEN®-80), polyoxyethylenesorbitan trioleate (TWEEN®- 85), such as TWEEN®-20 and/or TWEEN®-80. These excipients are commercially available from a number of vendors, such as Sigma, St. Louis, Mo. The amount of the various excipients present in any of the disclosed compositions varies and is readily determined by one of skill in the art. For example, a protein excipient, such as BSA, if present, will can be present at a concentration of between 1.0 weight (wt.) % to about 20 wt. %, preferably 10 wt. %. If an amino acid such as glycine is used in the formulations, it can be present at a concentration of about 1 wt. % to about 5 wt. %. A carbohydrate, such as sorbitol, if present, can be present at a concentration of about 0.1 wt % to about 10 wt. %, such as between about 0.5 wt. % to about 15 wt. %, or about 1 wt. % to about 5 wt. %. If polyethylene glycol is present, it can generally be present on the order of about 2 wt. % to about 40 wt. %, such as about 10 wt. % top about 25 wt. %. If propylene glycol is used in the subject formulations, it will typically be present at a concentration of about 2 wt. % to about 60 wt. %, such as about 5 wt. % to about 30 wt. %. If a detergent such as a sorbitan ester (TWEEN®) is present, it can be present at a concentration of about 0.05 wt. % to about 5 wt. %, such as between about 0.1 wt. % and about 1 wt %, see U.S. Published Patent Application No. 2012/0219528, which is incorporated herein by reference. In one example, an aqueous virion-stabilizing formulation comprises a carbohydrate, such as sorbitol, at a concentration of between 0.1 wt. % to about 10 wt. %, such as between about 1 wt. % to about 5 wt. %, and a detergent, such as a sorbitan ester (TWEEN®) at a concentration of between about 0.05 wt. % and about 5 wt. %, such as between about 0.1 wt. % and about 1 wt. %. Virions are generally present in the composition in an amount sufficient to provide a therapeutic effect when given in one or more doses, as defined above.
The pharmaceutical compositions can include a contrast dye, for administration in addition to the viral vector, such an adenoviral vector, including a disclosed MafB promoter operably linked to a heterologous nucleic acid molecule encoding PDX1 and MafA, and optionally MafB. The contrast dye can be a low-osmolar low-viscosity non-ionic dye, a low-viscosity high-osmolar dye, or a dissociable high- viscosity dye. In specific non-limiting examples the dye is lopromid, loglicinate, or loxaglinate. Thus, provided herein is a pharmaceutical composition including a) an adcno-associatcd virus vector, such as AAV6, comprising a disclosed MafB promoter operably linked to a nucleic acid molecule encoding PDX1 and a nucleic acid encoding MafA, and optionally MafB, and further optionally wherein the vector does not encode Ngn3; b) a buffer; and c) a contrast dye for endoscopic retrograde cholangiopancreatography. In some aspects, the pharmaceutical composition does not include a nucleic acid encoding Ngn3. Any of the AAV vectors disclosed herein, including any of the disclosed promoters, can be included in this composition. The AAV vector can be encapsulated in a virion. The AAV vector can be an AAV6 vector. The composition can be formulated for administration to the pancreatic duct.
The disclosed pharmaceutical compositions including a viral vector, such an adenoviral vector or AAV vector, can be delivered to humans or other animals by any means, including orally, intravenously, intramuscularly, intraperitoneally, intranasally, intradermally, intrathecally, subcutaneously, via inhalation or via suppository. In one non-limiting example, the composition is administered into the pancreatic duct of a subject in vivo.
One exemplary method for intraductal administration is Endoscopic Retrograde Cholangiopancreatography (ERCP). ERCP is an endoscopic technique that involves the placement of a side- viewing instrument (generally either an endoscope or duodenoscope) within the descending duodenum. The procedure eliminates the need for invasive surgical procedures for administration to the pancreatic duct.
In an ERCP procedure, the patient will generally lie on their side on an examining table. The patient will then be given medication to help numb the back of the patient's throat, and a sedative to help the patient relax during the examination. The patient then swallows the endoscope. The thin, flexible endoscope is passed carefully through the alimentary canal of the patient. The physician guides the endoscope through the patient's esophagus, stomach, and the first part of the small intestine known as the duodenum. Because of the endoscope's relatively small diameter, most patients can tolerate the unusualness of having the endoscope advanced through the opening of their mouth.
The physician stops the advancement of the endoscope when the endoscope reaches the junction where the ducts of the biliary tree and pancreas open into the duodenum. This location is called the papilla of Vater, or also commonly referred to as the ampulla of Vater. The papilla of Vater is a small mound of tissue looking and acting similarly to a nipple. The papilla of Vater emits a substance known as bile into the small intestine, as well as pancreatic secretions that contain digestive enzymes. Bile is a combination of chemicals made in the liver and is necessary in the act of digestion. Bile is stored and concentrated in the gallbladder between meals. When digestive indicators stimulate the gallbladder, however, the gallbladder squeezes the bile through the common bile duct and subsequently through the papilla of Vater. The pancreas secretes enzymes in response to a meal, and the enzymes help digest carbohydrates, fats, and proteins.
The patient will be instructed (or manually maneuvered) to lie flat on their stomach once the endoscope reaches the papilla of Vater. For visualization or treatment within the biliary tree, the distal end of the endoscope is positioned proximate the papilla of Vater. A catheter is then advanced through the endoscope until the distal tip of the catheter emerges from the opening at the endoscope's distal end. The distal end of the catheter is guided through the endoscope's orifice to the papilla of Vater (located between the sphincter of Oddi) leading to the common bile duct and the pancreatic duct. In the case of pancreasspecific delivery of reagents, the pancreatic duct proper can be entered.
ERCP catheters can be constructed from Teflon, polyurethane and polyaminde. ERCP catheters also can also be constructed from resin comprised of nylon and PEBA (see U.S. Patent No. 5,843,028), and can be construed for use by a single operator (see U.S. Patent No. 7,179,252). At times, a spring wire guide may be placed in the lumen of the catheter to assist in cannulation of the ducts. A stylet, used to stiffen the catheter, must first be removed prior to spring wire guide insertion.
A dual or multi-lumen ERCP catheter in which one lumen could be utilized to accommodate the spring wire guide or diagnostic or therapeutic device, and in which a second lumen could be utilized for contrast media and/or dye infusion and or for administration of a pharmaceutical composition including a viral vector, such an adenoviral vector, that includes a disclosed promoter, such as operably linked to a heterologous nucleic acid molecule encoding PDX1 and MafA. In some aspects, a contrast dye is administered to the subject.
A contrast dye can be included in a pharmaceutical composition. The contrast dye can be a low- osmolar low-viscosity non-ionic dye, a low-viscosity high-osmolar dye, or a dissociable high- viscosity dye. In specific non-limiting examples the dye is lopromid, loglicinate, or loxaglinate. Endoscopes have been designed for the delivery of more than one liquid solution, such as a first liquid composition including a viral vector, such an adenoviral vector, and a second liquid composition including dye, see U. S. Patent No. 7,597,662, which is incorporated herein by reference. Thus, the pharmaceutical composition including a viral vector, such an adenoviral vector, and the dye can be delivered in the same or separate liquid compositions. Methods and devices for using biliary catheters for accessing the biliary tree for ERCP procedures are disclosed in U.S. Patent No. 5,843,028, U.S. Patent No. 5,397,302 U.S. Pat. No. 5,320,602, which are incorporated by reference herein.
In additional examples, the vector is administered using a viral infusion technique into a pancreatic duct. Suitable methods are disclosed, for example, in Guo et al. Laboratory Invest. 93: 1241-1253, 2013, incorporated by reference herein.
The subject can be administered an effective amount of another agent, such as an immunosuppressive agent. The immunosuppressive agent can be a steroid. The immunosuppressive agent can be a non-steroidal anti-inflammatory agent. The immunosuppressive agent can be an antibody. The immunosuppressive agent can be a calcineurin inhibitor. The immunosuppressive agent can be a mammalian target inhibitor (mTOR inhibiter). The immunosuppressive agent can be a glucocorticoid. The immunosuppressive agent can be an alkylating agent. Immunosuppressive agents are disclosed, for example, in Hussain and Khan, Encyclopedia of Infection and Immunity 2022: 726-740, doi: 10.1016/B978-0-12- 818731-9.00068-9, April 8, 2022. Combinations of immunosuppressive agents are also of use.
Suitable immunosuppressive agents include, but are not limited to, mycophenolate mofetil (MMF), Rituximab, anti-Thymocyte Globulin (ATG), rapamycin. Methylprednisolone, and prednisolone. Combinations of these agents arc also of use. In one non-limiting example, the subject is administered Rituximab. In another non-limiting example, the subject is administered MMF. In a further non-limiting example, the subject is administered Rituximab and MMF. The immunosuppressive agent can be cortisol, cyclosporine-A, sirolimus, everolimus, mizoribine, lefunomide, azathioprine, methotrexate, or tacrolimus.
EXAMPLES
A gene therapy construct was engineered in which a shortened form of the MafB promoter drives expression of PDX1 and MafA, with or without additionally driving MafB production. This construct was able to convert human and mouse alpha cell lines into insulin-producing cells. The advantage over using the glucagon promoter is that when alpha cells are being converted into beta cells, the glucagon promoter will shut off, which could prevent full conversion. However, since in human and non-human primates, MafB is expressed in both alpha and beta cells, the MafB promoter does not turn off during conversion of an alpha cell into a beta cell. It was found that the conversion of human alpha cell lines into insulin-producing cells was much more durable than using either the glucagon promoter or the CMV promoter. It was also found that MafB driving PDX1 and MafA packaged in an AAV and infused into the pancreatic duct could induce formation of insulin-producing cells in non-human primates.
Example 1 Materials and Methods
Generation of rAAV vector and virus purification: A human MafB promoter was selected as a 348 bp sequence upstream of the human MafB open reading frame. The AAV vector, AAV-hMafBp-hPM, (Tina-059, see FIG. 2) contains human PDX1 and MafA open reading frames connected by a P2A linker under control of this human MafB promoter sequence plus a chimeric intron. AAV serotype 6, DJ (from Cell Biolabs), and/or KPI (Stanford University) were tested in non-human primate studies. The recombinant AAV vectors used for in vivo and in vitro infection (rAAV) were generated by transfection of human embryonic kidney 293T cells using polyethylenimine (PEI) max at 3:1 ratio to total DNA amount. rAAV vectors were harvested 3-4 days after transfection. rAAV vectors were released from cells by three frcczc-thaw cycles, and non-cncapsidatcd DNA was removed by digestion with Bcnzonasc. rAAV vectors were first purified by chloroform extraction and PEG-aqueous partition. Then rAAV vectors were further purified by 2 cycles of iodixanol ultracentrifugation. rAAV were collected and dialyzed in PBS-0.001%PF- 68 buffer, then they were concentrated with a protein concentrator 50,000 MWCO tube and sterile-filtered. The viral titer was determined with gene of interest specific Taqman probe set.
Cell culture conditions: Human Alpha Cells (Celprogen, Cat#35002-05) were cultured in human alpha cell complete media with serum (Celprogen, Cat#M35002-05S). The cell culture expansion extracellular matrix (Celprogen, Cat#E35002-05) were used for all human alpha cell culture experiments.
Virus infusion solution for non-human primates: Virus infusion solution was calculated with 1.7 mL/kg NHP body weight. The rAAV (1-5E+13 vg) was diluted with sterile saline-0.001% PF-68 and 2-4 mL indigo carmine solution (10 mg/mL). The virus solution was sterile-filtered before use. Example 2
Results
The MafB regulatory elements was investigated for driving the expression of PDX1 and MafA. In mouse, immature 0 cells express MafB, but then the expression ends when the cells mature, whereas both immature and mature a cells express MafB. However, in single cell RNAseq data from NHP islets, MafB was expressed equally in both mature a cells and mature 0 cells. This expression in both types of cells can be a distinct advantage over glucagon regulatory elements when driving PDX1 and MafA because the expression construct will not be shut off when transitioning into 0 cells or 0-like cells.
To create a proof of principle, a plasmid construct (FIG. 1) was made in which the MafB promoter, along with an intron sequence, which helps increase expression, drove luciferase and GFP to demonstrate activity of the promoter. When a human alpha cell line (CELPROGEN®) was transfected, it was found that the GFP (left panels) and luciferase (graph) signals were comparable to the CMV promoter. The CMV promoter is adequate to drive conversion of alpha cells into beta cells. Thus, these results imply that using the MafB promoter should be strong enough to convert alpha cells into beta cells.
A similar construct was created, with different combinations of PDX1, MafA, and MafB coding regions (FIG. 2). The plasmids were designed to allow a direct comparison of the MafB promoter with the CMV promoter in transforming human alpha cells into insulin-secreting cells. As shown in FIG. 3, the MafB promoter induced the production and secretion of insulin in human alpha cells more strongly, when compared with the CMV promoter (Tina-036-1, CMV, compared with Tina-055, MafB, and then Tina-007, CMV, compared with Tina-059, MafB). Based on these results, one of the plasmids (Tina-055) was converted into an AAV virus and surgery was performed to infuse the virus into the pancreatic duct of a diabetic non-human primate. Shown in FIG. 4 the blood glucose trends and insulin requirement trends. There was a clear improvement in blood glucoses and a decrease in insulin requirement over time, showing efficacy of the MafB promoter in converting alpha cells to beta cells. Furthermore, as shown in FIG. 5, the glucose tolerance curves showed a dramatic improvement at two months, with a near normal fasting blood glucose (time zero) and a much more rapid decline in glucose levels after glucose challenge. Notably, the animals did not receive insulin for 24 hours prior to the glucose tolerance test, so there is essentially no residual exogenous insulin circulating. Lastly, as shown in FIG. 6, insulin levels were measured in the blood 30 minutes after glucose challenge. Consistent with the glucose tolerance test shown in FIG. 5, the circulating (endogenous) levels were substantially higher after 1 month compared with the presurgical diabetic state.
FIG. 1 shows that the MafB promoter drives luciferase in alpha cells equivalent to the CMV promoter.
FIG. 2 provides a schematic diagram of MafB promoter constructs that were generated, Tina-055 and Tina-059, driving PDX1 and MafA, plus or minus driving MafB.
FIG. 3 provides evidence that human alpha cells transfected with Tina-055 make more insulin than cells transfected with a similar construct that utilizes the CMV promoter.
FIG. 4 provide results from a monkey that received an AAV with the MafB promoter driving pdxl and mafA infusion (surgery) into the pancreas. The insulin requirement and blood glucose improved.
FIG. 5 provides evidenced that the fasting glucose and glucose tolerance improved in this monkey, to such an extent that the fasting glucose was normal.
FIG. 6 shows the circulating monkey insulin levels during a glucose tolerance test, with time zero being fasting, for this animal.
Example 3
Additional Constructs and Results
The plasmids pTina-055 and pTina-059 are shown in FIG. 7 and FIG. 8, respectively. pTina-055 encodes human MafA without a STOP codon, includes the connector T2A, and encodes human mafB sequence. pTina-059 encodes human MafA with a STOP codon, but does not include the connector T2A, and does not encode a human MafB sequence. The following nucleic acid sequences were utilized in pTina- 055. human MAFB promoter: GGACCTGCAGGGTACGGGGGTGGGCCCAGGCTGGCCGTCCCACTCCGCTCCAGCGCTCCGCCA GGCCTCCGCCGCTCCGGGGCCGCTCGGGCCGCCAGTCAGCTGACGCGGGGGGCGGGGGAGCTG TCAGGCGCGCCCCGCCCTGCGCCGCTGGGCCGCGGAGGCCGTGCAGCTATTGGCCCGCGCACC GGGCCGCCCGGGCCCCCGCACCCCAGTGACATCAGGAGGCGATAAAAGGCTGCGGCGCCGCC GGATCCAGCACAGCTGCACCGCCGAGCTGCGAGCGGCTGCGAGCGAGAGAGCGTAAGAGCAA GAGAGCTAGAGAGCGAGCAACGGGCACTCGCCCCA (SEQ ID NO: 1) chimeric intron:
GTAAGTATCAAGGTTACAAGACAGGTTTAAGGAGACCAATAGAAACTGGGCTTGTCGAGACAG AGAAGACTCTTGCGTTTCTGATAGGCACCTATTGGTCTTACTGACATCCACTTTGCCTTTCTCTC CACAG (SEQ ID NO: 13)
Kozak:
GCCACCATGA (SEQ ID NO: 14) human PDX1 (no STOP codon) ATGAACGGCGAGGAGCAGTACTACGCGGCCACGCAGCTTTACAAGGACCCATGCGCGTTCCAG CGAGGCCCGGCGCCGGAGTTCAGCGCCAGCCCCCCTGCGTGCCTGTACATGGGCCGCCAGCCC CCGCCGCCGCCGCCGCACCCGTTCCCTGGCGCCCTGGGCGCGCTGGAGCAGGGCAGCCCCCCG GACATCTCCCCGTACGAGGTGCCCCCCCTCGCCGACGACCCCGCGGTGGCGCACCTTCACCACC
ACCTCCCGGCTCAGCTCGCGCTCCCCCACCCGCCCGCCGGGCCCTTCCCGGAGGGAGCCGAGC
CGGGCGTCCTGGAGGAGCCCAACCGCGTCCAGCTGCCTTTCCCATGGATGAAGTCTACCAAAG
CTCACGCGTGGAAAGGCCAGTGGGCAGGCGGCGCCTACGCTGCGGAGCCGGAGGAGAACAAG
CGGACGCGCACGGCCTACACGCGCGCACAGCTGCTAGAGCTGGAGAAGGAGTTCCTATTCAAC
AAGTACATCTCACGGCCGCGCCGGGTGGAGCTGGCTGTCATGTTGAACTTGACCGAGAGACAC
ATCAAGATCTGGTTCCAAAACCGCCGCATGAAGTGGAAAAAGGAGGAGGACAAGAAGCGCGG
CGGCGGGACAGCTGTCGGGGGTGGCGGGGTCGCGGAGCCTGAGCAGGACTGCGCCGTGACCTC
CGGCGAGGAGCTTCTGGCGCTGCCGCCGCCGCCGCCCCCCGGAGGTGCTGTGCCGCCCGCTGC
CCCCGTTGCCGCCCGAGAGGGCCGCCTGCCGCCTGGCCTTAGCGCGTCGCCACAGCCCTCCAGC
GTCGCGCCTCGGCGGCCGCAGGAACCACGA (SEQ ID NO: 15)
P2A
GGATCCGGAGCTACTAACTTCAGCCTGCTGAAGCAGGCTGGAGACGTGGAGGAGAACCCCGGG
CCT (SEQ ID NO: 16) human MafA (no STOP codon)
ATGGCCGCGGAGCTGGCGATGGGCGCCGAGCTGCCCAGCAGCCCGCTGGCCATCGAGTACGTC
AACGACTTCGACCTGATGAAGTTCGAGGTGAAGAAGGAGCCTCCCGAGGCCGAGCGCTTCTGC
CACCGCCTGCCGCCAGGCTCGCTGTCCTCGACGCCGCTCAGCACGCCCTGCTCCTCCGTGCCCT
CCTCGCCCAGCTTCTGCGCGCCCAGCCCGGGCACCGGCGGCGGCGGCGGCGCGGGGGGCGGCG
GCGGCTCGTCTCAGGCCGGGGGCGCCCCCGGGCCGCCGAGCGGGGGCCCCGGCGCCGTCGGGG
GCACCTCGGGGAAGCCGGCGCTGGAGGATCTGTACTGGATGAGCGGCTACCAGCATCACCTCA
ACCCCGAGGCGCTCAACCTGACGCCCGAGGACGCGGTGGAGGCGCTCATCGGCAGCGGCCACC
ACGGCGCGCACCACGGCGCGCACCACCCGGCGGCCGCCGCAGCCTACGAGGCTTTCCGCGGCC
CGGGCTTCGCGGGCGGCGGCGGAGCGGACGACATGGGCGCCGGCCACCACCACGGCGCGCAC
CACGCCGCCCACCATCACCACGCCGCCCACCACCACCACCACCACCACCACCACCATGGCGGC
GCGGGACACGGCGGTGGCGCGGGCCACCACGTGCGCCTGGAGGAGCGCTTCTCCGACGACCAG
CTGGTGTCCATGTCGGTGCGCGAGCTGAACCGGCAGCTCCGCGGCTTCAGCAAGGAGGAGGTC
ATCCGGCTCAAGCAGAAGCGGCGCACGCTCAAGAACCGCGGCTACGCGCAGTCCTGCCGCTTC
AAGCGGGTGCAGCAGCGGCACATTCTGGAGAGCGAGAAGTGCCAACTCCAGAGCCAGGTGGA
GCAGCTGAAGCTGGAGGTGGGGCGCCTGGCCAAAGAGCGGGACCTGTACAAGGAGAAATACG
AGAAGCTGGCGGGCCGGGGCGGCCCCGGGAGCGCGGGCGGGGCCGGTTTCCCGCGGGAGCCT
TCGCCGCCGCAGGCCGGTCCCGGCGGGGCCAAGGGCACGGCCGACTTCTTCCTG (SEQ ID NO: 17) T2A
GGAAGCGGAGAGGGCAGAGGAAGTCTTCTAACATGCGGTGACGTGGAGGAGAATCCCGGCCC
T (SEQ ID NO: 18) human MAFB (with STOP codon)
ATGGCCGCGGAGCTGAGCATGGGGCCAGAGCTGCCCACCAGCCCGCTGGCCATGGAGTATGTC
AACGACTTCGACCTGCTCAAGTTCGACGTGAAGAAGGAGCCACTGGGGCGCGCGGAGCGTCCG
GGCAGGCCCTGCACACGCCTGCAGCCAGCCGGCTCGGTGTCCTCCACACCGCTCAGCACTCCGT
GTAGCTCCGTGCCCTCGTCGCCCAGCTTCAGCCCGACCGAACAGAAGACACACCTCGAGGATC
TGTACTGGATGGCGAGCAACTACCAGCAGATGAACCCCGAGGCGCTCAACCTGACGCCCGAGG
ACGCGGTGGAAGCGCTCATCGGCTCGCACCCAGTGCCACAGCCGCTGCAAAGCTTCGACAGCT
TTCGCGGCGCTCACCACCACCACCATCACCACCACCCTCACCCGCACCACGCGTACCCGGGCGC
CGGCGTGGCCCACGACGAGCTGGGCCCGCACGCTCACCCGCACCATCACCATCATCACCAAGC
GTCGCCGCCGCCGTCCAGCGCCGCTAGCCCGGCGCAACAGCTGCCCACTAGCCACCCCGGGCC
CGGGCCGCACGCGACGGCCTCGGCGACGGCGGCGGGCGGCAACGGCAGCGTGGAGGACCGCT
TCTCCGACGACCAGCTCGTGTCCATGTCCGTGCGCGAGCTGAACCGCCACCTGCGGGGCTTCAC
CAAGGACGAGGTGATCCGCCTGAAGCAGAAGCGGCGGACCCTGAAGAACCGGGGCTACGCCC
AGTCTTGCAGGTATAAACGCGTCCAGCAGAAGCACCACCTGGAGAATGAGAAGACGCAGCTCA
TTCAGCAGGTGGAGCAGCTTAAGCAGGAGGTGTCCCGGCTGGCCCGCGAGAGAGACGCCTACA
AGGTCAAGTGCGAGAAACTCGCCAACTCCGGCTTCAGGGAGGCGGGCTCCACCAGCGACAGCC
CCTCCTCTCCCGAGTTCTTTCTGTAA (SEQ ID NO: 19)
WPRE
TCAACCTCTGGATTACAAAATTTGTGAAAGATTGACTGGTATTCTTAACTATGTTGCTCCTTTTA
CGCTATGTGGATACGCTGCTTTAATGCCTTTGTATCATGCTATTGCTTCCCGTATGGCTTTCATT
TTCTCCTCCTTGTATAAATCCTGGTTGCTGTCTCTTTATGAGGAGTTGTGGCCCGTTGTCAGGCA
ACGTGGCGTGGTGTGCACTGTGTTTGCTGACGCAACCCCCACTGGTTGGGGCATTGCCACCACC
TGTCAGCTCCTTTCCGGGACTTTCGCTTTCCCCCTCCCTATTGCCACGGCGGAACTCATCGCCGC
CTGCCTTGCCCGCTGCTGGACAGGGGCTCGGCTGTTGGGCACTGACAATTCCGTGGTGTTGTCG
GGGAAATCATCGTCCTTTCCTTGGCTGCTCGCCTGTGTTGCCACCTGGATTCTGCGCGGGACGT
CCTTCTGCTACGTCCCTTCGGCCCTCAATCCAGCGGACCTTCCTTCCCGCGGCCTGCTGCCGGCT
CTGCGGCCTCTTCCGCGTCTTCGCCTTCGCCCTCAGACGAGTCGGATCTCCCTTTGGGCCGCCTC
CCCGCA (SEQ ID NO: 20) poly(A)
CTGTGCCTTCTAGTTGCCAGCCATCTGTTGTTTGCCCCTCCCCCGTGCCTTCCTTGACCCTGGAA GGTGCCACTCCCACTGTCCTTTCCTAATAAAATGAGGAAATTGCATCGCATTGTCTGAGTAGGT GTCATTCTATTCTGGGGGGTGGGGTGGGGCAGGACAGCAAGGGGGAGGATTGGGAAGACAAT AGCAGGCATGCTGGGGATGCGGTGGGCTCTATGG (SEQ ID NO: 21)
FIGS. 9 and 10 provide glucose tolerance test (GTT) results from two monkeys that received a pancreatic duct infusion of AAV6 carrying Tina-059.
In view of the many possible aspects to which the principles of our invention may be applied, it should be recognized that illustrated aspects are only examples of the invention and should not be considered a limitation on the scope of the invention. Rather, the scope of the invention is defined by the following claims. We therefore claim as our invention all that comes within the scope and spirit of these claims.

Claims

We claim:
1. A recombinant nucleic acid molecule comprising a Musculoaponeurotic fibrosarcoma oncogene homolog (Maf)B promoter operably linked to a nucleic acid molecule encoding a heterologous protein, wherein the MafB promoter consists essentially of: a) a nucleotide sequence at least 95% identical to SEQ ID NO: 1, that functions as a promoter; or b) the nucleotide sequence of SEQ ID NO: 1.
2. The recombinant nucleic acid molecule of claim 1 , wherein the MafB promoter consists of: a) the nucleotide sequence at least 95% identical to SEQ ID NO: 1, that functions as a promoter; or b) the nucleotide sequence of SEQ ID NO: 1.
3. The recombinant nucleic acid molecule of claim 1 or claim 2, wherein the MafB promoter consists of the nucleotide sequence of SEQ ID NO: 1.
4. A vector comprising the nucleic acid molecule of any one of claims 1-3.
5. The vector of claim 4, wherein the vector is a viral vector.
6. The vector of claim 5, wherein the viral vector is a lentivirus vector, an adenovirus vector or an adeno-associated virus (AAV) vector.
7. The vector of claim 6, wherein the vector is an AAV6, AAVDJ or AAVKP1 vector.
8. The vector of any one of claims 4-7, wherein the MafB promoter is operably linked to a nucleic acid molecule encoding MafA and/or a nucleic acid molecule encoding PDX1.
9. The vector of claim 8, further comprising a nucleic acid molecule encoding MafB operably linked to the MafB promoter.
10. The vector of claim 8 or claim 9, comprising the nucleic acid molecule encoding PDX1 and the nucleic acid molecule encoding MafA, and comprising a connector between the nucleic acid molecule encoding PDX1 and the nucleic acid molecule encoding MafA.
11. The vector of claim 10, wherein the connector is a 2 A connector.
12. A host cell transformed with the recombinant nucleic acid molecule of any one of claims 1- 3, or the vector of any one of claim 4-11.
13. The host cell of claim 12, wherein the host cell is a mammalian host cell.
14. The host cell of claim 13, wherein the mammalian host cell is a human host cell.
15. The host cell of claim 13, wherein the mammalian host cell is a non-human primate host cell.
16. The host cell of any one of claims 13-15, wherein the host cell is an alpha cell.
17. A method of producing a heterologous protein in a host cell, comprising: transforming the host cell with an effective amount of the vector of any one of claims 4-11, thereby producing the heterologous protein in the host cell.
18. The method of claim 17, wherein the host cell is a mammalian host cell.
19. The method of claim 18, wherein the mammalian host cell is a human host cell or a non- human primate host cell.
20. A composition comprising an effective amount of the recombinant nucleic acid molecule of any one of claims 1-3 or the vector of any one of claims 4-11, and a pharmaceutically acceptable carrier.
21. The composition of claim 20, further comprising a contrast dye for endoscopic retrograde chol angiop ancre atography .
22. The composition of claim 21, wherein the contrast dye is a low-osmolar low-viscosity nonionic dye, a low-viscosity high-osmolar dye, or a dissociable high- viscosity dye.
23. The composition of claim 21 or 22, wherein the contrast dye is lopromid, loglicinate, or loxaglinate.
24. A method of producing pancreatic beta cells in a subject, comprising administering to the subject a composition comprising an effective amount of the vector of any one of claims 9-12, wherein the MafB promoter is operably linked to the nucleic acid molecule encoding PDX1 and the nucleic acid molecule encoding MafA, wherein the vector does not encode Neurogenin 3 (Ngn3), wherein the subject is not administered any other nucleic acid encoding Ngn3, wherein the vector is administered intraductally into a pancreatic duct of the subject, thereby inducing alpha cells to transdifferentiate into pancreatic beta cells in the subject.
25. The method of claim 24, wherein the MafB promoter is further operably linked to a nucleic acid molecule encoding MafB.
26. The method of claim 24 or claim 25, wherein administered intraductally comprises the use of endoscopic retrograde cholangiopancreatography (ERCP).
27. The method of any one of claims 24-26, wherein the subject is not administered an immunosuppressive agent.
28. The method of any one of claims 24-26, wherein the subject is administered an immunosuppressive agent.
29. The method of any one of claims 24-28, wherein the subject is a human.
30. The method of any one of claims 24-29, wherein the subject has Type 1 diabetes.
31. The method of any one of claims 24-30, wherein the composition further comprises a contrast dye for endoscopic retrograde cholangiopancreatography.
32. The method of claim 31, wherein the contrast dye is a low-osmolar low-viscosity non-ionic dye, a low-viscosity high-osmolar dye, or a dissociable high- viscosity dye.
33. The method of claim 31 or 32, wherein the contrast dye is lopromid, loglicinate, or loxaglinate.
34. A composition comprising an effective amount of the vector of any one of claims 4-11, wherein the MafB promoter is operably linked to the nucleic acid molecule encoding PDX1 and the nucleic acid molecule encoding MafA, for use in treating diabetes in a subject.
35. The composition of claim 34, for use in the method of any one of claims 24-32.
PCT/US2024/016536 2023-02-21 2024-02-20 Mafb promoter for gene therapy for diabetes Ceased WO2024178007A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202363486202P 2023-02-21 2023-02-21
US63/486,202 2023-02-21

Publications (1)

Publication Number Publication Date
WO2024178007A1 true WO2024178007A1 (en) 2024-08-29

Family

ID=92501472

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2024/016536 Ceased WO2024178007A1 (en) 2023-02-21 2024-02-20 Mafb promoter for gene therapy for diabetes

Country Status (1)

Country Link
WO (1) WO2024178007A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008013737A2 (en) * 2006-07-19 2008-01-31 University Of Florida Research Foundation, Inc. Compositions for reprogramming a cell and uses therefor
WO2015148832A1 (en) * 2014-03-27 2015-10-01 Salk Institute For Biological Studies Compositions and methods for treating type 1 and type 2 diabetes and related disorders
WO2022003357A1 (en) * 2020-06-30 2022-01-06 Syncona Investment Management Limited Vector

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008013737A2 (en) * 2006-07-19 2008-01-31 University Of Florida Research Foundation, Inc. Compositions for reprogramming a cell and uses therefor
WO2015148832A1 (en) * 2014-03-27 2015-10-01 Salk Institute For Biological Studies Compositions and methods for treating type 1 and type 2 diabetes and related disorders
WO2022003357A1 (en) * 2020-06-30 2022-01-06 Syncona Investment Management Limited Vector

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
MENÉNDEZ-GUTIÉRREZ MARÍA P., RŐSZER TAMÁS, FUENTES LUCÍA, NÚÑEZ VANESSA, ESCOLANO AMELIA, REDONDO JUAN MIGUEL, DE CLERCK NORA, MET: "Retinoid X receptors orchestrate osteoclast differentiation and postnatal bone remodeling", JOURNAL OF CLINICAL INVESTIGATION, AMERICAN SOCIETY FOR CLINICAL INVESTIGATION, vol. 125, no. 2, 2 February 2015 (2015-02-02), pages 809 - 823, XP093203219, ISSN: 0021-9738, DOI: 10.1172/JCI77186 *
SALEH MOHAMED, GITTES GEORGE K., PRASADAN KRISHNA: "Alpha-to-beta cell trans-differentiation for treatment of diabetes", BIOCHEMICAL SOCIETY TRANSACTIONS, PORTLAND PRESS LTD, GB, vol. 49, no. 6, 17 December 2021 (2021-12-17), GB , pages 2539 - 2548, XP093024467, ISSN: 0300-5127, DOI: 10.1042/BST20210244 *
SZABAT, MARTA: "Imaging Maturation in Single Adult ?-Cells using Fluorescent Reporters for Ins1, Pdx1, MafA and MafB Promoter Activities.", CANADIAN JOURNAL OF DIABETES, vol. ABS. 57, 1 October 2012 (2012-10-01), pages S17 - S18, XP093203224, ISSN: 1499-2671, DOI: 10.1016/j.jcjd.2012.07.070 *
YIN QINGLEI, NI QICHENG, WANG YICHEN, ZHANG HONGLI, LI WENYI, NIE AIFANG, WANG SHU, GU YANYUN, WANG QIDI, NING GUANG: "Raptor determines β-cell identity and plasticity independent of hyperglycemia in mice", NATURE COMMUNICATIONS, NATURE PUBLISHING GROUP, UK, vol. 11, no. 1, 1 January 2020 (2020-01-01), UK, XP093203218, ISSN: 2041-1723, DOI: 10.1038/s41467-020-15935-0 *

Similar Documents

Publication Publication Date Title
EP3313991B1 (en) Modified factor ix, and compositions, methods and uses for gene transfer to cells, organs and tissues
JP6348064B2 (en) Viral vectors for efficient transgene delivery
CA2919103C (en) Variant aav and compositions, methods and uses for gene transfer to cells, organs and tissues
CN120505370A (en) Means and methods for preparing viral vectors and uses thereof
US6967018B2 (en) Adiponectin gene therapy
CN111718947B (en) Adeno-associated virus vector for treating type IIIA or IIIB mucopolysaccharidosis and use thereof
EP2709653B1 (en) Aav mediated exendin-4 gene transfer to salivary glands to protect subjects from diabetes or obesity
US10071172B2 (en) Endogenous neogenesis of beta cells
EP4432836A1 (en) Adeno-associated virus compositions having preferred heart and skeletal muscle enrichment
US20250066444A1 (en) Glucagon promoter for diabetes gene therapy
US20230058380A1 (en) Intrapancreatic m2 polarization of macrophages to treat type 1 diabetes
WO2024178007A1 (en) Mafb promoter for gene therapy for diabetes
US20220354969A1 (en) Enhancing viral vector-based gene therapy
US20250369011A1 (en) Insulin promoter for gene therapy for type 2 diabetes mellitus
KR20240005950A (en) vector system
CN116194587A (en) A new muscle-specific promoter
US20240191226A1 (en) Smad2 inhibition in beta cells for type 2 diabetes therapy
HK40085426A (en) Modified nucleic acids encoding aspartoacylase (aspa) and vector for gene therapy
BR112017027956B1 (en) RECOMBINANT ADENOASSOCIATED VIRUS VECTORS, THEIR USE AND PRODUCTION METHOD, AND PHARMACEUTICAL COMPOSITION
HK1252351B (en) Modified factor ix, and compositions, methods and uses for gene transfer to cells, organs and tissues

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24760864

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2024760864

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2024760864

Country of ref document: EP

Effective date: 20250922