WO2024231820A1 - Traitement de la maladie de pompe - Google Patents
Traitement de la maladie de pompe Download PDFInfo
- Publication number
- WO2024231820A1 WO2024231820A1 PCT/IB2024/054397 IB2024054397W WO2024231820A1 WO 2024231820 A1 WO2024231820 A1 WO 2024231820A1 IB 2024054397 W IB2024054397 W IB 2024054397W WO 2024231820 A1 WO2024231820 A1 WO 2024231820A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- gaa
- seq
- sequence
- polypeptide
- nucleic acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/86—Viral vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2408—Glucanases acting on alpha -1,4-glucosidic bonds
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/0102—Alpha-glucosidase (3.2.1.20)
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/07—Animals genetically altered by homologous recombination
- A01K2217/075—Animals genetically altered by homologous recombination inducing loss of function, i.e. knock out
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
- A01K2267/0306—Animal model for genetic diseases
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2750/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
- C12N2750/00011—Details
- C12N2750/14011—Parvoviridae
- C12N2750/14111—Dependovirus, e.g. adenoassociated viruses
- C12N2750/14141—Use of virus, viral particle or viral elements as a vector
- C12N2750/14143—Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/22—Vectors comprising a coding region that has been codon optimised for expression in a respective host
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/008—Vector systems having a special element relevant for transcription cell type or tissue specific enhancer/promoter combination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2451—Glucanases acting on alpha-1,6-glucosidic bonds
Definitions
- the disclosure relates to acid alpha-glucosidase (GAA) gene therapy.
- GAA acid alpha-glucosidase
- Acid alpha-glucosidase is an enzyme that is responsible for the critical degradation of glycogen in lysosomes of cells. Loss of its activity leads to progressive intralysosomal accumulation of undegraded glycogen and lysosomal distention.
- Pompe disease is caused by mutations and reduced activity of the GAA gene (gaa).
- PD can be broadly classified into infantile-onset (IOPD) or late-onset (LOPD) PD.
- IOPD patients have under 1% GAA activity, develop cardiomegaly, muscle weakness with hypotonia, hepatomegaly, breathing problems and die within the first year of life if left not treated.
- LOPD patients have at least 1% GAA activity and manifest a less severe phenotype but present with progressive limb muscle weakness and respiratory insufficiency.
- ERT enzyme replacement therapy
- Lumizyme® marketed as Myozyme® outside of the United States; Sanofi Genzyme
- Pompe disease remains a devastating illness.
- ERT enzyme replacement therapy
- LOPD LOPD is heterogenous and the severity falls along a spectrum, many patients still lose independent mobility and/or require ventilator support as their symptoms progress. Patients reach a clinical plateau within 2-3 years of treatment and some show a decline over time (Harfouche (2020) J. Patient Rep. Outcomes 4(1): 83).
- ERT therapy The primary deficiencies of ERT therapy are: (1) detrimental immune responses including neutralizing antibodies against recombinant GAA enzyme, especially in cross-reactive immune- material (CRIM) negative patients; (2) poor uptake of the GAA enzyme by muscle cells from circulation; (3) limited availability of the GAA enzyme in circulation (85% taken up by the liver); (4) reduced stability of the GAA enzyme at neutral pH; and (4) progressive endosomal dysfunction reducing efficacy of the endogenous enzyme delivery to lysosomes.
- Other complications associated with ERT include infusion site reactions and the requirement of biweekly or even weekly (in severe cases) infusions. These deficiencies of ERT translate to a poor quality of life, indicating a sustained unmet need for these patients.
- ERTs are being developed to address some of these problems and to improve the current standard of care (SOC).
- SOC standard of care
- Such strategies are focused on improving uptake and bioavailability of GAA into muscles and include: (1) development of a GAA with high mannose 6-phosphate (M6P) content to improve uptake from circulation; (2) chimeric GAA variants with synthetic uptake domains; (3) administering beta-2 agonists to upregulate the expression of the cation-independent M6P receptor (CI-MPR) to improve cellular uptake (Farah et al. (2014) FASEB J. 28(5):2272-2280); and (4) combining ERT with pharmacological chaperones to improve GAA enzyme stability in plasma (Okumiya et al.
- M6P mannose 6-phosphate
- CI-MPR cation-independent M6P receptor
- glycogen accumulates in virtually all tissues of PD patients, the clinical manifestations are predominantly observed in the skeletal, cardiac, and respiratory muscles.
- the major unmet needs in PD are due to limited availability of GAA enzyme in the respiratory and deep skeletal muscles that is a consequence of not only low levels of circulating GAA enzyme and poor GAA enzyme uptake into muscle cells, but also of reduced enzyme stability and exaggerated immune responses.
- GAA gene therapy has the potential to have a lasting therapeutic effect on patients suffering from PD by delivering continuous, high exposure of the missing enzyme to the affected tissues.
- GT GAA gene therapy
- ACT-CS101 C Identifier: NCT03533673
- Audentes Therapeutics, Inc.'s FORTIS study sponsored by Audentes’ acquirer Astellas Gene Therapies, is in phase 1/2 for LOPD. While the Audentes GT candidate, AT845, is utilizing a muscle-directed promoter, the AAV8 capsid being used does not have high muscle tropism in primates and a high dose of virus of 1x1014 vg/kg was required for efficient transduction and skeletal muscle glycogen normalization in Pompe mouse models. Also, in trials for a different rare muscle disease, the Audentes AAV8-delivered ATI 32 candidate has seen three clinical fatalities using doses greater than 1x1014 vg/kg.
- the disclosure relates to a gene therapy that addresses the shortcomings of ERTs and competitor gene therapy (GT) candidates, thereby providing a transformative therapy for Pompe disease (PD) patients.
- GT competitor gene therapy
- the present therapy targets hard to treat muscle tissues by combining (1) an engineered AAV capsid that efficiently delivers the genetic payload to muscles and can be dosed into environmentally seropositive patients (2) transcriptional elements that drive maximal expression of GAA in muscles, and (3) an engineered GAA protein variant that has enhanced stability, uptake, catalytic activity, and a reduced immunogenic profile.
- the present GAA GT efficiently delivers, and provides a high, continuous exposure, of missing GAA enzyme to GAA- deficient muscle tissues by optimizing tissue-specific delivery, expression and function of the GAA enzyme and as a result will significantly improve patient quality of life and extend the lifespan of Pompe disease patients.
- the present disclosure concerns methods and compositions to alleviate the primary unmet need of Pompe patients that leads to morbidity and mortality by achieving therapeutic GAA protein levels in the lysosomes of diaphragm, cardiac, skeletal, and smooth muscle cells.
- the muscle-tropic AAV9-based or chimeric capsid influences delivery (i.e., transduction) of the recombinant genome comprising the engineered GAA transgene to the targeted muscle tissue.
- the muscle-specific promoter and enhancer elements drive strong expression of the GAA transgene in muscle cells, as well as non-muscle cells.
- the engineered GAA protein is processed and trafficked to cellular lysosomes where it can act to break down glycogen.
- a portion of the engineered GAA that is expressed in transduced cells is secreted and taken up by surrounding non-transduced cells. This local cross-correction within muscle allows for better treatment of the difficult-to-reach deep skeletal muscle tissues. Importantly, the therapeutic mechanism of action does not require transport into the serum and passive uptake by muscle cells, a major limitation of liver-directed GT approaches. Further, as the average lifespan of skeletal myocytes is 15 years (Spalding (2005) Cell 122(1): 133-43) and >50% cardiomyocytes persist throughout an adult’s lifetime (Lazar (2017) Eur. Heart J.
- the disclosure provides a nucleic acid encoding an acid alphaglucosidase (GAA) protein, the nucleic acid comprising a first polynucleotide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to CO3-MP- 6-dNA (SEQ ID NO:36).
- GAA acid alphaglucosidase
- the disclosure provides a nucleic acid encoding an acid alphaglucosidase (GAA) protein, the nucleic acid comprising a first polynucleotide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to CO3-MP- WT-NA (SEQ ID NO: 34).
- GAA acid alphaglucosidase
- the disclosure provides an expression cassette comprising a GAA nucleic acid disclosed herein and at least one regulatory nucleic acid sequence operably linked to the sequence encoding the GAA protein.
- the disclosure provides a mammalian expression vector comprising an expression cassette described herein.
- the disclosure provides a recombinant acid alpha-glucosidase (GAA) variant protein, wherein the GAA variant protein comprises an amino acid substitution selected from the group consisting of T151I, L650G, L650S, L650T, L650E, L650Y, L650F, S676D, L678H, and L868F, numbered relative to the full-length wild type GAA protein sequence of FL- WT-AA (SEQ ID NO:2).
- GAA acid alpha-glucosidase
- the recombinant GAA variant protein of claim 59 comprising an amino acid substitution selected from the group consisting of T1511, L650G, S676D, and L678H, numbered relative to the full-length wild type GAA protein sequence of FL- WT-AA (SEQ ID NO:2).
- the disclosure provides a recombinant acid alpha-glucosidase (GAA) variant protein, wherein the GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to MP-6-AA (SEQ ID NO:37) and wherein the GAA variant comprises one or more variant amino acids selected from the group consisting of T151I, L650G, S676D, and L678H.
- GAA acid alpha-glucosidase
- the disclosure provides a method for treating Pompe disease in a subject in need thereof, the method comprising administering to the subject a therapeutically effective amount of a composition described herein.
- Figure 1A, IB, 1C, ID, IE, IF, 1G, and 1H collectively illustrate (A) a vector comprising a representative GAA transgene driven by a muscle specific promoter and enhancer packaged within an AAV9 capsid.
- AAV9.Dph-CRE04.SPc512.WT hGAA GAA protein expression measured using GAA activity in the (B) heart, (C) diaphragm, (D) quadriceps and (E) triceps of GAA-/- mice dosed with 3x1013 vg/kg AAV9 capsid vectors comprising constructs containing various enhancer elements (SKSH4, CSK-SH5 and/or Dph-CRE04) and/or promoters (CBA or SPc512 or Desmin) and codon optimized human GAA (CO3 GAA) or WT GAA using a 4-MUG assay.
- SKSH4, CSK-SH5 and/or Dph-CRE04 promoters
- CBA or SPc512 or Desmin codon optimized human GAA
- gaa+/+ wild type (WT) vehicle-treated mouse
- gaa-/- gaa knock-out (KO) vehicle-treated mouse.
- F collectively illustrate the GAA activity in heart tissue lysates, quadriceps tissue lysates, and diaphragm tissue lysates of GAA KO mice that received either vehicle or a vector expressing WT hGAA.
- G PAS staining of heart tissue from GAA KO mice after dosing with a 3x1013 vg/kg rAAV9.GAA construct containing SKSH4.Desmin.WT human GAA compared to control gaa+/+ and gaa-/- mice and
- H western blot analysis of heart tissue protein from GAA KO mice that received 3x1013 vg/kg AAV9 capsid vector comprising constructs containing WT hGAA , with or without an enhancer, or a buffer group showing trafficking and lysosomal processing of human GAA (lane 1 — WT hGAA; Lanes 2 and 10 — molecular weight markers; lanes 3 and 4 — heart protein from GAA KO (gaa-/-) mice treated with vectors comprising AAV9.
- FIGS 2A, 2B, and 2C collectively illustrate (A) GAA activity and (B) reduction in glycogen levels in heart, quadriceps, triceps, and diaphragm of GAA KO mice after dosing with 3x10 13 vg/kg AAV9 capsid vectors comprising constructs with or without enhancer elements using a 4-MUG assay and (C) tissue section images of (i) GAA protein by immunohistochemical staining (IHC); (ii) glycogen by Periodic Acid Schiff (PAS) staining; (iii) lysosomal-associated membrane protein 1 (LAMP-1) levels by IHC; and (iv) tissue morphology by haematoxylin and eosin (H&E) staining, in representative tissue sections of quadriceps of control and vector- treated mice dosed with 3x10 13 vg/kg AAV9 capsid vectors comprising constructs with or without an enhancer element.
- IHC immunohistochemical sta
- Figure 3 illustrates codon optimized GAA constructs tested, containing 5’ and 3’ inverted terminal repeats (ITRs) from AAV2, the Sk-SH4 enhancer, the human desmin promoter, the Minute Virus of Mice (MVM) intron, an SV40 polyadenylation (poly A) signal (SV40pA), and a DNA ID tag, for expressing WT human GAA (AAV9.Sk-SH4.desmin.WT GAA), CO1 GAA (AAV9.Sk-SH4. desmin. CO 1 GAA), CO2GAA (AAV9.Sk-SH4. desmin. CO2GAA), and CO3 GAA AAV9.Sk-SH4. desmin. CO3 GAA)
- FIGS 4A and 4B collectively illustrate (A) GAA activity levels or (B) reduction in glycogen levels in heart, quadriceps, and diaphragm in GAA KO mice after dosing with 3x10 13 vg/kg AAV9 capsid vectors comprising constructs containing a codon optimized human GAA (CO1, CO2, or CO3) using a 4-MUG assay.
- Figure 5 illustrates the activity of GAA variants (Var 6, Var 7, Var 8, Var 9, Var 10, Var 11, Var 12, and Var 13) as compared to controls (WT hGAA, control plasmid, and no plasmid) in C2C12 gaa 1 ' mouse muscle cells following transfection of the cells with plasmids and activity measured in cell extracts using the 4-MUG assay.
- Figures 6A and 6B collectively illustrate (A) kinetic activity using 4-MUG as the substrate and (B) activity on glycogen of GAA variant 6 (Var 6) compared to WT hGAA.
- Figures 11 A, 11B, 11C, 11D, HE, 11F, HG, 11H, and HI collectively illustrate SEQ ID NOs: 1, 2, and 13-30, representing numerous human GAA polynucleotides and protein variants.
- Figures 12A, 12B, and 12C collectively illustrate SEQ ID NOs: 31, 60, and 61, representing codon-altered human GAA polynucleotides and protein variants.
- Figures 13A and 13B collectively illustrate SEQ ID NOs: 34-45, representing polynucleotides and polypeptides associated with human GAA protein variants.
- Figure 14 illustrates SEQ ID NOs: 46-49, representing polynucleotides and polypeptides of promoters, enhancers, and mammalian expression vectors.
- Figures 15A, 15B, and 15C illustrate SEQ ID NO: 60; representing the polynucleotide associated with a human GAA protein variant (CO3-FL-6-dNA).
- Figure 16 illustrates the list of IUPAC degenerate nucleotide codes.
- Figures 17A, 17B, and 17C illustrate SEQ ID NO: 36; representing the polynucleotide associated with a human GAA protein variant (CO3-MP-6-dNA).
- ERT regimens including: (1) detrimental immune responses including neutralizing antibodies against recombinant GAA enzyme, especially in cross-reactive immune-material (CRIM) negative patients; (2) poor uptake of the GAA enzyme by muscle cells from circulation; (3) limited availability of the GAA enzyme in circulation (85% taken up by the liver); (4) reduced stability of the GAA enzyme at neutral pH; and (4) progressive endosomal dysfunction reducing efficacy of the endogenous enzyme delivery to lysosomes.
- AAV therapy for Pompe disease while promising, have to this point encountered similar difficulties in the case of liver-directed therapies (e.g., low serum stability, poor uptake and inefficient deep tissue distribution).
- AAV8 therapies with muscle-specific expression have been forced to rely upon dangerously high doses.
- Alpha-glucosidase and “GAA” are used interchangeably and refer a protein with glucosidase activity for hydrolyzing terminal, non-reducing (1— >4)- linked a-D-glucose residues in polysaccharides with release of D-glucose (e.g., active GAA, also referred to herein as “GAA mature polypeptide,” “GAA MP,” or simply “MP”) or a protein precursor thereof (e.g., a pro-protein or a pre-pro-protein, often referred to as pGAA and ppGAA), e.g., as measured by quantification of glucose release from glycogen following incubation with the GAA polypeptide.
- active GAA also referred to herein as “GAA mature polypeptide,” “GAA MP,” or simply “MP”
- MP a protein precursor thereof
- pGAA and ppGAA e.g., as measured by quantification of glucose release from glycogen following incubation with the GAA polypeptide
- GAA is translated as an inactive, single-chain polypeptide that includes a signal peptide and a propeptide, often referred to as a GAA pre-pro-protein.
- the GAA pre-pro-protein undergoes post-translational processing to form an active GAA protein. This processing includes removal (e.g., by cleavage) of the signal peptide, followed by removal (e.g., by cleavage) of the propeptide, to form a mature GAA polypeptide.
- polynucleotides encoding the wild-type human GAA encode for an inactive single-chain polypeptide (e.g., a pre-pro-protein; amino acids 1-952 of GAA-FL-WT-AA (SEQ ID NO:2)) that undergoes post-translational processing to form an active GAA protein.
- a pre-pro-protein e.g., amino acids 1-952 of GAA-FL-WT-AA (SEQ ID NO:2)
- the GAA pre-pro-protein is first cleaved with a signal peptidase to release the encoded signal peptide (amino acids 1-27 of GAA-FL-WT-AA (SEQ ID NO:2)), forming a GAA precursor (amino acids 28-952 of SEQ ID NO:2; 110 kDa).
- the GAA precursor is cleaved by additional proteases to release a first associated polypeptide of 19.4 kD (amino acids 792-952 of SEQ ID NO:2), a second associated polypeptide of 3.9 kD (amino acids 78-113 of SEQ ID NO:2), and a third associated polypeptide of 10.4 kDa (amino acids 122-200 of SEQ ID NO:2), forming a mature GAA (amino acids 203-782 of SEQ ID NO:2; 70 kDa).
- the “MP” designation refers to a precursor polypeptide that includes the first associated polypeptide, the second associated polypeptide, the third associated polypeptide, and the mature GAA polypeptide.
- GAA polypeptide refers to a polypeptide having GAA glucosidase activity under particular conditions, e.g., as measured by quantification of glucose release from glycogen following incubation with the GAA polypeptide.
- GAA polypeptides include precursor polypeptides (e.g., GAA pre-pro-polypeptides and pro-polypeptides) which, when activated by the post-translational processing described above, become active GAA polypeptides with GAA glucosidase activity, as well as the active GAA polypeptides (e.g., GAA-MP) themselves.
- a human GAA polypeptide refers to a polypeptide that includes an amino acid sequence with high sequence identity (e.g., at least 85%, 90%, 95%, 99%, or more) to the portion of the wild-type human GAA polypeptide that includes the mature GAA polypeptide, GAA-MP-AA (SEQ ID NO:35) or to the portions of the disclosed variant GAA polypeptides (variants 6-13, shown in Figures 1 IB-1 II.
- GAA polypeptides Specifically included in the definition of GAA polypeptides are GAA polypeptides with one or more of the amino acid substitutions T151I, L650G, L650S, L650T, L650E, L650Y, L650F, S676D, and L678H, relative to the wild-type human GAA polypeptide, present in variants 6-13 described herein.
- Non-limiting examples of wild type GAA polypeptides include human GAA polypeptides (e.g., GenBank accession nos. NP_000143.2 (GAA-FL-WT-AA (SEQ ID NO:2)) and UniProt accession no. P10253), and natural variants thereof; bovine GAA (e.g., UniProt accession no. Q9MYM4); murine GAA (e.g., UniProt accession no. P70699); rat GAA (e.g., UniProt accession no. Q6P7A9), and natural variants thereof; and other mammalian GAA homologues (e.g., chimpanzee, ape, hamster, guinea pig, etc.).
- human GAA polypeptides e.g., GenBank accession nos. NP_000143.2 (GAA-FL-WT-AA (SEQ ID NO:2)
- UniProt accession no. P10253 UniProt accession no
- GAA polypeptide includes natural variants and artificial constructs.
- GAA encompasses any natural variants, alternative sequences, isoforms, or mutant proteins that retain some basal GAA glucosidase activity (e.g., at least 5%, 10%, 25%, 50%, 75%, or more of the corresponding wild type activity as assayed), including one or more variant amino acids found in the human population, such as S46P, C103G, C103R, C127F, R190H, Y191C, L208P, P217L, G219R, R224P, R224Q, R224W, T234K, T234R, A237, S251L, S254L, E262K, P266S, P285R, P285S, L291F, L291P, Y292C, G293R, L299R, H308L, H308P, G309R
- GAA protein i.e., as translated with a signal peptide and propeptide
- GAA protein can include one or more variants, with the Variant 6 finding particular use in some embodiments.
- This is referred to as “GAA-FL-Var6-AA” (SEQ ID NO: 14) with the nucleic acid sequence being referred to herein as "GAA-FL-Var6-NA.”
- codon- optimized sequences CO1-FL-WT-AA, CO2-FL-WT-AA, and CO3-FL-WT-AA exemplified herein, also encode the full-length GAA protein.
- specifically included in the definition of GAA is all such variants exemplified herein.
- GAA amino acids refers to the corresponding amino acid in the full-length, wild-type human GAA pre-pro-polypeptide sequence (GAA-FL-WT-AA), presented as SEQ ID NO: 2 in Figure 11 A.
- the recited amino acid number refers to the analogous (e.g., structurally or functionally equivalent) and/or homologous (e.g., evolutionarily conserved in the primary amino acid sequence) amino acid in the full-length, wild-type GAA pre-pro-polypeptide sequence.
- a T151I amino acid substitution refers to an threonine to isoleucine substitution at position 151 of the full-length, wild-type human GAA pre-pro-peptide sequence (GAA-FL-WT-AA (SEQ ID NO:2)), as well as a T to I substitution at position 151 of the mature, wild-type GAA single-chain polypeptide (GAA-MP-WT-AA (SEQ ID NO:35)). Both of these nomenclatures describe the same T to I amino acid substitution, in different GAA polypeptides.
- GAA polynucleotide refers to a polynucleotide encoding a GAA polypeptide having GAA glucosidase activity under particular conditions, e.g., as measured by quantification of glucose release from glycogen following incubation with the GAA polypeptide.
- GAA polynucleotides include polynucleotides encoding GAA precursor polypeptides, including GAA pre-pro-polypeptides, GAA pro-polypeptides, and mature, singlechain GAA polypeptides.
- GAA polynucleotides Specifically included in the definition of GAA polynucleotides are polynucleotides encoding a GAA polypeptide that includes one or more of the amino acid substitutions T151I, L650G, L650S, L650T, L650E, L650Y, L650F, S676D, L678H, S676D, and L678H, relative to the wild-type human GAA polypeptide.
- a human GAA polynucleotide refers to a polynucleotide that encodes a polypeptide that includes an amino acid sequence with high sequence identity (e.g., at least 85%, 90%, 95%, 99%, or more) to the portion of the wild-type human GAA polypeptide that includes the mature GAA polypeptide, GAA-MP-AA (SEQ ID NO:35) or to the portions of the disclosed variant GAA polypeptides (variants 6-13), shown in Figures 1 IB- 1 II.
- an amino acid sequence with high sequence identity e.g., at least 85%, 90%, 95%, 99%, or more
- GAA polynucleotides can include regulatory elements, such as promoters, enhancers, terminators, polyadenylation sequences, and introns, as well viral packaging elements, such as inverted terminal repeats (“ITRs”), and/or other elements that support replication of the polynucleotide in a non-viral host cell, e.g., a replicon supporting propagation of the polynucleotide, e.g., in a bacterial, yeast, or mammalian host cell.
- regulatory elements such as promoters, enhancers, terminators, polyadenylation sequences, and introns
- viral packaging elements such as inverted terminal repeats (“ITRs”), and/or other elements that support replication of the polynucleotide in a non-viral host cell, e.g., a replicon supporting propagation of the polynucleotide, e.g., in a bacterial, yeast, or mammalian host cell.
- codon-altered GAA polynucleotides are codon-altered GAA polynucleotides.
- the codon-altered GAA polynucleotides provide increased expression of transgenic GAA in vivo, as compared to the level of GAA expression provided by a natively- coded GAA construct (e.g., a polynucleotide encoding the same GAA amino acid sequence using the wild-type human codons).
- the term “increased expression” refers to an increased level of transgenic GAA protein in a tissue (e.g., a muscular tissue) of an animal administered the codon-altered polynucleotide encoding GAA, as compared to the level of transgenic GAA protein in the same tissue of an animal administered a natively-coded GAA construct. Increased expression of the protein leads to an increase in GAA activity; thus, increased expression leads to increased activity.
- increased expression refers to at least 25% greater transgenic GAA polypeptide in a tissue of an animal administered the codon-altered GAA polynucleotide, as compared to the level of transgenic GAA polypeptide in the same tissue of an animal administered a natively-coded GAA polynucleotide.
- increased expression refers to an effect generated by the alteration of the codon sequence, rather than hyperactivity caused by an underlying amino acid substitution. That is, the expression level obtained from a codon-optimized sequence encoding a GAA variant described herein is compared relative to the expression level obtained from a natively-coded GAA variant protein.
- increased expression refers to at least 50% greater, at least 75% greater, at least 100% greater, at least 3 -fold greater, at least 4-fold greater, at least 5-fold greater, at least 6- fold greater, at least 7-fold greater, at least 8-fold greater, at least 9-fold greater, at least 10-fold greater, at least 15-fold greater, at least 20-fold greater, at least 25-fold greater, at least 30-fold greater, at least 40-fold greater, at least 50-fold greater, at least 60-fold greater, at least 70-fold greater, at least 80-fold greater, at least 90-fold greater, at least 100-fold greater, at least 125-fold greater, at least 150-fold greater, at least 175-fold greater, at least 200-fold greater, at least 225- fold greater, or at least 250-fold greater transgenic GAA polypeptide in a tissue of an animal administered the codon-altered GAA polynucleotide, as compared to the level of transgenic GAA polypeptide in the same tissue of an animal administered a
- GAA activity or “GAA glucosidase activity” herein is meant the ability to hydrolyzing terminal, non-reducing (1 — >4)-linked a-D-glucose residues in polysaccharides with release of D-glucose.
- the activity levels can be measured using any GAA activity known in the art.
- An exemplary assay for determining GAA activity is quantification of glucose release from glycogen following incubation with the GAA polypeptide.
- the therapeutic potential of a GAA polynucleotide composition is evaluated by the increase in GAA activity in a tissue of an animal administered a GAA polynucleotide, e.g., instead of or in addition to increased GAA expression in the tissue.
- increased GAA activity refers to a greater increase in GAA activity in a tissue of an animal administered a codon-altered GAA polynucleotide, relative to a baseline GAA activity in the tissue of the animal prior to administration of the codon-altered GAA polynucleotide, as compared to the increase in GAA activity in the same tissue of an animal administered a natively-coded GAA polynucleotide, relative to a baseline GAA activity in the tissue of the animal prior to administration of the natively- coded GAA polynucleotide.
- increased GAA activity refers to at least a 25% greater increase in GAA activity in a tissue of an animal administered the codon-altered GAA polynucleotide, relative to a baseline level of GAA activity in the tissue of the animal prior to administration of the codon- altered GAA polynucleotide, as compared to the increase in the level GAA activity in the blood of an animal administered a natively-coded GAA polynucleotide, relative to the baseline level of GAA activity in the animal prior to administration of the natively-coded GAA polynucleotide.
- increased GAA activity refers to at least 50% greater, at least 75% greater, at least 100% greater, at least 3-fold greater, at least 4-fold greater, at least 5-fold greater, at least 6-fold greater, at least 7-fold greater, at least 8-fold greater, at least 9-fold greater, at least 10-fold greater, at least 15-fold greater, at least 20-fold greater, at least 25-fold greater, at least 30-fold greater, at least 40-fold greater, at least 50-fold greater, at least 60-fold greater, at least 70-fold greater, at least 80-fold greater, at least 90-fold greater, at least 100-fold greater, at least 125-fold greater, at least 150-fold greater, at least 175-fold greater, at least 200- fold greater, at least 225-fold greater, or at least 250-fold greater increase in GAA activity in a tissue of an animal administered the codon-altered GAA polynucleotide, relative to a baseline level of GAA activity in the tissue of the animal prior to administration of the codon-altered
- the GAA amino acid numbering system is dependent on whether the GAA pre-pro-peptide (e.g., amino acids 1-69 of the full-length, wild-type human GAA sequence, inclusive of the signal peptide and pro-peptide) is included. Where the pre-pro- peptide is included, the numbering is referred to as “pre-pro-peptide inclusive” or “PPI”. Where the pre-pro-peptide is not included, the numbering is referred to as “pre-pro-peptide exclusive” or “PPE.” For example, LI 17D is PPI numbering for the same amino acid substitution as L48D, in PPE numbering.
- GAA amino acid numbering is also dependent upon the size of the of the signal peptide and/or propeptide in the particular GAA polypeptide.
- GAA gene therapy includes any therapeutic approach of providing an exogenous nucleic acid encoding GAA to a patient to relieve, diminish, or prevent the reoccurrence of one or more symptoms (e.g., clinical factors) associated with a GAA deficiency (e.g., Pompe disease).
- the term encompasses administering any compound, drug, procedure, or regimen comprising a nucleic acid encoding a GAA molecule, including any modified form of GAA (e.g., a GAA variant 6), for maintaining or improving the health of an individual with a GAA deficiency (e.g., Pompe disease).
- a GAA deficiency e.g., Pompe disease
- One skilled in the art will appreciate that either the course of GAA gene therapy or the dose of a GAA gene therapy therapeutic agent can be changed, e.g., based upon the results obtained in accordance with the present disclosure.
- a therapeutically effective amount or dose or “therapeutically sufficient amount or dose” or “effective or sufficient amount or dose” refer to a dose that produces therapeutic effects for which it is administered.
- a therapeutically effective amount of a drug useful for treating Pompe disease can be the amount that is capable of preventing or relieving one or more symptoms associated with Pompe disease.
- a therapeutically effective treatment results in a decrease in the severity of musculoskeletal ailments (e.g., limb-girdle muscle weakness (LGMW)) in a subject.
- LGMW limb-girdle muscle weakness
- the term “gene” refers to the segment of a DNA molecule that codes for a polypeptide chain (e.g., the coding region).
- a gene is positioned by regions immediately preceding, following, and/or intervening the coding region that are involved in producing the polypeptide chain (e.g., regulatory elements such as a promoter, enhancer, polyadenylation sequence, 5' -untranslated region, 3 ' -untranslated region, or intron).
- regulatory elements refers to nucleotide sequences, such as promoters, enhancers, terminators, polyadenylation sequences, introns, etc., that provide for the expression of a coding sequence in a cell.
- promoter element refers to a nucleotide sequence that assists with controlling expression of a coding sequence. Generally, promoter elements are located 5' of the translation start site of a gene. However, in certain embodiments, a promoter element may be located within an intron sequence, or 3' of the coding sequence.
- a promoter useful for a gene therapy vector is derived from the native gene of the target protein (e.g., a GAA promoter). In some embodiments, a promoter useful for a gene therapy vector is specific for expression in a particular cell or tissue of the target organism (e.g., a muscle-specific promoter).
- one of a plurality of well characterized promoter elements is used in a gene therapy vector described herein.
- well-characterized promoter elements include the CMV early promoter, the (3 -actin promoter, and the methyl CpG binding protein 2 (MeCP2) promoter.
- the promoter is a constitutive promoter, which drives substantially constant expression of the target protein.
- the promoter is an inducible promoter, which drives expression of the target protein in response to a particular stimulus (e.g., exposure to a particular treatment or agent).
- an “MVM intron” refers to an intron sequence derived from minute virus of mice having high sequence identity to SEQ ID NO: 50.
- MVM intron For further information on the MVM intron itself, see Haut and Pintel, J Virol. 72(3): 1834-43 (1998), and use of the MVM intron in AAV gene therapy vectors, see Wu Z et al., Mol Then, 16(2):280-9 (2008), both of which are hereby incorporated by reference.
- operably linked refers to the relationship between a first reference nucleotide sequence (e.g., a gene) and a second nucleotide sequence (e.g., a regulatory control element) that allows the second nucleotide sequence to affect one or more properties associated with the first reference nucleotide sequence (e.g., a transcription rate).
- a regulatory control element is operably linked to a GAA transgene when the regulatory element is positioned within a gene therapy vector such that it exerts an effect (e.g., a promotive or tissue selective affect) on transcription of the GAA transgene.
- a vector refers to any nucleic acid construct used to transfer a GAA nucleic acid into a host cell.
- a vector includes a replicon, which functions to replicate the nucleic acid construct.
- Non-limiting examples of vectors useful for gene therapy include plasmids, phages, cosmids, artificial chromosomes, and viruses, which function as autonomous units of replication in vivo.
- a vector is a viral vector for introducing a GAA nucleic acid into the host cell.
- Many modified eukaryotic viruses useful for gene therapy are known in the art. For example, adeno-associated viruses (AAVs) are particularly well suited for use in human gene therapy because humans are a natural host for the virus, the native viruses are not known to contribute to any diseases, and the viruses illicit a mild immune response.
- AAVs adeno-associated viruses
- GAA viral vector refers to a recombinant virus comprising a GAA polynucleotide, encoding a GAA polypeptide, which is sufficient for expression of the GAA polypeptide when introduced into a suitable animal host (e.g., a human).
- a suitable animal host e.g., a human
- recombinant viruses in which a codon- altered GAA polynucleotide, which encodes a GAA polypeptide, has been inserted into the genome of the virus.
- GAA viral vectors are recombinant viruses in which the native genome of the virus has been replaced with a GAA polynucleotide, which encodes a GAA polypeptide. Included within the definition of GAA viral vectors are recombinant viruses comprising a GAA polynucleotide which encodes a GAA polypeptide with one or more of the amino acid substitutions T151I, L650G, L650S, L650T, L650E, L650Y, L650F, S676D, L678H, S676D, and L678H, relative to the wild-type human GAA polypeptide.
- GAA viral particle refers to a viral particle encapsidating a GAA polynucleotide, encoding a GAA polypeptide, which is specific for expression of the GAA polypeptide when introduced into a suitable animal host (e.g., a human).
- a suitable animal host e.g., a human
- recombinant viral particles encapsidating a genome in which a codon-altered GAA polynucleotide, which encodes a GAA polypeptide, has been inserted.
- GAA viral particles are recombinant viral particles encapsidating a GAA polynucleotide, which encodes a GAA polypeptide, which replaces the native genome of the virus. Included within the definition of GAA viral particles are recombinant viral particles encapsidating a GAA polynucleotide which encodes a GAA polypeptide with one or more of the amino acid substitutions T151I, L650G, L650S, L650T, L650E, L650Y, L650F, S676D, and L678H, relative to the wild-type human GAA polypeptide, present in variant 6 described herein.
- AAV adeno-associated virus
- ssDNA linear single-stranded DNA
- kb kilobases
- AAVs are not currently known to cause disease and cause only a very mild immune response.
- Gene therapy vectors using AAVs can “infect” or transduce both dividing and quiescent cells and persist in an extrachromosomal state without integrating into the genome of the host cell or integrating the genome at a low frequency.
- the wt genome comprises inverted terminal repeats (ITRs) at both ends of the DNA strand, and two open reading frames (ORFs): rep and cap.
- the former is composed of four overlapping genes encoding Rep proteins required for the AAV life cycle, and the latter contains overlapping nucleotide sequences of capsid proteins: VP1, VP2 and VP3, which interact to form a capsid with icosahedral symmetry.
- ITRs seem to be the only sequences required in cis next to the therapeutic gene: structural (cap) and packaging (rep) proteins can be delivered in trans. With this assumption many methods were established for efficient production of recombinant AAV (rAAV), or engineered AAV, vectors containing a heterologous sequence, e.g., a reporter or nucleic acid encoding a therapeutic gene product.
- AAV type 1 AAV1
- AAV type 2 AAV2
- AAV type 3 AAV3
- AAV type 4 AAV4
- AAV type 5 AAV5
- AAV type 6 AAV6
- AAV type 7 AAV7
- AAV8 AAV8
- AAV type 9 AAV9 viruses
- viruses e.g., encapsidating a GAA polynucleotide
- viruses formed by one or more variant AAV capsid proteins e.g., encapsidating a GAA polynucleotide.
- viruses formed using a non-naturally occurring, engineered capsid protein e.g., recombinant viruses and viral particles formed using a non-naturally occurring, engineered capsid protein.
- capsid protein refers to an expression product of a cap nucleic acid from an AAV serotype that forms a protein shell for an AAV virus, such as a wt capsid protein from serotypes 1, 6, 8, or 9; or a protein that shares at least 50% (alternatively at least 75, 80, 85, 90, 95, 96, 97, 98, 99%, or 99.5%) amino acid sequence identity with a wt capsid protein and displays a functional activity of a wt capsid protein.
- a “functional activity” of a protein is any activity associated with the physiological function of the protein, whether in vitro, ex vivo, or in vivo.
- functional activities of an AAV capsid protein may include its ability to form a capsid, evade host antibodies, recognize, and enter a cell, deliver DNA genome to the nucleus and transcription of its DNA genome.
- the capsid protein is a variant of the wt capsid protein with an altered functional activity such as tissue transduction or tissue tropism, e.g., into or to muscle, respectively.
- tissue transduction or tissue tropism e.g., into or to muscle, respectively.
- encapsidates or “packaged” means encloses or surrounds a gene or virus in a protein shell or capsid.
- the capsid polypeptides described herein refer to the VP1 form of the capsid polypeptide. However, it will be appreciated that these VP1 sequences also define the sequences of the VP2 form and the VP3 form.
- CpG refers to a cytosine-guanine dinucleotide along a single strand of DNA, with the “p” representing the phosphate linkage between the two.
- CpG island refers to a region within a polynucleotide having a statistically elevated density of CpG dinucleotides.
- a region of a polynucleotide e.g., a polynucleotide encoding a codon-altered GAA protein
- a region of a polynucleotide is a CpG island if, over a 200- base pair window: (i) the region has GC content of greater than 50%, and (ii) the ratio of observed CpG dinucleotides per expected CpG dinucleotides is at least 0.6, as defined by the relationship:
- nucleic acid refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double-stranded form and complements thereof.
- the term encompasses nucleic acids containing known nucleotide analogs or modified backbone residues or linkages, which are synthetic, naturally occurring, and non-naturally occurring, which have similar binding properties as the reference nucleic acid, and which are metabolized in a manner similar to the reference nucleotides.
- Examples of such analogs include, without limitation, phosphorothioates, phosphoramidates, methyl phosphonates, chiral-methyl phosphonates, 2-O-methyl ribonucleotides, and peptide-nucleic acids (PNAs).
- PNAs peptide-nucleic acids
- nucleic acid compositions herein is meant any molecule or formulation of a molecule that includes a GAA polynucleotide, encoding a GAA polynucleotide. Included within the definition of nucleic acid compositions are GAA polynucleotides, aqueous solutions of GAA polynucleotides, viral particles encapsidating a GAA polynucleotide, and aqueous formulations of viral particles encapsidating a GAA polynucleotide.
- a nucleic acid composition, as disclosed herein, includes a codon-altered GAA gene, that encodes a GAA polypeptide.
- amino acid refers to naturally occurring and non-natural amino acids, including amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids.
- Naturally occurring amino acids include those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- carboxyglutamate, and O-phosphoserine.
- Naturally occurring amino acids can include, e.g., D- and L-amino acids.
- amino acid sequences one of ordinary skill in the art will recognize that individual substitutions, deletions or additions to a nucleic acid or peptide sequence that alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a “conservatively modified variant” where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the disclosure.
- nucleic acids or peptide sequences refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same (i.e., about 60% identity, preferably 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and aligned for maximum correspondence over a comparison window or designated region) as measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default parameters described below, or by manual alignment and visual inspection.
- sequence identity and/or similarity is determined using standard techniques known in the art, including, but not limited to, the local sequence identity algorithm of Smith & Waterman, Adv. Appl. Math., 2:482 (1981), by the sequence identity alignment algorithm of Needleman & Wunsch, J. Mol. Biol., 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Natl. Acad. Sci.
- PILEUP creates a multiple sequence alignment from a group of related sequences using progressive, pair wise alignments. It may also plot a tree showing the clustering relationships used to create the alignment. PILEUP uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol. Evol. 35:351- 360 (1987); the method is similar to that described by Higgins & Sharp CABIOS 5:151-153 (1989), both incorporated by reference.
- Useful PILEUP parameters including a default gap weight of 3.00, a default gap length weight of 0.10, and weighted end gaps.
- Another example of a useful algorithm is the BLAST algorithm, described in: Altschul et al., J. Mol. Biol. 215, 403-410, (1990); Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997); and Karlin et al., Proc. Natl. Acad. Sci. U.S.A. 90:5873-5787 (1993), both incorporated by reference.
- a particularly useful BLAST program is the WU-BLAST-2 program which was obtained from Altschul et al., Methods in Enzymology, 266:460-480 (1996); http://blast.wustl/edu/blast/ README.html].
- WU-BLAST-2 uses several search parameters, most of which are set to the default values.
- the HSP S and HSP S2 parameters are dynamic values and are established by the program itself depending upon the composition of the particular sequence and composition of the particular database against which the sequence of interest is being searched; however, the values may be adjusted to increase sensitivity.
- Gapped BLAST uses BLOSUM-62 substitution scores; threshold T parameter set to 9; the two-hit method to trigger ungapped extensions; charges gap lengths of k a cost of 10+k; Xu set to 16, and Xg set to 40 for database search stage and to 67 for the output stage of the algorithms. Gapped alignments are triggered by a score corresponding to ⁇ 22 bits.
- a % amino acid sequence identity value is determined by the number of matching identical residues divided by the total number of residues of the “longer” sequence in the aligned region.
- the “longer” sequence is the one having the most actual residues in the aligned region (gaps introduced by WU-Blast-2 to maximize the alignment score are ignored).
- “percent (%) nucleic acid sequence identity” with respect to the coding sequence of the polypeptides identified is defined as the percentage of nucleotide residues in a candidate sequence that are identical with the nucleotide residues in the coding sequence of the cell cycle protein.
- a preferred method utilizes the BLASTN module of WU-BLAST-2 set to the default parameters, with overlap span and overlap fraction set to 1 and 0.125, respectively.
- the alignment may include the introduction of gaps in the sequences to be aligned.
- sequences which contain either more or fewer amino acids than the protein encoded by the wild-type GAA sequence of (SEQ ID NO:2) it is understood that in one embodiment, the percentage of sequence identity will be determined based on the number of identical amino acids or nucleotides in relation to the total number of amino acids or nucleotides.
- sequence identity of sequences shorter than SEQ ID NO:2 will be determined using the number of nucleotides in the shorter sequence, in one embodiment. In percent identity calculations relative weight is not assigned to various manifestations of sequence variation, such as, insertions, deletions, substitutions, etc.
- identity is scored positively (+1) and all forms of sequence variation including gaps are assigned a value of “0”, which obviates the need for a weighted scale or parameters as described below for sequence similarity calculations. Percent sequence identity may be calculated, for example, by dividing the number of matching identical residues by the total number of residues of the “shorter” sequence in the aligned region and multiplying by 100. The “longer” sequence is the one having the most actual residues in the aligned region.
- allelic variants refers to polymorphic forms of a gene at a particular genetic locus, as well as cDNAs derived from mRNA transcripts of the genes, and the polypeptides encoded by them.
- the term “preferred mammalian codon” refers a subset of codons from among the set of codons encoding an amino acid that are most frequently used in proteins expressed in mammalian cells as chosen from the following list: Gly (GGC, GGG); Glu (GAG); Asp (GAC); Vai (GTG, GTC); Ala (GCC, GCT); Ser (AGC, TCC); Lys (AAG); Asn (AAC); Met (ATG); He (ATC); Thr (ACC); Trp (TGG); Cys (TGC); Tyr (TAT, TAC); Leu (CTG); Phe (TTC); Arg (CGC, AGG, AGA); Gin (CAG); His (CAC); and Pro (CCC).
- the term “codon-altered” or “codon-optimized” refers to a polynucleotide sequence encoding a polypeptide (e.g., a GAA protein), where at least one codon of the native polynucleotide encoding the polypeptide has been changed to improve a property of the polynucleotide sequence.
- the improved property promotes increased transcription of mRNA coding for the polypeptide, increased stability of the mRNA (e.g., improved mRNA half-life), increased translation of the polypeptide, and/or increased packaging of the polynucleotide within the vector.
- Non-limiting examples of alterations that can be used to achieve the improved properties include changing the usage and/or distribution of codons for particular amino acids, adjusting global and/or local GC content, removing AT-rich sequences, removing repeated sequence elements, adjusting global and/or local CpG dinucleotide content, removing cryptic regulatory elements (e.g., TATA box and CCAAT box elements), removing of intron/exon splice sites, improving regulatory sequences (e.g., introduction of a Kozak consensus sequence), and removing sequence elements capable of forming secondary structure (e.g., stemloops) in the transcribed mRNA.
- cryptic regulatory elements e.g., TATA box and CCAAT box elements
- intron/exon splice sites e.g., introduction of a Kozak consensus sequence
- improving regulatory sequences e.g., introduction of a Kozak consensus sequence
- sequence elements capable of forming secondary structure e.g., stemloops
- CO-number refers to codon altered polynucleotides encoding GAA polypeptides and/or the encoded polypeptides, including variants.
- C03-FL refers to the Full Length codon altered CO3 polynucleotide sequence or amino acid sequence (sometimes referred to herein as “CO3-WT-FL-AA” for the Amino Acid sequence and “CO3-FL-NA” for the Nucleic Acid sequence) encoded by the CO3 polynucleotide sequence.
- amino acid sequences will be identical, as the amino acid sequences are not altered by the codon optimization.
- sequence constructs of the disclosure include, but are not limited to, C01-FL-WT-NA, C01-FL-6-NA, COl-FL-6-dNA, CO2-FL-6-NA, CO2-FL-6-dNA, CO3-FL-6-NA, CO3-FL-6-dNA, C01-FL-MP-NA, C01-MP- 6-NA, C01-MP-6-dNA, CO2-MP-6-NA, CO2-MP-6-dNA, CO3-MP-6-dNA, and CO3-MP-6- dNA.
- all “CO” constructs herein encode or contain the GAA amino acid sequence, although included within the definition of CO constructs are those that encode or contain the human wild type GAA amino acid sequence.
- muscle-specific expression refers to the preferential or predominant in vivo expression of a particular gene (e.g., a codon-altered, transgenic GAA gene) in musculoskeletal tissue, as compared to in other tissues.
- musclespecific expression means that at least 50% of all expression of the particular gene occurs within hepatic tissues of a subject.
- muscle-specific expression means that at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% of all expression of the particular gene occurs within musculoskeletal tissues of a subject.
- a musclespecific regulatory element is a regulatory element that drives muscle-specific expression of a gene in musculoskeletal tissue.
- the terms “about” and “approximately” include being within a statistically meaningful range of a value. Such a range can be within an order of magnitude, e.g., within 50%, within 20%, within 10%, and within 5% of a given value or range. The allowable variation encompassed by the term “about” or “approximately” depends on the particular system under study, and can be readily appreciated by one of ordinary skill in the art.
- the present disclosure provides codon-altered polynucleotides encoding a GAA polypeptide, e.g., a wild- type or variant GAA polypeptide. These codon- altered polynucleotides provide markedly improved expression of GAA glucosidase activity in vivo, as demonstrated in Example 3. Specifically, Applicants have achieved these advantages through the discovery of several codon-altered polynucleotide schemas, referred to herein as CO1, CO2, and CO3, for encoding a GAA polypeptide.
- a codon-altered polynucleotide provided herein has a nucleotide sequence with high sequence identity to C01-FL-WT-NA (SEQ ID NO: 60), CO2-FL-WT-NA (SEQ ID NO: 62), or CO3-FL- WT-NA (SEQ ID NO:31) encoding a human GAA pre-pro-polypeptide.
- the wild-type human GAA gene encodes a pre-pro-polypeptide having a 27 amino acid signal peptide (1-27 of SEQ ID NO:2) and an 42 amino acid pro-peptide (aa 28-69 of SEQ ID NO:2), which are cleaved from the encoded polypeptide prior to activation of GAA.
- signal peptides and/or pro-peptides may be mutated, replaced by signal peptides and/or pro-peptides from other genes or other organisms, or completely removed, without affecting the sequence of the mature polypeptide left after the signal and pro-peptide are removed by cellular processing.
- a codon-altered polynucleotide provided herein has a nucleotide sequence with high sequence identity to CO1- FL-MP-NA (SEQ ID NO: 60), CO2-FL-MP-NA (SEQ ID NO: 62), or CO3-FL-MP-NA (SEQ ID NO:31), e.g., where the wild-type human GAA signal peptide and/or propeptide has been modified or replaced with an alternative signal peptide and/or propeptide.
- the improved expression of GAA glucosidase activity provided by the CO1, CO2, and CO3 polynucleotide sequences are further improved when placed in operable communication with a muscle-specific regulatory control element, such as Dph-CRE04_NA (SEQ ID NO: 48) or sk-SH4-NA (SEQ ID NO: 49).
- a muscle-specific regulatory control element such as Dph-CRE04_NA (SEQ ID NO: 48) or sk-SH4-NA (SEQ ID NO: 49).
- the disclosure provides polynucleotides having a codon-altered GAA polynucleotide that is operably linked to a muscle-specific regulatory control element.
- the disclosure provides codon-altered polynucleotides encoding a GAA polypeptide containing one or more known amino acid substitution.
- GAA variant polypeptides having advantageous properties, e.g., improved specific activity, improved thermostability, and/or reduced immunogenicity, have been discovered, e.g., GAA variants 6-13.
- the disclosure provides codon-altered polynucleotides encoding a GAA polypeptide containing one or more amino acid substitutions present in any one of GAA variants 6-13: T151I, L650G, L650S, L650T, L650E, L650Y, L650F, S676D, L678H, L678T, T700G, A719H, A758P, A820E, Q838K, L868F, L879E, R891H, Q902G, V921R, and S940A.
- the codon-altered polynucleotide encodes a GAA polypeptide having any combination of the amino acid substitutions present in the GAA variant 6: T151I, L650G, S676D, and L678H.
- GC content of human genes varies widely, from less than 25% to greater than 90%. However, in general, human genes with higher GC contents are expressed at higher levels. For example, Kudla et al. (PLoS Biol., 4(6):80 (2006)) demonstrate that increasing a gene’s GC content increases expression of the encoded polypeptide, primarily by increasing transcription and effecting a higher steady state level of the mRNA transcript. Generally, the desired GC content of a codon-optimized gene construct is thought to be equal or greater than 60%. However, native AAV genomes have GC contents of around 56%.
- the codon-altered polynucleotides provided herein have a CG content that more closely matches the GC content of native AAV virions (e.g., around 56% GC), which is lower than the preferred CG contents of polynucleotides that are conventionally codon-optimized for expression in mammalian cells (e.g., at or above 60% GC).
- CO1-FL-WT-NA (SEQ ID NO:60) has a GC content of about 63.7%
- CO2-FL- WT-NA (SEQ ID NO:62) has a GC content of about 59.1%
- CO3-FL-WT-NA (SEQ ID NO:31) has a GC content of about 57.5%.
- the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is no more than 60%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is no more than 59%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is no more than 58%.
- the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is no more than 57%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is no more than 56%. In some embodiments, the overall GC content of a codon- altered polynucleotide encoding a GAA polypeptide is no more than 55%. [0092] In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is from 55% to 60%.
- the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is from 56% to 60%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is from 57% to 60%. In some embodiments, the overall GC content of a codon- altered polynucleotide encoding a GAA polypeptide is from 58% to 60%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is from 59% to 60%.
- the overall GC content of a codon- altered polynucleotide encoding a GAA polypeptide is from 55% to 59%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is from 56% to 59%. In some embodiments, the overall GC content of a codon- altered polynucleotide encoding a GAA polypeptide is from 57% to 59%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is from 58% to 59%.
- the overall GC content of a codon- altered polynucleotide encoding a GAA polypeptide is from 55% to 58%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is from 56% to 58%. In some embodiments, the overall GC content of a codon- altered polynucleotide encoding a GAA polypeptide is from 57% to 58%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is from 55% to 57%. In some embodiments, the overall GC content of a codon- altered polynucleotide encoding a GAA polypeptide is from 56% to 57%.
- the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is 57.5 ⁇ 0.5%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is 57.5 ⁇ 0.4%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is 57.5 ⁇ 0.3%. In some embodiments, the overall GC content of a codon-altered polynucleotide en-coding a GAA polypeptide is 57.5 ⁇ 0.2%.
- the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is 57.5 ⁇ 0.1%. In some embodiments, the overall GC content of a codon-altered polynucleotide encoding a GAA polypeptide is 57.5%.
- CpG dinucleotides i.e., a cytosine nucleotide followed by a guanine nucleotide
- CpG-depleted AAV vectors evade immune detection in mice, under certain circumstances (Faust et al., J. Clin. Invest. 2013; 123, 2994-3001).
- the wild type GAA coding sequence contains over 120 CpG dinucleotides.
- the codon-altered polynucleotides provided herein are codon-altered to reduce the number of CpG dinucleotides in the GAA coding sequence.
- CO3-FL-WT-NA (SEQ ID NO:31) has no CpG dinucleotides
- CO1-FL-WT-NA SEQ ID NO:60
- CO2-FL-WT-NA (SEQ ID NO:62) has no CpG dinucleotides.
- a sequence of a codon-altered polynucleotide encoding a GAA polypeptide has less than 20 CpG dinucleotides.
- a sequence of a codon-altered polynucleotide encoding a GAA polypeptide has less than 15 CpG dinucleotides.
- a sequence of a codon-altered polynucleotide encoding a GAA polypeptide has less than 12 CpG dinucleotides.
- a sequence of a codon-altered polynucleotide encoding a GAA polypeptide has less than 10 CpG dinucleotides. In some embodiments, a sequence of a codon-altered polynucleotide encoding a GAA polypeptide has less than 5 CpG dinucleotides. In some embodiments, a sequence of a codon- altered polynucleotide encoding a GAA polypeptide has less than 3 CpG dinucleotides. In some embodiments, a sequence of a codon-altered polynucleotide encoding a GAA polypeptide has no CpG dinucleotides.
- sequence of a codon-altered polynucleotide encoding a GAA polypeptide has no more than 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, or no CpG dinucleotides.
- a nucleic acid composition provided herein includes a GAA polynucleotide (e.g., a codon-altered polynucleotide) encoding a GAA polypeptide, where the GAA polynucleotide includes a nucleotide sequence having high sequence identity to all or a portion of the CO1 codon-optimized sequence.
- GAA polynucleotide e.g., a codon-altered polynucleotide
- the GAA polynucleotide includes a sequence having high sequence identity to the portion of the CO1 codon- optimized sequence that encodes for the mature GAA polypeptide. Accordingly, in some embodiments, the sequence of the codon- altered polynucleotide has at least 95% identity to C01-MP-WT-NA (SEQ ID NO:63). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 96% identity to C01-MP-WT-NA (SEQ ID NO:63). In a specific embodiment, the sequence of the codon- altered polynucleotide has at least 97% identity to C01-MP-WT-NA (SEQ ID NO:63).
- the sequence of the codon-altered polynucleotide has at least 98% identity to C01-MP-WT-NA (SEQ ID NO:63). In a specific embodiment, the sequence of the codon- altered polynucleotide has at least 99% identity to C01-MP-WT-NA (SEQ ID NO:63). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.5% identity to C01-MP-WT-NA (SEQ ID NO:63). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.9% identity to C01-MP-WT-NA (SEQ ID NO:63).
- the sequence of the codon-altered polynucleotide is CO1-MP- WT-NA (SEQ ID NO:63).
- SEQ ID NO:63 When determining the sequence identity between a GAA polypeptide and the portion of the CO1 codon-optimized sequence that encodes for the mature GAA polypeptide, only the portions of the sequence encoding the mature polypeptide should be considered. That is, the GAA polynucleotide may also encode for a signal peptide, a propeptide, and/or a purification/detection tag, but the sequence comparison should not include these sequences.
- a GAA polynucleotide having high sequence identity to CO1-MP- WT-NA further includes a polynucleotide sequence encoding a GAA signal peptide having the amino acid sequence of SP-WT-AA (SEQ ID NO:43).
- the GAA signal polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, or 100% identical to CO1-SP-WT-NA (SEQ ID NO: 70).
- a GAA polynucleotide having high sequence identity to CO1-MP- WT-NA further includes a polynucleotide sequence encoding a GAA pro-peptide having the amino acid sequence of PP-WT-AA (SEQ ID NO:39).
- the GAA pro-peptide polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to CO1-PP-WT-NA (SEQ ID NO: 71).
- the GAA polynucleotide includes a sequence having high sequence identity to the entirety of the CO1 codon-optimized sequence, encoding for the GAA pre-pro-polypeptide. Accordingly, in some embodiments, the sequence of the codon-altered polynucleotide has at least 95% identity to CO1-FL-WT-NA (SEQ ID NO:60). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 96% identity to CO1- FL-WT-NA (SEQ ID NO: 60).
- the sequence of the codon-altered polynucleotide has at least 97% identity to CO1-FL-WT-NA (SEQ ID NO:60). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 98% identity to CO1- FL-WT-NA (SEQ ID NO: 60). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99% identity to CO1-FL-WT-NA (SEQ ID NO:60). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.5% identity to CO1-FL-WT-NA (SEQ ID NO:60).
- sequence of the codon-altered polynucleotide has at least 99.9% identity to CO1-FL-WT-NA (SEQ ID NO:60). In another specific embodiment, the sequence of the codon-altered polynucleotide is CO1-FL-WT-NA (SEQ ID NO: 60).
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO1 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human wild-type mature GAA polypeptide (MP-WT-AA; SEQ ID NO:35). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to MP-WT-AA (SEQ ID NO:35).
- the encoded GAA polypeptide has a sequence that is at least 96% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 97% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to MP-WT-AA (SEQ ID NO: 35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to MP-WT-AA (SEQ ID NO:35).
- the encoded GAA polypeptide has a sequence that is at least 99.5% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence identical to MP-WT-AA (SEQ ID NO:35).
- the GAA polypeptide may also include a signal peptide, a pro-peptide, and/or a purification/ detection tag, but the sequence comparison should not include these sequences.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO1 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human wild-type GAA pre-pro-polypeptide (FL-WT-AA; SEQ ID NO:2). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to FL-WT-AA (SEQ ID NO:2).
- the encoded GAA polypeptide has a sequence that is at least 96% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 97% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.5% identical to FL-WT-AA (SEQ ID NO: 2).
- the encoded GAA polypeptide has a sequence that is at least 99.8% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence identical to FL-WT-AA (SEQ ID NO:2).
- the GAA polypeptide may also include a purification/detection tag, but the sequence comparison should not include these sequences.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO1 codon-optimized sequence includes one or more known amino acid substitutions, e.g., one or more amino acid substitutions described in U.S. Patent Application Publication No. 2021/0189365, the content of which is incorporated herein by reference in its entirety.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO1 codon-optimized sequence includes one or more amino acid substitutions present in one of GAA variants 6-13 described herein.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO1 codon-optimized sequence includes one or more amino acid substitutions present in GAA variant 6: T151I, L650G, S676D, and L678H. In some embodiments, the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO1 codon-optimized sequence includes all of the amino acid substitutions present in GAA variant 6 described herein.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO1 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human variant 6 mature GAA polypeptide (MP- 6-AA; SEQ ID NO:37). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 96% identical to MP-6-AA (SEQ ID NO: 37).
- the encoded GAA polypeptide has a sequence that is at least 97% identical to MP-6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to MP- 6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to MP-6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.5% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-6-AA (SEQ ID NO: 37).
- the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence identical to MP-6-AA (SEQ ID NO:37).
- the GAA polypeptide may also include a signal peptide, a pro-peptide, and/or a purification/ detection tag, but the sequence comparison should not include these sequences.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO1 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human variant 6 GAA pre-pro-polypeptide (FL- 6-AA; SEQ ID NO: 14). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to FL-6-AA (SEQ ID NO: 14).
- the encoded GAA polypeptide has a sequence that is at least 96% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 97% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to FL- 6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.5% identical to FL-6-AA (SEQ ID NO: 14).
- the encoded GAA polypeptide has a sequence that is at least 99.8% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence identical to FL-6-AA (SEQ ID NO: 14). When determining the sequence identity between an encoded GAA polypeptide and the GAA pre-pro-polypeptide, only the portions of the sequence corresponding to the pre-pro- polypeptide should be considered. That is, the GAA polypeptide may also include a purification/detection tag, but the sequence comparison should not include these sequences.
- the nucleotide sequence of the GAA polynucleotide having high sequence identity to a CO1 codon-optimized sequence (e.g., SEQ ID NO:60 or 63) has a reduced GC content, as compared to the wild-type GAA coding sequence SEQ ID NO: 1, as described above. Accordingly, in some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has a GC content of no more than 66%. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has a GC content of no more than 63.5%.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has a GC content of no more than 65%, no more than 64%, no more than 63%, no more than 62%, or no more than 61%.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon-optimized sequence has a GC content of from 61% to 66%. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon-optimized sequence has a GC content of from 62% to 66%, from 63% to 66%, from 64% to 66%, from 65% to 66%, from 61% to 65%, from 62% to 65%, from 63% to 65%, from 64% to 65%, from 61% to 64%, from 62% to 64%, from 63% to 64%, from 61% to 63%, from 62% to 63%, or from 61% to 62%.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon-optimized sequence has a GC content of 63.5% ⁇ 1.0. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon-optimized sequence has a GC content of 63.5% ⁇ 0.8. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has a GC content of 63.5% ⁇ 0.6.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon-optimized sequence has a GC content of 63.5% ⁇ 0.5. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has a GC content of 63.5% ⁇ 0.4. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon-optimized sequence has a GC content of 63.5% ⁇ 0.3.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon-optimized sequence has a GC content of 63.5% ⁇ 0.2. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon-optimized sequence has a GC content of 63.5% ⁇ 0.1. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has a GC content of 63.5%.
- the nucleotide sequence of the GAA polynucleotide having high sequence identity to a CO1 codon-optimized sequence (e.g., SEQ ID NO:60 or 63) has a reduced number of CpG dinucleotides, as compared to the wild-type GAA coding sequence SEQ ID NO: 1, as described above. Accordingly, in some embodiments, the sequence of the codon- altered polynucleotide having high sequence identity to a CO1 codon-optimized sequence has no more than 15 CpG dinucleotides.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has no more than 10 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has no more than 5 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has no more than 4 CpG dinucleotides.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has no more than 3 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has no more than 2 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has no more than 1 CpG dinucleotide. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO1 codon- optimized sequence has no CpG dinucleotides.
- a nucleic acid composition provided herein includes a GAA polynucleotide (e.g., a codon-altered polynucleotide) encoding a GAA polypeptide, where the GAA polynucleotide includes a nucleotide sequence having high sequence identity to all or a portion of the CO2 codon-optimized sequence.
- GAA polynucleotide e.g., a codon-altered polynucleotide
- the GAA polynucleotide includes a sequence having high sequence identity to the portion of the CO2 codon- optimized sequence that encodes for the mature GAA polypeptide. Accordingly, in some embodiments, the sequence of the codon- altered polynucleotide has at least 95% identity to CO2-MP-WT-NA (SEQ ID NO: 64). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 96% identity to CO2-MP-WT-NA (SEQ ID NO:64). In a specific embodiment, the sequence of the codon- altered polynucleotide has at least 97% identity to CO2-MP-WT-NA (SEQ ID NO:64).
- the sequence of the codon-altered polynucleotide has at least 98% identity to CO2-MP-WT-NA (SEQ ID NO:64). In a specific embodiment, the sequence of the codon- altered polynucleotide has at least 99% identity to CO2-MP-WT-NA (SEQ ID NO:64). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.5% identity to C02-MP-WT-NA (SEQ ID NO: 64). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.9% identity to C02-MP-WT-NA (SEQ ID NO:64).
- the sequence of the codon-altered polynucleotide is C02-MP- WT-NA (SEQ ID NO: 64).
- SEQ ID NO: 64 the sequence identity between a GAA polypeptide and the portion of the CO2 codon-optimized sequence that encodes for the mature GAA polypeptide. That is, the GAA polynucleotide may also encode for a signal peptide, a propeptide, and/or a purification/detection tag, but the sequence comparison should not include these sequences.
- a GAA polynucleotide having high sequence identity to CO2-MP- WT-NA further includes a polynucleotide sequence encoding a GAA signal peptide having the amino acid sequence of SP-WT-AA (SEQ ID NO:43).
- the GAA signal polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, or 100% identical to CO2-SP-WT-NA (SEQ ID NO: 73).
- a GAA polynucleotide having high sequence identity to CO2-MP- WT-NA further includes a polynucleotide sequence encoding a GAA pro-peptide having the amino acid sequence of PP-WT-AA (SEQ ID NO:39).
- the GAA pro-peptide polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to CO2-PP-WT-NA (SEQ ID NO:74).
- the GAA propeptide polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to CO2-PP-46-NA (SEQ ID NO:75).
- the GAA polynucleotide includes a sequence having high sequence identity to the entirety of the CO2 codon-optimized sequence, encoding for the GAA pre-pro-polypeptide. Accordingly, in some embodiments, the sequence of the codon-altered polynucleotide has at least 95% identity to CO2-FL-WT-NA (SEQ ID NO:62). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 96% identity to CO2- FL-WT-NA (SEQ ID NO: 62).
- the sequence of the codon-altered polynucleotide has at least 97% identity to CO2-FL-WT-NA (SEQ ID NO:62). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 98% identity to CO2- FL-WT-NA (SEQ ID NO: 62). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99% identity to C02-FL-WT-NA (SEQ ID NO:62). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.5% identity to C02-FL-WT-NA (SEQ ID NO:62).
- sequence of the codon-altered polynucleotide has at least 99.9% identity to C02-FL-WT-NA (SEQ ID NO:62). In another specific embodiment, the sequence of the codon-altered polynucleotide is C02-FL-WT-NA (SEQ ID NO: 62).
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO2 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human wild-type mature GAA polypeptide (MP-WT-AA; SEQ ID NO:35). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to MP-WT-AA (SEQ ID NO:35).
- the encoded GAA polypeptide has a sequence that is at least 96% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 97% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to MP-WT-AA (SEQ ID NO: 35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to MP-WT-AA (SEQ ID NO:35).
- the encoded GAA polypeptide has a sequence that is at least 99.5% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence identical to MP-WT-AA (SEQ ID NO:35).
- the GAA polypeptide may also include a signal peptide, a pro-peptide, and/or a purifi cation/ detection tag, but the sequence comparison should not include these sequences.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO2 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human wild-type GAA pre-pro-polypeptide (FL-WT-AA; SEQ ID N0:2). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to FL-WT-AA (SEQ ID NO:2).
- the encoded GAA polypeptide has a sequence that is at least 96% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 97% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.5% identical to FL-WT-AA (SEQ ID NO: 2).
- the encoded GAA polypeptide has a sequence that is at least 99.8% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence identical to FL-WT-AA (SEQ ID NO:2).
- the GAA polypeptide may also include a purification/detection tag, but the sequence comparison should not include these sequences.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO2 codon-optimized sequence includes one or more known amino acid substitutions, e.g., one or more amino acid substitutions described in U.S. Patent Application Publication No. 2021/0189365, the content of which is incorporated herein by reference in its entirety.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO2 codon-optimized sequence includes one or more amino acid substitutions present in one of GAA variants 1-5 described herein.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO2 codon-optimized sequence includes one or more amino acid substitutions present in one of GAA variants 6-13 described herein.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO2 codon-optimized sequence includes one or more amino acid substitutions present in GAA variant 6: T151I, L650G, S676D, and L678H.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO1 codon-optimized sequence includes all of the amino acid substitutions present in GAA variant 6 described herein.
- a GAA polynucleotide having high sequence identity to CO2-MP- 46-NA (SEQ ID NO: 68) further includes a polynucleotide sequence encoding a GAA pro-peptide having the amino acid sequence of PP-WT-AA (SEQ ID NO:39).
- the GAA pro-peptide polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to CO2-PP-WT-NA (SEQ ID NO: 74).
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO2 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human variant 6 mature GAA polypeptide (MP- 6-AA; SEQ ID NO:37). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 96% identical to MP-6-AA (SEQ ID NO: 37).
- the encoded GAA polypeptide has a sequence that is at least 97% identical to MP-6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to MP- 6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to MP-6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.5% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-6-AA (SEQ ID NO: 37).
- the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence identical to MP-6-AA (SEQ ID NO:37).
- the GAA polypeptide may also include a signal peptide, a pro-peptide, and/or a purification/ detection tag, but the sequence comparison should not include these sequences.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO2 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human variant 6 GAA pre-pro-polypeptide (FL- 6-AA; SEQ ID NO: 14). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to FL-6-AA (SEQ ID NO: 14).
- the encoded GAA polypeptide has a sequence that is at least 96% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 97% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to FL- 6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.5% identical to FL-6-AA (SEQ ID NO: 14).
- the encoded GAA polypeptide has a sequence that is at least 99.8% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence identical to FL-6-AA (SEQ ID NO: 14). When determining the sequence identity between an encoded GAA polypeptide and the GAA pre-pro-polypeptide, only the portions of the sequence corresponding to the pre-pro- polypeptide should be considered. That is, the GAA polypeptide may also include a purification/detection tag, but the sequence comparison should not include these sequences.
- the nucleotide sequence of the GAA polynucleotide having high sequence identity to a CO2 codon-optimized sequence (e.g., SEQ ID NO:62 or 64) has a reduced GC content, as compared to the wild-type GAA coding sequence SEQ ID NO: 1, as described above. Accordingly, in some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has a GC content of no more than 61.5%. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has a GC content of no more than 59%.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has a GC content of no more than 60.5%, no more than 59.5%, no more than 58.5%, no more than 57.5%, or no more than 56.5%.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon-optimized sequence has a GC content of from 56.5% to 61.5%. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon-optimized sequence has a GC content of from 57.5% to 61.5%, from 58.5% to 61.5%, from 59.5% to 61.5%, from 60.5% to 61.5%, from 56.5% to 60.5%, from 57.5% to 60.5%, from 58.5% to 60.5%, from 59.5% to 60.5%, from 56.5% to 59.5%, from 57.5% to 59.5%, from 58.5% to 59.5%, from 56.5% to 58.5%, from 57.5% to 58.5%, from 57.5% to 58.5%, or from 56.5% to 57.5%.5%.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon-optimized sequence has a GC content of 59% ⁇ 1.0. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon-optimized sequence has a GC content of 59% ⁇ 0.8. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has a GC content of 59% ⁇ 0.6.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has a GC content of 59% ⁇ 0.5. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has a GC content of 59% ⁇ 0.4. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon-optimized sequence has a GC content of 59% ⁇ 0.3. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon-optimized sequence has a GC content of 59% ⁇ 0.2.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon-optimized sequence has a GC content of 59% ⁇ 0.1. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has a GC content of 59%.
- the nucleotide sequence of the GAA polynucleotide having high sequence identity to a CO2 codon-optimized sequence (e.g., SEQ ID NO:62 or 64) has a reduced number of CpG dinucleotides, as compared to the wild-type GAA coding sequence SEQ ID NO: 1, as described above. Accordingly, in some embodiments, the sequence of the codon- altered polynucleotide having high sequence identity to a CO2 codon-optimized sequence has no more than 15 CpG dinucleotides.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has no more than 10 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has no more than 5 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has no more than 4 CpG dinucleotides.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has no more than 3 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has no more than 2 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has no more than 1 CpG dinucleotide. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO2 codon- optimized sequence has no CpG dinucleotides.
- a nucleic acid composition provided herein includes a GAA polynucleotide (e.g., a codon-altered polynucleotide) encoding a GAA polypeptide, where the GAA polynucleotide includes a nucleotide sequence having high sequence identity to all or a portion of the CO3 codon-optimized sequence.
- GAA polynucleotide e.g., a codon-altered polynucleotide
- the GAA polynucleotide includes a sequence having high sequence identity to the portion of the CO3 codon- optimized sequence that encodes for the mature GAA polypeptide. Accordingly, in some embodiments, the sequence of the codon- altered polynucleotide has at least 95% identity to CO3-MP-WT-NA (SEQ ID NO:34). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 96% identity to CO3-MP-WT-NA (SEQ ID NO:34). In a specific embodiment, the sequence of the codon- altered polynucleotide has at least 97% identity to CO3-MP-WT-NA (SEQ ID NO:34).
- the sequence of the codon-altered polynucleotide has at least 98% identity to CO3-MP-WT-NA (SEQ ID NO:34). In a specific embodiment, the sequence of the codon- altered polynucleotide has at least 99% identity to C03-MP-WT-NA (SEQ ID NO:34). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.5% identity to C03-MP-WT-NA (SEQ ID NO:34). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.9% identity to C03-MP-WT-NA (SEQ ID NO:34).
- the sequence of the codon-altered polynucleotide is C03-MP- WT-NA (SEQ ID NO: 34).
- SEQ ID NO: 34 When determining the sequence identity between a GAA polypeptide and the portion of the CO3 codon-optimized sequence that encodes for the mature GAA polypeptide, only the portions of the sequence encoding the mature polypeptide should be considered. That is, the GAA polynucleotide may also encode for a signal peptide, a propeptide, and/or a purification/detection tag, but the sequence comparison should not include these sequences.
- a GAA polynucleotide having high sequence identity to CO3-MP- WT-NA further includes a polynucleotide sequence encoding a GAA signal peptide having the amino acid sequence of SP-WT-AA (SEQ ID NO:43).
- the GAA signal polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, or 100% identical to CO3-SP-WT-NA (SEQ ID NO:42).
- a GAA polynucleotide having high sequence identity to CO3-MP- WT-NA further includes a polynucleotide sequence encoding a GAA pro-peptide having the amino acid sequence of PP-WT-AA (SEQ ID NO:39).
- the GAA pro-peptide polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to CO3-PP-WT-NA (SEQ ID NO:38).
- the GAA propeptide polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to CO3-PP-46-NA (SEQ ID NO:40).
- the GAA polynucleotide includes a sequence having high sequence identity to the entirety of the CO3 codon-optimized sequence, encoding for the GAA pre-pro-polypeptide. Accordingly, in some embodiments, the sequence of the codon-altered polynucleotide has at least 95% identity to CO3-FL-WT-NA (SEQ ID NO:31). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 96% identity to COS- FL- WT-NA (SEQ ID NO: 31).
- the sequence of the codon-altered polynucleotide has at least 97% identity to CO3-FL-WT-NA (SEQ ID NO:31). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 98% identity to CO3- FL-WT-NA (SEQ ID NO: 31). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99% identity to C03-FL-WT-NA (SEQ ID NO:31). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.5% identity to C03-FL-WT-NA (SEQ ID NO:31).
- sequence of the codon-altered polynucleotide has at least 99.9% identity to C03-FL-WT-NA (SEQ ID NO:31). In another specific embodiment, the sequence of the codon-altered polynucleotide is CO3 -FL-WT-NA (SEQ ID NO:31).
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO3 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human wild-type mature GAA polypeptide (MP-WT-AA; SEQ ID NO:35). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to MP-WT-AA (SEQ ID NO:35).
- the encoded GAA polypeptide has a sequence that is at least 96% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 97% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to MP-WT-AA (SEQ ID NO: 35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to MP-WT-AA (SEQ ID NO:35).
- the encoded GAA polypeptide has a sequence that is at least 99.5% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-WT-AA (SEQ ID NO:35). In some embodiments, the encoded GAA polypeptide has a sequence identical to MP-WT-AA (SEQ ID NO:35).
- the GAA polypeptide may also include a signal peptide, a pro-peptide, and/or a purifi cation/ detection tag, but the sequence comparison should not include these sequences.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO3 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human wild-type GAA pre-pro-polypeptide (FL-WT-AA; SEQ ID N0:2). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to FL-WT-AA (SEQ ID NO:2).
- the encoded GAA polypeptide has a sequence that is at least 96% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 97% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.5% identical to FL-WT-AA (SEQ ID NO: 2).
- the encoded GAA polypeptide has a sequence that is at least 99.8% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to FL-WT-AA (SEQ ID NO:2). In some embodiments, the encoded GAA polypeptide has a sequence identical to FL-WT-AA (SEQ ID NO:2).
- the GAA polypeptide may also include a purification/detection tag, but the sequence comparison should not include these sequences.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO3 codon-optimized sequence includes one or more known amino acid substitutions, e.g., one or more amino acid substitutions described in U.S. Patent Application Publication No. 2021/0189365, the content of which is incorporated herein by reference in its entirety.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO3 codon-optimized sequence includes one or more amino acid substitutions present in one of GAA variants 1-5 described herein.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO3 codon-optimized sequence includes one or more amino acid substitutions present in one of GAA variants 6-13 described herein. [00136] In some embodiments, the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO3 codon-optimized sequence includes one or more amino acid substitutions present in GAA variant 6: T151I, L650G, S676D, and L678H. In some embodiments, the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO3 codon-optimized sequence includes all of the amino acid substitutions present in GAA variant 6 described herein.
- the sequence of the codon-altered polynucleotide has at least 95% identity to the portion of a codon- optimized GAA polynucleotide encoding the mature polypeptide of the GAA variant 6 (CO3-MP-6-dNA; SEQ ID NO:36).
- the sequence of the codon-altered polynucleotide has at least 96% identity to CO3- MP-6-dNA (SEQ ID NO: 36).
- the sequence of the codon-altered polynucleotide has at least 97% identity to CO3-MP-6-dNA (SEQ ID NO:36).
- the sequence of the codon-altered polynucleotide has at least 98% identity to CO3- MP-6-dNA (SEQ ID NO: 36). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99% identity to CO3-MP-6-dNA (SEQ ID NO:36). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.5% identity to CO3-MP-6-dNA (SEQ ID NO:36). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.9% identity to CO3-MP-6-dNA (SEQ ID NO:36).
- the sequence of the codon-altered polynucleotide is CO3-MP-6-dNA (SEQ ID NO: 36).
- SEQ ID NO: 36 When determining the sequence identity between a GAA polypeptide and the portion of the CO3 codon- optimized sequence that encodes for the mature GAA polypeptide, only the portions of the sequence encoding the mature polypeptide should be considered. That is, the GAA polypeptide may also encode for a signal peptide, a pro-peptide, and/or a purification/detection tag, but the sequence comparison should not include these sequences.
- a GAA polynucleotide having high sequence identity to CO3-MP- 6-dNA (SEQ ID NO: 36) further includes a polynucleotide sequence encoding a GAA signal peptide having the amino acid sequence of SP-WT-AA (SEQ ID NO:43).
- the GAA signal polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, or 100% identical to CO3-SP-WT-NA (SEQ ID NO:42).
- a GAA polynucleotide having high sequence identity to CO3-MP- 6-dNA (SEQ ID NO: 36) further includes a polynucleotide sequence encoding a GAA pro-peptide having the amino acid sequence of PP-WT-AA (SEQ ID NO:39).
- the GAA pro-peptide polynucleotide has a nucleic acid sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to CO3-PP-WT-NA (SEQ ID NO:38).
- the GAA polynucleotide includes a sequence having high sequence identity to the entirety of a codon-optimized GAA polynucleotide encoding the variant 6 GAA pre-pro-polypeptide. Accordingly, in some embodiments, the sequence of the codon- altered polynucleotide has at least 95% identity to CO3-FL-6-dNA (SEQ ID NO:60). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 96% identity to CO3-FL-6-dNA (SEQ ID NO: 60).
- the sequence of the codon- altered polynucleotide has at least 97% identity to CO3-FL-6-dNA (SEQ ID NO: 60). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 98% identity to CO3-FL-6-dNA (SEQ ID NO: 60). In a specific embodiment, the sequence of the codon- altered polynucleotide has at least 99% identity to CO3-FL-6-dNA (SEQ ID NO: 60). In a specific embodiment, the sequence of the codon-altered polynucleotide has at least 99.5% identity to CO3-FL-6-dNA (SEQ ID NO: 60).
- sequence of the codon-altered polynucleotide has at least 99.9% identity to CO3-FL-6-dNA (SEQ ID NO: 60). In another specific embodiment, the sequence of the codon-altered polynucleotide is CO3-FL-6- dNA (SEQ ID NO: 60).
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO3 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human variant 6 mature GAA polypeptide (MP- 6-AA; SEQ ID NO:37). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 96% identical to MP-6-AA (SEQ ID NO: 37).
- the encoded GAA polypeptide has a sequence that is at least 97% identical to MP-6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to MP- 6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to MP-6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.5% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-6-AA (SEQ ID NO: 37).
- the encoded GAA polypeptide has a sequence that is at least 99.8% identical to MP-6-AA (SEQ ID NO: 37). In some embodiments, the encoded GAA polypeptide has a sequence identical to MP-6-AA (SEQ ID NO:37).
- the GAA polypeptide may also include a signal peptide, a pro-peptide, and/or a purification/ detection tag, but the sequence comparison should not include these sequences.
- the GAA polypeptide encoded by a GAA polynucleotide having high sequence identity to the CO3 codon-optimized sequence includes an amino acid sequencing having high sequence identity to the human variant 6 GAA pre-pro-polypeptide (FL- 6-AA; SEQ ID NO: 14). Accordingly, in some embodiments, the encoded GAA polypeptide has a sequence that is at least 90% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 95% identical to FL-6-AA (SEQ ID NO: 14).
- the encoded GAA polypeptide has a sequence that is at least 96% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 97% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 98% identical to FL- 6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least 99% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence that is at least FL-6-AA (SEQ ID NO: 14).
- the encoded GAA polypeptide has a sequence that is at least 99.8% identical to FL-6-AA (SEQ ID NO: 14). In some embodiments, the encoded GAA polypeptide has a sequence identical to FL-6-AA (SEQ ID NO: 14).
- the GAA polypeptide may also include a purification/detection tag, but the sequence comparison should not include these sequences.
- the nucleotide sequence of the GAA polynucleotide having high sequence identity to a CO3 codon-optimized sequence (e.g., SEQ ID NO:31 or 34) has a reduced GC content, as compared to the wild-type GAA coding sequence SEQ ID NO: 1, as described above. Accordingly, in some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has a GC content of no more than 60%. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has a GC content of no more than 57.5%.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has a GC content of no more than 59%, no more than 58%, no more than 57%, no more than 56%, or no more than 55%.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon-optimized sequence has a GC content of from 55% to 60%. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon-optimized sequence has a GC content of from 56% to 60%, from 57% to 60%, from 58% to 60%, from 59% to 60%, from 55% to 59%, from 56% to 59%, from 57% to 59%, from 58% to 59%, from 55% to 58%, from 56% to 58%, from 57% to 58%, from 55% to 57%, from 56% to 57%, or from 55% to 56%.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon-optimized sequence has a GC content of 57.5% ⁇ 1.0. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon-optimized sequence has a GC content of 57.5% ⁇ 0.8. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has a GC content of 57.5% ⁇ 0.6.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon-optimized sequence has a GC content of 57.5% ⁇ 0.5. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has a GC content of 57.5% ⁇ 0.4. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon-optimized sequence has a GC content of 57.5% ⁇ 0.3.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon-optimized sequence has a GC content of 57.5% ⁇ 0.2. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon-optimized sequence has a GC content of 57.5% ⁇ 0.1. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has a GC content of 57.5%.
- the nucleotide sequence of the GAA polynucleotide having high sequence identity to a CO3 codon-optimized sequence has a reduced number of CpG dinucleotides, as compared to the wild-type GAA coding sequence SEQ ID NO: 1, as described above. Accordingly, in some embodiments, the sequence of the codon- altered polynucleotide having high sequence identity to a CO3 codon-optimized sequence has no more than 15 CpG dinucleotides.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has no more than 10 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has no more than 5 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has no more than 4 CpG dinucleotides.
- the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has no more than 3 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has no more than 2 CpG dinucleotides. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has no more than 1 CpG dinucleotide. In some embodiments, the sequence of the codon-altered polynucleotide having high sequence identity to a CO3 codon- optimized sequence has no CpG dinucleotides.
- an expression cassette for expressing a GAA polynucleotide as disclosed herein, e.g., a codon-altered GAA polynucleotide.
- an expression cassette comprises one or more nucleic acids encoding a GAA protein and at least one regulatory nucleic acid sequence operably linked to the sequence encoding the GAA protein.
- the at least one regulatory nucleic acid sequence is selected from the group consisting of a promoter, an enhancer, an intron, a post- transcriptional regulatory element, an inverted terminal repeat (ITR), a polyadenylation (poly A) sequence, and a combination thereof.
- the at least one regulatory nucleic acid sequence comprises a promoter.
- the promoter is a muscle-specific promoter.
- the muscle-specific promoter comprises a polynucleotide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SPc512_NA (SEQ ID NO:46).
- the muscle-specific promoter comprises the polynucleotide sequence of SPc512_NA (SEQ ID NO:46).
- the muscle-specific promoter comprises a polynucleotide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to HsDesmin NA (SEQ ID NO:47). In some embodiments, the muscle-specific promoter comprises the polynucleotide sequence of HsDesmin NA (SEQ ID NO:47).
- the at least one regulatory nucleic acid sequence comprises an enhancer.
- the enhancer is a muscle-specific enhancer.
- the muscle-specific enhancer comprises a polynucleotide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to Dph-CRE04_NA (SEQ ID NO:48).
- the muscle-specific enhancer comprises the polynucleotide sequence of Dph-CRE04_NA (SEQ ID NO:48).
- the muscle-specific enhancer comprises a polynucleotide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to sk-SH4_NA (SEQ ID NO:49). In some embodiments, the muscle-specific enhancer comprises the polynucleotide sequence of sk-SH4_NA (SEQ ID NO:49).
- the at least one regulatory nucleic acid sequence comprises an intron.
- the intron comprises a polynucleotide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to MVM_NA (SEQ ID NO: 50).
- the intron comprises the polynucleotide sequence of MVM NA (SEQ ID NO: 50).
- the codon-altered polynucleotides and associated expression cassettes described herein are integrated into expression vectors.
- expression vectors include viral vectors (e.g., vectors suitable for gene therapy), plasmid vectors, bacteriophage vectors, cosmids, phagemids, artificial chromosomes, and the like.
- the gene therapy vector is an adeno-associated virus (AAV) based gene therapy vector.
- AAV systems have been described previously and are generally well known in the art (Kelleher and Vos, Biotechniques, 17(6): 1110-17 (1994); Cotten et al., P.N.A.S. U.S.A., 89(13):6094-98 (1992); Curiel, Nat lmmun, 13(2-3): 141-64 (1994); Muzyczka, Curr Top Microbiol Immunol, 158:97-129 (1992); and Asokan A, et al., Mol.
- the expression cassette is a mammalian expression vector.
- the mammalian expression vector comprises an adeno-associated virus (AAV) vector.
- the AAV vector comprises an AAV8 or AAV9 capsid polypeptide encapsidating the expression cassette.
- the AAV vector comprises an engineered capsid polypeptide encapsidating the expression cassette.
- the codon-altered polynucleotides described herein are integrated into a viral gene therapy vector.
- viral vectors include: retrovirus, e.g., Moloney murine leukemia virus (MMLV), Harvey murine sarcoma virus, murine mammary tumor virus, and Rous sarcoma virus; adenoviruses, adeno-associated viruses; SV40- type viruses; polyomaviruses; Epstein-Barr viruses; papilloma viruses; herpes viruses; vaccinia viruses; and polio viruses.
- retrovirus e.g., Moloney murine leukemia virus (MMLV), Harvey murine sarcoma virus, murine mammary tumor virus, and Rous sarcoma virus
- adenoviruses adeno-associated viruses
- SV40- type viruses polyomaviruses
- Epstein-Barr viruses Epstein-Barr viruses
- papilloma viruses herpes viruses
- the gene therapy vector is a retrovirus, and particularly a replication-deficient retrovirus.
- Protocols for the production of replication-deficient retroviruses are known in the art. For review, see Kriegler, M., Gene Transfer and Expression, A Laboratory Manual, W.H. Freeman Co., New York (1990) and Murry, E. J., Methods in Molecular Biology, Vol. 7, Humana Press, Inc., Cliffton, N.J. (1991).
- the codon-altered polynucleotides described herein are integrated into a retroviral expression vector.
- These systems have been described previously, and are generally well known in the art (Mann et al., Cell, 33: 153-159, 1983; Nicolas and Rubinstein, In: Vectors: A survey of molecular cloning vectors and their uses, Rodriguez and Denhardt, eds., Stoneham: Butterworth, pp. 494-513, 1988; Temin, In: Gene Transfer, Kucherlapati (ed.), New York: Plenum Press, pp. 149-188, 1986).
- the retroviral vector is a lentiviral vector (see, for example, Naldini et al., Science, 272(5259): 263- 267, 1996; Zufferey etal., Nat Biotechnol, 15(9):871-875, 1997; Blomer etal., J Virol., 71(9): 6641-6649, 1997; U.S. Pat. Nos. 6,013,516 and 5,994,136).
- the codon-altered polynucleotides described herein can be administered to a subject by a non-viral method.
- naked DNA can be administered into a cell by electroporation, sonoporation, particle bombarment, or hydrodyamic delivery.
- DNA can also be encapsulated or coupled with polymers, e.g., liposomes, polysomes, polypleses, dendrimers, and administered to the subject as a complex.
- DNA can be coupled to inorganic nanoparticles, e.g., gold, silica, iron oxide, or calcium phosphate particles, or attached to cell-penetrating peptides for delivery to cells in vivo.
- Codon-altered GAA coding polynucleotides can also be incorporated into artificial chromosomes, such as Artificial Chromosome Expression (ACEs) (see, e.g., Lindenbaum et al., Nucleic Acids Res., 32(21):el72 (2004)) and mammalian artificial chromosomes (MACs).
- ACEs Artificial Chromosome Expression
- MACs mammalian artificial chromosomes
- a wide variety of vectors can be used for the expression of a GAA polypeptide from a codon-altered polypeptide in cell culture, including eukaryotic and prokaryotic expression vectors.
- a plasmid vector is contemplated for use in expressing a GAA polypeptide in cell culture.
- plasmid vectors containing replicon and control sequences which are derived from species compatible with the host cell are used in connection with these hosts.
- the vector can carry a replication site, as well as marking sequences which are capable of providing phenotypic selection in transformed cells.
- the plasmid will include the codon-altered polynucleotide encoding the GAA polypeptide, operably linked to one or more control sequences, for example, a promoter.
- Non-limiting examples of vectors for prokaryotic expression include plasmids such as pRSET, pET, pBAD, etc., wherein the promoters used in prokaryotic expression vectors include lac, trc, trp, recA, araBAD, etc.
- vectors for eukaryotic expression include: (i) for expression in yeast, vectors such as pAO, pPIC, pYES, pMET, using promoters such as A0X1, GAP, GALI, AUG1, etc; (ii) for expression in insect cells, vectors such as pMT, pAc5, pIB, pMIB, pBAC, etc., using promoters such as PH, plO, MT, Ac5, OpIE2, gp64, polh, etc., and (iii) for expression in mammalian cells, vectors such as pSVL, pCMV, pRc/RSV, pcDNA3, pBPV, etc., and vectors derived from viral systems such as vaccinia virus, adeno-associated viruses, herpes viruses, retroviruses, etc., using promoters such as CMV, SV40, EF-1, UbC, RSV, ADV, BPV, and
- the disclosure provides an AAV gene therapy vector that includes a codon-altered GAA polynucleotide, as described herein, internal terminal repeat (ITR) sequences on the 5’ and 3’ ends of the vector, one or more promoter and/or enhancer sequences operably linked to the GAA polynucleotide, and a poly-adenylation signal following the 3 ’ end of the GAA polynucleotide sequence.
- the one or more promoter and/or enhancer sequences include one or more copies of a muscle-specific regulatory control element.
- the codon-altered GAA polynucleotides and viral vectors described herein are produced according to conventional methods for nucleic acid amplification and vector production.
- Two predominant platforms have developed for large-scale production of recombinant AAV vectors. The first platform is based on replication in mammalian cells, while the second is based on replication in invertebrate cells.
- the first platform is based on replication in mammalian cells, while the second is based on replication in invertebrate cells.
- the disclosure provides methods for producing an adeno-associated virus (AAV) particle.
- the methods include introducing a codon-altered GAA polynucleotide construct having high nucleotide sequence identity (e.g., at least 95%, 96%, 97%, 98%, 99%, 99.5%, 99.9%, or 100%) to one of a CO1, CO2, or CO3 sequence, as described herein, into a host cell where the polynucleotide construct is competent for replication in the host cell.
- the host cell is a mammalian host cell e.g., an HEK, CHO, or BHK cell. In a specific embodiment, the host cell is an HEK 293 cell. In some embodiments, the host cell is an invertebrate cell, e.g., an insect cell. In a specific embodiment, the host cell is an SF9 cell.
- the present disclosure provides expression constructs such as helper plasmids (e.g., non- AAV expression constructs) comprising a nucleic acid that encodes one or more of the AAV capsid polypeptides described herein.
- Such plasmids are useful as expression constructs for producing AAV capsid polypeptides or proteins or to transfect cells (e.g., as part of a triple transfection) in the preparation of engineered AAV vectors.
- AAV vectors could be produced using herpes virus, baculovirus, stable genetically engineered cell lines, or any other method known in the art (Dobrowsky et al. (2021) Curr. Opinion Biomed. Engin. 20: 100353, the disclosure of which is hereby incorporated herein by reference in its entirety).
- the capsid helper plasmid may comprise one or more nucleic acid sequences to regulate expression of the AAV capsid polypeptide.
- the sequences include but are not limited to, a promoter, an enhancer, an intron, a post-transcriptional regulatory sequence, a polyadenylation (poly A) signal, or any combination thereof, which are operably linked to the nucleic acid sequences that encode the AAV capsid polypeptide.
- the promoter may be a heterologous promoter, a tissue-specific promoter, a cellspecific promoter, a constitutive promoter, an inducible promoter, a hybrid promoter, or any combination thereof.
- the capsid helper plasmid of the present disclosure comprises at least one promoter capable of expressing, or directed to primarily express, the nucleic acid segment in a suitable host cell (e.g., a muscle cell) into which the engineered capsid helper plasmid can be transfected.
- Exemplary promoters include, but are not limited to, a ubiquitous promoter, a CMV promoter, a 0-actin promoter, a muscle-specific promoter, a Desmin promoter, an SPc5-12 promoter, an MCK-based promoter an insulin promoter, an enolase promoter, a BDNF promoter, an NGF promoter, an EGF promoter, a growth factor promoter, an axon-specific promoter, a dendrite-specific promoter, a brain-specific promoter, a hippocampal-specific promoter, a kidney-specific promoter, a retinal- specific promoter, an elafin promoter, a cytokine promoter, an interferon promoter, a growth factor promoter, an al- antitrypsin promoter, a brain cell-specific promoter, a neural cell- specific promoter, a central nervous system cell-specific promoter, a peripheral nervous system cell-specific promoter, an interleukin promoter, a
- Exemplary enhancer sequences include, but are not limited to, one or more selected from the group consisting of a CMV enhancer, a muscle-specific enhancer, a synthetic enhancer, a liver-specific enhancer, a vascular-specific enhancer, a brain-specific enhancer, a neural cellspecific enhancer, a lung-specific enhancer, a kidney-specific enhancer, a pancreas-specific enhancer, retinal-specific enhancer, and an islet cell-specific enhancer.
- Exemplary post-transcriptional regulatory sequences include a woodchuck hepatitis post-transcription regulatory element (WPRE)), one or more ribosome entry sites (IRES), one or more polyadenylation (poly A) signal sequences, or any combination thereof.
- WPRE woodchuck hepatitis post-transcription regulatory element
- IVS ribosome entry sites
- poly A polyadenylation
- a polyA signal may be an artificial polyA.
- suitable polyA sequences include, e.g., bovine growth hormone, SV40, rabbit beta globin, and TK polyA, amongst others.
- the capsid helper plasmid described herein may contain other appropriate transcription initiation, termination, and efficient RNA processing signals.
- Such sequences include splicing, inducible expression control elements, regulatory elements that enhance expression, sequences that stabilize cytoplasmic mRNA, sequences that enhance translation efficiency (e.g., Kozak consensus sequence), sequences that enhance protein stability, and when desired, sequences that enhance secretion of the encoded product.
- a Kozak sequence is included.
- the present disclosure provides GAA polypeptide variants that have advantageous properties relative to wild type GAA polypeptides.
- GAA variants 6-13 a series of variant GAA polypeptides (GAA variants 6-13) were identified, several of which demonstrated significantly increased catalytic activity relative to the human wild-type GAA polypeptide.
- GAA variant 6 demonstrated approximately 5.5-fold higher activity than the human wild-type GAA polypeptide, as shown in Figure 5, and improved kinetic parameters, as shown in Figure 6A.
- GAA variants 9 and 11-13 demonstrated higher catalytic activity than the human wild-type GAA polypeptide.
- the disclosure provides variant GAA polypeptides having high sequence identity to the variant 6 GAA pre-pro-polypeptide (GAA-FL-6-AA; SEQ ID NO: 14) and/or the variant 6 GAA mature polypeptide (GAA-MP-6-AA; SEQ ID NO:37).
- the GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97% or at least 98% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the first polypeptide sequence is at least 99% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the first polypeptide sequence is at least 99.5% identical to MP-6-AA (SEQ ID NO:37). In some embodiments, the first polypeptide sequence is MP-6-AA (SEQ ID NO: 37).
- the GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97% or at least 98% identical to MP-6-AA (SEQ ID NO: 37) and comprises one or more variant amino acids selected from the group consisting of T1511, L650G, S676D, and L678H.
- the first polypeptide sequence is at least 99% identical to MP-6-AA (SEQ ID NO: 37).
- the first polypeptide sequence is at least 99.5% identical to MP-6-AA (SEQ ID NO: 37).
- the GAA variant protein further comprises a second polypeptide sequence that is at least 95% identical to PP-WT-AA (SEQ ID NO:39). In some embodiments, the second polypeptide sequence is at least 97% identical to PP-WT-AA (SEQ ID NO:39). In some embodiments, the second polypeptide sequence is PP-WT-AA (SEQ ID NO:39).
- the recombinant GAA variant protein further comprises a third polypeptide sequence that is at least 95% identical to SP-WT-AA (SEQ ID NO:43). In some embodiments, the third polypeptide sequence is SP-WT-AA (SEQ ID NO:43). In some embodiments, the recombinant GAA variant protein comprises the polypeptide sequence of FL- 6-AA (SEQ ID NO: 33).
- the recombinant GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%, or 100% identical to amino acid residues 70-952 of FL-6-AA (SEQ ID NO: 14).
- the recombinant GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%, or 100% identical to amino acid residues 70-952 of FL-7-AA (SEQ ID NO: 16).
- the recombinant GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%, or 100% identical to amino acid residues 70-952 of FL-8-AA (SEQ ID NO: 18).
- the recombinant GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%, or 100% identical to amino acid residues 70-952 of FL-9-AA (SEQ ID NO:20).
- the recombinant GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%, or 100% identical to amino acid residues 70-952 of FL-10- AA (SEQ ID NO:22).
- the recombinant GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%, or 100% identical to amino acid residues 70-952 of FL-11-AA (SEQ ID NO:24).
- the recombinant GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%, or 100% identical to amino acid residues 70-952 of FL-12-AA (SEQ ID NO:26).
- the recombinant GAA variant protein comprises a first polypeptide sequence that is at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.5%, or 100% identical to amino acid residues 70-952 of FL-13-AA (SEQ ID NO:28).
- the recombinant GAA variant protein further comprises a second polypeptide sequence that is at least 95%, at least 97%, or 100% identical to PP-WT-AA (SEQ ID NO: 39).
- Vectors and Components Figure 14 provides the nucleotide sequences of various components of the vectors tested herein.
- GAA KO a suitable animal model of Pompe disease. This model was generated by insertion of a neomycin cassette into exon 6 of the mouse gaa gene, thereby creating a functional knockout (KO) of the gaa genes. GAA KO recapitulates critical features of both the infantile and the adult forms of PD at a pace suitable for the evaluation of gene therapy (GT).
- GT gene therapy
- Glycogen accumulation in cardiac and skeletal muscles can be detected as early as 3 weeks of age, resembling IOPD, and reduction in the number of myofibrils and signs of damaged muscle structure, impaired autophagic flux in skeletal muscle, mild cardiac defects and muscle weakness leading to locomotor defects which develops by 8-9 months resembles LOPD (reviewed in Geel et al. (2007) Mol. Genet. Metab. 92(4):299-307).
- LOPD Reviewed in Geel et al. (2007) Mol. Genet. Metab. 92(4):299-307.
- these secondary tissues will be characterized and monitored in the mice experiments to determine if this disease phenotype is replicated in the mouse model.
- AAV vector preparations comprising test AAV transgene constructs or controls were prepared for injection by dilution in vehicle (1.5 mM KH2PO4, 2.7mM KCI, 8.1 mM Na2HPO4, 136.9mM NaCl, 0.001% Pluronic F-68), and doses were administered through a single intravenous administration of vector or buffer only into the tail vein at 3x10 12 vg/kg, 1x10 13 vg/kg or 3x10 13 vg/kg, as indicated. Clinical and mortality observations were conducted daily post dosing until the end of each study. At four or twelve weeks post dosing, mice were anesthetized with isoflurane, euthanized, and necropsied.
- DNA was extracted and purified from tissue homogenates using a MagMAX kit (Thermofisher, Los Angeles, California) according to manufacturer instructions.
- Vector copy number was quantified using a digital polymerase chain reaction (dPCR) quantification assay with primers and probes designed on a proprietary DNA sequence, using a linearized vector plasmid as the reference standard.
- dPCR digital polymerase chain reaction
- Each 12 ml dPCR reaction contained 2-200 ng sample genomic DNA (gDNA) that was run using a Qiacuity 4 instrument (Qiagen, Germantown MD).
- Vector genome copy numbers (VGCNs) were normalized to microgram of DNA used in the dPCR reaction.
- the reaction mixture was incubated at 37°C for 1 hour and then stopped by adding 150 mL of stop buffer (133 mM Glycine, 83 mM Sodium Carbonate, pH 10).
- stop buffer 133 mM Glycine, 83 mM Sodium Carbonate, pH 10.
- a standard curve (0-4.25 nmol/mL) was used to measure released fluorescent 4-methylumbelliferone (4-MU) from the individual reaction mixture using the Spectramax M3 reader (PerkinElmer, Waltham, MA) at 460 nm (emission) and 360 nm (excitation).
- the protein concentration of the clarified supernatant was quantified using a BradfordUltra assay (AbCam, Waltham, MA).
- the released 4-MU concentration was divided by the sample protein concentration, and activity was reported as nanomoles per hour per milligram protein.
- GAA Activity Measurements using Glycogen Substrate [00191] Supernatant from tissue homogenates was transferred to a 1.5 ml tube, placed in boiling water for 10 minutes, cooled and then centrifuged. The supernatant was transferred to prelabeled cryovials for glycogen analysis following enzyme hydrolysis using amyloglucosidase from Aspergillus niger (Sigma- Aldrich, Burlington, MA). Glucose, the glycogen cleavage product, was measured using an Amplex red kit (Invitrogen, Carlsbad, California).
- H&E haematoxylin and eosin
- Muscle cryosections were cut (4-5 Im), placed onto slides, and air-dried for 20 minutes.
- H&E and Periodic Acid Schiff (PAS) stains were performed according to established protocols using kits from Biovision (Waltham, MA).
- IHC GAA immunohistochemistry
- samples were incubated overnight at 4°C with an anti-GAA rabbit antibody (Sigma/HPA029126, St.
- MAPPs Major Histocompatibility Associated Peptide Proteomics
- PBMCs Human peripheral blood mononuclear cells
- monocyte- derived dendritic cells To prepare monocyte- derived dendritic cells (MoDCs), fresh PBMC from 20 healthy donors were used and CD 14+ cells (monocytes) were isolated using RoboSepTM negative human monocyte isolation kits and a RoboSepTM cell isolation instrument (StemCell Technologies, Cambridge, UK) according to the manufacturer’s instructions. Monocytes were re-suspended in MoDC culture medium (RPMI 1640 supplemented with 10% FBS, 50gm 2-ME, 2mM L- Glutamine (all from ThermoFisher Scientific, Loughborough, UK), IL-4 (Peprotech, London, UK), and GM-CSF (Peprotech) and plated in tissue culture flasks.
- MoDC culture medium RPMI 1640 supplemented with 10% FBS, 50gm 2-ME, 2mM L- Glutamine (all from ThermoFisher Scientific, Loughborough, UK), IL-4 (Peprotech, London
- the MoDCs were thawed at RT and subsequently lysed using a hypotonic buffer solution (20 mM Tris, 5 mM MgCh; ThermoFisher Scientific, Waltham, MA), 0.1% Triton X-100 and protease inhibitors (Sigma Aldrich, St. Louis, MO), pH 7.8, for 1 hour at 4 °C.
- HLA-DR/peptide complexes were purified from the cell lysate by immunoprecipitation using magnetic beads (Promega, Southampton, UK) coated with anti-HLA-DR antibody (BioLegend, London, UK) overnight at 4 °C.
- Peptides bound to HLA-DR were eluted under acidic conditions (3% MeCN, 0.2% TFA; ThermoFisher Scientific Waltham, MA) and purified by solid phase extraction using Oasis® HLB pElution plates (Waters, Ellsmere Port, UK). Peptides were freeze-dried using a 5301 vacuum concentrator (Eppendorf, Stevenage, UK) and stored at -80 °C until analyzed by mass spectrometry (MS).
- Peptides were identified using the Sequest algorithm, built in the Proteome Discoverer software v2.1 (ThermoFisher Scientific, Waltham, MA) against a proprietary database and the sequences of the test samples determined. Once the final list of identified peptides was completed, the sequence heatmaps were generated using MATLAB (MathWorks®, Cambridge, UK).
- C2C12 myoblast cells were seeded into 24-well tissue culture plates (Corning,
- Lysate GAA activity of duplicate wells was measured with the GAA 4-methylumbelliferyl-a-D- glucopyranoside (4-MUG) assay (described above) and normalized to lysate protein concentration as determined using the bicinchoninic acid protein assay (Pierce, Appleton, Wisconsin).
- Thermal protein unfolding was monitored using a Prometheus NT.48 instrument (NanoTemper Technologies, Miinchen, Germany). For each condition, 50 gl of a l mg/ml protein solution was prepared, and 20 gl of sample was filled into 3 low volume differential scanning fluorimetry (nanoDSF) Grade Standard Capillaries (NanoTemper Technologies, Miinchen, Germany), respectively, and loaded into the instrument. Thermal unfolding of the proteins was monitored in a 1 °C/minute thermal ramp from 25 °C to 95 °C. Tm values were determined automatically by the PR control software.
- An 8M stock of GuHCl was prepared by mixing 7.64 g of GuHCl with 4.21 ml assay buffer (50 mM sodium phosphate buffer, pH 7.5, 150 mM NaCl) and the pH adjusted to pH 7.5 using 1 M Tris, pH 8.0.
- a 9 M stock of urea was prepared freshly by mixing 5.41 g urea with 5.9 nil lx assay buffer.
- One microliter of concentrated protein stock (final concentration 0.5 mg/ml) was added to 30 pl of a series of denaturant concentrations (0.25-6M) and the mixture was incubated for 1 hour and 16 hours at 25°C.
- Transthoracic echocardiography was performed on mice that were anaesthetized with isoflurane.
- Parasternal long axis and short axis images were obtained using an MX550S probe attached to Vevo3100 (FujiFilms, Visulsonics, Ontario, Canada). Images were acquired when the rectal temperature was between 36-38 °C and respiratory rate was 40-120 breaths per minute. Images were analyzed by Vevo Lab 5.5.0 (). An average of three heart beats were used for analysis.
- Rotarod [00199] An accelerating rotarod assay was used to determine neuro-motor coordination of the Pompe GAA KO mice. The assay was performed on a rotarod apparatus (Model 47650; Ugo Basile, Italy) that was set to accelerate from 5-40 rpm over 5 minutes. Latency and the speed at fall were recorded from a total of 6 trials (3 trials/day). Animals were habituated and trained before the test trials.
- GAA KO mice were dosed once with 3x10 13 vg/kg of a rAAV9.GAA vector only differing in their enhancer-promoter elements (i.e., all vectors comprised an AAV9 capsid and encoded the same codon optimized wild type (WT) human GAA transgene CO3 - see Example 3). After a 5 week incubation, the animals were sacrificed, their muscle tissues harvested, and GAA enzymatic activity determined according to the 4-MUG assay described in Example 1.
- WT codon optimized wild type
- mice dosed with vectors comprising the Dph-CRE04 enhancer or both the CSK-SH5 and the Dph-CRE04 enhancer showed an increase in GAA activity over the use of the SPc512 promoter alone but not CSK- SH5 alone ( Figure IE).
- the addition of the enhancer element SKSH4 to the desmin promoter resulted in 2-3 fold increase in GAA activity was observed in quadriceps and triceps of GAA KO mice treated with vectors that included the enhancer elements as compared to animals that were dosed with vectors that did not have any enhancer elements (Figure IF - Figure 1H).
- Figure 2 is a comparison of GAA protein activity in key muscle tissues in mice after treatment with engineered AAV9 vectors comprising a GAA transgene in the presence of either the muscle-specific SPc512 promoter alone or in the presence of the Dph-CRE04 musclespecific enhancer and compared to GAA WT and GAA KO mice ( Figure 2A).
- GAA KO mice were dosed once either with buffer alone or with 3x10 13 vg/kg of a rAAV9.GAA vector only differing in the enhancer-promoter elements (i.e., all vectors were AAV9 and encoded same codon optimized wild type (WT) human GAA transgene CO3).
- FIG. 2A shows GAA enzymatic activity using 4-MUG as a substrate and Figure 2B shows reduction in glycogen levels in the respective tissues.
- Figure 2B shows reduction in glycogen levels in the respective tissues.
- This study shows that the vectors comprising the GAA constructs that contain the muscle specific enhancer Dph-CRE04 when combined with the muscle specific promoter SPc512 results in enhanced GAA proteins levels and catalytic activity as measured by the increased release of fluorescent 4-MU ( Figure 2A) and the enhanced reduction in glycogen (Figure 2B) in all four muscle types compared to the GAA KO mice dosed with vector comprising only the muscle specific SPc512 promoter.
- Example 3 describes the identification of a preferred codon-optimized human GAA that provided enhanced gene expression.
- Example 4 describes a campaign 2 to identify amino acid substitutions that improved the catalytic activity of GAA.
- the human WT GAA nucleotide sequence was codon optimized in order to improve expression in human muscle cells, while reducing the immuno- stimulatory CpG content.
- Three codon variants with reduced CpGs (named CO1, CO2, and CO3) were inserted into an expression cassette comprising a muscle-specific Sk-SH4 enhancer and the muscle specific desmin promoter [AAV9.Sk-SH4.desmin.COGAA] ( Figure 3) and tested for expression at a dose of 3x10 13 vg/kg in GAA KO mice.
- the CO3 codon optimized variant demonstrated increased GAA expression and activity at or slightly higher than the WT hGAA sequence and resulted in a similar or more efficient reduction in the levels of glycogen in heart, diaphragm, and quadriceps muscle ( Figure 4A and 4B, respectively).
- Significant clearance of glycogen in the heart muscle of GAA KO mice dosed with vectors comprising CO3 was also observed in tissue sections using PAS staining ( Figure 4C).
- a 76 kDa mature form of GAA was detected in tissue lysates confirming correct GAA processing in the lysosomes ( Figure 4D). While these results were encouraging, insufficient clearance of glycogen observed in the skeletal and respiratory muscles suggested that further improvements to the GAA protein were required to increase its efficacy.
- the GAA protein was engineered to improve its specific activity (2x-10x) for its natural substate glycogen.
- 13 different libraries ranging over 100,000 clones were screened for GAA variants with increased activity over the natural hGAA.
- the initial screening was performed in HEK cells followed by validation promising positive hits in C2C12 cells.
- Activity was initially screened using the 4-MUG assay while GAA’s natural substrate glycogen was used for final selection of the GAA variants that were then further assessed both in vitro and in vivo. Only GAA variants demonstrating an increase in activity on glycogen (2X-8X compared to WT hGAA) were selected (Figure 5). Table 5 provides the amino acid sequence of each of these proteins.
- GAA variants had approximately 3-4 amino acid differences relative to WT hGAA.
- GAA variant 6 (Var 6) was further evaluated in vitro for catalytic activity using kinetic assays using either the synthetic substrate 4-MUG (Figure 6A) or the natural substrate glycogen ( Figure 6B). A 3.5x fold improvement in the activity on lOmg/ml or lOOmg/ml glycogen substrate was observed for GAA Var 6 compared to the WT hGAA ( Figures 6B).
- Example 6 Production and purification viral vectors expressing a transgene
- a recombinant adeno-associated virus 9 was developed to express wild type human cx-GAL or cx-GAL variants (e.g., amino acid sequences shown in Table 1) under the control of a ubiquitous promoter, in a viral vector.
- a WPRE element was linked to the 3’ end of the GLA transgene to increases transgene expression to improve mRNA stability,
- a bovine growth hormone poly A tail was appended to the 3’ end of the WPRE element.
- the DNA construct of promoter-GLA-WPRE-BGHpA was integrated between the inverted terminal repeats of a circular plasmid vector.
- Figure 1 shows an exemplary r AAV9 vector construct.
- rAAV vectors were encapsulated using the AAV2 inverted terminal repeats and rep sequences using methods in the art.
- the rAAV9 stocks were produced using HEK-293T cells by the adenovirus free, triple-plasmid co-transfection method and purified using cesium chloride ultracentrifugation. Titers of v.g. particle number were determined by quantitative PCR.
- rAAV9 virus suspension were diluted in the formulation buffer consisting of 1.5 mM KH2PO4 (Potassium dihydrogen phosphate), 2.7 mM KC1 (Potassium chloride), 8.1 mM Na2HPO4 (Di-sodium hydrogen phosphate), 136.9 mM NaCl (Sodium chloride) and 0.001% Pluronic F-68. Null vector with rAAV9 capsid (rAAV9-null) were used as controls.
- Variant 1 comprising a codon optimized nucleic acid sequence of SEQ ID NO: 58.
- Variant 2 comprising a codon optimized nucleic acid sequence of SEQ ID NO: 59.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Pharmacology & Pharmacy (AREA)
- Epidemiology (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
L'invention concerne des polypeptides d'alpha-glucosidase acide (GAA) variants, des polynucléotides à codons optimisés codant pour GAA, et des procédés et des constructions de thérapie génique GAA.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202363500524P | 2023-05-05 | 2023-05-05 | |
| US63/500,524 | 2023-05-05 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2024231820A1 true WO2024231820A1 (fr) | 2024-11-14 |
Family
ID=91129971
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/IB2024/054397 Pending WO2024231820A1 (fr) | 2023-05-05 | 2024-05-06 | Traitement de la maladie de pompe |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2024231820A1 (fr) |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4797368A (en) | 1985-03-15 | 1989-01-10 | The United States Of America As Represented By The Department Of Health And Human Services | Adeno-associated virus as eukaryotic expression vector |
| US5139941A (en) | 1985-10-31 | 1992-08-18 | University Of Florida Research Foundation, Inc. | AAV transduction vectors |
| US5994136A (en) | 1997-12-12 | 1999-11-30 | Cell Genesys, Inc. | Method and means for producing high titer, safe, recombinant lentivirus vectors |
| US6013516A (en) | 1995-10-06 | 2000-01-11 | The Salk Institute For Biological Studies | Vector and method of use for nucleic acid delivery to non-dividing cells |
| WO2019222411A1 (fr) * | 2018-05-16 | 2019-11-21 | Spark Therapeutics, Inc. | Cassettes d'expression d'alpha-glucosidase acide optimisées par des codons et leurs méthodes d'utilisation |
| US20210189365A1 (en) | 2019-12-20 | 2021-06-24 | Codexis, Inc. | Engineered acid alpha-glucosidase variants |
| WO2023028567A2 (fr) * | 2021-08-25 | 2023-03-02 | Canbridge Pharmaceuticals, Inc. | Particules d'aav comprenant une protéine capsidique tropique du foie et une alpha-glucosidase acide (gaa) et leur utilisation pour traiter la maladie de pompe |
-
2024
- 2024-05-06 WO PCT/IB2024/054397 patent/WO2024231820A1/fr active Pending
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4797368A (en) | 1985-03-15 | 1989-01-10 | The United States Of America As Represented By The Department Of Health And Human Services | Adeno-associated virus as eukaryotic expression vector |
| US5139941A (en) | 1985-10-31 | 1992-08-18 | University Of Florida Research Foundation, Inc. | AAV transduction vectors |
| US6013516A (en) | 1995-10-06 | 2000-01-11 | The Salk Institute For Biological Studies | Vector and method of use for nucleic acid delivery to non-dividing cells |
| US5994136A (en) | 1997-12-12 | 1999-11-30 | Cell Genesys, Inc. | Method and means for producing high titer, safe, recombinant lentivirus vectors |
| WO2019222411A1 (fr) * | 2018-05-16 | 2019-11-21 | Spark Therapeutics, Inc. | Cassettes d'expression d'alpha-glucosidase acide optimisées par des codons et leurs méthodes d'utilisation |
| US20210189365A1 (en) | 2019-12-20 | 2021-06-24 | Codexis, Inc. | Engineered acid alpha-glucosidase variants |
| WO2021127457A1 (fr) * | 2019-12-20 | 2021-06-24 | Codexis, Inc. | Variants d'alpha-glucosidase acide modifiés |
| WO2023028567A2 (fr) * | 2021-08-25 | 2023-03-02 | Canbridge Pharmaceuticals, Inc. | Particules d'aav comprenant une protéine capsidique tropique du foie et une alpha-glucosidase acide (gaa) et leur utilisation pour traiter la maladie de pompe |
Non-Patent Citations (54)
| Title |
|---|
| "CLONING AND EXPRESSION VECTORS FOR GENE FUNCTION ANALYSIS", 2001, BIOTECHNIQUES PRESS |
| "CURRENT PROTOCOLS IN MOLECULAR BIOLOGY", 1993, JOHN WILEY & SONS |
| "MOLECULAR CLONING: A LABORATORY MANUAL", 1989, COLD SPRING HARBOR LABORATORY PRESS |
| "PERFORMANCE", 1984, WILEY & SONS |
| "Selected Methods and Applications", 1988, ALAN R. LISS, INC, article "Current Methods in Sequence Comparison and Analysis,'' Macromolecule Sequencing and Synthesis", pages: 127 - 149 |
| "Temin, In: Gene Transfer", 1986, PLENUM PRESS, pages: 149 - 188 |
| ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 410 |
| ALTSCHUL ET AL., METHODS IN ENZYMOLOGY, vol. 266, 1996, pages 460 - 480 |
| ALTSCHUL ET AL., NUCL. ACIDS RES, vol. 25, pages 3389 - 3402 |
| ALTSCHUL ET AL., NUCLEIC ACIDS RES., vol. 25, 1997, pages 3389 - 3402 |
| ASOKAN A ET AL., MOL. THER., vol. 20, no. 4, 2012, pages 699 - 708 |
| BLOMER ET AL., J VIROL., vol. 71, no. 9, 1997, pages 6641 - 6649 |
| COTTEN ET AL., P.N.A.S. U.S.A., vol. 89, no. 13, 1992, pages 6094 - 98 |
| CURIEL, NAT IMMUN, vol. 13, no. 2-3, 1994, pages 141 - 64 |
| DAYABERNS, CLIN. MICROBIAL. REV, vol. 21, no. 4, 2008, pages 583 - 593 |
| DEVEREUX ET AL., NUCL. ACID RES., vol. 12, 1984, pages 387 - 395 |
| DIAZ-MANERA, J. ET AL., THE LANCET. NEUROLOGY, vol. 20, no. 12, 2021, pages 1027 - 1037 |
| DOBROWSKY ET AL., CURR. OPINION BIOMED. ENGIN., vol. 20, 2021, pages 100353 |
| FARAH ET AL., FASEB J., vol. 28, no. 5, 2014, pages 2272 - 2280 |
| FAUST ET AL., J. CLIN. INVEST., vol. 123, 2013, pages 2994 - 3001 |
| FENGDOOLITTLE, J. MOL. EVOL., vol. 35, 1987, pages 351 - 360 |
| GARDINER-GARDEN M. ET AL., J MOL BIOL., vol. 196, no. 2, 1987, pages 261 - 82 |
| GIEGE ET AL.: "CRYSTALLIZATION OF NUCLEIC ACIDS AND PROTEINS, a Practical Approach", 1999, OXFORD UNIVERSITY PRESS |
| GRAY ET AL., HUMAN GENE THERAPY, vol. 22, 2011, pages 1143 - 53 |
| HAMMERLING ET AL.: "MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS", 1981, ELSEVIER |
| HARFOUCHE, J. PATIENT REP. OUTCOMES, vol. 4, no. 1, 2020, pages 83 |
| HAUTPINTEL, J VIROL., vol. 72, no. 3, 1998, pages 1834 - 43 |
| HIGGINSSHARP, CABIOS, vol. 5, 1989, pages 151 - 153 |
| KABAT ET AL.: "FROM GENES TO CLONES: INTRODUCTION TO GENE TECHNOLOGY", 1987, NATIONAL INSTITUTES OF HEALTH |
| KABAT ET AL.: "SEQUENCES OF PROTEINS OF IMMUNOLOGICAL INTEREST", vol. 7, 1991, U.S. DEPARTMENT OF HEALTH AND HUMAN SERVICES, pages: 91 - 3242 |
| KARLIN ET AL., PROC. NATL. ACAD. SCI. U.S.A., vol. 90, 1993, pages 5873 - 5787 |
| KELLEHERVOS, BIOTECHNIQUES, vol. 17, no. 6, 1994, pages 1110 - 17 |
| KISHNANI, P. S. ET AL., GENETICS IN MEDICINE, vol. 25, no. 2, 2023, pages 100328 |
| KRIEGLER, M: "GENE TRANSFER AND EXPRESSION, A LABORATORY MANUAL", 1990, W.H. FREEMAN CO |
| KUDLA ET AL., PLOS BIOL., vol. 4, no. 6, 2006, pages 80 |
| LAZAR, EUR. HEART J., vol. 38, no. 30, 2017, pages 2333 - 2342 |
| LINDENBAUM ET AL., NUCLEIC ACIDS RES., vol. 32, no. 21, 2004, pages e172 |
| MANN ET AL., CELL, vol. 33, 1983, pages 153 - 159 |
| MCCALL ET AL., J. SMOOTH MUSCLE RES., vol. 54, no. 0, 2018, pages 100 - 118 |
| MORELAND ET AL.: "Species-specific differences in the processing of acid α-glucosidase are due to the amino acid identity at position 201", GENE, vol. 491, 2012, pages 25 - 30, XP055028473, DOI: 10.1016/j.gene.2011.09.011 |
| MUZYCZKA, CURR TOP MICROBIOL IMMUNOL, vol. 158, 1992, pages 97 - 129 |
| NALDINI ET AL., SCIENCE, vol. 272, no. 5259, 1996, pages 263 - 267 |
| NEEDLEMANWUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443 |
| NICOLASRUBINSTEIN: "In: Vectors: A survey of molecular cloning vectors and their uses", 1988, COLD SPRING HARBOR LABORATORY PRESS, pages: 494 - 513 |
| OKUMIYA ET AL., MOL. GENET. METAB., vol. 92, no. 4, 2007, pages 49057 - 307 |
| OLDPRIMROSE: "PRINCIPLES OF GENE MANIPULATION: AN INTRODUCTION TO GENETIC ENGINEERING", 1985, BLACKWELL SCIENTIFIC PUBLICATIONS |
| PEARSONLIPMAN, PROC. NATL. ACAD. SCI. U.S.A., vol. 85, 1988, pages 2444 |
| PÉREZ-LUZDIAZ-NIDO, J BIOMED BIOTECHNOL., 2010, pages 642804 |
| ROIG-ZAMBONI V ET AL.: "Structure of human lysosomal acid a-glucosidase-a guide for the treatment of Pompe disease", NAT COMMUN., vol. 8, no. 1, 2017, pages 1111, XP055915132, DOI: 10.1038/s41467-017-01263-3 |
| SMITHWATERMAN, ADV. APPL. MATH., vol. 2, 1981, pages 482 |
| SPALDING, CELL, vol. 122, no. 1, 2005, pages 133 - 43 |
| WU Z ET AL., MOL THER., vol. 16, no. 2, 2008, pages 280 - 9 |
| XU ET AL., JCI INSIGHT, vol. 4, no. 5, 2019, pages e125358 |
| ZUFFEREY ET AL., NAT BIOTECHNOL, vol. 15, no. 9, 1997, pages 871 - 875 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20220396813A1 (en) | Recombinase compositions and methods of use | |
| US20250313859A1 (en) | Compositions useful in treatment of metachromatic leukodystrophy | |
| EP3519569B1 (fr) | Vecteurs viraux recombinés adéno-associés pour le traitement de la mucopolysaccharidose | |
| CN117321213A (zh) | 具有优选表达水平的腺相关病毒组合物 | |
| CN119613504A (zh) | 腺相关病毒变异衣壳和其使用方法 | |
| US20250049955A1 (en) | Compositons and methods for the treatment of neurological disorders related to glucosylceramidase beta deficiency | |
| KR20220004696A (ko) | 폼페병의 치료에 유용한 조성물 | |
| US20150045416A1 (en) | Methods and Compositions for Gene Delivery | |
| WO2023092002A2 (fr) | Compositions et méthodes de traitement de la sclérose latérale amyotrophique et de troubles associés à la moelle épinière | |
| CN111718947B (zh) | 用于治疗ⅲa或ⅲb型粘多糖贮积症的腺相关病毒载体及用途 | |
| CA3193833A1 (fr) | Compositions et methodes de traitement de la maladie de fabry | |
| TW202449167A (zh) | 用於治療肌肉萎縮性脊髓側索硬化症之組成物及方法 | |
| CN120265647A (zh) | 具有优选脑富集和低肝富集的腺相关病毒组合物 | |
| JP2025530726A (ja) | 聴力障害を治療するためのデュアルベクターシステム及びその使用 | |
| CN119213013A (zh) | 具有增加的心脏富集的腺相关病毒组合物 | |
| WO2024231820A1 (fr) | Traitement de la maladie de pompe | |
| JP2023526923A (ja) | ポンペ病の治療に有用な組成物 | |
| WO2025004002A2 (fr) | Traitement de la maladie de pompe | |
| WO2021080975A1 (fr) | Compositions et procédés pour abaisser les niveaux de cholestérol | |
| US20250161493A1 (en) | Compositions and methods for in vivo nuclease-mediated treatment of ornithine transcarbamylase (otc) deficiency | |
| CN114507692B (zh) | 用于治疗法布里病的腺相关病毒载体及其用途 | |
| US20250099618A1 (en) | Recombinant tert-encoding viral genomes and vectors | |
| US20250041445A1 (en) | Gene therapy for treatment of mucopolysaccharidosis iiia | |
| KR20250156211A (ko) | 글루코실세라미다제 베타 1 결핍증과 관련된 신경 장애의 치료를 위한 조성물 및 방법 | |
| US20230175014A1 (en) | Compositions and methods for reducing nuclease expression and off-target activity using a promoter with low transcriptional activity |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24726744 Country of ref document: EP Kind code of ref document: A1 |