[go: up one dir, main page]

WO2025162267A1 - Dual-vector system for expressing myo7a protein, and use thereof - Google Patents

Dual-vector system for expressing myo7a protein, and use thereof

Info

Publication number
WO2025162267A1
WO2025162267A1 PCT/CN2025/074724 CN2025074724W WO2025162267A1 WO 2025162267 A1 WO2025162267 A1 WO 2025162267A1 CN 2025074724 W CN2025074724 W CN 2025074724W WO 2025162267 A1 WO2025162267 A1 WO 2025162267A1
Authority
WO
WIPO (PCT)
Prior art keywords
myo7a
intein
seq
coding sequence
terminal coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2025/074724
Other languages
French (fr)
Chinese (zh)
Inventor
柴人杰
谈方志
齐洁玉
余朝荣
张李燕
陈�田
张善中
谭畅
陆玲
丁泽阳
崔志平
宋怀恩
孙思睫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Otovia Therapeutics
Original Assignee
Otovia Therapeutics
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Otovia Therapeutics filed Critical Otovia Therapeutics
Publication of WO2025162267A1 publication Critical patent/WO2025162267A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • A61K38/16Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • A61K38/17Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • A61K38/16Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • A61K38/17Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • A61K38/1703Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • A61K38/1709Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • A61K38/1719Muscle proteins, e.g. myosin or actin
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/0008Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'non-active' part of the composition delivered, e.g. wherein such 'non-active' part is not delivered simultaneously with the 'active' part of the composition
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • A61K48/005Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P27/00Drugs for disorders of the senses
    • A61P27/16Otologicals
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • C07K14/4701Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
    • C07K14/4716Muscle proteins, e.g. myosin, actin
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/67General methods for enhancing the expression
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • C12N15/864Parvoviral vectors, e.g. parvovirus, densovirus
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N7/00Viruses; Bacteriophages; Compositions thereof; Preparation or purification thereof
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14121Viruses as such, e.g. new isolates, mutants or their genomic sequences
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14141Use of virus, viral particle or viral elements as a vector
    • C12N2750/14143Use of virus, viral particle or viral elements as a vector viral genome or elements thereof as genetic vector
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2750/00MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssDNA viruses
    • C12N2750/00011Details
    • C12N2750/14011Parvoviridae
    • C12N2750/14111Dependovirus, e.g. adenoassociated viruses
    • C12N2750/14151Methods of production or purification of viral material
    • C12N2750/14152Methods of production or purification of viral material relating to complementing cells and packaging systems for producing virus or viral particles
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2800/00Nucleic acids vectors
    • C12N2800/10Plasmid DNA
    • C12N2800/106Plasmid DNA for vertebrates
    • C12N2800/107Plasmid DNA for vertebrates for mammalian
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/42Vector systems having a special element relevant for transcription being an intron or intervening sequence for splicing and/or stability of RNA
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/48Vector systems having a special element relevant for transcription regulating transport or export of RNA, e.g. RRE, PRE, WPRE, CTE
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N2830/00Vector systems having a special element relevant for transcription
    • C12N2830/50Vector systems having a special element relevant for transcription regulating RNA stability, not being an intron, e.g. poly A signal
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12RINDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
    • C12R2001/00Microorganisms ; Processes using microorganisms
    • C12R2001/91Cell lines ; Processes using cell lines

Definitions

  • the present invention relates to a dual-vector system for expressing MYO7A protein and its application in gene therapy, in particular to the treatment of hearing loss.
  • the Myo7a gene encodes the MYO7A protein (also known as "myosin 7A"). Since the Myo7a gene was first reported in 1996, more than 500 mutation sites have been confirmed to be associated with hereditary deafness.
  • the Myo7a gene is located on chromosome 11q13.5 and encodes an unconventional myosin VIIA composed of 2215 amino acids. It is a motor molecule widely expressed in the retinal epithelium and inner ear hair cells.
  • Rapidly evolving gene therapy strategies offer a potential treatment for hearing loss caused by Myo7a mutations.
  • the cochlea's highly compartmentalized structure and blood-cochlear barrier (BCB) isolate it from other organs, significantly reducing medication dosage and the risk of drug penetration.
  • the cells in the cochlea are highly specialized; hair cells and supporting cells no longer divide, resulting in a stable cellular structure that favors sustained transgene expression via non-integrating viral vectors (such as AAV).
  • successful gene therapy has been demonstrated in the eye, also considered “immune-privileged," and AAV has been used as a viral vector to treat OTOF mutations in the cochlea.
  • the hearing loss caused by the Myo7a gene mutation is due to the loss of a single gene protein. Therefore, in theory, as long as the exogenous Myo7a gene is transduced into the inner ear hair cells through AAV to express a sufficient amount of MYO7A protein, the abnormal morphology of the stereocilia in the inner ear hair cells can be repaired, the normal survival of the hair cells and neurons can be maintained, and the hearing function can be protected.
  • AAV AAV vector strategy has been reported in the prior art. In this strategy, dual AAV achieves mRNA expression of long-fragment genes through mRNA trans-splicing after each mRNA is transcribed. This delivery scheme first divides the coding region of the target gene into two parts and constructs them into different AAV vector plasmids.
  • Two different AAV vectors transcribe the N-terminal and C-terminal mRNA sequences of the target gene in the cell, and complete the splicing of the N-terminal and C-terminal mRNA of the target gene by splicing the donor sequence-splicing acceptor sequence (abbreviated as SD-SA sequence), thereby forming a complete target gene mRNA template to express the complete protein.
  • SD-SA sequence donor sequence-splicing acceptor sequence
  • the dual-vector system developed by the present invention and the MYO7A protein produced by intein splicing can replenish the MYO7A protein of hair cells in the early stage, repair the morphology and function of hair cell stereocilia, prevent hair cell death, and thus prevent further deterioration of hearing loss.
  • the dual-vector system provided by the present invention can effectively express sufficient MYO7A protein in cochlear hair cells, thereby repairing the morphology and function of hair cell stereocilia, maintaining hair cell survival and protecting hearing. It can be used as a candidate drug for the future clinical treatment of hearing loss.
  • the AAV dual-vector system of the present invention provides a MYO7A cleavage site, a MYO7A protein cleavage site sequence, and a screening method thereof that can be used to treat USH1B, autosomal recessive hereditary deafness DFNB2, and autosomal dominant hereditary deafness DFNA11.
  • This dual-vector system can be used in the clinical treatment of USH1B.
  • the present invention provides a dual vector system for expressing MYO7A protein, comprising a first nucleic acid vector and a second nucleic acid vector, wherein
  • the first nucleic acid vector comprises a first nucleotide sequence
  • the second nucleic acid vector comprises a second nucleotide sequence
  • the first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences
  • the second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences
  • the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A, an N-terminal coding sequence of an intein, and polyA;
  • the expression cassette of the second nucleotide sequence comprises a promoter, a C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A and polyA;
  • a MYO7A cleavage site is provided in the MYO7A amino acid sequence, for example, the MYO7A amino acid sequence is as shown in SEQ ID NO: 2 or a functional fragment thereof, for example, the functional fragment is an amino acid sequence having at least 80% sequence identity with SEQ ID NO: 2;
  • the N-terminal coding sequence of MYO7A is a nucleotide coding sequence from the N-terminus of the MYO7A amino acid sequence to the MYO7A cleavage site;
  • the C-terminal coding sequence of MYO7A is a nucleotide coding sequence from the amino acid after the MYO7A cleavage site to the C-terminus of the MYO7A amino acid sequence.
  • the MYO7A cleavage site is located at the amino acid preceding serine, threonine, or cysteine in the MYO7A amino acid sequence.
  • the promoter of the expression cassette of the first nucleotide sequence or the second nucleotide sequence is selected from CAG promoter, CMV promoter, CBA promoter, UbC promoter, SFFV promoter, EF1 ⁇ promoter, PGK promoter, or promoters of genes encoding Myo7A, Myo15, Atoh1, POU4F3, Lhx3, Myo6, ⁇ 9AchR, ⁇ 10AchR, OTOF and STRC;
  • the polyA of the expression cassette of the first nucleotide sequence or the second nucleotide sequence comprises AATAAA (SEQ ID NO: 33) and variants of AATAAA; the variants of AATAAA comprise ATTAAA (SEQ ID NO: 34), AGTAAA (SEQ ID NO: 35), CATAAA (SEQ ID NO: 36), TATAAA (SEQ ID NO: 37), GATAAA (SEQ ID NO: 38), ACTAAA (SEQ ID NO: 39), AATATA (SEQ ID NO: 40), AAGAAA (SEQ ID NO: 41), AATAAT (SEQ ID NO: 42), AAAAAA (SEQ ID NO: 43), AATGAA (SEQ ID NO: 44), NO: 44), AATCAA (SEQ ID NO: 45), AACAAA (SEQ ID NO: 46), AATCAA (SEQ ID NO: 47), AATAAC (SEQ ID NO: 48), AATAAA (SEQ ID NO: 33) and variants of AATAAA; the variants of AATAAA comprise
  • the expression cassette of the first nucleotide sequence or the second nucleotide sequence further comprises an expression regulatory element and/or a tag element
  • the expression regulatory element is a woodchuck hepatitis posttranscriptional regulatory element (WPRE) or a variant thereof, preferably a WPRE truncated variant, for example, a nucleotide sequence having at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identity with the nucleotide sequence shown in SEQ ID NO: 28, for example, the nucleotide sequence shown in SEQ ID NO: 31; for example, the tag element is HA.
  • WPRE woodchuck hepatitis posttranscriptional regulatory element
  • the intein in the dual-vector system for expressing MYO7A protein of the present invention, is derived from MxeGyrA, pabPolIII, MjaKlbA, SspDnaB, SceVMA, SspDnaE, NpuDnaE, AvaDnaE, CraDnaE, CspDnaE, CwaDnaE, MchtDnaE, OliDnaE, TerDnaE, gp41-1, gp41-8, IMPDH-1 or RmaDnaB, for example, the intein is derived from RmaDnaB, for example, the N-terminus of the intein is the N-terminus of the RmaDnaB intein as shown in SEQ ID NO:23, and the C-terminus of the intein is the C-terminus of the RmaDnaB intein as shown in SEQ ID NO:24.
  • the intein is derived from NpuDnaE, for example, the N-terminus of the intein is the N-terminus of the NpuDnaE intein as shown in SEQ ID NO:52, and the C-terminus of the intein is the C-terminus of the NpuDnaE intein as shown in SEQ ID NO:54.
  • the first nucleotide sequence is inserted into a plasmid comprising two first ITR sequences
  • the second nucleotide sequence is inserted into a plasmid comprising two second ITR sequences
  • the plasmid comprising two first ITR sequences and the plasmid comprising two second ITR sequences are the same or different, for example, the plasmid is pAAV, pAAV-CMV, pX601, pX551 or pAAV-MCS plasmid.
  • the MYO7A cleavage site is as shown in Table 1.
  • amino acid 1043 of the MYO7A amino acid sequence shown in SEQ ID NO: 2 is used as the MYO7A cleavage site, and the RmaDnaB intein is used; the N-terminal coding sequence of MYO7A is connected and fused to the N-terminal coding sequence of the RmaDnaB intein, and then a first nucleotide sequence is constructed, using the pAAV-CMV plasmid as a vector; the C-terminal coding sequence of the RmaDnaB intein is connected and fused to the C-terminal coding sequence of MYO7A, and then a second nucleotide sequence is constructed, using the pAAV-CMV plasmid as a vector;
  • the 1058th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;
  • the 1061st amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;
  • the 1064th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;
  • the 1071st amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;
  • the 1076th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;
  • the 1104th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;
  • the 1105th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;
  • the 1114th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;
  • the 1119th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;
  • the 1126th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;
  • the N-terminal coding sequence of the RmaDnaB intein encodes the N-terminal portion of RmaDnaB shown in SEQ ID NO:23
  • the C-terminal coding sequence of the RmaDnaB intein encodes the C-terminal portion of RmaDnaB shown in SEQ ID NO:24.
  • the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 4, an N-terminal coding sequence of an intein, and polyA;
  • the expression cassette of the second nucleotide sequence comprises a promoter, a C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A as shown in SEQ ID No: 6, and polyA;
  • the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 8, an N-terminal coding sequence of an intein, and polyA;
  • the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A as shown in SEQ ID No: 10, and polyA;
  • the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 12, an N-terminal coding sequence of an intein, and polyA;
  • the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A as shown in SEQ ID No: 14, and polyA;
  • the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 16, an N-terminal coding sequence of an intein, and polyA;
  • the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, an C-terminal coding sequence of MYO7A as shown in SEQ ID No: 18, and polyA; or
  • the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 20, an N-terminal coding sequence of an intein, and polyA;
  • the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, an C-terminal coding sequence of MYO7A as shown in SEQ ID No: 22, and polyA.
  • the expression cassette of the first nucleotide sequence and the expression cassette of the second nucleotide sequence each contain a combination of a WPRE nucleotide sequence and an SV40 polyadenylation sequence at the N-terminus of the 3’ITR sequence, for example, a nucleotide sequence shown in SEQ ID NO: 27 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 27; or a combination of a WPRE3 nucleotide sequence and an SV40 late polyadenylation sequence, for example, a nucleotide sequence shown in SEQ ID NO: 30 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 30.
  • the present invention provides an adeno-associated virus packaging vector system, which comprises the dual-vector system for expressing MYO7A protein described in the first aspect of the present invention, a vector carrying AAV rep and cap genes, and a helper virus vector, which are packaged into an AAV vector.
  • adeno-associated virus packaging vector system which comprises the dual-vector system for expressing MYO7A protein described in the first aspect of the present invention, a vector carrying AAV rep and cap genes, and a helper virus vector, which are packaged into an AAV vector.
  • the amino acid sequence of the MYO7A protein is as shown in SEQ ID NO: 2.
  • the vector carrying the AAV rep and cap genes is selected from AAV1, AAV2, AAV5, AAV8, AAV9, Anc80, PHP.eB, AAV-DJ and AAVrh.10 vectors; and the helper virus vector is a pHelper plasmid.
  • the present invention provides a method for packaging an adeno-associated virus, wherein the adeno-associated virus packaging vector system described in the second aspect of the present invention is transferred into a host cell for packaging.
  • the host cell is selected from Hela-S3 cells, HEK-293 cells, HEK-293T cells, HEK-293FT cells, A549 cells, and Sf9 cells.
  • the present invention provides a dual adeno-associated virus vector, which is obtained by the packaging method according to the second aspect of the present invention.
  • the present invention provides the use of the dual vector system for expressing MYO7A protein described in the first aspect of the present invention or the dual adeno-associated virus vector described in the fourth aspect of the present invention for preparing a drug or preparation for treating deafness, hearing loss or hearing dysfunction.
  • the present invention provides a medicine or preparation for treating deafness, hearing loss, or hearing dysfunction, which is prepared by the dual-vector system for expressing MYO7A protein described in the first aspect of the present invention or the adeno-associated virus described in the fourth aspect of the present invention, wherein the adeno-associated virus is obtained by transferring the adeno-associated virus packaging vector system into a host cell for packaging, and the adeno-associated virus packaging vector system includes a dual-vector system for expressing MYO7A protein, a vector carrying AAVrep and cap genes, and a helper virus vector.
  • the drug or formulation of the present invention further comprises a neutral salt buffer, an acidic salt buffer, an alkaline salt buffer, glucose, mannose, mannitol, proteins, polypeptides, amino acids, antibiotics, chelating agents, adjuvants, preservatives, nanoparticles, liposomes and positive lipid particles.
  • the drug or preparation of the present invention is administered by injection through the round window, oval window, semicircular canal, or common canal of the cochlea; and is administered once or multiple times throughout life, with a total dose of 1 ⁇ 10 9 -1 ⁇ 10 13 viral genomes.
  • the present invention provides a method for screening MYO7A protein cleavage sites for intein splicing, comprising:
  • the first nucleic acid vector and the second nucleic acid vector according to the first aspect of the present invention, wherein, optionally, the first residue at the N-terminus of the intein is Cys; the 3' terminal sequence at the C-terminus of the intein contains His and Asn, and the first amino acid residue at the 5' terminal of the C-terminus of MYO7A is Cys, Ser, or Thr;
  • the 3' terminal residue of the N-terminal portion of the MYO7A protein is used as the MYO7A protein cleavage site for intein splicing.
  • Figure 1 shows a schematic diagram of intein-mediated full-length extein expression.
  • N-extein represents the N-terminal extein
  • C-extein represents the C-terminal extein
  • intein N represents the N-terminal intein
  • intein C represents the C-terminal intein.
  • Figure 2 shows a schematic diagram of the screening process for Myo7a cleavage sites via intein splicing.
  • the intein splicing method involves protein trans-splicing, where the N-terminal and C-terminal coding sequences (CDS) of Myo7a are constructed and expressed separately in two separate plasmids, and the full-length Myo7a protein is assembled via intein splicing.
  • CDS C-terminal coding sequences
  • the N-terminal plasmid contains an intein N-terminal fragment sequence (e.g., the RmaDnaB intein N-terminal fragment (Rmintein N ) sequence, also referred to herein as the "Rm-N-intein" sequence, where the first residue of the Rm-N-intein sequence is Cys) at the 3' end of the Myo7a N-terminal CDS sequence.
  • an intein N-terminal fragment sequence e.g., the RmaDnaB intein N-terminal fragment (Rmintein N ) sequence, also referred to herein as the "Rm-N-intein" sequence, where the first residue of the Rm-N-intein sequence is Cys
  • the C-terminal plasmid adds an intein C-terminal fragment (Rmintein C ) sequence before the 5' start of the C-terminal CDS of Myo7a (for example, the RmaDnaB intein C-terminal fragment sequence, also referred to as the "Rm-C-intein" sequence in this article, the 3' end sequence of the Rm-C-intein sequence contains His and Asn).
  • the first amino acid residue 5' of the C-terminal CDS of Myo7a is Cys, Ser, and Thr for intein splicing.
  • FIG. 3 illustrates a schematic diagram of the AAV binary vector plasmid elements.
  • the pAAV-CMV-EGFP-WPRE-SV40 plasmid backbone sequence can be as shown in SEQ ID NO: 26, wherein the 5' ITR sequence is located between 1 bp and 141 bp, the CMV promoter sequence is located between 169 bp and 752 bp, the Kozak sequence is located between 792 bp and 797 bp, the EGFP sequence is located between 801 bp and 1517 bp, the WPRE sequence is located between 1536 bp and 2124 bp, the SV40 PolyA signal sequence is located between 2131 bp and 2252 bp, the 3' ITR sequence is located between 2290 bp and 2430 bp, the f1 Ori sequence is located between 2505 bp and 2960 bp, the kana resistance sequence is located between 3242 bp and 4156 bp, and the Ori sequence
  • Figure 4 shows the N-terminal protein map of the MYO7A cleavage site S3.
  • Figure 5 shows the C-terminal protein map of the MYO7A cleavage site S3.
  • Figure 6 shows the N-terminal protein map of the MYO7A cleavage site S4.
  • Figure 7 shows the C-terminal protein map of the MYO7A cleavage site S4.
  • Figure 8 shows the N-terminal protein map of the MYO7A cleavage site S8.
  • Figure 9 shows the C-terminal protein plasmid map of the MYO7A cleavage site S8.
  • Figure 10 shows the N-terminal protein map of the MYO7A cleavage site S9.
  • Figure 11 shows the C-terminal protein map of the MYO7A cleavage site S9.
  • FIG12 shows the N-terminal protein map of the MYO7A cleavage site S11.
  • FIG13 shows a protein map of the C-terminal region of the MYO7A cleavage site S11.
  • Figure 14 shows the full-length protein expression levels of each MYO7A cleavage site plasmid pair. As shown in Figure 14, MYO7A cleavage sites S3, S4, S8, S9, and S11 produce significantly higher full-length protein expression levels.
  • Figure 15 shows the results of Western blot analysis of MYO7A protein expression in 293T cells for the MYO7A cleavage site S3, using vector plasmids containing the WPRE-SV40 element (the WPRE -SV40 element is the name for the combination of the WPR E element and the SV40 poly(A) signal sequence, and its sequence is shown in SEQ ID NO:27) or the W3SL element (the W3SL element is the name for the combination of the WPR E element and the SV40 late poly(A) signal sequence, and its sequence is shown in SEQ ID NO:30), as well as plasmids that do not contain the WPRE-SV40 element or the W3SL element.
  • "HA” indicates the results of detecting HA-tagged MYO7A protein expression using an antibody against the HA tag.
  • Figure 16 shows the immunofluorescence results of MYO7A protein expression in mouse cochlea.
  • the left panel of Figure 16 shows the results of parvalbumin staining; the middle panel shows the results of Myo7a staining; and the right panel is an overlay of parvalbumin and Myo7a staining.
  • FIG17 shows the test results of treating hearing function in Myo7a knockout mice using Myo7a genes corresponding to various candidate cleavage sites of MYO7A.
  • the term "about" when used in conjunction with a numerical value is intended to encompass numerical values within a range having a lower limit that is 5% less than the specified numerical value and an upper limit that is 5% greater than the specified numerical value.
  • the term is also intended to encompass values within ⁇ 1%, ⁇ 0.5%, or ⁇ 0.1% of the specified number.
  • the term “comprising” or “including” means including the recited elements, integers, steps, or groups of elements, integers, or steps, but does not exclude any other elements, integers, or steps, or other groups of elements, integers, or steps.
  • the term “comprising” or “including” also encompasses the situation consisting of the recited elements, integers, or steps. For example, when referring to a polynucleotide "comprising" a particular sequence, it is intended to encompass a polynucleotide consisting of that particular sequence.
  • intein or protein intein (also known as intein) described herein is a polypeptide chain within an immature precursor protein. Through a series of self-catalytic reactions, such as rearrangement, transesterification, and cyclization, it can be excised from the precursor protein and its two end polypeptide segments (exteins) connected by a natural peptide bond. This is to say, protein self-splicing achieves a rearrangement of the protein structure.
  • a split intein is a structural type of intein. Structurally, its N-terminal and C-terminal regions are separated from each other. When the two fragments containing the N-terminal and C-terminal regions of the intein are connected, the exteins at both ends can be spliced together according to the standard intein splicing pathway.
  • inteins consist of terminal splicing regions and a central endonuclease domain or linker domain.
  • Inteins can be divided into three types: canonical inteins, miniinteins, and split inteins. Both canonical and miniinteins contain terminal splicing domains and a central region. The difference between them is that the central region of a canonical intein is an endonuclease domain, while that of a miniintein is a linker domain. The lengths of the linker domains vary between miniinteins.
  • a split intein The central region of a split intein is disconnected at a specific site, forming an N-terminal fragment and a C-terminal fragment, respectively, located on two genes distant from each other in the genome. During the translation and maturation of the precursor protein, these two intein fragments recognize each other and restore endonuclease activity, mediating protein trans-splicing.
  • a dual AAV vector system can be used to deliver a nucleic acid containing a nucleic acid encoding a split intein.
  • inteins are composed of 10 modules, starting from the N-terminus: A, N2, B, N4, C, D, E, H, F, and G.
  • A, N2, B, and N4 are the N-terminal splicing regions
  • F and G are the C-terminal splicing regions
  • C, D, E, and H are the homing endonuclease active regions or linker domains.
  • the motifs involved in intein splicing within the A, B, F, and G modules contain highly conserved amino acid residues at the splice sites, essential for the affinity displacement reaction during intein splicing.
  • the motifs within the A module of the intein typically contain amino acids with hydroxyl or sulfhydryl groups, such as Ser and Cys.
  • the motif within the B module contains the highly conserved amino acid sequence Thr-X-X-His, which is also found in serine proteases.
  • the conserved amino acid residues within the motif involved in the splicing reaction within the G module are Asn, Ser, Cys, Thr, and His.
  • the conserved sites in the motif of the A module (such as Ser, Cys) can be replaced by Ala, Gln or Pro in some inteins, and the same is true for the motif of the G module.
  • hearing loss refers to hearing below the normal hearing threshold level determined by audiometry, including mild, moderate, severe and profound hearing loss, and deafness.
  • Hearing loss can be described by a percentage of hearing loss, for example, 30%, 60%, 80% or even 100% hearing loss, or by a grading description of hearing loss.
  • the hearing loss can be hearing loss caused by or associated with a gene defect, such as congenital deafness and pre-lingual deafness caused by genetic factors, or hearing loss associated with genetic factors, induced by environmental factors (for example, aging, noise, drugs or infection-induced).
  • Hearing loss can be asymptomatic (that is, there is no associated visible outer ear or other organ abnormality) or symptomatic.
  • hearing loss is sensorineural hearing loss.
  • a “hearing loss-associated gene” refers to a gene whose variation can cause hearing loss or susceptibility to hearing loss by altering the ability of the inner ear to function normally. In this article, such genes are also referred to as “hearing loss genes.” More than 100 genes have been identified as being associated with hearing loss (see Hereditary Hearing Loss Homepage, https://hereditaryhearingloss.org/, which lists the gene locations and identification data for currently known single-gene asymptomatic hearing loss). In the case of causing susceptibility to hearing loss, individuals carrying the hearing loss gene variation may show a greater susceptibility to hearing loss due to environmental factors, such as aging, noise, drugs, or infections, relative to healthy individuals.
  • cochlear inner hair cells refer to isolated or in vitro cochlear inner hair cells, cell lines, or cell populations derived from a mammal, or inner hair cells in the cochlea of a mammal.
  • cochlear outer hair cells refer to cochlear outer hair cells, cell lines, or cell populations isolated from or in vitro of a mammal, or outer hair cells in the cochlea of a mammal.
  • an "isolated" nucleic acid refers to a nucleic acid molecule that has been artificially synthesized or separated from at least some components of its natural environment.
  • an isolated nucleic acid can be part of a larger nucleic acid, or part of a vector or composition of matter, or can be contained within a cell and still be “isolated” provided that the larger nucleic acid, vector, composition of matter, or specific cell is not the natural environment of the nucleic acid.
  • operably linked also referred to as “effectively linked” or “functionally linked,” means that two or more polynucleotide (e.g., DNA) segments are in a relationship that allows them to function in the intended manner.
  • a promoter sequence is operably linked to a coding sequence if it stimulates or regulates the transcription of the coding sequence in a suitable host cell or other expression system.
  • promoters that are operably linked to a transcribable sequence are contiguous with the transcribable sequence, i.e., they are cis-acting.
  • some transcriptional regulatory sequences e.g., enhancers
  • full-length MYO7A protein refers to a MYO7A protein produced by operatively linking the N-terminal portion of the MYO7A protein and the C-terminal portion of the MYO7A protein expressed in the dual vector system of the present invention.
  • the full-length MYO7A protein is a wild-type or functional human MYO7A protein.
  • the amino acid sequences of wild-type and functional hMYO7A proteins and the polynucleotide sequences encoding them are known in the art (see, for example, GenBank accession numbers NP_000251 and U39226.1).
  • the full-length MYO7A protein is the full-length MYO7A protein shown in SEQ ID NO. 2, or a functional derivative or functional fragment thereof.
  • AAV adeno-associated virus
  • the first AAV virus isolated was serotype 2 (AAV2).
  • AAV2 serotype 2
  • the AAV2 genome is approximately 4.7 kb long, flanked by 145-bp inverted terminal repeats (ITRs) at each end, forming a palindromic hairpin structure.
  • the genome contains two large open reading frames (ORFs), encoding the rep and cap genes, respectively.
  • ITRs are cis-acting elements of the AAV vector genome, playing a crucial role in AAV integration, rescue, replication, and genome packaging.
  • the ITR sequence contains the Rep binding site (RBS) and the terminal resolution site (TRs), which are recognized by the Rep protein and produce a cleavage at the TRs.
  • the ITR sequence also forms a unique "T"-shaped secondary structure, which plays a crucial role in the AAV life cycle.
  • the rest of the AAV2 genome can be divided into two functional regions, the rep gene region and the cap gene region.
  • the rep gene region encodes four Rep proteins: Rep78, Rep68, Rep52, and Rep40.
  • Rep proteins play an important role in the replication, integration, rescue, and packaging of AAV viruses.
  • Rep78 and Rep68 specifically bind to the terminal melting sites trs and GAGY repeat motifs in the ITR, initiating the replication of the AAV genome from single-stranded to double-stranded.
  • the trs and GAGC repeat motifs and/or GAGY repeat motifs in the ITR are the center of AAV genome replication. Therefore, although the ITR sequences are different in various serotypes of AAV viruses, they can all form a hairpin structure and contain Rep binding sites.
  • Rep52 and Rep40 have ATP-dependent DNA helicase activity but do not have the function of binding to DNA.
  • the cap gene encodes the AAV capsid proteins VP1, VP2, and VP3.
  • VP3 has the smallest molecular weight but is the most abundant. In mature AAV particles, the ratio of VP1, VP2, and VP3 is approximately 1:1:10.
  • VP1 is essential for the formation of infectious AAV; VP2 facilitates VP3 entry into the cell nucleus; and VP3 is the primary protein in AAV particles.
  • AAV vector refers to an efficient exogenous gene transfer tool, i.e., an AAV vector, that has been transformed from wild-type AAV virus as people gain a better understanding of the AAV virus life cycle and its related molecular biological mechanisms.
  • the modified AAV vector genome only contains the ITR sequence of the AAV virus and the exogenous sequence to be transferred.
  • the Rep and Cap proteins required for AAV virus packaging are provided in trans by other exogenous plasmids, thereby reducing the possible harm caused by packaging the rep and cap genes into the AAV vector.
  • the AAV virus itself is not pathogenic, which makes the AAV vector recognized as one of the safest viral vectors.
  • AAV virus serotypes There are many AAV virus serotypes, and different serotypes have different tissue infection tropisms. Therefore, the use of AAV vectors can transport exogenous genes to specific organs and tissues.
  • the existing technology has a relatively mature packaging system for AAV vectors, which facilitates the large-scale production of AAV vectors.
  • vector genome refers to the nucleic acid sequence that is packaged within the rAAV capsid to form the rAAV vector.
  • mammals include, but are not limited to, humans, non-human primates (e.g., cynomolgus monkeys, rhesus monkeys), rodents, and other mammals, such as cattle, pigs, horses, and dogs.
  • mammals include individuals at all stages of development, including embryonic and fetal stages.
  • treatment refers to clinical intervention intended to alter the natural course of a disease in the individual being treated. Desired therapeutic effects include, but are not limited to, preventing the onset or recurrence of the disease, alleviating symptoms, reducing any direct or indirect pathological consequences of the disease, preventing metastasis, reducing the rate of disease progression, ameliorating or palliating the disease state, and alleviating or improving prognosis.
  • treatment also encompasses modification or improvement of at least one physical parameter, including physical parameters that may not be discernible by the patient.
  • prevention refers to preventing or delaying the onset or development or progression of a disease or condition.
  • prevention generally refers to hospital intervention performed before at least one symptom of a disease occurs.
  • the present invention utilizes protein trans-splicing, that is, the N-terminal and C-terminal CDS of MYO7A are constructed into two different plasmids for expression, and the complete full-length protein is assembled through intein splicing.
  • the N-terminal plasmid adds the N-terminal coding sequence of the intein at the 3' end of the MYO7A N-terminal CDS sequence, such as the nucleic acid sequence encoding the Rm-N-intein sequence (SEQ ID NO: 23), and the first amino acid residue of the Rm-N-intein sequence contains Cys.
  • the C-terminal plasmid adds the C-terminal coding sequence of the intein at the 5' end of the MYO7A C-terminal CDS sequence, such as the nucleic acid sequence encoding the Rm-C-intein (SEQ ID NO: 24), and the terminal sequence of the Rm-C-intein contains His and Asn, and the first amino acid residue of the MYO7A C-terminal CDS sequence is Cys, Ser, and Thr for intein splicing.
  • the present invention provides a dual vector system for expressing MYO7A protein, which comprises a first nucleic acid vector and a second nucleic acid vector, wherein
  • the first nucleic acid vector comprises a first nucleotide sequence
  • the second nucleic acid vector comprises a second nucleotide sequence
  • the first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences
  • the second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences
  • the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A, an N-terminal coding sequence of an intein, and polyA;
  • the expression cassette of the second nucleotide sequence comprises a promoter, a C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A and polyA.
  • Inteins can splice proteins and exert their effects by covalently linking two different proteins after or during protein translation.
  • the earliest inteins were discovered in fungi. Comparison and analysis of intein sequences predicts that there are over 600 intein genes present in viruses, bacteria, archaea, and eukaryotic microorganisms. Most inteins are complete proteins, but a small number of inteins have separate N- and C-termini. Inteins are linked to a portion of a protein at each end, and then reassemble after translation to produce the complete protein through nucleophilic chemical reactions and conformational changes.
  • the intein is separated from the N-terminus and the C-terminus.
  • the intein can be derived from MxeGyrA, pabPolIII, MjaKlbA, SspDnaB, SceVMA, SspDnaE, NpuDnaE, AvaDnaE, CraDnaE, CspDnaE, CwaDnaE, MchtDnaE, OliDnaE, TerDnaE, gp41-1, gp41-8, IMPDH-1 or RmaDnaB.
  • the intein is an RmaDnaB intein, e.g., having an N-terminal portion of the RmaDnaB intein set forth in SEQ ID NO: 23 and a C-terminal portion of the RmaDnaB intein set forth in SEQ ID NO: 24.
  • the intein is an NpuDnaE intein, e.g., having an N-terminal portion of the NpuDnaE intein set forth in SEQ ID NO: 52 and a C-terminal portion of the NpuDnaE intein set forth in SEQ ID NO: 54.
  • the MYO7A protein comprises or consists of the amino acid sequence of SEQ ID No 2.
  • a cleavage site is provided in the amino acid sequence of the MYO7A protein, dividing the MYO7A protein into the N-terminal portion of the MYO7A protein (also referred to herein as the "N-terminus of MYO7A") and the C-terminal portion of the MYO7A protein (also referred to herein as the "C-terminus of MYO7A").
  • the N-terminus of MYO7A is the sequence from the N-terminus of the MYO7A amino acid sequence to the cleavage site
  • the C-terminus of MYO7A is the sequence from the amino acid residue immediately adjacent to the cleavage site to the C-terminus of the MYO7A amino acid sequence.
  • the N-terminus of MYO7A is linked and fused to the N-terminus of the intein
  • the C-terminus of the intein is linked and fused to the C-terminus of MYO7A.
  • Table 1 lists the locations of some of these cleavage sites on the MYO7A protein and the corresponding N-terminal and C-terminal portions of MYO7A.
  • the MYO7A protein is cleaved at one or more of the following groups of amino acid residues to form the N-terminal portion of the MYO7A protein and the C-terminal portion of the MYO7A protein: 1043; 1058; 1061; 1064; 1071; 1076; 1081; 1104; 1105; 1114; 1119; 1122; or 1126, wherein the amino acid position is relative to the position of SEQ ID No: 2.
  • the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein and a C-terminal portion of the MYO7A protein selected from any one of the following groups:
  • the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein of 1-1061 aa and a C-terminal portion of the MYO7A protein of 1062-2215 aa.
  • the N-terminal portion of the MYO7A protein is represented by SEQ ID No: 4 and the C-terminal portion of the MYO7A protein is represented by SEQ ID No: 6.
  • the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein ranging from 1 to 1064 aa and a C-terminal portion of the MYO7A protein ranging from 1065 to 2215 aa.
  • the N-terminal portion of the MYO7A protein is represented by SEQ ID No: 8 and the C-terminal portion of the MYO7A protein is represented by SEQ ID No: 10.
  • the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein of 1-1104 aa and a C-terminal portion of the MYO7A protein of 1105-2215 aa.
  • the N-terminal portion of the MYO7A protein is represented by SEQ ID No: 12 and the C-terminal portion of the MYO7A protein is represented by SEQ ID No: 14.
  • the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein of 1-1105 aa and a C-terminal portion of the MYO7A protein of 1115-2215 aa.
  • the N-terminal portion of the MYO7A protein is SEQ ID No: 16 and the C-terminal portion of the MYO7A protein is SEQ ID No: 18.
  • the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein of 1-1119 aa and a C-terminal portion of the MYO7A protein of 1120-2215 aa.
  • the N-terminal portion of the MYO7A protein is represented by SEQ ID No: 20 and the C-terminal portion of the MYO7A protein is represented by SEQ ID No: 22.
  • the vector plasmid of the present invention can be any plasmid that can replicate in a host cell and express a corresponding polypeptide.
  • the vector plasmid comprises two ITR sequences, namely a 5' inverted terminal repeat (5' ITR) sequence and a 3' inverted terminal repeat (3' ITR) sequence.
  • the first nucleotide sequence is inserted into a plasmid comprising two first ITR sequences
  • the second nucleotide sequence is inserted into a plasmid comprising two second ITR sequences
  • the plasmid comprising two first ITR sequences and the plasmid comprising two second ITR sequences are the same or different, for example, the plasmid is pAAV, pAAV-CMV, pX601, pX551 or pAAV-MCS plasmid.
  • the present invention provides a dual vector system comprising a first nucleic acid vector and a second nucleic acid vector, wherein:
  • the first nucleic acid vector comprises, in 5'-3' direction: a 5' inverted terminal repeat (5'ITR) sequence, a nucleic acid sequence encoding the N-terminal portion of the MYO7A protein, a nucleic acid sequence encoding the N-terminal portion of the intein, and a 3' inverted terminal repeat (3'ITR) sequence;
  • 5'ITR 5' inverted terminal repeat
  • 3'ITR 3' inverted terminal repeat
  • the second nucleic acid vector comprises, in the 5'-3' direction: a 5'ITR sequence, a nucleic acid sequence encoding the C-terminal portion of an intein, a nucleic acid sequence encoding the C-terminal portion of a MYO7A protein, and a 3'ITR sequence, and
  • the N-terminal portion of the MYO7A protein and the C-terminal portion of the MYO7A protein are operably linked to produce the MYO7A protein.
  • the present invention provides a two-vector system comprising a first nucleic acid vector and a second nucleic acid vector, wherein:
  • the first nucleic acid vector comprises, in the 5'-3' direction: a 5' inverted terminal repeat (5'ITR) sequence, a nucleic acid sequence encoding the N-terminal portion of the MYO7A protein, a nucleic acid sequence encoding the N-terminal portion of the intein, and a 3' inverted terminal repeat (3'ITR) sequence;
  • 5'ITR 5' inverted terminal repeat
  • 3'ITR 3' inverted terminal repeat
  • the second nucleic acid vector comprises, in 5'-3' direction: a 5' ITR sequence, a nucleic acid sequence encoding the C-terminal portion of an intein, a nucleic acid sequence encoding the C-terminal portion of a MYO7A protein, and a 3' ITR sequence, and
  • the MYO7A protein has a MYO7A cleavage site in its amino acid sequence, for example, the MYO7A protein has an amino acid sequence as shown in SEQ ID NO: 2 or a functional fragment thereof, for example, an amino acid sequence having at least 80% sequence identity with SEQ ID NO: 2;
  • the N-terminal portion of the MYO7A protein is the sequence from the N-terminus of the MYO7A amino acid sequence to the MYO7A cleavage site;
  • the C-terminal portion of the MYO7A protein is a sequence from the amino acid after the MYO7A cleavage site to the C-terminus of the MYO7A amino acid sequence;
  • the N-terminal portion of the MYO7A protein and the C-terminal portion of the MYO7A protein are operably linked to produce a full-length MYO7A protein.
  • the nucleotide sequences of the ITRs in the dual vector system are derived from the same AAV serotype or different AAV serotypes, for example, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9 serotypes.
  • the 5'-ITR and 3'-ITR of the first nucleic acid vector and the 5'-ITR and 3'-ITR of the second nucleic acid vector are derived from the same AAV serotype.
  • the 5'-ITR and 3'-ITR of the first nucleic acid vector and the 5'-ITR and 3'-ITR of the second nucleic acid vector are derived from different AAV serotypes.
  • tissue-specific promoter is used in a two-vector system, for example, a promoter that mediates expression in the ear, such as the synapsin promoter or the GFAP promoter.
  • any one of the following promoters is used in the binary vector system: cytomegalovirus (CMV) promoter, SV40 promoter, Rous sarcoma virus (RSV) promoter, CAG promoter, chimeric CMV/chicken beta actin (CBA) promoter, truncated CBA (smCBA) promoter, UbC promoter, SFFV promoter, EF1 ⁇ promoter, PGK promoter, or promoters of Myo7A, Myo15, Atoh1, POU4F3, Lhx3, Myo6, ⁇ 9AchR, ⁇ 10AchR, OTOF and STRC encoding genes.
  • the promoter is a CMV promoter.
  • the dual vector system of the present invention may also include one or more additional regulatory sequences that can function before or after transcription.
  • the regulatory sequences may be part of the native transgenic locus or may be heterologous regulatory sequences.
  • a portion of the 5'UTR or 3'UTR of the native transgenic transcript may be included in the dual vector system of the present invention.
  • the regulatory sequence may be any sequence that promotes transgene expression, i.e., serves to increase transcript expression, improve nuclear export of mRNA, or enhance its stability.
  • Such regulatory sequences include, for example, enhancer elements, post-transcriptional regulatory elements, and polyadenylation sequences.
  • Enhancers are cis-regulatory elements that affect the transcription of genes on the same molecule of DNA. Enhancers can be located upstream, downstream, within introns, or even relatively far from the genes they regulate.
  • the preferred post-transcriptional regulatory element used in the dual vector system of the present invention is the woodchuck hepatitis post-transcriptional regulatory element (WPRE) or a variant thereof.
  • WPRE woodchuck hepatitis post-transcriptional regulatory element
  • an AAV vector containing WPRE or a variant thereof increases the expression of MYO7A protein.
  • the dual vector system of the present invention comprises a WPRE nucleotide sequence as set forth in SEQ ID NO:28.
  • the dual vector system of the present invention comprises a post-transcriptional regulatory element having a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to the WPRE nucleotide sequence as set forth in SEQ ID NO:28, wherein the nucleotide sequence substantially retains the functional activity of the post-transcriptional regulatory element as set forth in SEQ ID NO:28, for example, a truncated variant of WPRE.
  • the truncated variant of WPRE has the WPRE3 nucleotide sequence as set forth in SEQ ID NO:31.
  • the dual vector system of the present invention comprises a polyadenylation sequence, for example, a bovine growth hormone polyadenylation sequence, an SV40 polyadenylation sequence, and/or an SV40 late polyadenylation sequence.
  • the dual vector system of the present invention comprises an SV40 polyadenylation sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the nucleotide sequence set forth in SEQ ID NO: 29.
  • the dual vector system of the present invention comprises an SV40 late polyadenylation sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the nucleotide sequence set forth in SEQ ID NO: 32.
  • the dual vector system of the present invention comprises a combination of a WPRE nucleotide sequence and an SV40 polyadenylation sequence.
  • the dual vector system of the present invention has the nucleotide sequence set forth in SEQ ID NO: 27, or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 27.
  • the combination of the WPRE nucleotide sequence and the SV40 polyadenylation sequence allows for high-level expression of a transgene.
  • the dual-vector system of the present invention comprises a combination of a WPRE3 nucleotide sequence and an SV40 late polyadenylation sequence.
  • the dual-vector system of the present invention has the nucleotide sequence set forth in SEQ ID NO: 30, or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 30.
  • the combination of the WPRE3 nucleotide sequence and the SV40 late polyadenylation sequence can efficiently express larger exogenous genes while occupying less AAV packaging capacity.
  • the present invention uses the dual-vector system to deliver the MYO7A protein gene in two parts to inner ear cells, inner hair cells, or outer hair cells, where the N-terminal and C-terminal parts of the MYO7A protein expressed undergo trans-splicing to form the full-length MYO7A protein.
  • the present invention demonstrates that the dual-vector system for expressing the MYO7A protein can effectively transduce the targeted inner ear cells, inner hair cells, or outer hair cells, producing the MYO7A protein in these cells and durably restoring hearing loss caused by MYO7A gene knockout.
  • the dual vector system of the present invention allows for the expression of homologous polypeptides having an amino acid sequence that is at least 70% identical and/or similar to SEQ ID NO: 2. More preferably, the homologous sequence is at least 75%, even more preferably at least 80%, or at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 99%, or at least 99% identical and/or similar to SEQ ID NO: 2. When the homologous polypeptide is much shorter than SEQ ID NO: 2, a local alignment may be considered.
  • the dual vector system of the present invention can allow for the expression of functional fragments of the MYO7A protein polypeptide.
  • functional fragment herein refers to any fragment that retains at least one biological function of the target MYO7A protein polypeptide.
  • the full-length MYO7A protein can be obtained by transforming host cells using the dual vector system of the present invention.
  • the host cells are selected from Hela-S3 cells, HEK-293 cells, HEK-293T cells, HEK-293FT cells, A549 cells, and Sf9 cells.
  • the dual-vector system of the present invention is used to administer to patients suffering from Myo7a mutation-induced hearing loss.
  • Patient suffering from Myo7a mutation-induced hearing loss refers to a patient, particularly a human patient, who is believed to have (or has been diagnosed with) a mutation in the constitutive Myo7a gene that triggers abnormal expression, abnormal function, or both of the MYO7A protein.
  • the Myo7a mutation-induced hearing loss is USH1B, autosomal recessive hearing loss DFNB2, or autosomal dominant hearing loss DFNA11.
  • the dual vector system of the present invention is a dual AAV vector system.
  • the first AAV vector and the second AAV vector in the dual AAV vector system are vectors each having a capsid of the same or different AAV origin, for example, the first AAV vector and the second AAV vector in the dual AAV vector system are vectors each having an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV-Anc80 capsid or an AAV vector with a chimeric capsid, in particular an AAV vector with an AAV-Anc80 capsid.
  • the synthetic adeno-associated virus vector Anc80L65 is used, which has been shown to have the highest transduction efficiency of inner ear hair cells reported to date (Suzuki et al., Sci. Rep. 7: 45524 (2017)).
  • the dual-vector system of the present invention can trigger the expression of the full-length MYO7A protein polypeptide, or a functional fragment thereof, in inner ear cells, inner hair cells, or outer hair cells.
  • the patients to whom the dual vector system of the present invention is administered are preferably newborn human infants, usually less than 6 months old, or even less than 3 months old (if they were diagnosed with DFNB16 deafness in childhood). These human infants are more preferably between 3 months and 1 year old.
  • the two-vector system of the present invention can also be administered to, for example, infants (2-6 years), children (6-12 years), adolescents (12-18 years), or adults (18 years and older).
  • the term “treating” is intended to mean administering a therapeutically effective amount of the dual-vector system of the present invention to a patient suffering from DFNB16 deafness to partially or completely restore the patient's hearing. Restoration can be assessed by testing auditory brainstem responses (ABRs) using electrophysiological equipment. “Treatment of Myo7a mutation-induced hearing loss” specifically refers to complete restoration of hearing function. The term “preventing” refers to reducing or delaying hearing loss within the auditory frequency range.
  • Example 1 Selecting intein cleavage sites in the amino acid sequence of MYO7A protein
  • Figure 1 shows a schematic diagram of intein-mediated full-length extein expression.
  • An intein cleavage site is set within the amino acid sequence of the MYO7A protein.
  • the N-terminal coding sequence of MYO7A is the nucleotide coding sequence from the N-terminus of the MYO7A amino acid sequence to the cleavage site
  • the C-terminal coding sequence of MYO7A is the nucleotide coding sequence from the amino acid immediately following the cleavage site to the C-terminus of the MYO7A amino acid sequence.
  • the N-terminal sequence of MYO7A is fused to the N-terminal sequence of the intein, and the C-terminal sequence of the intein is fused to the C-terminal sequence of MYO7A.
  • Numerous cleavage sites can be selected within the amino acid sequence of the MYO7A protein.
  • Table 2 lists 13 cleavage site schemes and the corresponding MYO7A N-terminal and C-terminal amino acid sequences. The goal is to identify cleavage sites with improved efficacy for Myo7A gene therapy.
  • Figure 2 shows a schematic diagram of the screening process for Myo7A cleavage sites through intein splicing.
  • the 1043rd amino acid residue is used as the cleavage site (S1).
  • the 1058th amino acid residue is used as the cleavage site (S2).
  • amino acid residue 1061 is used as the cleavage site (S3).
  • S3 and SEQ ID NO:4 are the nucleotide sequence and amino acid sequence of the N-terminus of MYO7A, respectively
  • SEQ ID NO:5 and SEQ ID NO:6 are the nucleotide sequence and amino acid sequence of the C-terminus of MYO7A, respectively.
  • amino acid residue 1064 is used as the cleavage site (S4).
  • SEQ ID NO:7 and SEQ ID NO:8 are the nucleotide sequence and amino acid sequence of the N-terminus of MYO7A, respectively
  • SEQ ID NO:9 and SEQ ID NO:10 are the nucleotide sequence and amino acid sequence of the C-terminus of MYO7A, respectively.
  • the 1071st amino acid residue is used as the cleavage site (S5).
  • the 1076th amino acid residue is used as the cleavage site (S6).
  • the 1081st amino acid residue is used as the cleavage site (S7).
  • amino acid residue 1104 is used as the cleavage site (S8).
  • SEQ ID NO:11 and SEQ ID NO:12 are the nucleotide sequence and amino acid sequence of the N-terminus of MYO7A, respectively
  • SEQ ID NO:13 and SEQ ID NO:14 are the nucleotide sequence and amino acid sequence of the C-terminus of MYO7A, respectively.
  • amino acid residue 1105 is used as the cleavage site (S9).
  • SEQ ID NO:15 and SEQ ID NO:16 are the nucleotide sequence and amino acid sequence of the N-terminus of MYO7A, respectively
  • SEQ ID NO:17 and SEQ ID NO:18 are the nucleotide sequence and amino acid sequence of the C-terminus of MYO7A, respectively.
  • the 1114th amino acid residue is used as the cleavage site (S10).
  • amino acid residue 1119 is used as the cleavage site (S11).
  • SEQ ID NO:19 and SEQ ID NO:20 are the nucleotide sequence and amino acid sequence of the N-terminus of MYO7A, respectively
  • SEQ ID NO:21 and SEQ ID NO:22 are the nucleotide sequence and amino acid sequence of the C-terminus of MYO7A, respectively.
  • the 1122nd amino acid residue is used as the cleavage site (S12).
  • the 1126th amino acid residue is used as the cleavage site (S13).
  • Example 2 Construction of a dual-vector system for expressing MYO7A protein using a plasmid containing ITR sequences
  • a first vector plasmid expressing the N-terminus of the MYO7A protein and a second vector plasmid expressing the C-terminus of the MYO7A protein were constructed.
  • the constructed plasmid elements are shown in the upper and lower panels of Figure 2, respectively.
  • the second vector plasmid was connected to the end of the coding sequence (CDS) expressing the C-terminus of the MYO7A protein (HA tag sequence YPYDVPDYA (SEQ ID NO. 25) for verifying in vitro expression, thereby obtaining a dual-vector system.
  • the first vector plasmid and the second vector plasmid in the dual-vector system are obtained by modifying the pAAV-CMV-EGFP-WPRE-SV40 plasmid backbone shown in Figure 3. After expression, the first vector plasmid and the second vector plasmid achieve expression of the MYO7A full-length protein by protein trans-splicing.
  • the first vector plasmid is connected to the sequence encoding the N-terminal fragment of the intein (Rm-N, SEQ ID NO: 23) at the 3' end of the sequence encoding the Myo7a-N sequence (i.e., the sequence of the N-terminal part of Myo7a), and the second vector plasmid is connected to the sequence encoding the Myo7a-C sequence (i.e., the sequence of the C-terminal part of Myo7a) at the 3' end of the sequence encoding the C-terminal fragment of the intein (Rm-C sequence, SEQ ID NO: 24), and the end of the Myo7a-C sequence is connected to an HA tag.
  • the EGFP reporter gene sequence and other sequences in the pAAV-CMV-EGFP-WPRE-SV40 plasmid were replaced with a sequence encoding the Myo7a-N sequence and the N-terminal fragment of the intein (Rm-N, SEQ ID NO: 23) using EcoRI and EcoRV enzymes to obtain the first vector plasmid.
  • the EGFP reporter gene sequence in the pAAV-CMV-EGFP-WPRE-SV40 plasmid was replaced with a sequence encoding the C-terminal fragment of the intein (Rm-C sequence, SEQ ID NO: 24) and the Myo7a-C sequence using EcoRI and EcoRV enzymes to obtain the second vector plasmid.
  • the synthesis of these sequences and the construction of these vectors were commissioned to Nanjing GenScript Biotechnology Co., Ltd.
  • the plasmid transcripts all contain a covalently linked MYO7A portion and an Rm intein portion.
  • the plasmid names and transcripts are shown in Table 3 below.
  • Example 2 The 13 pairs of plasmids shown in Table 3 obtained in Example 2 were transfected into HEK-293T cells (cells purchased from ATCC) in pairs, and the expression of full-length MYO7A protein was analyzed by Western blotting 48 hours after transfection.
  • the specific experimental method is as follows.
  • HEK-293T cells Human Embryonic Kidney 293T cells, hereinafter referred to as "293T cells"
  • 293T cells Human Embryonic Kidney 293T cells
  • Prepare Tube A 125 ⁇ L serum-free DMEM medium + 8 ⁇ L Lipofectamine 3000 reagent (Invitrogen, catalog number: L3000015) and mix thoroughly.
  • Prepare Tube B 125 ⁇ L serum-free DMEM medium + 2 ⁇ g first vector plasmid + 2 ⁇ g second vector plasmid + 8 ⁇ L P3000 reagent (Invitrogen, catalog number: L3000015) and mix thoroughly.
  • Tube B contains a mixture of culture medium and Lipofectamine 3000 transfection reagent
  • Tube B contains a mixture of culture medium, vector plasmid DNA, and transfection enhancer P3000.
  • the obtained DNA-liposome complex was added to 293T cells for transfection, and the cells were incubated at 37°C in 95% air and 5% CO2 . After 48 hours of transfection, the cells were harvested by centrifugation.
  • the cell pellet harvested by centrifugation was fully resuspended in an appropriate amount of RIPA lysis buffer (Thermo Fisher Scientific, catalog number: 89900) (supplemented with 1% protease inhibitor cocktail (Thermo Fisher Scientific, catalog number: 87786), 1% PMSF), and lysed on ice for 30 minutes. Vortex every 10 minutes during this period to fully resuspend the cell pellet in the lysis buffer.
  • the incubated PVDF membrane was removed and rinsed three times with 1X TBST for 5 minutes each time.
  • the membrane was incubated with the secondary antibody (HRP-conjugated Affinipure Goat Anti-Mouse IgG (H+L), Proteintech, catalog number: SA00001-1) at room temperature for 1 hour and rinsed three times with 1X TBST for 5 minutes each time.
  • the chemiluminescent reagent (ECL) was then added for development in a dark room.
  • Lane “1” indicates the protein expression level of MYO7A after co-transfection of 293T cells with pAAV-CMV-MYO7A-N-S1-Rm-N-intein plasmid and pAAV-CMV-Rm-C-intein-MYO7A-C-S1 plasmid;
  • Lane “2" represents the protein expression level of MYO7A after co-transfection of 293T cells with pAAV-CMV-MYO7A-N-S2-Rm-N-intein plasmid and pAAV-CMV-Rm-C-intein-MYO7A-C-S2 plasmid; ...
  • Lane “13” indicates the protein expression level of MYO7A after co-transfection of pAAV-CMV-MYO7A-N-S13-Rm-N-intein plasmid and pAAV-CMV-Rm-C-intein-MYO7A-C-S13 plasmid into 293T cells.
  • the intein cleavage site S3 of the MYO7A protein was selected, and a dual-vector system containing a truncated WPRE was constructed, wherein the WPRE+SV40 poly(A) (717bp) nucleotide sequence shown in SEQ ID NO:27 in the pAAV-CMV-EGFP-WPRE-SV40 plasmid backbone shown in Figure 3 was replaced with the WPRE3-SV40 late poly(A) (432bp) nucleotide sequence shown in SEQ ID NO:30, and a first vector plasmid and a second vector plasmid containing a truncated WPRE were constructed.
  • the first vector plasmid and the second vector plasmid containing the truncated WPRE were co-transfected into HEK-293T cells, and the expression of the full-length MYO7A protein was analyzed by Western blotting 48 hours after transfection.
  • the results were compared with the Western blotting results of HEK-293T cells co-transfected with the first vector plasmid and the second vector plasmid (containing the full-length WPRE) using the intein cleavage site S3 of the MYO7A protein in Example 3.1, and the Western blotting results of HEK-293T cells co-transfected with the first vector plasmid and the second vector plasmid that did not contain the full-length WPRE or its truncated variants or the poly A sequence.
  • the 10 plasmids corresponding to the intein cleavage sites S3, S4, S8, S9, and S11 of the MYO7A protein constructed in Example 2 were mixed with pHelper plasmid (synthesized by Nanjing GenScript) and pRC plasmid (containing the PHP.B VP1 gene, partial CDS sequence ID: KU056473.1) at a molar ratio of 1:1:1, and co-transfected into HEK-293T cells using PEI transfection reagent (a total of 1 ⁇ g of the three plasmids was added per approximately one million cells).
  • the cells were cultured in DMEM medium containing 10% fetal bovine serum in a 5% carbon dioxide incubator at 37°C for 3 days, washed once with PBS buffer, collected, and repeatedly frozen and thawed five times. NaCl was added to a final concentration of 500 mM, and 10,0 The cells were centrifuged at 00g for half an hour, and the supernatant was filtered with a 0.45 ⁇ m filter membrane. The filtrate was purified and concentrated to obtain an adeno-associated virus with a virus titer of 4.19E+13GC/ml. The adeno-associated virus was named according to the name of the plasmid.
  • the adenovirus packaged with pAAV-CMV-MYO7A-N-S3-Rm-N-intein, pHelper plasmid and pRC plasmid was named pAAV-CMV-MYO7A-N-S3-Rm-N-intein adeno-associated virus (AAV);
  • the adenovirus packaged with pAAV-CMV-Rm-C-intein-MYO7A-C-S3, pHelper plasmid and pRC plasmid was named pAAV-CMV-Rm-C-intein-MYO7A-C-S3 AAV.
  • Example 4 To verify whether the candidate intein cleavage site can express MYO7A protein in the inner ear hair cells, the following paired adeno-associated viruses constructed in Example 4 were respectively injected into the ears of three P3 (i.e., 3 days after birth) Myo7a (p.Q720X) point mutation KO mice (the mice were commissioned by Saiye Biotechnology Co., Ltd. to construct using the CRISPR/Cas9 method.
  • the Myo7a (p.Q720X) point mutation will lead to the loss of Myo7A protein expression in the inner ear hair cells, thereby leading to hair cell death and hearing loss) through the round window administration for in vivo expression verification.
  • the virus was injected. Mice were anesthetized hypothermically on ice for 1-2 minutes. After anesthesia, a postauricular incision was made to expose the round window to the visual field. A glass micropipette was used to inject a paired adeno-associated virus from (i) to (v) totaling 2 ⁇ 10 10 viral genomes (i.e., 1 ⁇ 10 10 viral genomes for each AAV). After injection, the skin wound was sealed with 3M Vethod tissue glue. Two weeks later, the mice were sacrificed, and the cochlea was removed and fixed with 4% paraformaldehyde (PFA). Then, the cochlea was decalcified with EDTA solution until ready for use.
  • PFA paraformaldehyde
  • the immunofluorescence results in Figure 16 show that the dual AAV system achieves the expression of MYO7A protein in the mouse cochlea.
  • WT is a wild-type mouse
  • blue is parvalbumin-labeled hair cells
  • green is Myo7a protein.
  • Myo7a (p.Q720X) mice do not express Myo7a protein when they are not injected with paired AAV viruses.
  • Myo7a (p.Q720X) + AAV” is Myo7a (p.Q720X) mice injected with paired AAV viruses. After virus injection, Myo7a (p.Q720X) mice expressed Myo7a protein in the inner ear hair cells, and the expression level was similar to that of WT mice.
  • Myo7a knockout mice are Myo7a (p.Q720X) point mutation KO mice, which were commissioned to Saiye Biotechnology Co., Ltd. to construct.
  • the paired adeno-associated viruses of Example 5 were injected into the inner ear of P3 mice through the round window for in vivo expression verification. First, the virus was injected. The mice were anesthetized in ice for 1-2 minutes. After anesthesia, a post-auricular incision was made to expose the round window to the field of view, and the virus was injected with a glass micropipette. After the injection, the skin wound was sealed with 3M Vethod tissue glue.
  • ABR auditory brainstem response
  • mice's hearing thresholds were used as stimuli to measure the mice's hearing thresholds. This analysis of the mice's hearing sensitivity allowed them to assess overall hearing function, from hair cells to the cerebral cortex. Higher ABR thresholds indicate more severe hearing loss in Myo7a knockout mice. Conversely, lower ABR thresholds indicate greater efficacy of gene therapy.
  • amino acid sequence of the N-terminal portion of the RmaDnaB intein is SEQ ID NO: 23
  • amino acid sequence of the C-terminal portion of the RmaDnaB intein is SEQ ID NO: 24
  • nucleotide sequence of the vector pAAV-CMV-EGFP-WPRE-SV40 plasmid shown in Figure 3 is shown in SEQ ID NO:26.
  • Nucleotide sequence of SV40 poly(A) signal (122 bp): SEQ ID NO: 29
  • Nucleotide sequence of SV40 late poly(A) signal SEQ ID NO: 32
  • amino acid sequence of the N-terminal portion of the NpuDnaE intein is SEQ ID NO: 52
  • nucleotide sequence of the N-terminal portion of the NpuDnaE intein is SEQ ID NO: 53
  • amino acid sequence of the C-terminal portion of the NpuDnaE intein is SEQ ID NO: 54
  • nucleotide sequence of the C-terminal portion of the NpuDnaE intein is SEQ ID NO: 55

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Medicinal Chemistry (AREA)
  • Wood Science & Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Veterinary Medicine (AREA)
  • Public Health (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Biochemistry (AREA)
  • Animal Behavior & Ethology (AREA)
  • Epidemiology (AREA)
  • Microbiology (AREA)
  • Biophysics (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Virology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Immunology (AREA)
  • Plant Pathology (AREA)
  • Physics & Mathematics (AREA)
  • General Chemical & Material Sciences (AREA)
  • Toxicology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Marine Sciences & Fisheries (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

The present invention relates to a dual-vector system for expressing an MYO7A protein. The dual-vector system comprises a first nucleic acid vector and a second nucleic acid vector, wherein the first nucleic acid vector comprises a first nucleotide sequence, and the second nucleic acid vector comprises a second nucleotide sequence. The first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences, and the second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences. The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A, an N-terminal coding sequence of intein, and polyA; and the expression cassette of the second nucleotide sequence comprises a promoter, a C-terminal coding sequence of intein, a C-terminal coding sequence of MYO7A, and polyA. The present invention also relates to a packaging vector system of an adeno-associated virus, a method for packaging an adeno-associated virus, and an adeno-associated virus obtained thereby. The dual-vector system for expressing the MYO7A protein, or the adeno-associated virus can be used for gene therapy, especially for the treatment of hearing loss, such as for the treatment of MYO7A-mutation-related USH1B, autosomal recessive hereditary deafness DFNB2 and autosomal dominant hereditary deafness DFNA11.

Description

表达MYO7A蛋白的双载体系统及其用途Dual vector system for expressing MYO7A protein and its use 技术领域Technical Field

本发明涉及表达MYO7A蛋白的双载体系统、及其在基因治疗中的应用,尤其是在治疗听力损失中的应用。The present invention relates to a dual-vector system for expressing MYO7A protein and its application in gene therapy, in particular to the treatment of hearing loss.

背景技术Background Art

Myo7a基因编码MYO7A蛋白(也称为“肌球蛋白7A”)。Myo7a基因自1996年报道至今,已被证实有500多个突变位点与遗传性耳聋相关。Myo7a基因位于染色体11q13.5,编码由2215个氨基酸组成的非常规的肌球蛋白ⅦA,是一种马达分子,广泛表达在视网膜上皮和内耳毛细胞中。Myo7a基因突变导致内耳毛细胞静纤毛形态异常,以及毛细胞和神经元死亡,与Usher综合征(Usher Syndrome;缩写为USH)(又称遗传性耳聋-色素性视网膜炎综合征)1B型、常染色体隐性遗传性聋DFNB2、常染色体显性遗传性聋DFNA11相关。因此不论是助听器还是人工耳蜗植入都不是相关听力损伤的完美有效的治疗方案。The Myo7a gene encodes the MYO7A protein (also known as "myosin 7A"). Since the Myo7a gene was first reported in 1996, more than 500 mutation sites have been confirmed to be associated with hereditary deafness. The Myo7a gene is located on chromosome 11q13.5 and encodes an unconventional myosin VIIA composed of 2215 amino acids. It is a motor molecule widely expressed in the retinal epithelium and inner ear hair cells. Mutations in the Myo7a gene lead to abnormal morphology of inner ear hair cell stereocilia, as well as hair cell and neuronal death, and are associated with Usher syndrome (USH) (also known as hereditary deafness-retinitis pigmentosa syndrome) type 1B, autosomal recessive deafness DFNB2, and autosomal dominant deafness DFNA11. Therefore, neither hearing aids nor cochlear implants are perfect and effective treatment options for related hearing loss.

快速发展的基因治疗策略为Myo7a基因突变诱发的听力损伤提供了可能的治疗方案。首先,耳蜗的高度格室化结构以及血液-耳蜗屏障(BCB)使其与身体其他器官隔离,能极大的减少用药量并降低药物渗透带来的风险。其次,耳蜗中的细胞高度特化,毛细胞和支持细胞不再分裂,细胞结构稳定,有利于非整合型病毒载体(例如AAV)中的转基因持续表达。此外,同样是被认为是“免疫豁免”的眼睛已经取得人体基因治疗的成功先例,而且AAV已作为病毒载体应用到耳蜗基因OTOF基因突变的治疗中。该病毒已经应用到3例研究者发起的OTOF基因突变的临床研究(Investigator-initiated clinical trials,IIT)中(鼎新公司2例,苏州星奥拓维生物技术有限公司1例),目前没有任何证据表明AAV载体在耳蜗基因治疗会引起任何不良反应。AAV作为载体递送基因治疗药物的安全性得到了重要的保障。Rapidly evolving gene therapy strategies offer a potential treatment for hearing loss caused by Myo7a mutations. First, the cochlea's highly compartmentalized structure and blood-cochlear barrier (BCB) isolate it from other organs, significantly reducing medication dosage and the risk of drug penetration. Second, the cells in the cochlea are highly specialized; hair cells and supporting cells no longer divide, resulting in a stable cellular structure that favors sustained transgene expression via non-integrating viral vectors (such as AAV). Furthermore, successful gene therapy has been demonstrated in the eye, also considered "immune-privileged," and AAV has been used as a viral vector to treat OTOF mutations in the cochlea. This virus has been used in three investigator-initiated clinical trials (IITs) targeting OTOF mutations (two at Dingxin Pharmaceuticals and one at Suzhou Xingaotowei Biotechnology Co., Ltd.). Currently, there is no evidence that AAV vectors cause any adverse reactions in cochlear gene therapy. This significantly enhances the safety of AAV as a vector for delivering gene therapy drugs.

最重要的是,Myo7a基因突变导致的听力损伤是由于单基因蛋白缺失造成的,所以理论上只要将外源的Myo7a基因通过AAV转导在内耳毛细胞内表达出足量的MYO7A蛋白就能修复内耳毛细胞异常形态的静纤毛,维持毛细胞和神经元正常存活,从而保护听觉功能。Most importantly, the hearing loss caused by the Myo7a gene mutation is due to the loss of a single gene protein. Therefore, in theory, as long as the exogenous Myo7a gene is transduced into the inner ear hair cells through AAV to express a sufficient amount of MYO7A protein, the abnormal morphology of the stereocilia in the inner ear hair cells can be repaired, the normal survival of the hair cells and neurons can be maintained, and the hearing function can be protected.

小尺寸的AAV虽然有转导优势但是其装载量有限。由于Myo7a的编码区DNA长度为6.65kb,超过了一个AAV的包装容量(4.5kb)。使得AAV载体不能直接携带完整的Myo7a编码序列(CDS)从而在毛细胞中表达出完整的Myo7a。现有技术中报导了一种双AAV载体策略。在该策略中,双AAV在各mRNA转录后通过mRNA反式剪接实现长片段基因的mRNA表达。该递送方案先将目的基因的编码区分成两部分构建到不同的AAV载体质粒中。两种不同的AAV载体在细胞中转录出目的基因N端和C端mRNA序列,通过剪接供体序列-剪接受体序列(简写为SD-SA序列)完成目的基因N端和C端mRNA拼接,从而形成完整的目的基因mRNA模板从而表达出完整的蛋白。2019年Sebastian Kuegler和Ellen Reisinger(EMBO Mol Med.2019 Jan;11(1):e9396.doi:10.15252/emmm.201809396)以及Omar Akil(Proc Natl Acad Sci U S A.2019 Mar 5;116(10):4496-4501)先后发表文章表明,将分别含鼠源otoferlin cDNA N段和C段的两个AAV同时注射入耳蜗能够高效转染内毛细胞并恢复了OTOF-/-小鼠的部分ABR阈值。该突破性的研究进展为双AAV基因治疗策略走上临床治疗奠定了基础。根据佛罗里达州大学研究基因会报道,通过该方式已在体外筛选出exon21/22和exon23/24两个切分位点可以表达出全长的Myo7a蛋白。Although the small size of AAV has transduction advantages, its loading capacity is limited. Since the DNA length of the coding region of Myo7a is 6.65kb, it exceeds the packaging capacity of an AAV (4.5kb). This makes it impossible for the AAV vector to directly carry the complete Myo7a coding sequence (CDS) to express the complete Myo7a in hair cells. A dual AAV vector strategy has been reported in the prior art. In this strategy, dual AAV achieves mRNA expression of long-fragment genes through mRNA trans-splicing after each mRNA is transcribed. This delivery scheme first divides the coding region of the target gene into two parts and constructs them into different AAV vector plasmids. Two different AAV vectors transcribe the N-terminal and C-terminal mRNA sequences of the target gene in the cell, and complete the splicing of the N-terminal and C-terminal mRNA of the target gene by splicing the donor sequence-splicing acceptor sequence (abbreviated as SD-SA sequence), thereby forming a complete target gene mRNA template to express the complete protein. In 2019, Sebastian Kuegler and Ellen Reisinger (EMBO Mol Med. 2019 Jan; 11(1): e9396.doi: 10.15252/emmm.201809396) and Omar Akil (Proc Natl Acad Sci U S A. 2019 Mar 5; 116(10): 4496-4501) published articles showing that the simultaneous injection of two AAVs containing the N and C segments of mouse otoferlin cDNA into the cochlea can efficiently transfect inner hair cells and restore some ABR thresholds in OTOF -/- mice. This groundbreaking research progress has laid the foundation for the dual AAV gene therapy strategy to enter clinical treatment. According to the University of Florida Gene Research Group, two cleavage sites, exon21/22 and exon23/24, have been screened in vitro through this method to express the full-length Myo7a protein.

目前没有任何药物或手术可以有效治疗Myo7a突变导致的听力损伤,该听力损伤是不可逆转的。不论是在婴儿时期通过佩戴助听器来刺激残余听力,还是人工耳蜗植入都无法恢复甚至是维持残余的听力。双AAV载体的基因治疗策略有可能在毛细胞中表达出MYO7A蛋白,从而恢复毛细胞静纤毛形态,维持毛细胞存活,保护听力。但现有技术中通过mRNA反式剪接产生的全长蛋白效率相对比较低,有可能不足以维持毛细胞长期存活,且尚未应用到听力治疗中,其可靠性尚不得而知。Currently, there are no drugs or surgeries that can effectively treat hearing loss caused by Myo7a mutations, and this hearing loss is irreversible. Neither stimulating residual hearing through hearing aids during infancy nor cochlear implants can restore or even maintain residual hearing. The gene therapy strategy of dual AAV vectors has the potential to express MYO7A protein in hair cells, thereby restoring the morphology of hair cell stereocilia, maintaining hair cell survival, and protecting hearing. However, the efficiency of full-length protein produced by mRNA trans-splicing in existing technologies is relatively low, and may not be sufficient to maintain long-term survival of hair cells. It has not yet been applied to hearing treatment, and its reliability is still unknown.

本发明开发的双载体系统和通过内含肽剪接产生的MYO7A蛋白可以在早期补足毛细胞的MYO7A蛋白,修复毛细胞静纤毛形态和功能,阻止毛细胞死亡,从而防止听力损伤进一步恶化。The dual-vector system developed by the present invention and the MYO7A protein produced by intein splicing can replenish the MYO7A protein of hair cells in the early stage, repair the morphology and function of hair cell stereocilia, prevent hair cell death, and thus prevent further deterioration of hearing loss.

发明概述SUMMARY OF THE INVENTION

现有技术中针对Myo7a基因突变引起的USH1B、常染色体隐性遗传性聋DFNB2、常染色体显性遗传性聋DFNA11没有有效的药物和手术治疗方案,本发明提供的双载体系统可以有效地在耳蜗毛细胞中表达出足量的表达MYO7A蛋白从而修复毛细胞静纤毛形态和功能,维持毛细胞存活并保护听力,可作为将来临床治疗听力损伤的候选药物。In the existing technology, there are no effective drugs or surgical treatments for USH1B, autosomal recessive hearing loss DFNB2, and autosomal dominant hearing loss DFNA11 caused by Myo7a gene mutations. The dual-vector system provided by the present invention can effectively express sufficient MYO7A protein in cochlear hair cells, thereby repairing the morphology and function of hair cell stereocilia, maintaining hair cell survival and protecting hearing. It can be used as a candidate drug for the future clinical treatment of hearing loss.

本发明的AAV双载体系统提供了可以用于治疗USH1B、常染色体隐性遗传性聋DFNB2、常染色体显性遗传性聋DFNA11的MYO7A切分位点、MYO7A蛋白的切分位点序列及其筛选方法。该双载体系统能够用在USH1B的临床治疗中。The AAV dual-vector system of the present invention provides a MYO7A cleavage site, a MYO7A protein cleavage site sequence, and a screening method thereof that can be used to treat USH1B, autosomal recessive hereditary deafness DFNB2, and autosomal dominant hereditary deafness DFNA11. This dual-vector system can be used in the clinical treatment of USH1B.

在第一方面,本发明提供了一种用于表达MYO7A蛋白的双载体系统,其包含第一核酸载体和第二核酸载体,其中In a first aspect, the present invention provides a dual vector system for expressing MYO7A protein, comprising a first nucleic acid vector and a second nucleic acid vector, wherein

第一核酸载体包含第一核苷酸序列;且第二核酸载体包含第二核苷酸序列;The first nucleic acid vector comprises a first nucleotide sequence; and the second nucleic acid vector comprises a second nucleotide sequence;

所述第一核苷酸序列包含插入在两个第一ITR序列之间的表达盒;The first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences;

所述第二核苷酸序列包含插入在两个第二ITR序列之间的表达盒;The second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences;

所述第一核苷酸序列的表达盒包含启动子、MYO7A的N端编码序列、内含肽的N端编码序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A, an N-terminal coding sequence of an intein, and polyA;

所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的C端编码序列和polyA;和The expression cassette of the second nucleotide sequence comprises a promoter, a C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A and polyA; and

在MYO7A氨基酸序列中设置有MYO7A切分位点,例如,所述MYO7A氨基酸序列如SEQ ID NO:2所示或其功能片段,例如,所述功能片段是与SEQ ID NO:2具有至少80%序列同一性的氨基酸序列;A MYO7A cleavage site is provided in the MYO7A amino acid sequence, for example, the MYO7A amino acid sequence is as shown in SEQ ID NO: 2 or a functional fragment thereof, for example, the functional fragment is an amino acid sequence having at least 80% sequence identity with SEQ ID NO: 2;

MYO7A的N端编码序列是从MYO7A氨基酸序列的N端至MYO7A切分位点的核苷酸编码序列;MYO7A的C端编码序列是从MYO7A切分位点后一位的氨基酸至MYO7A氨基酸序列的C端的核苷酸编码序列。The N-terminal coding sequence of MYO7A is a nucleotide coding sequence from the N-terminus of the MYO7A amino acid sequence to the MYO7A cleavage site; the C-terminal coding sequence of MYO7A is a nucleotide coding sequence from the amino acid after the MYO7A cleavage site to the C-terminus of the MYO7A amino acid sequence.

在一些实施方案中,所述MYO7A切分位点位于MYO7A氨基酸序列中丝氨酸、苏氨酸或半胱氨酸的前一位氨基酸。In some embodiments, the MYO7A cleavage site is located at the amino acid preceding serine, threonine, or cysteine in the MYO7A amino acid sequence.

在一些实施方案中,在本发明的用于表达MYO7A蛋白的双载体系统中,所述第一核苷酸序列或所述第二核苷酸序列的表达盒的启动子选自CAG启动子、CMV启动子、CBA启动子、UbC启动子、SFFV启动子、EF1α启动子、PGK启动子、或Myo7A、Myo15、Atoh1、POU4F3、Lhx3、Myo6、α9AchR、α10AchR、OTOF和STRC编码基因的启动子;In some embodiments, in the binary vector system for expressing MYO7A protein of the present invention, the promoter of the expression cassette of the first nucleotide sequence or the second nucleotide sequence is selected from CAG promoter, CMV promoter, CBA promoter, UbC promoter, SFFV promoter, EF1α promoter, PGK promoter, or promoters of genes encoding Myo7A, Myo15, Atoh1, POU4F3, Lhx3, Myo6, α9AchR, α10AchR, OTOF and STRC;

在一些实施方案中,在本发明的用于表达MYO7A蛋白的双载体系统中,所述第一核苷酸序列或第二核苷酸序列的表达盒的polyA包含AATAAA(SEQ ID NO:33)和AATAAA的变体;所述AATAAA的变体包含ATTAAA(SEQ ID NO:34)、AGTAAA(SEQ ID NO:35)、CATAAA(SEQ ID NO:36)、TATAAA(SEQ ID NO:37)、GATAAA(SEQ ID NO:38)、ACTAAA(SEQ ID NO:39)、AATATA(SEQ ID NO:40)、AAGAAA(SEQ ID NO:41)、AATAAT(SEQ ID NO:42)、AAAAAA(SEQ ID NO:43),AATGAA(SEQ ID NO:44)、AATCAA(SEQ ID NO:45)、AACAAA(SEQ ID NO:46)、AATCAA(SEQ ID NO:47)、AATAAC(SEQ ID NO:48)、AATAGA(SEQ ID NO:49)、AATTAA(SEQ ID NO:50)或AATAAG(SEQ ID NO:51);例如,所述polyA是与SEQ ID NO:29或SEQ ID NO:32所示的多聚A信号序列具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列;和所述两个第一ITR序列和所述两个第二ITR序列中的每个ITR序列源自AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8或AAV9。In some embodiments, in the dual-vector system for expressing MYO7A protein of the present invention, the polyA of the expression cassette of the first nucleotide sequence or the second nucleotide sequence comprises AATAAA (SEQ ID NO: 33) and variants of AATAAA; the variants of AATAAA comprise ATTAAA (SEQ ID NO: 34), AGTAAA (SEQ ID NO: 35), CATAAA (SEQ ID NO: 36), TATAAA (SEQ ID NO: 37), GATAAA (SEQ ID NO: 38), ACTAAA (SEQ ID NO: 39), AATATA (SEQ ID NO: 40), AAGAAA (SEQ ID NO: 41), AATAAT (SEQ ID NO: 42), AAAAAA (SEQ ID NO: 43), AATGAA (SEQ ID NO: 44), NO: 44), AATCAA (SEQ ID NO: 45), AACAAA (SEQ ID NO: 46), AATCAA (SEQ ID NO: 47), AATAAC (SEQ ID NO: 48), AATAGA (SEQ ID NO: 49), AATTAA (SEQ ID NO: 50) or AATAAG (SEQ ID NO: 51); for example, the polyA is a nucleotide sequence having at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identity with the poly A signal sequence shown in SEQ ID NO: 29 or SEQ ID NO: 32; and each of the two first ITR sequences and the two second ITR sequences is derived from AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 or AAV9.

在一些实施方案中,在本发明的用于表达MYO7A蛋白的双载体系统中,所述第一核苷酸序列或所述第二核苷酸序列的表达盒还包含表达调控元件和/或标签元件,例如,所述表达调控元件为土拨鼠肝炎转录后调控元件(WPRE)或其变体,优选地,WPRE截短变体,例如,与SEQ ID NO:28所示的核苷酸序列具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列,例如,SEQ ID NO:31所示的核苷酸序列;例如,所述标签元件为HA。In some embodiments, in the dual-vector system for expressing MYO7A protein of the present invention, the expression cassette of the first nucleotide sequence or the second nucleotide sequence further comprises an expression regulatory element and/or a tag element, for example, the expression regulatory element is a woodchuck hepatitis posttranscriptional regulatory element (WPRE) or a variant thereof, preferably a WPRE truncated variant, for example, a nucleotide sequence having at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identity with the nucleotide sequence shown in SEQ ID NO: 28, for example, the nucleotide sequence shown in SEQ ID NO: 31; for example, the tag element is HA.

在一些实施方案中,在本发明的用于表达MYO7A蛋白的双载体系统中,内含肽源自MxeGyrA、pabPolIII、MjaKlbA、SspDnaB、SceVMA、SspDnaE、NpuDnaE、AvaDnaE、CraDnaE、CspDnaE、CwaDnaE、MchtDnaE、OliDnaE、TerDnaE、gp41-1、gp41-8、IMPDH-1或RmaDnaB,例如,内含肽源自RmaDnaB,例如,内含肽的N端为如SEQ ID NO:23所示的RmaDnaB内含肽N端,内含肽的C端为如SEQ ID NO:24所示的RmaDnaB内含肽C端。在一些实施方案中,内含肽源自NpuDnaE,例如,内含肽的N端为如SEQ ID NO:52所示的NpuDnaE内含肽N端,内含肽的C端为如SEQ ID NO:54所示的NpuDnaE内含肽C端。In some embodiments, in the dual-vector system for expressing MYO7A protein of the present invention, the intein is derived from MxeGyrA, pabPolIII, MjaKlbA, SspDnaB, SceVMA, SspDnaE, NpuDnaE, AvaDnaE, CraDnaE, CspDnaE, CwaDnaE, MchtDnaE, OliDnaE, TerDnaE, gp41-1, gp41-8, IMPDH-1 or RmaDnaB, for example, the intein is derived from RmaDnaB, for example, the N-terminus of the intein is the N-terminus of the RmaDnaB intein as shown in SEQ ID NO:23, and the C-terminus of the intein is the C-terminus of the RmaDnaB intein as shown in SEQ ID NO:24. In some embodiments, the intein is derived from NpuDnaE, for example, the N-terminus of the intein is the N-terminus of the NpuDnaE intein as shown in SEQ ID NO:52, and the C-terminus of the intein is the C-terminus of the NpuDnaE intein as shown in SEQ ID NO:54.

在一些实施方案中,在本发明的用于表达MYO7A蛋白的双载体系统中,将所述第一核苷酸序列插入在包含两个第一ITR序列的质粒中,将所述第二核苷酸序列插入在包含两个第二ITR序列的质粒中,例如,所述包含两个第一ITR序列的质粒和所述包含两个第二ITR序列的质粒相同或不同,例如,所述质粒是pAAV、pAAV-CMV、pX601、pX551或pAAV-MCS质粒。In some embodiments, in the dual-vector system for expressing MYO7A protein of the present invention, the first nucleotide sequence is inserted into a plasmid comprising two first ITR sequences, and the second nucleotide sequence is inserted into a plasmid comprising two second ITR sequences, for example, the plasmid comprising two first ITR sequences and the plasmid comprising two second ITR sequences are the same or different, for example, the plasmid is pAAV, pAAV-CMV, pX601, pX551 or pAAV-MCS plasmid.

在一些实施方案中,在本发明的用于表达MYO7A蛋白的双载体系统中,MYO7A切分位点如表1所示。In some embodiments, in the dual-vector system for expressing MYO7A protein of the present invention, the MYO7A cleavage site is as shown in Table 1.

在一些具体实施方案中,在本发明的用于表达MYO7A蛋白的双载体系统中,使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1043位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;In some specific embodiments, in the dual-vector system for expressing the MYO7A protein of the present invention, amino acid 1043 of the MYO7A amino acid sequence shown in SEQ ID NO: 2 is used as the MYO7A cleavage site, and the RmaDnaB intein is used; the N-terminal coding sequence of MYO7A is connected and fused to the N-terminal coding sequence of the RmaDnaB intein, and then a first nucleotide sequence is constructed, using the pAAV-CMV plasmid as a vector; the C-terminal coding sequence of the RmaDnaB intein is connected and fused to the C-terminal coding sequence of MYO7A, and then a second nucleotide sequence is constructed, using the pAAV-CMV plasmid as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1058位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1058th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1061位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1061st amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1064位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1064th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1071位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1071st amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1076位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1076th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1081位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1081st amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1104位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1104th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1105位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1105th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1114位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1114th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1119位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1119th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1122位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;或者Using amino acid 1122 of the MYO7A amino acid sequence shown in SEQ ID NO: 2 as the MYO7A cleavage site and using the RmaDnaB intein; connecting and fusing the N-terminal coding sequence of MYO7A with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, using the pAAV-CMV plasmid as a vector; connecting and fusing the C-terminal coding sequence of the RmaDnaB intein with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, using the pAAV-CMV plasmid as a vector; or

使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1126位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1126th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector;

例如,RmaDnaB内含肽的N端编码序列编码SEQ ID NO:23所示的RmaDnaB N端部分,RmaDnaB内含肽的C端编码序列编码SEQ ID NO:24所示的RmaDnaB C端部分。For example, the N-terminal coding sequence of the RmaDnaB intein encodes the N-terminal portion of RmaDnaB shown in SEQ ID NO:23, and the C-terminal coding sequence of the RmaDnaB intein encodes the C-terminal portion of RmaDnaB shown in SEQ ID NO:24.

在一些实施方案中,在本发明的用于表达MYO7A蛋白的双载体系统中,所述第一核苷酸序列的表达盒包含启动子、MYO7A的SEQ ID No:4所示N端的编码序列、内含肽的N端编码序列和polyA;所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的SEQ ID No:6所示C端的编码序列和polyA;In some embodiments, in the dual-vector system for expressing MYO7A protein of the present invention, the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 4, an N-terminal coding sequence of an intein, and polyA; the expression cassette of the second nucleotide sequence comprises a promoter, a C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A as shown in SEQ ID No: 6, and polyA;

所述第一核苷酸序列的表达盒包含启动子、MYO7A的SEQ ID No:8所示N端的编码序列、内含肽的N端编码序列和polyA;所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的SEQ ID No:10所示C端的编码序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 8, an N-terminal coding sequence of an intein, and polyA; the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A as shown in SEQ ID No: 10, and polyA;

所述第一核苷酸序列的表达盒包含启动子、MYO7A的SEQ ID No:12所示N端的编码序列、内含肽的N端编码序列和polyA;所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的SEQ ID No:14所示C端的编码序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 12, an N-terminal coding sequence of an intein, and polyA; the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A as shown in SEQ ID No: 14, and polyA;

所述第一核苷酸序列的表达盒包含启动子、MYO7A的SEQ ID No:16所示N端的编码序列、内含肽的N端编码序列和polyA;所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的SEQ ID No:18所示C端的编码序列和polyA;或The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 16, an N-terminal coding sequence of an intein, and polyA; the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, an C-terminal coding sequence of MYO7A as shown in SEQ ID No: 18, and polyA; or

所述第一核苷酸序列的表达盒包含启动子、MYO7A的SEQ ID No:20所示N端的编码序列、内含肽的N端编码序列和polyA;所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的SEQ ID No:22所示C端的编码序列和polyA。The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 20, an N-terminal coding sequence of an intein, and polyA; the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, an C-terminal coding sequence of MYO7A as shown in SEQ ID No: 22, and polyA.

在一些实施方案中,在本发明的用于表达MYO7A蛋白的双载体系统中,所述第一核苷酸序列的表达盒和所述第二核苷酸序列的表达盒各自在3’ITR序列的N端包含WPRE核苷酸序列和SV40多聚腺苷酸化序列的组合,例如,具有SEQ ID NO:27所示的核苷酸序列或与SEQ ID NO:27具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列;或者包含WPRE3核苷酸序列和SV40晚期多聚腺苷酸化序列的组合,例如,具有SEQ ID NO:30所示的核苷酸序列或与SEQ ID NO:30具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列。In some embodiments, in the dual-vector system for expressing MYO7A protein of the present invention, the expression cassette of the first nucleotide sequence and the expression cassette of the second nucleotide sequence each contain a combination of a WPRE nucleotide sequence and an SV40 polyadenylation sequence at the N-terminus of the 3’ITR sequence, for example, a nucleotide sequence shown in SEQ ID NO: 27 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 27; or a combination of a WPRE3 nucleotide sequence and an SV40 late polyadenylation sequence, for example, a nucleotide sequence shown in SEQ ID NO: 30 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 30.

在第二方面,本发明提供了一种腺相关病毒的包装载体系统,所述包装载体系统包含本发明第一方面所述的用于表达MYO7A蛋白的双载体系统、携带AAV rep和cap基因的载体和辅助病毒载体,经包装成为AAV载体,优选地,其中MYO7A蛋白的氨基酸序列如SEQ ID NO:2所示。In a second aspect, the present invention provides an adeno-associated virus packaging vector system, which comprises the dual-vector system for expressing MYO7A protein described in the first aspect of the present invention, a vector carrying AAV rep and cap genes, and a helper virus vector, which are packaged into an AAV vector. Preferably, the amino acid sequence of the MYO7A protein is as shown in SEQ ID NO: 2.

在一些实施方案中,在本发明的腺相关病毒的包装载体系统中,所述携带AAV rep和cap基因的载体选自AAV1、AAV2、AAV5、AAV8、AAV9、Anc80、PHP.eB、AAV-DJ和AAVrh.10载体;所述辅助病毒载体为pHelper质粒。In some embodiments, in the adeno-associated virus packaging vector system of the present invention, the vector carrying the AAV rep and cap genes is selected from AAV1, AAV2, AAV5, AAV8, AAV9, Anc80, PHP.eB, AAV-DJ and AAVrh.10 vectors; and the helper virus vector is a pHelper plasmid.

在第三方面,本发明提供了一种腺相关病毒的包装方法,其中,将本发明第二方面所述的腺相关病毒的包装载体系统转入宿主细胞中进行包装。In a third aspect, the present invention provides a method for packaging an adeno-associated virus, wherein the adeno-associated virus packaging vector system described in the second aspect of the present invention is transferred into a host cell for packaging.

在一些实施方案中,所述宿主细胞选自Hela-S3细胞、HEK-293细胞、HEK-293T细胞、HEK-293FT细胞、A549细胞和Sf9细胞。In some embodiments, the host cell is selected from Hela-S3 cells, HEK-293 cells, HEK-293T cells, HEK-293FT cells, A549 cells, and Sf9 cells.

在第四方面,本发明提供了双腺相关病毒载体,其是通过根据本发明第二方面所述的包装方法获得的。In a fourth aspect, the present invention provides a dual adeno-associated virus vector, which is obtained by the packaging method according to the second aspect of the present invention.

在第五方面,本发明提供了本发明第一方面所述的用于表达MYO7A蛋白的双载体系统或本发明第四方面所述的双腺相关病毒载体的用途,用于制备治疗耳聋性疾病或听力损伤或听力功能障碍的药物或制剂。In a fifth aspect, the present invention provides the use of the dual vector system for expressing MYO7A protein described in the first aspect of the present invention or the dual adeno-associated virus vector described in the fourth aspect of the present invention for preparing a drug or preparation for treating deafness, hearing loss or hearing dysfunction.

在第六方面,本发明提供了一种用于治疗耳聋疾病或听力损伤或听力功能障碍的药物或制剂,其由本发明第一方面所述的用于表达MYO7A蛋白的双载体系统或本发明第四方面所述的腺相关病毒制备,其中,所述腺相关病毒是将腺相关病毒的包装载体系统转入宿主细胞进行包装得到的,所述腺相关病毒的包装载体系统包括表达MYO7A蛋白的双载体系统、携带AAVrep和cap基因的载体,以及辅助病毒载体。In a sixth aspect, the present invention provides a medicine or preparation for treating deafness, hearing loss, or hearing dysfunction, which is prepared by the dual-vector system for expressing MYO7A protein described in the first aspect of the present invention or the adeno-associated virus described in the fourth aspect of the present invention, wherein the adeno-associated virus is obtained by transferring the adeno-associated virus packaging vector system into a host cell for packaging, and the adeno-associated virus packaging vector system includes a dual-vector system for expressing MYO7A protein, a vector carrying AAVrep and cap genes, and a helper virus vector.

在一些实施方案中,本发明的所述药物或制剂还包含中性盐缓冲剂、酸性盐缓冲剂、碱性盐缓冲剂、葡萄糖、甘露糖、甘露糖醇、蛋白质、多肽、氨基酸、抗生素、螯合剂、佐剂、防腐剂、纳米颗粒、脂质体和阳性脂质颗粒。In some embodiments, the drug or formulation of the present invention further comprises a neutral salt buffer, an acidic salt buffer, an alkaline salt buffer, glucose, mannose, mannitol, proteins, polypeptides, amino acids, antibiotics, chelating agents, adjuvants, preservatives, nanoparticles, liposomes and positive lipid particles.

在一些实施方案中,将本发明的所述药物或制剂通过耳蜗的圆窗、卵圆窗、半规管、总管进行注射给药;以及终生单次给药或多次给药,总剂量为1×109-1×1013病毒基因组。In some embodiments, the drug or preparation of the present invention is administered by injection through the round window, oval window, semicircular canal, or common canal of the cochlea; and is administered once or multiple times throughout life, with a total dose of 1×10 9 -1×10 13 viral genomes.

在第七方面,本发明提供了筛选用于内含肽拼接的MYO7A蛋白切分位点的方法,包括In a seventh aspect, the present invention provides a method for screening MYO7A protein cleavage sites for intein splicing, comprising:

a)构建本发明第一方面所述的第一核酸载体和第二核酸载体,任选地,所述内含肽的N端的首个残基为Cys;所述内含肽的C端的3’末端序列含有His和Asn,和MYO7A的C端的5’首个氨基酸残基为Cys、Ser、Thr;a) constructing the first nucleic acid vector and the second nucleic acid vector according to the first aspect of the present invention, wherein, optionally, the first residue at the N-terminus of the intein is Cys; the 3' terminal sequence at the C-terminus of the intein contains His and Asn, and the first amino acid residue at the 5' terminal of the C-terminus of MYO7A is Cys, Ser, or Thr;

b)将构建的第一核酸载体和第二核酸载体共转染哺乳动物细胞;b) co-transfecting the constructed first nucleic acid vector and the second nucleic acid vector into mammalian cells;

c)当能够高表达MYO7A蛋白时,将MYO7A蛋白N端部分3’末端残基作为用于内含肽拼接的MYO7A蛋白切分位点。c) When the MYO7A protein can be highly expressed, the 3' terminal residue of the N-terminal portion of the MYO7A protein is used as the MYO7A protein cleavage site for intein splicing.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1显示了内含肽(intein)介导的全长外显肽(extein)表达示意图。图中,“N-extein”表示N端外显肽,“C-extein”表示C端外显肽,“inteinN”表示N端内含肽,“inteinC”表示C端内含肽。Figure 1 shows a schematic diagram of intein-mediated full-length extein expression. In the figure, "N-extein" represents the N-terminal extein, "C-extein" represents the C-terminal extein, "intein N " represents the N-terminal intein, and "intein C " represents the C-terminal intein.

图2显示了通过内含肽拼接筛选Myo7a的切分位点的筛选示意图。通过内含肽拼接的方法筛选Myo7a切分位点涉及蛋白反式剪接法,即,将Myo7a的N端和C端的编码区序列(CDS)分别构建到两个不同的质粒中并表达,通过内含肽剪接拼成完整的全长Myo7a蛋白。N端质粒在Myo7a的N端CDS序列3’末端添加了内含肽N端片段序列(例如,RmaDnaB内含肽N端片段(RminteinN)序列,文中也简称为“Rm-N-内含肽”序列,Rm-N-内含肽序列首个残基为Cys)。C端质粒在Myo7a的C端CDS 5’起始端前添加了内含肽C端片段(RminteinC)序列(例如,RmaDnaB内含肽C端片段序列,文中也简称为“Rm-C-内含肽”序列,Rm-C-内含肽序列的3’末端序列含有His和Asn),Myo7a的C端CDS 5’首个氨基酸残基为Cys、Ser、Thr用于内含肽剪接。Figure 2 shows a schematic diagram of the screening process for Myo7a cleavage sites via intein splicing. The intein splicing method involves protein trans-splicing, where the N-terminal and C-terminal coding sequences (CDS) of Myo7a are constructed and expressed separately in two separate plasmids, and the full-length Myo7a protein is assembled via intein splicing. The N-terminal plasmid contains an intein N-terminal fragment sequence (e.g., the RmaDnaB intein N-terminal fragment (Rmintein N ) sequence, also referred to herein as the "Rm-N-intein" sequence, where the first residue of the Rm-N-intein sequence is Cys) at the 3' end of the Myo7a N-terminal CDS sequence. The C-terminal plasmid adds an intein C-terminal fragment (Rmintein C ) sequence before the 5' start of the C-terminal CDS of Myo7a (for example, the RmaDnaB intein C-terminal fragment sequence, also referred to as the "Rm-C-intein" sequence in this article, the 3' end sequence of the Rm-C-intein sequence contains His and Asn). The first amino acid residue 5' of the C-terminal CDS of Myo7a is Cys, Ser, and Thr for intein splicing.

图3例示了AAV双载体质粒元件的示意图。pAAV-CMV-EGFP-WPRE-SV40质粒骨架序列可以如SEQ ID NO:26所示,其中5’ITR序列位于1bp-141bp,CMV启动子序列位于169bp-752bp,Kozak序列位于792bp-797bp,EGFP序列位于801bp-1517bp,WPRE序列位于1536bp-2124bp,SV40 PolyA信号序列位于2131bp-2252bp,3’ITR序列位于2290bp-2430bp,f1 Ori序列位于2505bp-2960bp,kana抗性序列位于3242bp-4156bp,Ori序列位于4327bp-4515bp。Figure 3 illustrates a schematic diagram of the AAV binary vector plasmid elements. The pAAV-CMV-EGFP-WPRE-SV40 plasmid backbone sequence can be as shown in SEQ ID NO: 26, wherein the 5' ITR sequence is located between 1 bp and 141 bp, the CMV promoter sequence is located between 169 bp and 752 bp, the Kozak sequence is located between 792 bp and 797 bp, the EGFP sequence is located between 801 bp and 1517 bp, the WPRE sequence is located between 1536 bp and 2124 bp, the SV40 PolyA signal sequence is located between 2131 bp and 2252 bp, the 3' ITR sequence is located between 2290 bp and 2430 bp, the f1 Ori sequence is located between 2505 bp and 2960 bp, the kana resistance sequence is located between 3242 bp and 4156 bp, and the Ori sequence is located between 4327 bp and 4515 bp.

图4显示了MYO7A切分位点S3的N端蛋白质粒图谱。Figure 4 shows the N-terminal protein map of the MYO7A cleavage site S3.

图5显示了MYO7A切分位点S3的C端蛋白质粒图谱。Figure 5 shows the C-terminal protein map of the MYO7A cleavage site S3.

图6显示了MYO7A切分位点S4的N端蛋白质粒图谱。Figure 6 shows the N-terminal protein map of the MYO7A cleavage site S4.

图7显示了MYO7A切分位点S4的C端蛋白质粒图谱。Figure 7 shows the C-terminal protein map of the MYO7A cleavage site S4.

图8显示了MYO7A切分位点S8的N端蛋白质粒图谱。Figure 8 shows the N-terminal protein map of the MYO7A cleavage site S8.

图9显示了MYO7A切分位点S8的C端蛋白质粒图谱。Figure 9 shows the C-terminal protein plasmid map of the MYO7A cleavage site S8.

图10显示了MYO7A切分位点S9的N端蛋白质粒图谱。Figure 10 shows the N-terminal protein map of the MYO7A cleavage site S9.

图11显示了MYO7A切分位点S9的C端蛋白质粒图谱。Figure 11 shows the C-terminal protein map of the MYO7A cleavage site S9.

图12显示了MYO7A切分位点S11的N端蛋白质粒图谱。FIG12 shows the N-terminal protein map of the MYO7A cleavage site S11.

图13显示了MYO7A切分位点S11的C端蛋白质粒图谱。FIG13 shows a protein map of the C-terminal region of the MYO7A cleavage site S11.

图14显示了MYO7A各切分位点质粒对的全长蛋白表达量。由图14可见,MYO7A切分位点S3、S4、S8、S9和S11产生显著更多的全长蛋白表达量。Figure 14 shows the full-length protein expression levels of each MYO7A cleavage site plasmid pair. As shown in Figure 14, MYO7A cleavage sites S3, S4, S8, S9, and S11 produce significantly higher full-length protein expression levels.

图15显示了对于MYO7A切分位点S3,当载体质粒含WPRE-SV40元件(WPRE-SV40元件是WPRE元件+SV40 poly(A)信号序列的组合名称,其序列如SEQ ID NO:27所示)或W3SL元件(W3SL元件是WPRE3元件+SV40 late poly(A)信号序列的组合名称,其序列如SEQ ID NO:30所示),以及不包含WPRE-SV40元件或W3SL元件的质粒在293T细胞中的MYO7A蛋白表达的Western印迹结果。图中,“HA”表示使用针对HA标签的抗体检测HA标签标记的MYO7A蛋白表达的结果。Figure 15 shows the results of Western blot analysis of MYO7A protein expression in 293T cells for the MYO7A cleavage site S3, using vector plasmids containing the WPRE-SV40 element (the WPRE -SV40 element is the name for the combination of the WPR E element and the SV40 poly(A) signal sequence, and its sequence is shown in SEQ ID NO:27) or the W3SL element (the W3SL element is the name for the combination of the WPR E element and the SV40 late poly(A) signal sequence, and its sequence is shown in SEQ ID NO:30), as well as plasmids that do not contain the WPRE-SV40 element or the W3SL element. In the figure, "HA" indicates the results of detecting HA-tagged MYO7A protein expression using an antibody against the HA tag.

图16显示了小鼠耳蜗内MYO7A蛋白表达的免疫荧光结果。图16的左栏小图表示染色parvalbumin的结果;中栏小图表示染色Myo7a的结果;右栏小图是染色parvalbumin和Myo7a的叠图。Figure 16 shows the immunofluorescence results of MYO7A protein expression in mouse cochlea. The left panel of Figure 16 shows the results of parvalbumin staining; the middle panel shows the results of Myo7a staining; and the right panel is an overlay of parvalbumin and Myo7a staining.

图17显示了对应于MYO7A各候选切分位点的Myo7a基因在Myo7a敲除小鼠中治疗听觉功能的检测结果。FIG17 shows the test results of treating hearing function in Myo7a knockout mice using Myo7a genes corresponding to various candidate cleavage sites of MYO7A.

发明详述Detailed Description of the Invention

除非下文中另外定义,否则本说明书中所用的全部技术与科学术语具有如本发明所属领域的普通技术人员通常理解的相同含义。本文所提及的全部出版物、专利申请、专利和其他参考文献通过引用的方式完整地并入本文作为参考。此外,本文中所述的材料、方法和例子仅是说明性的并且不意在是限制性的。本发明的其他特征、目的和优点将从本说明书及附图并且从后附的权利要求书中显而易见。Unless otherwise defined hereinafter, all technical and scientific terms used in this specification have the same meaning as those of ordinary skill in the art to which the present invention pertains. All publications, patent applications, patents and other references mentioned herein are incorporated herein by reference in their entirety. In addition, the materials, methods and examples described herein are merely illustrative and are not intended to be restrictive. Other features, objects and advantages of the present invention will become apparent from this specification and the accompanying drawings and from the appended claims.

I.定义I. Definition

在本文中,术语“约”在与数字数值联合使用时意为涵盖具有比指定数字数值小5%的下限和比指定数字数值大5%的上限的范围内的数字数值。该术语也旨在涵盖在指定数字±1%、±0.5%或±0.1%范围内的数值。As used herein, the term "about" when used in conjunction with a numerical value is intended to encompass numerical values within a range having a lower limit that is 5% less than the specified numerical value and an upper limit that is 5% greater than the specified numerical value. The term is also intended to encompass values within ±1%, ±0.5%, or ±0.1% of the specified number.

在本文中,术语“包含”或“包括”意指包括所述及的要素、整数或步骤、或要素组、整数组或步骤组,但是不排除任何其他的要素、整数或步骤、或其他要素组、整数组或步骤组。在本文中,当使用术语“包含”或“包括”时,除非另有指明,否则也涵盖由所述及的要素、整数或步骤组成的情形。例如,当提及“包含”某个具体序列的多核苷酸时,也旨在涵盖由该具体序列组成的多核苷酸。As used herein, the term "comprising" or "including" means including the recited elements, integers, steps, or groups of elements, integers, or steps, but does not exclude any other elements, integers, or steps, or other groups of elements, integers, or steps. As used herein, unless otherwise indicated, the term "comprising" or "including" also encompasses the situation consisting of the recited elements, integers, or steps. For example, when referring to a polynucleotide "comprising" a particular sequence, it is intended to encompass a polynucleotide consisting of that particular sequence.

在本文中,表述“和/或”在与两个或两个以上项目联合使用时,意在表示所列相关项目中的任何一个、或所列相关项目中的任何多个或所有的可能组合。Herein, the expression "and/or," when used in conjunction with two or more items, is intended to mean any one of the associated listed items, or any multiple or all possible combinations of the associated listed items.

本文所述的内含肽或蛋白质内含肽(又称蛋白质内含子,Intein)是前体未成熟蛋白质中的一段多肽链,它通过一系列重排、转酯、环化等自我催化的反应过程,可以从前体蛋白中切除并将其两端的蛋白多肽段(蛋白质外显肽,Extein)通过一个天然的肽键连接,即,通过蛋白质自我剪接作用实现蛋白质结构的重新布局。断裂内含肽是内含肽的一种结构类型。在结构上,它的N端区域和C端区域相互分离,而当内含肽N端区域和C端区域所在的两个片段连接后可按照标准内含肽剪接途径完成其两端外显肽的拼接。The intein or protein intein (also known as intein) described herein is a polypeptide chain within an immature precursor protein. Through a series of self-catalytic reactions, such as rearrangement, transesterification, and cyclization, it can be excised from the precursor protein and its two end polypeptide segments (exteins) connected by a natural peptide bond. This is to say, protein self-splicing achieves a rearrangement of the protein structure. A split intein is a structural type of intein. Structurally, its N-terminal and C-terminal regions are separated from each other. When the two fragments containing the N-terminal and C-terminal regions of the intein are connected, the exteins at both ends can be spliced together according to the standard intein splicing pathway.

大部分内含肽由两端的剪接区域(terminal splicing region)和中间的核酸内切酶结构域(homing endonuclease domain)或连接结构域(linker domain)组成。内含肽可分为3种:经典内含肽(canonical intein)、微小内含肽(miniintein)和断裂内含肽(split intein)。所述经典内含肽和微小内含肽均包含两端的剪接结构域和中间区域,它们之间的区别是,经典内含肽中间区域为核酸内切酶结构域,而微小内含肽中间区域为连接结构域,不同微小内含肽连接结构域的长度并不相同。而断裂内含肽的中间区域在特定位点断开,形成N端片段和C端片段,并且分别位于基因组上相距较远的2个基因上。在前体蛋白翻译成熟过程中,这2个内含肽片段相互识别并恢复核酸内切酶活性,介导蛋白质反式剪接(protein trans splicing)。在本文中,双AAV载体系统可用于递送含有编码断裂内含肽的核酸的核酸。Most inteins consist of terminal splicing regions and a central endonuclease domain or linker domain. Inteins can be divided into three types: canonical inteins, miniinteins, and split inteins. Both canonical and miniinteins contain terminal splicing domains and a central region. The difference between them is that the central region of a canonical intein is an endonuclease domain, while that of a miniintein is a linker domain. The lengths of the linker domains vary between miniinteins. The central region of a split intein is disconnected at a specific site, forming an N-terminal fragment and a C-terminal fragment, respectively, located on two genes distant from each other in the genome. During the translation and maturation of the precursor protein, these two intein fragments recognize each other and restore endonuclease activity, mediating protein trans-splicing. Herein, a dual AAV vector system can be used to deliver a nucleic acid containing a nucleic acid encoding a split intein.

通常,内含肽由10个模块组成,从内含肽N端开始依次为A、N2、B、N4、C、D、E、H、F和G,其中A、N2、B、N4为N端剪接区域,F、G为C端剪接区域,C、D、E、H为自导引核酸内切酶活性区域或连接结构域。A、B、F、G模块中参与内含肽剪接的基序在剪接位点处的氨基酸残基高度保守,为内含肽剪接过程中的亲和置换反应所必需。内含肽A模块中的基序通常都含有带有羟基或巯基的氨基酸,如Ser、Cys。B模块中的基序则含Thr-X-X-His高度保守的氨基酸序列,该序列也存在于丝氨酸蛋白酶中。G模块中参与剪接反应的基序的保守氨基酸残基为Asn、Ser、Cys、Thr、His。另外,A模块的基序中的保守位点(如Ser、Cys)在某些内含肽中可以被Ala、Gln或Pro等替代,G模块的基序中也是如此。Typically, inteins are composed of 10 modules, starting from the N-terminus: A, N2, B, N4, C, D, E, H, F, and G. A, N2, B, and N4 are the N-terminal splicing regions, F and G are the C-terminal splicing regions, and C, D, E, and H are the homing endonuclease active regions or linker domains. The motifs involved in intein splicing within the A, B, F, and G modules contain highly conserved amino acid residues at the splice sites, essential for the affinity displacement reaction during intein splicing. The motifs within the A module of the intein typically contain amino acids with hydroxyl or sulfhydryl groups, such as Ser and Cys. The motif within the B module contains the highly conserved amino acid sequence Thr-X-X-His, which is also found in serine proteases. The conserved amino acid residues within the motif involved in the splicing reaction within the G module are Asn, Ser, Cys, Thr, and His. In addition, the conserved sites in the motif of the A module (such as Ser, Cys) can be replaced by Ala, Gln or Pro in some inteins, and the same is true for the motif of the G module.

在本文中,“听力损失”是指,通过听力测量确定的低于正常听力阈值水平的听力,包括轻度、中度、重度和极重度听力损失、以及耳聋。听力损失可以通过听力损失百分比描述,例如,30%、60%、80%或甚至100%听力损失,或通过听力损失分级描述。所述听力损失可以是由基因缺陷引起的或与之相关的听力损失,例如由遗传因素引起的先天性耳聋和学语前聋,或与遗传因素相关的、由环境因素诱发的(例如,衰老、噪音、药物或感染诱导的)听力损失。听力损失可以是无症状的(即,不存在相关的可见外耳或其他器官异常)或有症状的。在一些实施方案中,听力损失是感音神经性听力损失。In this article, "hearing loss" refers to hearing below the normal hearing threshold level determined by audiometry, including mild, moderate, severe and profound hearing loss, and deafness. Hearing loss can be described by a percentage of hearing loss, for example, 30%, 60%, 80% or even 100% hearing loss, or by a grading description of hearing loss. The hearing loss can be hearing loss caused by or associated with a gene defect, such as congenital deafness and pre-lingual deafness caused by genetic factors, or hearing loss associated with genetic factors, induced by environmental factors (for example, aging, noise, drugs or infection-induced). Hearing loss can be asymptomatic (that is, there is no associated visible outer ear or other organ abnormality) or symptomatic. In some embodiments, hearing loss is sensorineural hearing loss.

在本文中,“听力损失相关基因”是指,所述基因上的变异可通过改变内耳正常发挥功能的能力而引起听力损失或造成听力损失易感性。在本文中,这样的基因也称作“听损基因”。目前已经鉴定了100多个基因与听力损失相关(参见,Hereditary Hearing Loss Homepage,https://hereditaryhearingloss.org/,其中列出了目前已知的单基因无症状听力损失的基因位置和鉴定数据)。在引起听力损失易感性的情况下,相对于健康个体,携带所述听损基因变异的个体可表现出更容易因环境因素,例如衰老、噪音、药物或感染而引发听力损失。As used herein, a "hearing loss-associated gene" refers to a gene whose variation can cause hearing loss or susceptibility to hearing loss by altering the ability of the inner ear to function normally. In this article, such genes are also referred to as "hearing loss genes." More than 100 genes have been identified as being associated with hearing loss (see Hereditary Hearing Loss Homepage, https://hereditaryhearingloss.org/, which lists the gene locations and identification data for currently known single-gene asymptomatic hearing loss). In the case of causing susceptibility to hearing loss, individuals carrying the hearing loss gene variation may show a greater susceptibility to hearing loss due to environmental factors, such as aging, noise, drugs, or infections, relative to healthy individuals.

在本文中,“耳蜗内毛细胞”是指,来自哺乳动物的离体或体外的耳蜗内毛细胞或细胞系或细胞群体,或在哺乳动物耳蜗中的内毛细胞。As used herein, "cochlear inner hair cells" refer to isolated or in vitro cochlear inner hair cells, cell lines, or cell populations derived from a mammal, or inner hair cells in the cochlea of a mammal.

在本文中,“耳蜗外毛细胞”是指,来自哺乳动物的离体或体外的耳蜗外毛细胞或细胞系或细胞群体,或在哺乳动物耳蜗中的外毛细胞。As used herein, "cochlear outer hair cells" refer to cochlear outer hair cells, cell lines, or cell populations isolated from or in vitro of a mammal, or outer hair cells in the cochlea of a mammal.

在本文中,“分离的”核酸是指,人工合成的、或自包含其的天然环境的至少一些组分中分离出来的核酸分子。例如,分离的核酸可以是一个更大的核酸的一部分、或是载体或物质组合物的一部分,或者可以包含在细胞内,并且仍然是“分离的”,条件是该更大的核酸、载体、物质组合物或特定细胞不是所述核酸的天然环境。As used herein, an "isolated" nucleic acid refers to a nucleic acid molecule that has been artificially synthesized or separated from at least some components of its natural environment. For example, an isolated nucleic acid can be part of a larger nucleic acid, or part of a vector or composition of matter, or can be contained within a cell and still be "isolated" provided that the larger nucleic acid, vector, composition of matter, or specific cell is not the natural environment of the nucleic acid.

在本文中,术语“可操作连接”也称作“有效连接”或“功能性连接”,意指两个或更多个多核苷酸(例如DNA)区段处于一种允许它们以预期的方式起作用的关系中。举例而言,若启动子序列在合适的宿主细胞或其他表达系统中刺激或调节编码序列的转录,则该启动子序列有效连接到该编码序列。一般而言,有效连接到可转录序列的启动子,与该可转录序列是连续的,即它们是顺式作用(cis-acting)。然而,一些转录调控序列(如增强子)无须物理邻接于或紧密接近于其增强转录的编码序列。As used herein, the term "operably linked," also referred to as "effectively linked" or "functionally linked," means that two or more polynucleotide (e.g., DNA) segments are in a relationship that allows them to function in the intended manner. For example, a promoter sequence is operably linked to a coding sequence if it stimulates or regulates the transcription of the coding sequence in a suitable host cell or other expression system. Generally, promoters that are operably linked to a transcribable sequence are contiguous with the transcribable sequence, i.e., they are cis-acting. However, some transcriptional regulatory sequences (e.g., enhancers) do not need to be physically adjacent to or in close proximity to the coding sequence whose transcription they enhance.

术语“全长MYO7A蛋白”是指将本发明的双载体系统中表达的MYO7A蛋白N端部分和MYO7A蛋白C端部分有效连接产生的MYO7A蛋白。在一些实施方案中,全长MYO7A蛋白是野生型或功能性人MYO7A蛋白。野生型和功能性hMYO7A蛋白的氨基酸序列和编码它们的多核苷酸序列是本领域已知的(参见例如,GenBank登录号NP_000251和U39226.1)。在一些具体实施方案中,全长MYO7A蛋白是SEQ ID NO.2所示的全长MYO7A蛋白或其功能衍生物或其功能片段。The term "full-length MYO7A protein" refers to a MYO7A protein produced by operatively linking the N-terminal portion of the MYO7A protein and the C-terminal portion of the MYO7A protein expressed in the dual vector system of the present invention. In some embodiments, the full-length MYO7A protein is a wild-type or functional human MYO7A protein. The amino acid sequences of wild-type and functional hMYO7A proteins and the polynucleotide sequences encoding them are known in the art (see, for example, GenBank accession numbers NP_000251 and U39226.1). In some specific embodiments, the full-length MYO7A protein is the full-length MYO7A protein shown in SEQ ID NO. 2, or a functional derivative or functional fragment thereof.

如本文中所用,术语“腺相关病毒(Adeno-associated virus,AAV)”因在腺病毒制品中发现而得名。AAV是微小病毒科(Parvovirus)成员,包含多种血清型,其基因组为单链DNA。As used herein, the term "adeno-associated virus (AAV)" is named after its discovery in adenovirus products. AAV is a member of the Parvovirus family, which includes multiple serotypes and has a single-stranded DNA genome.

AAV是依赖性病毒,需要其它病毒如腺病毒、单纯疱疹病毒、人乳头瘤病毒、或辅助因素提供辅助功能蛋白才能复制。AAV is a dependent virus that requires other viruses such as adenovirus, herpes simplex virus, human papillomavirus, or auxiliary factors to provide auxiliary functional proteins for replication.

最早分离到的AAV病毒是血清型2型AAV(AAV2)。AAV2基因组长约4.7kb,基因组两端为长度145bp的“反向末端重复序列”(inverted terminal repeat,ITR),呈回文-发卡结构。基因组中有两个大开放阅读框(ORF),分别编码rep和cap基因。The first AAV virus isolated was serotype 2 (AAV2). The AAV2 genome is approximately 4.7 kb long, flanked by 145-bp inverted terminal repeats (ITRs) at each end, forming a palindromic hairpin structure. The genome contains two large open reading frames (ORFs), encoding the rep and cap genes, respectively.

ITR是AAV载体基因组的顺式作用元件,在AAV病毒的整合、拯救、复制和基因组包装中发挥重要作用。ITR序列中包含Rep蛋白结合位点(Rep binding site,RBS)和末端解链位点trs(terminal resolution site),能够被Rep蛋白结合识别并在trs处产生切口。ITR序列还可形成独特的“T”字母型二级结构,在AAV病毒的生活周期中发挥重要作用。ITRs are cis-acting elements of the AAV vector genome, playing a crucial role in AAV integration, rescue, replication, and genome packaging. The ITR sequence contains the Rep binding site (RBS) and the terminal resolution site (TRs), which are recognized by the Rep protein and produce a cleavage at the TRs. The ITR sequence also forms a unique "T"-shaped secondary structure, which plays a crucial role in the AAV life cycle.

AAV2基因组其余部分可分为2个功能区,rep基因区和cap基因区。The rest of the AAV2 genome can be divided into two functional regions, the rep gene region and the cap gene region.

rep基因区编码Rep78、Rep68、Rep52和Rep40四种Rep蛋白。Rep蛋白对于AAV病毒的复制、整合、拯救和包装都具有重要作用。其中Rep78和Rep68与ITR中的末端解链位点trs和GAGY重复基序特异性结合,启动AAV基因组由单链向双链的复制过程。ITR中trs和GAGC重复基序和/或GAGY重复基序是AAV基因组复制的中心,因此虽然在各种血清型的AAV病毒中ITR序列都不尽相同,但是都能形成发卡结构和存在Rep结合位点。在AAV2基因组图谱位置19处有p19启动子,启动分别表达Rep52和Rep40。Rep52和Rep40具有ATP依赖的DNA解旋酶活性,但没有结合DNA的功能。The rep gene region encodes four Rep proteins: Rep78, Rep68, Rep52, and Rep40. Rep proteins play an important role in the replication, integration, rescue, and packaging of AAV viruses. Rep78 and Rep68 specifically bind to the terminal melting sites trs and GAGY repeat motifs in the ITR, initiating the replication of the AAV genome from single-stranded to double-stranded. The trs and GAGC repeat motifs and/or GAGY repeat motifs in the ITR are the center of AAV genome replication. Therefore, although the ITR sequences are different in various serotypes of AAV viruses, they can all form a hairpin structure and contain Rep binding sites. There is a p19 promoter at position 19 on the AAV2 genome map, which initiates the expression of Rep52 and Rep40, respectively. Rep52 and Rep40 have ATP-dependent DNA helicase activity but do not have the function of binding to DNA.

cap基因编码AAV病毒的衣壳蛋白VP1、VP2和VP3。其中,VP3分子量最小,但数量最多,在成熟的AAV颗粒中VP1、VP2、VP3的比例大致为1:1:10。VP1是形成有感染性的AAV所必需的;VP2协助VP3进入细胞核;VP3是组成AAV颗粒的主要蛋白。The cap gene encodes the AAV capsid proteins VP1, VP2, and VP3. VP3 has the smallest molecular weight but is the most abundant. In mature AAV particles, the ratio of VP1, VP2, and VP3 is approximately 1:1:10. VP1 is essential for the formation of infectious AAV; VP2 facilitates VP3 entry into the cell nucleus; and VP3 is the primary protein in AAV particles.

如本文中所用,术语“AAV载体”是人们随着对AAV病毒生活周期及其相关分子生物学机制的了解,将野生型AAV病毒改造成的一种高效的外源基因转移工具,即AAV载体。改造后的AAV载体基因组中只包含AAV病毒的ITR序列和携带待转运的外源序列。AAV病毒包装需要的Rep和Cap蛋白通过其他外源质粒反式提供,由此降低了rep和cap基因包装入AAV载体可能带来的危害。进一步地,AAV病毒本身不具有致病性,这使得AAV载体成为公认的最安全的病毒载体之一。As used herein, the term "AAV vector" refers to an efficient exogenous gene transfer tool, i.e., an AAV vector, that has been transformed from wild-type AAV virus as people gain a better understanding of the AAV virus life cycle and its related molecular biological mechanisms. The modified AAV vector genome only contains the ITR sequence of the AAV virus and the exogenous sequence to be transferred. The Rep and Cap proteins required for AAV virus packaging are provided in trans by other exogenous plasmids, thereby reducing the possible harm caused by packaging the rep and cap genes into the AAV vector. Furthermore, the AAV virus itself is not pathogenic, which makes the AAV vector recognized as one of the safest viral vectors.

AAV病毒血清型众多,不同的血清型具有不同的组织感染嗜性,因此应用AAV载体能够将外源基因转运至特定的器官和组织。There are many AAV virus serotypes, and different serotypes have different tissue infection tropisms. Therefore, the use of AAV vectors can transport exogenous genes to specific organs and tissues.

现有技术中对AAV载体具有相对成熟的包装系统,这便于规模化生产AAV载体。The existing technology has a relatively mature packaging system for AAV vectors, which facilitates the large-scale production of AAV vectors.

术语“载体基因组(vg)”是指包装在rAAV衣壳内形成rAAV载体的核酸序列。The term "vector genome (vg)" refers to the nucleic acid sequence that is packaged within the rAAV capsid to form the rAAV vector.

在本文中,“个体”和“受试者”可互换地使用,是指哺乳动物。哺乳动物的实例包括,但不限于,人、非人灵长类动物(例如,食蟹猴、恒河猴)、啮齿类动物,以及其他哺乳动物,例如,牛、猪、马、狗。在本文中,哺乳动物包括处于所有发育期(包括胚胎和胎儿期)的个体。As used herein, "individual" and "subject" are used interchangeably to refer to mammals. Examples of mammals include, but are not limited to, humans, non-human primates (e.g., cynomolgus monkeys, rhesus monkeys), rodents, and other mammals, such as cattle, pigs, horses, and dogs. As used herein, mammals include individuals at all stages of development, including embryonic and fetal stages.

在本文中,术语“治疗”指意欲改变正在接受治疗的个体中疾病之天然过程的临床介入。想要的治疗效果包括但不限于防止疾病出现或复发、减轻症状、减小疾病的任何直接或间接病理学后果、防止转移、降低病情进展速率、改善或缓和疾病状态,以及缓解或改善预后。术语“治疗”也涵盖改良或改善至少一项身体参数,包括患者可能无法辨别的身体参数。As used herein, the term "treatment" refers to clinical intervention intended to alter the natural course of a disease in the individual being treated. Desired therapeutic effects include, but are not limited to, preventing the onset or recurrence of the disease, alleviating symptoms, reducing any direct or indirect pathological consequences of the disease, preventing metastasis, reducing the rate of disease progression, ameliorating or palliating the disease state, and alleviating or improving prognosis. The term "treatment" also encompasses modification or improvement of at least one physical parameter, including physical parameters that may not be discernible by the patient.

在本文中,术语“预防”是指预防或延缓疾病或病症的发作或发展或进程。在本文中,“预防”通常是指在疾病的至少一个症状发生前实施的医院干预。As used herein, the term "prevention" refers to preventing or delaying the onset or development or progression of a disease or condition. As used herein, "prevention" generally refers to hospital intervention performed before at least one symptom of a disease occurs.

以下对本发明的各个方面进行描述。Various aspects of the present invention are described below.

II.双载体系统II. Dual Vector System

本发明利用蛋白反式剪接法,即,将MYO7A的N端和C端的CDS分别构建到两个不同的质粒中表达,通过内含肽剪接拼成完整的全长蛋白。N端质粒在MYO7A N端CDS序列的3’端添加了内含肽的N端编码序列,例如编码Rm-N-内含肽序列(SEQ ID NO:23)的核酸序列,Rm-N-内含肽序列的首个氨基酸残基含有Cys。C端质粒在MYO7A C端CDS序列的5’端添加了内含肽的C端编码序列,例如编码Rm-C-内含肽(SEQ ID NO:24)的核酸序列,Rm-C-内含肽的末端序列含有His和Asn,MYO7A C端CDS序列的首个氨基酸残基为Cys、Ser、Thr用于内含肽剪接。The present invention utilizes protein trans-splicing, that is, the N-terminal and C-terminal CDS of MYO7A are constructed into two different plasmids for expression, and the complete full-length protein is assembled through intein splicing. The N-terminal plasmid adds the N-terminal coding sequence of the intein at the 3' end of the MYO7A N-terminal CDS sequence, such as the nucleic acid sequence encoding the Rm-N-intein sequence (SEQ ID NO: 23), and the first amino acid residue of the Rm-N-intein sequence contains Cys. The C-terminal plasmid adds the C-terminal coding sequence of the intein at the 5' end of the MYO7A C-terminal CDS sequence, such as the nucleic acid sequence encoding the Rm-C-intein (SEQ ID NO: 24), and the terminal sequence of the Rm-C-intein contains His and Asn, and the first amino acid residue of the MYO7A C-terminal CDS sequence is Cys, Ser, and Thr for intein splicing.

本发明提供了一种用于表达MYO7A蛋白的双载体系统,其包含第一核酸载体和第二核酸载体,其中The present invention provides a dual vector system for expressing MYO7A protein, which comprises a first nucleic acid vector and a second nucleic acid vector, wherein

第一核酸载体包含第一核苷酸序列;且第二核酸载体包含第二核苷酸序列;The first nucleic acid vector comprises a first nucleotide sequence; and the second nucleic acid vector comprises a second nucleotide sequence;

所述第一核苷酸序列包含插入在两个第一ITR序列之间的表达盒;The first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences;

所述第二核苷酸序列包含插入在两个第二ITR序列之间的表达盒;The second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences;

所述第一核苷酸序列的表达盒包含启动子、MYO7A的N端编码序列、内含肽的N端编码序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A, an N-terminal coding sequence of an intein, and polyA;

所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的C端编码序列和polyA。The expression cassette of the second nucleotide sequence comprises a promoter, a C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A and polyA.

内含肽Inteins

内含肽可以剪接蛋白质,并在蛋白质翻译后或在蛋白质翻译同时通过共价连接两种不同的蛋白质发挥作用。最早的内含肽是在真菌中发现的。通过对内含肽序列的比较和分析,预测病毒、细菌、古细菌和真核微生物中存在的内含肽基因的数量超过600个。大多数内含肽是完整的蛋白质,但有一小部分内含肽是N端和C端分离的,且内含肽N端和C端各连接到一个蛋白质的一个部分,并在该蛋白质翻译后进行重组,通过亲核化学反应和变构产生完整的蛋白质。Inteins can splice proteins and exert their effects by covalently linking two different proteins after or during protein translation. The earliest inteins were discovered in fungi. Comparison and analysis of intein sequences predicts that there are over 600 intein genes present in viruses, bacteria, archaea, and eukaryotic microorganisms. Most inteins are complete proteins, but a small number of inteins have separate N- and C-termini. Inteins are linked to a portion of a protein at each end, and then reassemble after translation to produce the complete protein through nucleophilic chemical reactions and conformational changes.

在本发明中,优选地,内含肽是N端和C端分离的。内含肽可以源自MxeGyrA、pabPolIII、MjaKlbA、SspDnaB、SceVMA、SspDnaE、NpuDnaE、AvaDnaE、CraDnaE、CspDnaE、CwaDnaE、MchtDnaE、OliDnaE、TerDnaE、gp41-1、gp41-8、IMPDH-1或RmaDnaB。In the present invention, preferably, the intein is separated from the N-terminus and the C-terminus. The intein can be derived from MxeGyrA, pabPolIII, MjaKlbA, SspDnaB, SceVMA, SspDnaE, NpuDnaE, AvaDnaE, CraDnaE, CspDnaE, CwaDnaE, MchtDnaE, OliDnaE, TerDnaE, gp41-1, gp41-8, IMPDH-1 or RmaDnaB.

在一些实施方案中,内含肽是RmaDnaB内含肽,例如,具有SEQ ID NO:23所示的RmaDnaB内含肽N端部分和SEQ ID NO:24所示的RmaDnaB内含肽C端部分。在一些实施方案中,内含肽是NpuDnaE内含肽,例如,具有SEQ ID NO:52所示的NpuDnaE内含肽N端部分和SEQ ID NO:54所示的NpuDnaE内含肽C端部分。In some embodiments, the intein is an RmaDnaB intein, e.g., having an N-terminal portion of the RmaDnaB intein set forth in SEQ ID NO: 23 and a C-terminal portion of the RmaDnaB intein set forth in SEQ ID NO: 24. In some embodiments, the intein is an NpuDnaE intein, e.g., having an N-terminal portion of the NpuDnaE intein set forth in SEQ ID NO: 52 and a C-terminal portion of the NpuDnaE intein set forth in SEQ ID NO: 54.

MYO7A蛋白MYO7A protein

在一些具体实施方案中,MYO7A蛋白包含SEQ ID No 2的氨基酸序列或由其组成。In some embodiments, the MYO7A protein comprises or consists of the amino acid sequence of SEQ ID No 2.

在一些实施方案中,在MYO7A蛋白的氨基酸序列上设置切分位点,将MYO7A蛋白切分为MYO7A蛋白N端部分(本文中也简称为“MYO7A的N端”)和MYO7A蛋白C端部分(本文中也简称为“MYO7A的C端”),MYO7A的N端为MYO7A氨基酸序列的N端至切分位点的序列,MYO7A的C端是从紧邻切分位点的氨基酸残基到MYO7A氨基酸序列的C端的序列。将MYO7A的N端与内含肽的N端连接并融合,将内含肽的C端与MYO7A的C端连接并融合。MYO7A的切分位点有多种选择,下表1中列出了其中一些切分位点在MYO7A蛋白上的位置以及相应的MYO7A的N端部分和MYO7A的C端部分。In some embodiments, a cleavage site is provided in the amino acid sequence of the MYO7A protein, dividing the MYO7A protein into the N-terminal portion of the MYO7A protein (also referred to herein as the "N-terminus of MYO7A") and the C-terminal portion of the MYO7A protein (also referred to herein as the "C-terminus of MYO7A"). The N-terminus of MYO7A is the sequence from the N-terminus of the MYO7A amino acid sequence to the cleavage site, and the C-terminus of MYO7A is the sequence from the amino acid residue immediately adjacent to the cleavage site to the C-terminus of the MYO7A amino acid sequence. The N-terminus of MYO7A is linked and fused to the N-terminus of the intein, and the C-terminus of the intein is linked and fused to the C-terminus of MYO7A. There are various options for the cleavage site of MYO7A. Table 1 below lists the locations of some of these cleavage sites on the MYO7A protein and the corresponding N-terminal and C-terminal portions of MYO7A.

表1.切分位点在MYO7A蛋白上的位置以及相应的MYO7A的N端部分和MYO7A的C端部分

Table 1. Location of cleavage sites on the MYO7A protein and the corresponding N-terminal and C-terminal portions of MYO7A

表1续

Table 1 continued

表1续

Table 1 continued

在一些实施方案中,MYO7A蛋白在一组或多组以下氨基酸残基处进行切分形成MYO7A蛋白N端部分和MYO7A蛋白C端部分:1043位;1058位;1061位;1064位;1071位;1076位;1081位;1104位;1105位;1114位;1119位;1122位;或1126位,其中氨基酸位置是相对于SEQ ID No:2的位置。In some embodiments, the MYO7A protein is cleaved at one or more of the following groups of amino acid residues to form the N-terminal portion of the MYO7A protein and the C-terminal portion of the MYO7A protein: 1043; 1058; 1061; 1064; 1071; 1076; 1081; 1104; 1105; 1114; 1119; 1122; or 1126, wherein the amino acid position is relative to the position of SEQ ID No: 2.

在一些实施方案中,MYO7A蛋白被切分为选自以下的任一组的MYO7A蛋白N端部分和MYO7A蛋白C端部分:In some embodiments, the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein and a C-terminal portion of the MYO7A protein selected from any one of the following groups:

(1)1-1043aa+1044-2215aa;(1)1-1043aa+1044-2215aa;

(2)1-1058aa+1059-2215aa;(2)1-1058aa+1059-2215aa;

(3)1-1061aa+1062-2215aa;(3)1-1061aa+1062-2215aa;

(4)1-1064aa+1065-2215aa;(4)1-1064aa+1065-2215aa;

(5)1-1071aa+1072-2215aa;(5)1-1071aa+1072-2215aa;

(6)1-1076aa+1077-2215aa;(6)1-1076aa+1077-2215aa;

(7)1-1081aa+1082-2215aa;(7)1-1081aa+1082-2215aa;

(8)1-1104aa+1105-2215aa;(8)1-1104aa+1105-2215aa;

(9)1-1105aa+1106-2215aa;(9)1-1105aa+1106-2215aa;

(10)1-1114aa+1115-2215aa;(10)1-1114aa+1115-2215aa;

(11)1-1119aa+1120-2215aa;(11)1-1119aa+1120-2215aa;

(12)1-1122aa+11123-2215aa;或(12)1-1122aa+11123-2215aa; or

(13)1-1126aa+1127-2215aa,任选地其中氨基酸位置相对于SEQ ID No:2的位置。(13)1-1126aa+1127-2215aa, optionally wherein the amino acid position is relative to the position of SEQ ID No:2.

在一些实施方案中,MYO7A蛋白被切分为1-1061aa的MYO7A蛋白N端部分和1062-2215aa的MYO7A蛋白C端部分。在一些实施方案中MYO7A蛋白N端部分为SEQ ID No:4和MYO7A蛋白C端部分为SEQ ID No:6。In some embodiments, the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein of 1-1061 aa and a C-terminal portion of the MYO7A protein of 1062-2215 aa. In some embodiments, the N-terminal portion of the MYO7A protein is represented by SEQ ID No: 4 and the C-terminal portion of the MYO7A protein is represented by SEQ ID No: 6.

在一些实施方案中,MYO7A蛋白被切分为1-1064aa的MYO7A蛋白N端部分和1065-2215aa a的MYO7A蛋白C端部分。在一些实施方案中MYO7A蛋白N端部分为SEQ ID No:8和MYO7A蛋白C端部分为SEQ ID No:10。In some embodiments, the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein ranging from 1 to 1064 aa and a C-terminal portion of the MYO7A protein ranging from 1065 to 2215 aa. In some embodiments, the N-terminal portion of the MYO7A protein is represented by SEQ ID No: 8 and the C-terminal portion of the MYO7A protein is represented by SEQ ID No: 10.

在一些实施方案中,MYO7A蛋白被切分为1-1104aa的MYO7A蛋白N端部分和1105-2215aa的MYO7A蛋白C端部分。在一些实施方案中MYO7A蛋白N端部分为SEQ ID No:12和MYO7A蛋白C端部分为SEQ ID No:14。In some embodiments, the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein of 1-1104 aa and a C-terminal portion of the MYO7A protein of 1105-2215 aa. In some embodiments, the N-terminal portion of the MYO7A protein is represented by SEQ ID No: 12 and the C-terminal portion of the MYO7A protein is represented by SEQ ID No: 14.

在一些实施方案中,MYO7A蛋白被切分为1-1105aa的MYO7A蛋白N端部分和1115-2215aa的MYO7A蛋白C端部分。在一些实施方案中MYO7A蛋白N端部分为SEQ ID No:16和MYO7A蛋白C端部分为SEQ ID No:18。In some embodiments, the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein of 1-1105 aa and a C-terminal portion of the MYO7A protein of 1115-2215 aa. In some embodiments, the N-terminal portion of the MYO7A protein is SEQ ID No: 16 and the C-terminal portion of the MYO7A protein is SEQ ID No: 18.

在一些实施方案中,MYO7A蛋白被切分为1-1119aa的MYO7A蛋白N端部分和1120-2215aa的MYO7A蛋白C端部分。在一些实施方案中MYO7A蛋白N端部分为SEQ ID No:20和MYO7A蛋白C端部分为SEQ ID No:22。In some embodiments, the MYO7A protein is cleaved into an N-terminal portion of the MYO7A protein of 1-1119 aa and a C-terminal portion of the MYO7A protein of 1120-2215 aa. In some embodiments, the N-terminal portion of the MYO7A protein is represented by SEQ ID No: 20 and the C-terminal portion of the MYO7A protein is represented by SEQ ID No: 22.

载体质粒vector plasmid

本发明的载体质粒可以是任何能够在宿主细胞中复制并表达相应多肽的质粒。The vector plasmid of the present invention can be any plasmid that can replicate in a host cell and express a corresponding polypeptide.

在一些实施方案中,载体质粒包含两个ITR序列,分别为5’反向末端重复(5’ITR)序列和3’反向末端重复(3’ITR)序列。In some embodiments, the vector plasmid comprises two ITR sequences, namely a 5' inverted terminal repeat (5' ITR) sequence and a 3' inverted terminal repeat (3' ITR) sequence.

在一些实施方案中,在本发明的用于表达MYO7A蛋白的双载体系统中,将第一核苷酸序列插入在包含两个第一ITR序列的质粒中,将第二核苷酸序列插入在包含两个第二ITR序列的质粒中,例如,所述包含两个第一ITR序列的质粒和所述包含两个第二ITR序列的质粒相同或不同,例如,所述质粒是pAAV、pAAV-CMV、pX601、pX551或pAAV-MCS质粒。In some embodiments, in the dual-vector system for expressing MYO7A protein of the present invention, the first nucleotide sequence is inserted into a plasmid comprising two first ITR sequences, and the second nucleotide sequence is inserted into a plasmid comprising two second ITR sequences, for example, the plasmid comprising two first ITR sequences and the plasmid comprising two second ITR sequences are the same or different, for example, the plasmid is pAAV, pAAV-CMV, pX601, pX551 or pAAV-MCS plasmid.

双载体系统Dual vector system

本发明提供了一种双载体系统,其包含第一核酸载体和第二核酸载体,其中:The present invention provides a dual vector system comprising a first nucleic acid vector and a second nucleic acid vector, wherein:

第一核酸载体以5’-3’方向包含:5’反向末端重复(5’ITR)序列、编码MYO7A蛋白N端部分的核酸序列、编码内含肽N端部分的核酸序列、和3’反向末端重复(3’ITR)序列;The first nucleic acid vector comprises, in 5'-3' direction: a 5' inverted terminal repeat (5'ITR) sequence, a nucleic acid sequence encoding the N-terminal portion of the MYO7A protein, a nucleic acid sequence encoding the N-terminal portion of the intein, and a 3' inverted terminal repeat (3'ITR) sequence;

第二核酸载体以5’-3’方向包含:5’ITR序列、编码内含肽C端部分的核酸序列、含有编码MYO7A蛋白C端部分的核酸序列和3’ITR序列,且The second nucleic acid vector comprises, in the 5'-3' direction: a 5'ITR sequence, a nucleic acid sequence encoding the C-terminal portion of an intein, a nucleic acid sequence encoding the C-terminal portion of a MYO7A protein, and a 3'ITR sequence, and

任选地在将所述第一核酸载体和所述第二核酸载体导入宿主细胞后,所述MYO7A蛋白N端部分和MYO7A蛋白C端部分有效连接产生MYO7A蛋白。Optionally, after the first nucleic acid vector and the second nucleic acid vector are introduced into a host cell, the N-terminal portion of the MYO7A protein and the C-terminal portion of the MYO7A protein are operably linked to produce the MYO7A protein.

在一些实施方案中本发明提供了一种双载体系统,其包含第一核酸载体和第二核酸载体,其中:In some embodiments, the present invention provides a two-vector system comprising a first nucleic acid vector and a second nucleic acid vector, wherein:

第一核酸载体以5’-3’方向包含:5’反向末端重复(5’ITR)序列、编码MYO7A蛋白N端部分的核酸序列、编码内含肽N端部分的核酸序列和3’反向末端重复(3’ITR)序列;The first nucleic acid vector comprises, in the 5'-3' direction: a 5' inverted terminal repeat (5'ITR) sequence, a nucleic acid sequence encoding the N-terminal portion of the MYO7A protein, a nucleic acid sequence encoding the N-terminal portion of the intein, and a 3' inverted terminal repeat (3'ITR) sequence;

第二核酸载体以5’-3’方向包含:5’ITR序列、编码内含肽C端部分的核酸序列、编码MYO7A蛋白C端部分的核酸序列和3’ITR序列,以及The second nucleic acid vector comprises, in 5'-3' direction: a 5' ITR sequence, a nucleic acid sequence encoding the C-terminal portion of an intein, a nucleic acid sequence encoding the C-terminal portion of a MYO7A protein, and a 3' ITR sequence, and

所述MYO7A蛋白的氨基酸序列中设置有MYO7A切分位点,例如,所述MYO7A蛋白的氨基酸序列如SEQ ID NO:2所示或其功能片段,例如,与SEQ ID NO:2具有至少80%序列同一性的氨基酸序列;The MYO7A protein has a MYO7A cleavage site in its amino acid sequence, for example, the MYO7A protein has an amino acid sequence as shown in SEQ ID NO: 2 or a functional fragment thereof, for example, an amino acid sequence having at least 80% sequence identity with SEQ ID NO: 2;

所述MYO7A蛋白N端部分是MYO7A氨基酸序列的N端至MYO7A切分位点的序列;The N-terminal portion of the MYO7A protein is the sequence from the N-terminus of the MYO7A amino acid sequence to the MYO7A cleavage site;

所述MYO7A蛋白C端部分是MYO7A切分位点后一位的氨基酸至MYO7A氨基酸序列的C端的序列;The C-terminal portion of the MYO7A protein is a sequence from the amino acid after the MYO7A cleavage site to the C-terminus of the MYO7A amino acid sequence;

任选地,在将所述第一核酸载体和所述第二核酸载体导入细胞后,所述MYO7A蛋白N端部分和MYO7A蛋白C端部分有效连接产生全长MYO7A蛋白。Optionally, after the first nucleic acid vector and the second nucleic acid vector are introduced into cells, the N-terminal portion of the MYO7A protein and the C-terminal portion of the MYO7A protein are operably linked to produce a full-length MYO7A protein.

在一些实施方案中,双载体系统中的所述ITR的核苷酸序列分别衍生自同一AAV血清型或不同AAV血清型,例如,AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8或AAV9血清型。在一些实施方案中,所述第一核酸载体的5’-ITR和3’-ITR和所述第二核酸载体的5’-ITR和3’-ITR衍生自同一AAV血清型。在一些实施方案中,所述第一核酸载体的5’-ITR和3’-ITR和所述第二核酸载体的5’-ITR和3’-ITR分别来自不同AAV血清型。In some embodiments, the nucleotide sequences of the ITRs in the dual vector system are derived from the same AAV serotype or different AAV serotypes, for example, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9 serotypes. In some embodiments, the 5'-ITR and 3'-ITR of the first nucleic acid vector and the 5'-ITR and 3'-ITR of the second nucleic acid vector are derived from the same AAV serotype. In some embodiments, the 5'-ITR and 3'-ITR of the first nucleic acid vector and the 5'-ITR and 3'-ITR of the second nucleic acid vector are derived from different AAV serotypes.

在一些实施方案中,双载体系统中使用了组织特异性启动子。例如,在耳中介导表达的启动子,例如,突触蛋白启动子或GFAP启动子。In some embodiments, a tissue-specific promoter is used in a two-vector system, for example, a promoter that mediates expression in the ear, such as the synapsin promoter or the GFAP promoter.

在一些实施方案中,双载体系统中使用了以下启动子中的任一种:巨细胞病毒(CMV)启动子、SV40启动子、劳斯肉瘤病毒(RSV)启动子、CAG启动子、嵌合CMV/鸡β肌动蛋白(CBA)启动子、截短形式的CBA(smCBA)启动子、UbC启动子、SFFV启动子、EF1α启动子、PGK启动子、或Myo7A、Myo15、Atoh1、POU4F3、Lhx3、Myo6、α9AchR、α10AchR、OTOF和STRC编码基因的启动子。在一些实施方案中,启动子是CMV启动子。In some embodiments, any one of the following promoters is used in the binary vector system: cytomegalovirus (CMV) promoter, SV40 promoter, Rous sarcoma virus (RSV) promoter, CAG promoter, chimeric CMV/chicken beta actin (CBA) promoter, truncated CBA (smCBA) promoter, UbC promoter, SFFV promoter, EF1α promoter, PGK promoter, or promoters of Myo7A, Myo15, Atoh1, POU4F3, Lhx3, Myo6, α9AchR, α10AchR, OTOF and STRC encoding genes. In some embodiments, the promoter is a CMV promoter.

本发明的双载体系统还可以包含一个或多个可以在转录之前或转录之后起作用的其他调控序列。调控序列可以是天然转基因基因座的一部分或者可以是异源的调控序列。本发明的双载体系统中可以包含天然转基因转录物的5'UTR或3'UTR的一部分。The dual vector system of the present invention may also include one or more additional regulatory sequences that can function before or after transcription. The regulatory sequences may be part of the native transgenic locus or may be heterologous regulatory sequences. A portion of the 5'UTR or 3'UTR of the native transgenic transcript may be included in the dual vector system of the present invention.

调控序列可以是促进转基因表达的任何序列,即用于增加转录物表达,改善mRNA的核输出或增强其稳定性。这种调控序列包括例如增强子元件、转录后调控元件和多聚腺苷酸化序列。The regulatory sequence may be any sequence that promotes transgene expression, i.e., serves to increase transcript expression, improve nuclear export of mRNA, or enhance its stability. Such regulatory sequences include, for example, enhancer elements, post-transcriptional regulatory elements, and polyadenylation sequences.

增强子是影响同一分子DNA上基因转录的顺式调控元件。增强子可以位于它们调节的基因的上游、下游、内含子内,或甚至相对远离它们调节的基因。Enhancers are cis-regulatory elements that affect the transcription of genes on the same molecule of DNA. Enhancers can be located upstream, downstream, within introns, or even relatively far from the genes they regulate.

在本发明的双载体系统中使用的优选转录后调控元件是土拨鼠肝炎转录后调控元件(WPRE)或其变体。与无WPRE或其变体的AAV载体相比,包含WPRE或其变体的AAV载体增加MYO7A蛋白的表达。The preferred post-transcriptional regulatory element used in the dual vector system of the present invention is the woodchuck hepatitis post-transcriptional regulatory element (WPRE) or a variant thereof. Compared with an AAV vector without WPRE or a variant thereof, an AAV vector containing WPRE or a variant thereof increases the expression of MYO7A protein.

在一个实施方案中,本发明的双载体系统包含具有SEQ ID NO:28所示的WPRE核苷酸序列。在另一个实施方案中,本发明的双载体系统包含与SEQ ID NO:28所示的WPRE核苷酸序列具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列的转录后调控元件,其中该核苷酸序列基本上保留了如SEQ ID NO:28所示的转录后调控元件的功能性活性,例如,WPRE的截短变体。缩小AAV基因组的尺寸使得能够增加除了转基因以外在载体中引入其他调控元件的灵活性。在一个实施方案中,WPRE的截短变体具有SEQ ID NO:31所示的WPRE3核苷酸序列。In one embodiment, the dual vector system of the present invention comprises a WPRE nucleotide sequence as set forth in SEQ ID NO:28. In another embodiment, the dual vector system of the present invention comprises a post-transcriptional regulatory element having a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to the WPRE nucleotide sequence as set forth in SEQ ID NO:28, wherein the nucleotide sequence substantially retains the functional activity of the post-transcriptional regulatory element as set forth in SEQ ID NO:28, for example, a truncated variant of WPRE. Reducing the size of the AAV genome enables increased flexibility in introducing other regulatory elements into the vector in addition to transgenes. In one embodiment, the truncated variant of WPRE has the WPRE3 nucleotide sequence as set forth in SEQ ID NO:31.

在一个实施方案中,本发明的双载体系统包含多聚腺苷酸化序列,例如,牛生长激素多聚腺苷酸化序列、SV40多聚腺苷酸化序列和/或SV40晚期多聚腺苷酸化序列。在一个实施方案中,本发明的双载体系统包含与SEQ ID NO:29所示的核苷酸序列具有至少90%、95%、96%、97%、98%、99%或100%同一性的SV40多聚腺苷酸化序列。在一个实施方案中,本发明的双载体系统包含与SEQ ID NO:32所示的核苷酸序列具有至少90%、95%、96%、97%、98%、99%或100%同一性的SV40晚期多聚腺苷酸化序列。In one embodiment, the dual vector system of the present invention comprises a polyadenylation sequence, for example, a bovine growth hormone polyadenylation sequence, an SV40 polyadenylation sequence, and/or an SV40 late polyadenylation sequence. In one embodiment, the dual vector system of the present invention comprises an SV40 polyadenylation sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the nucleotide sequence set forth in SEQ ID NO: 29. In one embodiment, the dual vector system of the present invention comprises an SV40 late polyadenylation sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the nucleotide sequence set forth in SEQ ID NO: 32.

在一些实施方案中,本发明的双载体系统包含WPRE核苷酸序列和SV40多聚腺苷酸化序列的组合,例如,本发明的双载体系统具有SEQ ID NO:27所示的核苷酸序列或与SEQ ID NO:27具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列。WPRE核苷酸序列和SV40多聚腺苷酸化序列的组合使得可以高水平表达转基因。In some embodiments, the dual vector system of the present invention comprises a combination of a WPRE nucleotide sequence and an SV40 polyadenylation sequence. For example, the dual vector system of the present invention has the nucleotide sequence set forth in SEQ ID NO: 27, or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 27. The combination of the WPRE nucleotide sequence and the SV40 polyadenylation sequence allows for high-level expression of a transgene.

在一些实施方案中,本发明的双载体系统包含WPRE3核苷酸序列和SV40晚期多聚腺苷酸化序列的组合,例如,本发明的双载体系统具有SEQ ID NO:30所示的核苷酸序列或与SEQ ID NO:30具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列。WPRE3核苷酸序列和SV40晚期多聚腺苷酸化序列的组合(也简称为“W3SL”)可以高效率表达更大的外源基因,且占用AAV更少的包装容量。In some embodiments, the dual-vector system of the present invention comprises a combination of a WPRE3 nucleotide sequence and an SV40 late polyadenylation sequence. For example, the dual-vector system of the present invention has the nucleotide sequence set forth in SEQ ID NO: 30, or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 30. The combination of the WPRE3 nucleotide sequence and the SV40 late polyadenylation sequence (also referred to as "W3SL") can efficiently express larger exogenous genes while occupying less AAV packaging capacity.

本发明通过使用所述双载体系统,将MYO7A蛋白基因分成两个部分提供至内耳细胞、内毛细胞或外毛细胞,在那里表达的MYO7A蛋白N端部分和MYO7A蛋白C端部分发生反式剪接并形成全长MYO7A蛋白。本发明证实了,本发明的用于表达MYO7A蛋白的双载体系统能够有效地转导所靶向的内耳细胞、内毛细胞或外毛细胞,在所述细胞中产生MYO7A蛋白,并以持久的方式恢复因MYO7A基因敲除而产生的听力损伤。The present invention uses the dual-vector system to deliver the MYO7A protein gene in two parts to inner ear cells, inner hair cells, or outer hair cells, where the N-terminal and C-terminal parts of the MYO7A protein expressed undergo trans-splicing to form the full-length MYO7A protein. The present invention demonstrates that the dual-vector system for expressing the MYO7A protein can effectively transduce the targeted inner ear cells, inner hair cells, or outer hair cells, producing the MYO7A protein in these cells and durably restoring hearing loss caused by MYO7A gene knockout.

在优选的实施方案中,本发明的双载体系统允许表达氨基酸序列与SEQ ID NO:2具有至少70%的同一性和/或相似性的同源多肽。所述同源序列更优选地与SEQ ID NO:2具有至少75%、甚至更优选至少80%、或至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少99%、至少99%的同一性和/或相似性。当同源多肽比SEQ ID NO:2短得多时,可以考虑局部比对。In a preferred embodiment, the dual vector system of the present invention allows for the expression of homologous polypeptides having an amino acid sequence that is at least 70% identical and/or similar to SEQ ID NO: 2. More preferably, the homologous sequence is at least 75%, even more preferably at least 80%, or at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 99%, or at least 99% identical and/or similar to SEQ ID NO: 2. When the homologous polypeptide is much shorter than SEQ ID NO: 2, a local alignment may be considered.

在另一个实施方案中,本发明的双载体系统可以允许表达MYO7A蛋白多肽的功能片段。术语“功能片段”本文是指保留目的MYO7A蛋白多肽的至少一种生物学功能的任何片段。In another embodiment, the dual vector system of the present invention can allow for the expression of functional fragments of the MYO7A protein polypeptide. The term "functional fragment" herein refers to any fragment that retains at least one biological function of the target MYO7A protein polypeptide.

可以使用本发明所述的双载体系统转化宿主细胞而获得全长MYO7A蛋白。在一些实施方案中,所述宿主细胞选自Hela-S3细胞、HEK-293细胞、HEK-293T细胞、HEK-293FT细胞、A549细胞和Sf9细胞。The full-length MYO7A protein can be obtained by transforming host cells using the dual vector system of the present invention. In some embodiments, the host cells are selected from Hela-S3 cells, HEK-293 cells, HEK-293T cells, HEK-293FT cells, A549 cells, and Sf9 cells.

III.双载体系统的用途III. Uses of the Dual Vector System

本发明的双载体系统用于施与患有Myo7a突变诱发的听力损伤患者。“患有Myo7a突变诱发的听力损伤的患者”是指这样的患者、特别是人患者,其被认为具有(或已被诊断为具有)在组成型Myo7a基因中的突变,所述突变触发MYO7A蛋白的异常表达、异常功能或两者都有。在一些实施方案中,Myo7a突变诱发的听力损伤是USH1B、常染色体隐性遗传性聋DFNB2、常染色体显性遗传性聋DFNA11。The dual-vector system of the present invention is used to administer to patients suffering from Myo7a mutation-induced hearing loss. "Patient suffering from Myo7a mutation-induced hearing loss" refers to a patient, particularly a human patient, who is believed to have (or has been diagnosed with) a mutation in the constitutive Myo7a gene that triggers abnormal expression, abnormal function, or both of the MYO7A protein. In some embodiments, the Myo7a mutation-induced hearing loss is USH1B, autosomal recessive hearing loss DFNB2, or autosomal dominant hearing loss DFNA11.

在一些实施方案中,本发明的双载体系统为双AAV载体系统。在一些实施方案中,双AAV载体系统中的第一AAV载体和第二AAV载体为各自具有相同或不同的AAV来源的衣壳的载体,例如,双AAV载体系统中的第一AAV载体和第二AAV载体为各自具有AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV-Anc80衣壳的载体或具有嵌合衣壳的AAV载体,特别是具有AAV-Anc80衣壳的AAV载体。优选地,使用合成的腺相关病毒载体Anc80L65,其已被证明具有迄今为止报道的内耳毛细胞的最高转导效率(Suzuki等人,Sci.Rep.7:45524(2017))。In some embodiments, the dual vector system of the present invention is a dual AAV vector system. In some embodiments, the first AAV vector and the second AAV vector in the dual AAV vector system are vectors each having a capsid of the same or different AAV origin, for example, the first AAV vector and the second AAV vector in the dual AAV vector system are vectors each having an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV-Anc80 capsid or an AAV vector with a chimeric capsid, in particular an AAV vector with an AAV-Anc80 capsid. Preferably, the synthetic adeno-associated virus vector Anc80L65 is used, which has been shown to have the highest transduction efficiency of inner ear hair cells reported to date (Suzuki et al., Sci. Rep. 7: 45524 (2017)).

本发明的双载体系统在施用后,可以触发在内耳细胞、内毛细胞或外毛细胞中全长MYO7A蛋白多肽、或其功能片段的表达。After administration, the dual-vector system of the present invention can trigger the expression of the full-length MYO7A protein polypeptide, or a functional fragment thereof, in inner ear cells, inner hair cells, or outer hair cells.

本发明的双载体系统施与的患者优选为新生人婴儿,通常小于6月龄,甚至小于3月龄(如果他们在幼时被诊断出来患DFNB16耳聋)。这些人婴儿更优选在3月龄和1岁之间。The patients to whom the dual vector system of the present invention is administered are preferably newborn human infants, usually less than 6 months old, or even less than 3 months old (if they were diagnosed with DFNB16 deafness in childhood). These human infants are more preferably between 3 months and 1 year old.

还可以将本发明的双载体系统施用于例如幼儿(2-6岁)、儿童(6-12岁)、青少年(12-18岁)或成人(18岁及以上)。The two-vector system of the present invention can also be administered to, for example, infants (2-6 years), children (6-12 years), adolescents (12-18 years), or adults (18 years and older).

如本文所用,术语“治疗”旨在表示将治疗有效量的本发明的双载体系统施用至患有DFNB16耳聋的患者,以便部分地或完全恢复所述患者的听力。通过使用电生理设备测试听觉脑干反应(ABR)可以评估所述恢复。“治疗Myo7a突变诱发的听力损伤”特别旨在指听力功能的完全恢复。术语“预防”指减少或延迟听觉频率范围内的听力丧失。As used herein, the term "treating" is intended to mean administering a therapeutically effective amount of the dual-vector system of the present invention to a patient suffering from DFNB16 deafness to partially or completely restore the patient's hearing. Restoration can be assessed by testing auditory brainstem responses (ABRs) using electrophysiological equipment. "Treatment of Myo7a mutation-induced hearing loss" specifically refers to complete restoration of hearing function. The term "preventing" refers to reducing or delaying hearing loss within the auditory frequency range.

实施例Example

实施例1在MYO7A蛋白的氨基酸序列中选择内含肽切分位点Example 1: Selecting intein cleavage sites in the amino acid sequence of MYO7A protein

图1显示了内含肽(intein)介导的全长外显肽(extein)表达示意图。在MYO7A蛋白的氨基酸序列中设置内含肽切分位点,MYO7A的N端编码序列为从MYO7A的氨基酸序列的N末端至切分位点的核苷酸编码序列,MYO7A的C端编码序列为从MYO7A的氨基酸序列的切分位点下一位氨基酸至C末端的核苷酸编码序列。MYO7A的N端序列与内含肽的N端序列融合,内含肽的C端序列与MYO7A的C端序列融合。在MYO7A蛋白的氨基酸序列中可以选择许多切分位点,其中13个切分位点方案及对应的MYO7A的N端和MYO7A的C端的氨基酸序列在表2中列出,旨在寻找疗效更佳的使用Myo7a基因治疗的切分位点。图2显示了通过内含肽拼接筛选Myo7a的切分位点的筛选示意图。Figure 1 shows a schematic diagram of intein-mediated full-length extein expression. An intein cleavage site is set within the amino acid sequence of the MYO7A protein. The N-terminal coding sequence of MYO7A is the nucleotide coding sequence from the N-terminus of the MYO7A amino acid sequence to the cleavage site, and the C-terminal coding sequence of MYO7A is the nucleotide coding sequence from the amino acid immediately following the cleavage site to the C-terminus of the MYO7A amino acid sequence. The N-terminal sequence of MYO7A is fused to the N-terminal sequence of the intein, and the C-terminal sequence of the intein is fused to the C-terminal sequence of MYO7A. Numerous cleavage sites can be selected within the amino acid sequence of the MYO7A protein. Table 2 lists 13 cleavage site schemes and the corresponding MYO7A N-terminal and C-terminal amino acid sequences. The goal is to identify cleavage sites with improved efficacy for Myo7A gene therapy. Figure 2 shows a schematic diagram of the screening process for Myo7A cleavage sites through intein splicing.

表2 SEQ ID NO:2所示的MYO7A蛋白的示例性切分位点

Table 2 Exemplary cleavage sites of the MYO7A protein shown in SEQ ID NO: 2

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1043位氨基酸残基用作切分位点(S1)。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, the 1043rd amino acid residue is used as the cleavage site (S1).

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1058位氨基酸残基用作切分位点(S2)。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, the 1058th amino acid residue is used as the cleavage site (S2).

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1061位氨基酸残基用作切分位点(S3)。在切分位点S3时,SEQ ID NO:3和SEQ ID NO:4分别是MYO7A的N端的核苷酸序列和氨基酸序列,SEQ ID NO:5和SEQ ID NO:6分别是MYO7A的C端的核苷酸序列和氨基酸序列。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, amino acid residue 1061 is used as the cleavage site (S3). At the cleavage site S3, SEQ ID NO:3 and SEQ ID NO:4 are the nucleotide sequence and amino acid sequence of the N-terminus of MYO7A, respectively, and SEQ ID NO:5 and SEQ ID NO:6 are the nucleotide sequence and amino acid sequence of the C-terminus of MYO7A, respectively.

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1064位氨基酸残基用作切分位点(S4)。在切分位点S4时,SEQ ID NO:7和SEQ ID NO:8分别是MYO7A的N端的核苷酸序列和氨基酸序列,SEQ ID NO:9和SEQ ID NO:10分别是MYO7A的C端的核苷酸序列和氨基酸序列。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, amino acid residue 1064 is used as the cleavage site (S4). At the cleavage site S4, SEQ ID NO:7 and SEQ ID NO:8 are the nucleotide sequence and amino acid sequence of the N-terminus of MYO7A, respectively, and SEQ ID NO:9 and SEQ ID NO:10 are the nucleotide sequence and amino acid sequence of the C-terminus of MYO7A, respectively.

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1071位氨基酸残基用作切分位点(S5)。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, the 1071st amino acid residue is used as the cleavage site (S5).

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1076位氨基酸残基用作切分位点(S6)。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, the 1076th amino acid residue is used as the cleavage site (S6).

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1081位氨基酸残基用作切分位点(S7)。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, the 1081st amino acid residue is used as the cleavage site (S7).

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1104位氨基酸残基用作切分位点(S8)。在切分位点S8时,SEQ ID NO:11和SEQ ID NO:12分别是MYO7A的N端的核苷酸序列和氨基酸序列,SEQ ID NO:13和SEQ ID NO:14分别是MYO7A的C端的核苷酸序列和氨基酸序列。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, amino acid residue 1104 is used as the cleavage site (S8). At the cleavage site S8, SEQ ID NO:11 and SEQ ID NO:12 are the nucleotide sequence and amino acid sequence of the N-terminus of MYO7A, respectively, and SEQ ID NO:13 and SEQ ID NO:14 are the nucleotide sequence and amino acid sequence of the C-terminus of MYO7A, respectively.

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1105位氨基酸残基用作切分位点(S9)。在切分位点S9时,SEQ ID NO:15和SEQ ID NO:16分别是MYO7A的N端的核苷酸序列和氨基酸序列,SEQ ID NO:17和SEQ ID NO:18分别是MYO7A的C端的核苷酸序列和氨基酸序列。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, amino acid residue 1105 is used as the cleavage site (S9). At the cleavage site S9, SEQ ID NO:15 and SEQ ID NO:16 are the nucleotide sequence and amino acid sequence of the N-terminus of MYO7A, respectively, and SEQ ID NO:17 and SEQ ID NO:18 are the nucleotide sequence and amino acid sequence of the C-terminus of MYO7A, respectively.

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1114位氨基酸残基用作切分位点(S10)。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, the 1114th amino acid residue is used as the cleavage site (S10).

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1119位氨基酸残基用作切分位点(S11)。在切分位点S11时,SEQ ID NO:19和SEQ ID NO:20分别是MYO7A的N端的核苷酸序列和氨基酸序列,SEQ ID NO:21和SEQ ID NO:22分别是MYO7A的C端的核苷酸序列和氨基酸序列。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, amino acid residue 1119 is used as the cleavage site (S11). At the cleavage site S11, SEQ ID NO:19 and SEQ ID NO:20 are the nucleotide sequence and amino acid sequence of the N-terminus of MYO7A, respectively, and SEQ ID NO:21 and SEQ ID NO:22 are the nucleotide sequence and amino acid sequence of the C-terminus of MYO7A, respectively.

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1122位氨基酸残基用作切分位点(S12)。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, the 1122nd amino acid residue is used as the cleavage site (S12).

在SEQ ID NO:2所示的MYO7A蛋白的氨基酸序列中,将第1126位氨基酸残基用作切分位点(S13)。In the amino acid sequence of the MYO7A protein shown in SEQ ID NO:2, the 1126th amino acid residue is used as the cleavage site (S13).

实施例2.使用含有ITR序列的质粒构建表达MYO7A蛋白的双载体系统Example 2. Construction of a dual-vector system for expressing MYO7A protein using a plasmid containing ITR sequences

使用pAAV-CMV-EGFP-WPRE-SV40质粒(南京金斯瑞公司合成,图3)作为质粒骨架,构建表达MYO7A蛋白N端的第一载体质粒和表达MYO7A蛋白C端的第二载体质粒,所构建的质粒元件分别如图2的上小图和下小图所示,其中第二载体质粒在表达MYO7A蛋白C端的编码序列(CDS)末端连接了HA标签(HA标签序列YPYDVPDYA(SEQ ID NO.25)用于验证体外表达,由此获得了双载体系统。Using the pAAV-CMV-EGFP-WPRE-SV40 plasmid (synthesized by Nanjing GenScript, Figure 3) as the plasmid backbone, a first vector plasmid expressing the N-terminus of the MYO7A protein and a second vector plasmid expressing the C-terminus of the MYO7A protein were constructed. The constructed plasmid elements are shown in the upper and lower panels of Figure 2, respectively. The second vector plasmid was connected to the end of the coding sequence (CDS) expressing the C-terminus of the MYO7A protein (HA tag sequence YPYDVPDYA (SEQ ID NO. 25) for verifying in vitro expression, thereby obtaining a dual-vector system.

具体地,该双载体系统中的第一载体质粒和第二载体质粒是在图3所示的pAAV-CMV-EGFP-WPRE-SV40质粒骨架上改造而获得的。第一载体质粒和第二载体质粒在表达后,通过蛋白反式剪接法实现MYO7A全长蛋白的表达。所述第一载体质粒在编码Myo7a-N序列(即Myo7a N端部分的序列)的3’端连接编码内含肽N端片段(Rm-N,SEQ ID NO:23)的序列,所述第二载体质粒在编码内含肽C端片段(Rm-C序列,SEQ ID NO:24)的序列的3’端连接编码Myo7a-C序列(即Myo7a C端部分的序列),且Myo7a-C序列末端连接有HA标签。Specifically, the first vector plasmid and the second vector plasmid in the dual-vector system are obtained by modifying the pAAV-CMV-EGFP-WPRE-SV40 plasmid backbone shown in Figure 3. After expression, the first vector plasmid and the second vector plasmid achieve expression of the MYO7A full-length protein by protein trans-splicing. The first vector plasmid is connected to the sequence encoding the N-terminal fragment of the intein (Rm-N, SEQ ID NO: 23) at the 3' end of the sequence encoding the Myo7a-N sequence (i.e., the sequence of the N-terminal part of Myo7a), and the second vector plasmid is connected to the sequence encoding the Myo7a-C sequence (i.e., the sequence of the C-terminal part of Myo7a) at the 3' end of the sequence encoding the C-terminal fragment of the intein (Rm-C sequence, SEQ ID NO: 24), and the end of the Myo7a-C sequence is connected to an HA tag.

使用EcoRI和EcoRV双酶切将pAAV-CMV-EGFP-WPRE-SV40质粒中的EGFP报告基因序列及其他序列替换为包含编码Myo7a-N序列和内含肽N端片段(Rm-N,SEQ ID NO:23)的序列,获得第一载体质粒。使用EcoRI和EcoRV双酶切将pAAV-CMV-EGFP-WPRE-SV40质粒中的EGFP报告基因序列替换为包含编码内含肽C端片段(Rm-C序列,SEQ ID NO:24)的序列和Myo7a-C序列,获得第二载体质粒。所述序列的合成及载体构建均委托南京金斯瑞生物科技有限公司完成。The EGFP reporter gene sequence and other sequences in the pAAV-CMV-EGFP-WPRE-SV40 plasmid were replaced with a sequence encoding the Myo7a-N sequence and the N-terminal fragment of the intein (Rm-N, SEQ ID NO: 23) using EcoRI and EcoRV enzymes to obtain the first vector plasmid. The EGFP reporter gene sequence in the pAAV-CMV-EGFP-WPRE-SV40 plasmid was replaced with a sequence encoding the C-terminal fragment of the intein (Rm-C sequence, SEQ ID NO: 24) and the Myo7a-C sequence using EcoRI and EcoRV enzymes to obtain the second vector plasmid. The synthesis of these sequences and the construction of these vectors were commissioned to Nanjing GenScript Biotechnology Co., Ltd.

获得了对应于表2中的各切分位点的13对质粒用于后续实施例,分别命名为Thirteen pairs of plasmids corresponding to the cleavage sites in Table 2 were obtained for subsequent examples and were named

(1)pAAV-CMV-MYO7A-N-S1-Rm-N-内含肽质粒;和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S1质粒;(1) pAAV-CMV-MYO7A-N-S1-Rm-N-intein plasmid; and pAAV-CMV-Rm-C-intein-MYO7A-C-S1 plasmid;

(2)pAAV-CMV-MYO7A-N-S2-Rm-N-内含肽质粒;和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S2质粒;(2) pAAV-CMV-MYO7A-N-S2-Rm-N-intein plasmid; and pAAV-CMV-Rm-C-intein-MYO7A-C-S2 plasmid;

(3)pAAV-CMV-MYO7A-N-S3-Rm-N-内含肽质粒(质粒图谱显示于图4);和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S3质粒(质粒图谱显示于图5);(3) pAAV-CMV-MYO7A-N-S3-Rm-N-intein plasmid (plasmid map is shown in FIG4 ); and pAAV-CMV-Rm-C-intein-MYO7A-C-S3 plasmid (plasmid map is shown in FIG5 );

(4)pAAV-CMV-MYO7A-N-S4-Rm-N-内含肽质粒(质粒图谱显示于图6);和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S4质粒(质粒图谱显示于图7);(4) pAAV-CMV-MYO7A-N-S4-Rm-N-intein plasmid (plasmid map is shown in FIG6 ); and pAAV-CMV-Rm-C-intein-MYO7A-C-S4 plasmid (plasmid map is shown in FIG7 );

(5)pAAV-CMV-MYO7A-N-S5-Rm-N-内含肽质粒;和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S5质粒;(5) pAAV-CMV-MYO7A-N-S5-Rm-N-intein plasmid; and pAAV-CMV-Rm-C-intein-MYO7A-C-S5 plasmid;

(6)pAAV-CMV-MYO7A-N-S6-Rm-N-内含肽质粒;和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S6质粒;(6) pAAV-CMV-MYO7A-N-S6-Rm-N-intein plasmid; and pAAV-CMV-Rm-C-intein-MYO7A-C-S6 plasmid;

(7)pAAV-CMV-MYO7A-N-S7-Rm-N-内含肽质粒;和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S7质粒;(7) pAAV-CMV-MYO7A-N-S7-Rm-N-intein plasmid; and pAAV-CMV-Rm-C-intein-MYO7A-C-S7 plasmid;

(8)pAAV-CMV-MYO7A-N-S8-Rm-N-内含肽质粒(质粒图谱显示于图8);和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S8质粒(质粒图谱显示于图9);(8) pAAV-CMV-MYO7A-N-S8-Rm-N-intein plasmid (plasmid map is shown in FIG8 ); and pAAV-CMV-Rm-C-intein-MYO7A-C-S8 plasmid (plasmid map is shown in FIG9 );

(9)pAAV-CMV-MYO7A-N-S9-Rm-N-内含肽质粒(质粒图谱显示于图10);和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S9质粒(质粒图谱显示于图11);(9) pAAV-CMV-MYO7A-N-S9-Rm-N-intein plasmid (plasmid map is shown in FIG10 ); and pAAV-CMV-Rm-C-intein-MYO7A-C-S9 plasmid (plasmid map is shown in FIG11 );

(10)pAAV-CMV-MYO7A-N-S10-Rm-N-内含肽质粒;和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S10质粒;(10) pAAV-CMV-MYO7A-N-S10-Rm-N-intein plasmid; and pAAV-CMV-Rm-C-intein-MYO7A-C-S10 plasmid;

(11)pAAV-CMV-MYO7A-N-S11-Rm-N-内含肽质粒(质粒图谱显示于图12);和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S11质粒(质粒图谱显示于图13);(11) pAAV-CMV-MYO7A-N-S11-Rm-N-intein plasmid (plasmid map is shown in FIG12 ); and pAAV-CMV-Rm-C-intein-MYO7A-C-S11 plasmid (plasmid map is shown in FIG13 );

(12)pAAV-CMV-MYO7A-N-S12-Rm-N-内含肽质粒;和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S12质粒;(12) pAAV-CMV-MYO7A-N-S12-Rm-N-intein plasmid; and pAAV-CMV-Rm-C-intein-MYO7A-C-S12 plasmid;

(13)pAAV-CMV-MYO7A-N-S13-Rm-N-内含肽质粒;和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S13质粒。(13) pAAV-CMV-MYO7A-N-S13-Rm-N-intein plasmid; and pAAV-CMV-Rm-C-intein-MYO7A-C-S13 plasmid.

上述质粒转录产物均含有共价连接的MYO7A部分和Rm内含肽部分。质粒名称和转录产物等示于下表3中。The plasmid transcripts all contain a covalently linked MYO7A portion and an Rm intein portion. The plasmid names and transcripts are shown in Table 3 below.

表3.示例性质粒名称以及各质粒的转录和翻译产物


Table 3. Exemplary plasmid names and transcription and translation products of each plasmid


实施例3.构建的质粒在细胞中的重组Example 3. Recombination of the constructed plasmid in cells

3.1包含全长WPRE的双载体系统在细胞中的重组3.1 Recombination of the binary vector system containing the full-length WPRE in cells

将实施例2获得的表3所示的13对质粒分别成对转染HEK-293T细胞(细胞购自ATCC),转染48小时后通过蛋白质印迹分析全长MYO7A蛋白的表达。具体实验方法如下。The 13 pairs of plasmids shown in Table 3 obtained in Example 2 were transfected into HEK-293T cells (cells purchased from ATCC) in pairs, and the expression of full-length MYO7A protein was analyzed by Western blotting 48 hours after transfection. The specific experimental method is as follows.

细胞转染:接种HEK-293T细胞(为人胚肾(Human Embryonic Kidney)293T细胞,下文中也缩写为“293T细胞”)至70-90%密度时准备转染。配制A管:125μL无血清DMEM培养基+8μL Lipofectamine 3000试剂(Invitrogen,目录号:L3000015),充分混匀。配制B管:125μL无血清DMEM培养基+2μg第一载体质粒+2μg第二载体质粒+8μL P3000试剂(Invitrogen,目录号:L3000015),充分混匀。将B管混合物加到A管中,轻轻充分混匀,室温静置10-15min。所述A管是培养基和Lipofectamine 3000转染试剂的混合物,所述B管是培养基和载体质粒DNA及转染加强剂P3000的混合物。向293T细胞添加所获得的DNA-脂质体复合物进行转染,37℃、95%空气和5%的CO2孵育,转染48小时后离心收获细胞。Cell transfection: Inoculate HEK-293T cells (Human Embryonic Kidney 293T cells, hereinafter referred to as "293T cells") to a density of 70-90% and prepare for transfection. Prepare Tube A: 125 μL serum-free DMEM medium + 8 μL Lipofectamine 3000 reagent (Invitrogen, catalog number: L3000015) and mix thoroughly. Prepare Tube B: 125 μL serum-free DMEM medium + 2 μg first vector plasmid + 2 μg second vector plasmid + 8 μL P3000 reagent (Invitrogen, catalog number: L3000015) and mix thoroughly. Add the mixture in Tube B to Tube A, mix gently and thoroughly, and let it stand at room temperature for 10-15 minutes. Tube A contains a mixture of culture medium and Lipofectamine 3000 transfection reagent, while Tube B contains a mixture of culture medium, vector plasmid DNA, and transfection enhancer P3000. The obtained DNA-liposome complex was added to 293T cells for transfection, and the cells were incubated at 37°C in 95% air and 5% CO2 . After 48 hours of transfection, the cells were harvested by centrifugation.

将细胞转染后离心收获的细胞沉淀在适量RIPA裂解液(Thermo Fisher Scientific,目录号:89900)(添加有1%蛋白酶抑制剂cocktail(Thermo Fisher Scientific,目录号:87786),1%PMSF)里充分重悬,冰上裂解30min,期间每10min涡旋一下,使细胞沉淀充分重悬于裂解液中。12000rpm离心15min收集上清(不要吸到沉淀),使用5X上样缓冲液(Loading Buffer)按比例加入样品,75℃煮15min,冰上冷却,离心沉降后,取上清进行Western印迹分析。After cell transfection, the cell pellet harvested by centrifugation was fully resuspended in an appropriate amount of RIPA lysis buffer (Thermo Fisher Scientific, catalog number: 89900) (supplemented with 1% protease inhibitor cocktail (Thermo Fisher Scientific, catalog number: 87786), 1% PMSF), and lysed on ice for 30 minutes. Vortex every 10 minutes during this period to fully resuspend the cell pellet in the lysis buffer. Centrifuge at 12000rpm for 15 minutes to collect the supernatant (do not aspirate the precipitate), use 5X loading buffer (Loading Buffer) to add the sample in proportion, boil at 75℃ for 15 minutes, cool on ice, and after centrifugation, take the supernatant for Western blotting analysis.

Western印迹检测MYO7A蛋白表达:洗净玻璃板,将玻璃板平整固定在架子上,凹面朝里夹紧,前后两块对称放置。配分离胶,用异丙醇液封,0.5h后将异丙醇倒掉,侧置用泵吸干。配浓缩胶,加到溢出为止,插梳子。45min后,将胶板取下,装到电泳槽的夹子上,从槽中心加入运行缓冲液,直到运行缓冲液溢出到槽体积的1/2处,用力小心拔出梳子。上样,100v恒压1h进行电泳和冰上300mA恒流转膜90min。Western blotting to detect MYO7A protein expression: Wash the glass plates and secure them flat on a rack, clamping them with the concave surface facing inward. Position the plates symmetrically, front and back. Prepare separating gel and seal with isopropanol. After 0.5 hours, discard the isopropanol and place on its side with a pump to dry. Prepare stacking gel, adding until overflowing, and insert a comb. After 45 minutes, remove the gel plate and attach it to the clamps in the electrophoresis tank. Add running buffer from the center of the tank until it overflows to 1/2 of the tank volume. Carefully remove the comb. Load the sample and perform electrophoresis at a constant voltage of 100 V for 1 hour. Transfer the membrane at a constant current of 300 mA on ice for 90 minutes.

将转膜完成后的PVDF膜取出,使用封闭液(5%脱脂奶粉,使用TBST缓冲液配制)室温孵育1小时。将封闭完成后的PVDF膜加入第一抗体(HA-Tag Mouse mAb,Cell Signaling Technology,目录号:6E2;β-Actin Mouse mAb,Cell Signaling Technology,目录号:8H10D10)置于4℃孵育过夜。使用HA抗体作为第一抗体来检测MYO7A蛋白全长蛋白的表达,使用β肌动蛋白抗体作为第一抗体来检测Western印迹的内参蛋白β肌动蛋白水平。β肌动蛋白的蛋白水平通常不会发生改变,因此能够用于检测Western印迹时上样量是否一致。After transfer, remove the PVDF membrane and incubate it with blocking solution (5% skim milk powder, prepared in TBST buffer) at room temperature for 1 hour. Add the primary antibody (HA-Tag Mouse mAb, Cell Signaling Technology, catalog number: 6E2; β-Actin Mouse mAb, Cell Signaling Technology, catalog number: 8H10D10) to the blocked PVDF membrane and incubate it at 4°C overnight. Use HA antibody as the primary antibody to detect the expression of the full-length MYO7A protein, and use β-actin antibody as the primary antibody to detect the level of β-actin, an internal control protein in Western blotting. The protein level of β-actin does not usually change, so it can be used to check whether the sample amount is consistent during Western blotting.

次日取出孵育好的PVDF膜,1X TBST漂洗3次,每次5min,使用第二抗体(HRP-conjugated Affinipure Goat Anti-Mouse IgG(H+L),Proteintech,目录号:SA00001-1)室温孵育1小时,1X TBST漂洗3次,每次5min;然后加入化学发光试剂(ECL)在暗室中进行显影。The next day, the incubated PVDF membrane was removed and rinsed three times with 1X TBST for 5 minutes each time. The membrane was incubated with the secondary antibody (HRP-conjugated Affinipure Goat Anti-Mouse IgG (H+L), Proteintech, catalog number: SA00001-1) at room temperature for 1 hour and rinsed three times with 1X TBST for 5 minutes each time. The chemiluminescent reagent (ECL) was then added for development in a dark room.

表3所示的13对质粒分别成对转染293T细胞后的目的蛋白质表达结果如图14所示。图14中,泳道“Ctrl”表示没有转染质粒的蛋白对照;泳道“FL”表示全长蛋白MYO7A,The expression results of target proteins after 13 pairs of plasmids shown in Table 3 were transfected into 293T cells are shown in Figure 14. In Figure 14, lane "Ctrl" represents the protein control without transfection plasmid; lane "FL" represents the full-length protein MYO7A,

泳道“1”表示pAAV-CMV-MYO7A-N-S1-Rm-N-内含肽质粒和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S1质粒共转染293T细胞后,MYO7A的蛋白表达量;Lane “1” indicates the protein expression level of MYO7A after co-transfection of 293T cells with pAAV-CMV-MYO7A-N-S1-Rm-N-intein plasmid and pAAV-CMV-Rm-C-intein-MYO7A-C-S1 plasmid;

泳道“2”表示pAAV-CMV-MYO7A-N-S2-Rm-N-内含肽质粒和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S2质粒共转染293T细胞后,MYO7A的蛋白表达量;…Lane "2" represents the protein expression level of MYO7A after co-transfection of 293T cells with pAAV-CMV-MYO7A-N-S2-Rm-N-intein plasmid and pAAV-CMV-Rm-C-intein-MYO7A-C-S2 plasmid; ...

泳道“13”表示pAAV-CMV-MYO7A-N-S13-Rm-N-内含肽质粒和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S13质粒共转染293T细胞后,MYO7A的蛋白表达量。Lane “13” indicates the protein expression level of MYO7A after co-transfection of pAAV-CMV-MYO7A-N-S13-Rm-N-intein plasmid and pAAV-CMV-Rm-C-intein-MYO7A-C-S13 plasmid into 293T cells.

由图14可见,MYO7A蛋白的内含肽切分位点S3、S4、S8、S9和S11对应的成对质粒共转染293T细胞后,产生显著更多的全长MYO7A的蛋白表达量。MYO7A蛋白的内含肽切分位点S3、S4、S8、S9和S11能够使得所构建的相应成对质粒在细胞中高效重组,具有非常好的应用前景,可作双AAV载体治疗的候选位点。As shown in Figure 14, co-transfection of 293T cells with paired plasmids corresponding to the MYO7A protein intein cleavage sites S3, S4, S8, S9, and S11 resulted in significantly higher expression of full-length MYO7A protein. The MYO7A protein intein cleavage sites S3, S4, S8, S9, and S11 enable efficient recombination of the corresponding paired plasmids in cells, demonstrating promising application prospects and serving as candidate sites for dual AAV vector therapy.

3.2包含截短的WPRE的双载体系统在细胞中的重组3.2 Recombination of the binary vector system containing truncated WPRE in cells

选择MYO7A蛋白的内含肽切分位点S3,构建了包含截短的WPRE的双载体系统,其中,将图3所示的pAAV-CMV-EGFP-WPRE-SV40质粒骨架中的SEQ ID NO:27所示的WPRE+SV40多聚(A)(717bp)核苷酸序列替换为SEQ ID NO:30所示的WPRE3-SV40晚期多聚(A)(432bp)核苷酸序列,并构建了包含截短的WPRE的第一载体质粒和第二载体质粒。The intein cleavage site S3 of the MYO7A protein was selected, and a dual-vector system containing a truncated WPRE was constructed, wherein the WPRE+SV40 poly(A) (717bp) nucleotide sequence shown in SEQ ID NO:27 in the pAAV-CMV-EGFP-WPRE-SV40 plasmid backbone shown in Figure 3 was replaced with the WPRE3-SV40 late poly(A) (432bp) nucleotide sequence shown in SEQ ID NO:30, and a first vector plasmid and a second vector plasmid containing a truncated WPRE were constructed.

将包含截短的WPRE的所述第一载体质粒和第二载体质粒共转染HEK-293T细胞,转染48小时后通过蛋白质印迹分析全长MYO7A蛋白的表达,并与实施例3.1的用MYO7A蛋白的内含肽切分位点S3的第一载体质粒和第二载体质粒(包含全长WPRE)共转染HEK-293T细胞的蛋白质印迹结果、以及与不包含全长WPRE或其截短变体、也不包含多聚A序列的第一载体质粒和第二载体质粒共转染HEK-293T细胞的蛋白质印迹结果进行比较。The first vector plasmid and the second vector plasmid containing the truncated WPRE were co-transfected into HEK-293T cells, and the expression of the full-length MYO7A protein was analyzed by Western blotting 48 hours after transfection. The results were compared with the Western blotting results of HEK-293T cells co-transfected with the first vector plasmid and the second vector plasmid (containing the full-length WPRE) using the intein cleavage site S3 of the MYO7A protein in Example 3.1, and the Western blotting results of HEK-293T cells co-transfected with the first vector plasmid and the second vector plasmid that did not contain the full-length WPRE or its truncated variants or the poly A sequence.

结果如图15所示,与不包含全长WPRE或其截短变体、也不包含多聚A序列的对照(“Ctrl”)相比较,全长WPRE和截短的WPRE与多聚A序列的组合增加了全长MYO7A蛋白的表达量。The results are shown in FIG15 . Compared with the control (“Ctrl”) containing neither the full-length WPRE nor its truncated variants nor the poly A sequence, the combination of the full-length WPRE and the truncated WPRE with the poly A sequence increased the expression of the full-length MYO7A protein.

实施例4.腺相关病毒的制备Example 4. Preparation of adeno-associated virus

将实施例2构建的MYO7A蛋白的内含肽切分位点S3、S4、S8、S9和S11对应的10个质粒分别与pHelper质粒(南京金斯瑞公司合成)和pRC质粒(包含PHP.B VP1基因,部分CDS序列ID:KU056473.1)以1:1:1的摩尔比混合,使用PEI转染试剂共转染入HEK-293T细胞中(约每一百万个细胞中加入共1μg所述三种质粒),使用含有10%胎牛血清的DMEM培养基在5%二氧化碳培养箱中37℃培养细胞3天,用PBS缓冲液洗涤细胞一次,收集细胞,并反复冻融五次,加入NaCl,使NaCl终浓度为500mM,以10,000g离心细胞半小时,取上清液用0.45μm滤膜过滤,纯化滤液,并浓缩,得到病毒滴度为4.19E+13GC/ml的腺相关病毒,并根据质粒的名称命名腺相关病毒,例如,将使用pAAV-CMV-MYO7A-N-S3-Rm-N-内含肽、pHelper质粒和pRC质粒包装的腺病毒命名为pAAV-CMV-MYO7A-N-S3-Rm-N-内含肽腺相关病毒(AAV);将使用pAAV-CMV-Rm-C-内含肽-MYO7A-C-S3、pHelper质粒和pRC质粒包装的腺病毒命名为pAAV-CMV-Rm-C-内含肽-MYO7A-C-S3 AAV。The 10 plasmids corresponding to the intein cleavage sites S3, S4, S8, S9, and S11 of the MYO7A protein constructed in Example 2 were mixed with pHelper plasmid (synthesized by Nanjing GenScript) and pRC plasmid (containing the PHP.B VP1 gene, partial CDS sequence ID: KU056473.1) at a molar ratio of 1:1:1, and co-transfected into HEK-293T cells using PEI transfection reagent (a total of 1 μg of the three plasmids was added per approximately one million cells). The cells were cultured in DMEM medium containing 10% fetal bovine serum in a 5% carbon dioxide incubator at 37°C for 3 days, washed once with PBS buffer, collected, and repeatedly frozen and thawed five times. NaCl was added to a final concentration of 500 mM, and 10,0 The cells were centrifuged at 00g for half an hour, and the supernatant was filtered with a 0.45μm filter membrane. The filtrate was purified and concentrated to obtain an adeno-associated virus with a virus titer of 4.19E+13GC/ml. The adeno-associated virus was named according to the name of the plasmid. For example, the adenovirus packaged with pAAV-CMV-MYO7A-N-S3-Rm-N-intein, pHelper plasmid and pRC plasmid was named pAAV-CMV-MYO7A-N-S3-Rm-N-intein adeno-associated virus (AAV); the adenovirus packaged with pAAV-CMV-Rm-C-intein-MYO7A-C-S3, pHelper plasmid and pRC plasmid was named pAAV-CMV-Rm-C-intein-MYO7A-C-S3 AAV.

实施例5.候选的内含肽切分位点的MYO7A蛋白在小鼠中的表达Example 5. Expression of MYO7A protein at candidate intein cleavage sites in mice

为验证候选的内含肽切分位点能否在内耳毛细胞表达MYO7A蛋白,分别将实施例4构建的如下成对腺相关病毒通过圆窗给药的方式注射到3只P3(即,出生后3天龄)Myo7a(p.Q720X)点突变KO小鼠(该小鼠委托赛业生物科技有限公司通过CRISPR/Cas9方法构建,Myo7a(p.Q720X)点突变会导致内耳毛细胞中的Myo7A蛋白表达缺失,进而导致毛细胞死亡和听力损伤)耳进行体内表达验证。To verify whether the candidate intein cleavage site can express MYO7A protein in the inner ear hair cells, the following paired adeno-associated viruses constructed in Example 4 were respectively injected into the ears of three P3 (i.e., 3 days after birth) Myo7a (p.Q720X) point mutation KO mice (the mice were commissioned by Saiye Biotechnology Co., Ltd. to construct using the CRISPR/Cas9 method. The Myo7a (p.Q720X) point mutation will lead to the loss of Myo7A protein expression in the inner ear hair cells, thereby leading to hair cell death and hearing loss) through the round window administration for in vivo expression verification.

(i)pAAV-CMV-MYO7A-N-S3-Rm-N-内含肽AAV;和(i) pAAV-CMV-MYO7A-N-S3-Rm-N-intein AAV; and

pAAV-CMV-Rm-C-内含肽-MYO7A-C-S3 AAV;pAAV-CMV-Rm-C-intein-MYO7A-C-S3 AAV;

(ii)pAAV-CMV-MYO7A-N-S4-Rm-N-内含肽AAV;和(ii) pAAV-CMV-MYO7A-N-S4-Rm-N-intein AAV; and

pAAV-CMV-Rm-C-内含肽-MYO7A-C-S4 AAV;pAAV-CMV-Rm-C-intein-MYO7A-C-S4 AAV;

(iii)pAAV-CMV-MYO7A-N-S8-Rm-N-内含肽AAV;和(iii) pAAV-CMV-MYO7A-N-S8-Rm-N-intein AAV; and

pAAV-CMV-Rm-C-内含肽-MYO7A-C-S8 AAV;pAAV-CMV-Rm-C-intein-MYO7A-C-S8 AAV;

(iv)pAAV-CMV-MYO7A-N-S9-Rm-N-内含肽AAV;和(iv) pAAV-CMV-MYO7A-N-S9-Rm-N-intein AAV; and

pAAV-CMV-Rm-C-内含肽-MYO7A-C-S9 AAV;pAAV-CMV-Rm-C-intein-MYO7A-C-S9 AAV;

(v)pAAV-CMV-MYO7A-N-S11-Rm-N-内含肽AAV;和(v) pAAV-CMV-MYO7A-N-S11-Rm-N-intein AAV; and

pAAV-CMV-Rm-C-内含肽-MYO7A-C-S11 AAV。pAAV-CMV-Rm-C-intein-MYO7A-C-S11 AAV.

首先是病毒注射,在冰中低温麻醉小鼠1-2分钟,麻醉后进行耳后切口将圆窗暴露到视野中,用玻璃微量移液管分别注射任一(i)-(v)中的成对腺相关病毒共2×1010个病毒基因组(即,各AAV为1×1010个病毒基因组)。注射后用3M Vethod组织胶水粘合皮肤伤口。两周后处死鼠,取出耳蜗用4%多聚甲醛(PFA)固定,接着用EDTA脱钙液脱钙备用。最后进行免疫荧光染色,解剖耳蜗的同时揭去盖膜,然后用封闭液封闭组织1h,第一抗体4℃孵育过夜(Proteus BioSciences,25-6790,抗Myo7a,1:1000;sigma,sab4200545,抗parvalbumin,1:1000);PBS洗涤3次,第二抗体室温孵育1h(Invitrogen,Donkey anti-rabbit Alexa Fluor 488,1:500;Invitrogen,Goat anti-mouse lgG1 Alexa Fluor 647,1:500),孵育结束用PBS洗涤3次后封片;用蔡司激光共聚焦显微镜拍摄耳蜗染色切片。First, the virus was injected. Mice were anesthetized hypothermically on ice for 1-2 minutes. After anesthesia, a postauricular incision was made to expose the round window to the visual field. A glass micropipette was used to inject a paired adeno-associated virus from (i) to (v) totaling 2×10 10 viral genomes (i.e., 1×10 10 viral genomes for each AAV). After injection, the skin wound was sealed with 3M Vethod tissue glue. Two weeks later, the mice were sacrificed, and the cochlea was removed and fixed with 4% paraformaldehyde (PFA). Then, the cochlea was decalcified with EDTA solution until ready for use. Finally, immunofluorescence staining was performed. The cochlea was dissected and the tectorial membrane was removed. The tissue was then blocked with blocking solution for 1 hour and incubated with the primary antibody at 4°C overnight (Proteus BioSciences, 25-6790, anti-Myo7a, 1:1000; Sigma, sab4200545, anti-parvalbumin, 1:1000); washed three times with PBS, and incubated with the secondary antibody at room temperature for 1 hour (Invitrogen, Donkey anti-rabbit Alexa Fluor 488, 1:500; Invitrogen, Goat anti-mouse IgG1 Alexa Fluor 647, 1:500). After incubation, the sections were washed three times with PBS and mounted. The stained cochlear sections were photographed using a Zeiss laser confocal microscope.

图16的免疫荧光结果显示,双AAV系统实现了小鼠耳蜗内MYO7A蛋白的表达。图中“WT”为野生型小鼠,蓝色为parvalbumin标记的毛细胞,绿色为Myo7a蛋白。如图16所示,Myo7a(p.Q720X)小鼠没有注射成对AAV病毒时,不表达Myo7a蛋白,“Myo7a(p.Q720X)+AAV”为Myo7a(p.Q720X)小鼠注射了成对AAV病毒,注射病毒后Myo7a(p.Q720X)小鼠内耳毛细胞中表达了Myo7a蛋白,且表达量与WT小鼠相近。The immunofluorescence results in Figure 16 show that the dual AAV system achieves the expression of MYO7A protein in the mouse cochlea. In the figure, "WT" is a wild-type mouse, blue is parvalbumin-labeled hair cells, and green is Myo7a protein. As shown in Figure 16, Myo7a (p.Q720X) mice do not express Myo7a protein when they are not injected with paired AAV viruses. "Myo7a (p.Q720X) + AAV" is Myo7a (p.Q720X) mice injected with paired AAV viruses. After virus injection, Myo7a (p.Q720X) mice expressed Myo7a protein in the inner ear hair cells, and the expression level was similar to that of WT mice.

实施例6候选的内含肽切分位点的MYO7A蛋白在Myo7a基因敲除小鼠中治疗听觉功能的检测分析Example 6 Detection and Analysis of Candidate Intein Cleavage Sites in MYO7A Proteins for the Treatment of Hearing Function in Myo7a Knockout Mice

Myo7a基因敲除小鼠为Myo7a(p.Q720X)点突变KO小鼠,委托赛业生物科技有限公司构建。为验证该候选的内含肽切分位点能否通过表达MYO7A蛋白来有效保护听力,分别将实施例5的成对腺相关病毒通过圆窗给药的方式注射到P3小鼠内耳进行体内表达验证。首先是病毒注射,在冰中低温麻醉小鼠1-2分钟,麻醉后进行耳后切口将圆窗暴露到视野中,用玻璃微量移液管注射病毒。注射后用3M Vethod组织胶水粘合皮肤伤口。Myo7a knockout mice are Myo7a (p.Q720X) point mutation KO mice, which were commissioned to Saiye Biotechnology Co., Ltd. to construct. To verify whether the candidate intein cleavage site can effectively protect hearing by expressing MYO7A protein, the paired adeno-associated viruses of Example 5 were injected into the inner ear of P3 mice through the round window for in vivo expression verification. First, the virus was injected. The mice were anesthetized in ice for 1-2 minutes. After anesthesia, a post-auricular incision was made to expose the round window to the field of view, and the virus was injected with a glass micropipette. After the injection, the skin wound was sealed with 3M Vethod tissue glue.

3周后通过ABR(auditory brainstem response,听觉脑干反应)检测Myo7a敲除小鼠分别经我们的AAV病毒制剂(内含肽切分位点S3,S4,S8,S9,S11)治疗后小鼠整体的听觉功能差异,其中Myo7a敲除小鼠和同窝野生小鼠作为对照小鼠参与听觉功能分析。Three weeks later, the auditory brainstem response (ABR) was used to detect the overall auditory function differences of Myo7a knockout mice after treatment with our AAV virus preparation (intein cleavage sites S3, S4, S8, S9, S11). Myo7a knockout mice and wild-type mice of the same litter served as control mice for auditory function analysis.

通过ABR检测听觉反应阈值、潜伏期和波间期等参数,分别用短声(Click),尤其是不同音频的短音(4KHz、8KHz、12KHz、16KHz、24KHz、32KHz)作为刺激声,检测小鼠对不同音频的听力阈值,分析小鼠的听力敏感性,从整体上判断小鼠从毛细胞到大脑皮层有无正常的听觉功能。ABR阈值越高,表明Myo7a基因敲除小鼠的听觉功能损失越严重。反之,ABR阈值越低,基因治疗的效果越好。Using ABR testing, auditory response thresholds, latency, and inter-wave duration were measured. Clicks, particularly those at different frequencies (4 kHz, 8 kHz, 12 kHz, 16 kHz, 24 kHz, and 32 kHz), were used as stimuli to measure the mice's hearing thresholds. This analysis of the mice's hearing sensitivity allowed them to assess overall hearing function, from hair cells to the cerebral cortex. Higher ABR thresholds indicate more severe hearing loss in Myo7a knockout mice. Conversely, lower ABR thresholds indicate greater efficacy of gene therapy.

测听结果显示(图17),带有内含肽切分位点S3、内含肽切分位点S4、内含肽切分位点S8、内含肽切分位点S9、内含肽切分位点S11的腺相关病毒制剂注射后都能有效地保护听力,其中相较于对照小鼠ABR阈值,pAAV-CMV-MYO7A-N-S4-Rm-N-内含肽AAV和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S4 AAV施用后的听力保护效果为,ABR阈值约上升20-40dB;pAAV-CMV-MYO7A-N-S11-Rm-N-内含肽AAV和pAAV-CMV-Rm-C-内含肽-MYO7A-C-S11 AAV施用后的听力保护效果最好,相较于对照小鼠ABR阈值约上升5-20dB。The audiometry results showed (Figure 17) that adeno-associated virus preparations carrying intein cleavage site S3, intein cleavage site S4, intein cleavage site S8, intein cleavage site S9, and intein cleavage site S11 could effectively protect hearing after injection. Compared with the ABR threshold of control mice, the hearing protection effect after administration of pAAV-CMV-MYO7A-N-S4-Rm-N-intein AAV and pAAV-CMV-Rm-C-intein-MYO7A-C-S4 AAV was that the ABR threshold increased by about 20-40dB; the hearing protection effect after administration of pAAV-CMV-MYO7A-N-S11-Rm-N-intein AAV and pAAV-CMV-Rm-C-intein-MYO7A-C-S11 AAV was the best, and the ABR threshold increased by about 5-20dB compared with the control mice.

以上描述了本发明的示例性实施方案,本领域技术人员应当理解的是,这些公开内容仅是示例性的,在本发明的范围内可以进行各种其它替换、适应和修改。因此,本发明不限于文中列举的具体实施方案。While the exemplary embodiments of the present invention have been described above, it should be understood by those skilled in the art that these disclosures are merely exemplary and that various other substitutions, adaptations, and modifications may be made within the scope of the present invention. Therefore, the present invention is not limited to the specific embodiments listed herein.

示例性序列Exemplary sequences

编码人肌球蛋白Ⅶa多肽的核苷酸序列SEQ ID NO:1


Nucleotide sequence encoding human myosin VIIa polypeptide SEQ ID NO: 1


编码人肌球蛋白Ⅶa多肽的氨基酸序列SEQ ID NO:2

Amino acid sequence encoding human myosin VIIa polypeptide SEQ ID NO: 2

编码人肌球蛋白Ⅶa多肽的第1-1061位氨基酸序列的核苷酸序列SEQ ID NO:3

Nucleotide sequence encoding amino acid sequence 1-1061 of human myosin VIIa polypeptide SEQ ID NO: 3

人肌球蛋白Ⅶa多肽的第1-1061位氨基酸序列SEQ ID NO:4
Amino acid sequence of human myosin VIIa polypeptide SEQ ID NO: 4

编码人肌球蛋白Ⅶa多肽的第1062-2215位氨基酸序列的核苷酸序列SEQ ID NO:5

Nucleotide sequence encoding amino acid sequence 1062-2215 of human myosin VIIa polypeptide SEQ ID NO: 5

人肌球蛋白Ⅶa多肽的第1062-2215位氨基酸序列SEQ ID NO:6
Amino acid sequence of human myosin VIIa polypeptide at positions 1062-2215 SEQ ID NO: 6

编码人肌球蛋白Ⅶa多肽的第1-1064位氨基酸序列的核苷酸序列SEQ ID NO:7
Nucleotide sequence encoding amino acid sequence 1-1064 of human myosin VIIa polypeptide SEQ ID NO: 7

人肌球蛋白ⅦA多肽的第1-1064位氨基酸序列SEQ ID NO:8
Amino acid sequence of human myosin VIIA polypeptide SEQ ID NO: 8

编码人肌球蛋白Ⅶa多肽的第1065-2215位氨基酸序列的核苷酸序列SEQ ID NO:9

Nucleotide sequence encoding amino acid sequence 1065-2215 of human myosin VIIa polypeptide SEQ ID NO: 9

人肌球蛋白ⅦA多肽的第1065-2215位氨基酸序列SEQ ID NO:10
Amino acid sequence of human myosin VIIA polypeptide at positions 1065-2215 SEQ ID NO: 10

编码人肌球蛋白Ⅶa多肽的第1-1104位氨基酸序列的核苷酸序列SEQ ID NO:11

Nucleotide sequence encoding amino acid sequence 1-1104 of human myosin VIIa polypeptide SEQ ID NO: 11

人肌球蛋白ⅦA多肽的第1-1104位氨基酸序列SEQ ID NO:12

Amino acid sequence of human myosin VIIA polypeptide SEQ ID NO: 12

编码人肌球蛋白Ⅶa多肽的第1105-2215位氨基酸序列的核苷酸序列SEQ ID NO:13

Nucleotide sequence encoding amino acid sequence 1105-2215 of human myosin VIIa polypeptide SEQ ID NO: 13

人肌球蛋白ⅦA多肽的第1105-2215位氨基酸序列SEQ ID NO:14
Amino acid sequence of human myosin VIIA polypeptide at positions 1105-2215 SEQ ID NO: 14

编码人肌球蛋白Ⅶa多肽的第1-1105位氨基酸序列的核苷酸序列SEQ ID NO:15

Nucleotide sequence encoding amino acid sequence 1-1105 of human myosin VIIa polypeptide SEQ ID NO: 15

人肌球蛋白ⅦA多肽的第1-1105位氨基酸序列SEQ ID NO:16
Amino acid sequence of human myosin VIIA polypeptide SEQ ID NO: 16

编码人肌球蛋白Ⅶa多肽的第1106-2215位氨基酸序列的核苷酸序列SEQ ID NO:17

Nucleotide sequence encoding amino acid sequence 1106-2215 of human myosin VIIa polypeptide SEQ ID NO: 17

人肌球蛋白ⅦA多肽的第1106-2215位氨基酸序列SEQ ID NO:18
Amino acid sequence of human myosin VIIA polypeptide at positions 1106-2215 SEQ ID NO: 18

编码人肌球蛋白Ⅶa多肽的第1-1119位氨基酸序列的核苷酸序列SEQ ID NO:19
Nucleotide sequence encoding amino acids 1-1119 of human myosin VIIa polypeptide SEQ ID NO: 19

人肌球蛋白ⅦA多肽的第1-1119位氨基酸序列SEQ ID NO:20
Amino acid sequence of human myosin VIIA polypeptide SEQ ID NO: 20

编码人肌球蛋白Ⅶa多肽的第1120-2215位氨基酸序列的核苷酸序列SEQ ID NO:21

Nucleotide sequence encoding amino acid sequence 1120-2215 of human myosin VIIa polypeptide SEQ ID NO: 21

人肌球蛋白ⅦA多肽的第1120-2215位氨基酸序列SEQ ID NO:22
Amino acid sequence of human myosin VIIA polypeptide SEQ ID NO: 22

RmaDnaB内含肽N端部分的氨基酸序列SEQ ID NO:23
The amino acid sequence of the N-terminal portion of the RmaDnaB intein is SEQ ID NO: 23

RmaDnaB内含肽C端部分的氨基酸序列SEQ ID NO:24
The amino acid sequence of the C-terminal portion of the RmaDnaB intein is SEQ ID NO: 24

HA标签的氨基酸序列SEQ ID NO:25
Amino acid sequence of HA tag SEQ ID NO: 25

图3所示的载体pAAV-CMV-EGFP-WPRE-SV40质粒的核苷酸序列如SEQ ID NO:26所示。

The nucleotide sequence of the vector pAAV-CMV-EGFP-WPRE-SV40 plasmid shown in Figure 3 is shown in SEQ ID NO:26.

WPRE+SV40 poly(A)signal(717bp)的核苷酸序列:SEQ ID NO:27
Nucleotide sequence of WPRE+SV40 poly(A) signal (717 bp): SEQ ID NO: 27

WPRE(589bp)的核苷酸序列:SEQ ID NO:28
Nucleotide sequence of WPRE (589 bp): SEQ ID NO: 28

SV40 poly(A)信号(122bp)的核苷酸序列:SEQ ID NO:29
Nucleotide sequence of SV40 poly(A) signal (122 bp): SEQ ID NO: 29

WPRE3-SV40 late poly(A)(432bp)的核苷酸序列:SEQ ID NO:30
Nucleotide sequence of WPRE3-SV40 late poly(A) (432 bp): SEQ ID NO: 30

WPRE3的核苷酸序列:SEQ ID NO:31
Nucleotide sequence of WPRE3: SEQ ID NO: 31

SV40 late poly(A)信号的核苷酸序列:SEQ ID NO:32
Nucleotide sequence of SV40 late poly(A) signal: SEQ ID NO: 32

polyA中的序列Sequences in polyA

AATAAA(SEQ ID NO:33)、ATTAAA(SEQ ID NO:34)、AGTAAA(SEQ ID NO:35)、CATAAA(SEQ ID NO:36)、TATAAA(SEQ ID NO:37)、GATAAA(SEQ ID NO:38)、ACTAAA(SEQ ID NO:39)、AATATA(SEQ ID NO:40)、AAGAAA(SEQ ID NO:41)、AATAAT(SEQ ID NO:42)、AAAAAA(SEQ ID NO:43),AATGAA(SEQ ID NO:44)、AATCAA(SEQ ID NO:45)、AACAAA(SEQ ID NO:46)、AATCAA(SEQ ID NO:47)、AATAAC(SEQ ID NO:48)、AATAGA(SEQ ID NO:49)、AATTAA(SEQ ID NO:50)或AATAAG(SEQ ID NO:51)AATAAA (SEQ ID NO: 33), ATTAAA (SEQ ID NO: 34), AGTAAA (SEQ ID NO: 35), CATAAA (SEQ ID NO: 36), TATAAA (SEQ ID N O: 37), GATAAA (SEQ ID NO: 38), ACTAAA (SEQ ID NO: 39), AATATA (SEQ ID NO: 40), AAGAAA (SEQ ID NO: 41), AATAAT (SEQ ID NO: 42), AAAAAA (SEQ ID NO: 43), AATGAA (SEQ ID NO: 44), AATCAA (SEQ ID NO: 45), AACAAA (SEQ ID NO: 46), AATC AA(SEQ ID NO:47), AATAAC(SEQ ID NO:48), AATAGA(SEQ ID NO:49), AATTAA(SEQ ID NO:50) or AATAAG(SEQ ID NO:51)

NpuDnaE内含肽N端部分的氨基酸序列SEQ ID NO:52
The amino acid sequence of the N-terminal portion of the NpuDnaE intein is SEQ ID NO: 52

NpuDnaE内含肽N端部分的核苷酸序列SEQ ID NO:53
The nucleotide sequence of the N-terminal portion of the NpuDnaE intein is SEQ ID NO: 53

NpuDnaE内含肽C端部分的氨基酸序列SEQ ID NO:54
The amino acid sequence of the C-terminal portion of the NpuDnaE intein is SEQ ID NO: 54

NpuDnaE内含肽C端部分的核苷酸序列SEQ ID NO:55
The nucleotide sequence of the C-terminal portion of the NpuDnaE intein is SEQ ID NO: 55

Claims (18)

用于表达MYO7A蛋白的双载体系统,其包含第一核酸载体和第二核酸载体,其中A dual vector system for expressing MYO7A protein, comprising a first nucleic acid vector and a second nucleic acid vector, wherein 第一核酸载体包含第一核苷酸序列;且第二核酸载体包含第二核苷酸序列;The first nucleic acid vector comprises a first nucleotide sequence; and the second nucleic acid vector comprises a second nucleotide sequence; 所述第一核苷酸序列包含插入在两个第一ITR序列之间的表达盒;The first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences; 所述第二核苷酸序列包含插入在两个第二ITR序列之间的表达盒;The second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences; 所述第一核苷酸序列的表达盒包含启动子、MYO7A的N端编码序列、内含肽的N端编码序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A, an N-terminal coding sequence of an intein, and polyA; 所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的C端编码序列和polyA;和The expression cassette of the second nucleotide sequence comprises a promoter, a C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A and polyA; and 在MYO7A氨基酸序列中设置有MYO7A切分位点,例如,所述MYO7A氨基酸序列如SEQ ID NO:2所示或其功能片段,例如,与SEQ ID NO:2具有至少80%序列同一性的氨基酸序列;A MYO7A cleavage site is provided in the MYO7A amino acid sequence, for example, the MYO7A amino acid sequence as set forth in SEQ ID NO: 2 or a functional fragment thereof, for example, an amino acid sequence having at least 80% sequence identity to SEQ ID NO: 2; MYO7A的N端编码序列是从MYO7A氨基酸序列的N端至MYO7A切分位点的核苷酸编码序列;MYO7A的C端编码序列是从MYO7A切分位点后一位的氨基酸至MYO7A氨基酸序列的C端的核苷酸编码序列。The N-terminal coding sequence of MYO7A is a nucleotide coding sequence from the N-terminus of the MYO7A amino acid sequence to the MYO7A cleavage site; the C-terminal coding sequence of MYO7A is a nucleotide coding sequence from the amino acid after the MYO7A cleavage site to the C-terminus of the MYO7A amino acid sequence. 根据权利要求1所述的用于表达MYO7A蛋白的双载体系统,其中,所述MYO7A切分位点位于MYO7A氨基酸序列中丝氨酸、苏氨酸或半胱氨酸的前一位氨基酸。The dual-vector system for expressing MYO7A protein according to claim 1, wherein the MYO7A cleavage site is located at the amino acid preceding serine, threonine or cysteine in the MYO7A amino acid sequence. 根据权利要求1所述的用于表达MYO7A蛋白的双载体系统,其中,所述第一核苷酸序列或所述第二核苷酸序列的表达盒的启动子选自CAG启动子、CMV启动子、CBA启动子、UbC启动子、SFFV启动子、EF1α启动子、PGK启动子、或Myo7A、Myo15、Atoh1、POU4F3、Lhx3、Myo6、α9AchR、α10AchR、OTOF和STRC编码基因的启动子;The dual-vector system for expressing MYO7A protein according to claim 1, wherein the promoter of the expression cassette of the first nucleotide sequence or the second nucleotide sequence is selected from CAG promoter, CMV promoter, CBA promoter, UbC promoter, SFFV promoter, EF1α promoter, PGK promoter, or promoters of genes encoding Myo7A, Myo15, Atoh1, POU4F3, Lhx3, Myo6, α9AchR, α10AchR, OTOF and STRC; 所述第一核苷酸序列或第二核苷酸序列的表达盒的polyA包含AATAAA(SEQ ID NO:33)和AATAAA的变体;所述AATAAA的变体包含ATTAAA(SEQ ID NO:34)、AGTAAA(SEQ ID NO:35)、CATAAA(SEQ ID NO:36)、TATAAA(SEQ ID NO:37)、GATAAA(SEQ ID NO:38)、ACTAAA(SEQ ID NO:39)、AATATA(SEQ ID NO:40)、AAGAAA(SEQ ID NO:41)、AATAAT(SEQ ID NO:42)、AAAAAA(SEQ ID NO:43),AATGAA(SEQ ID NO:44)、AATCAA(SEQ ID NO:45)、AACAAA(SEQ ID NO:46)、AATCAA(SEQ ID NO:47)、AATAAC(SEQ ID NO:48)、AATAGA(SEQ ID NO:49)、AATTAA(SEQ ID NO:50)或AATAAG(SEQ ID NO:51);例如,所述polyA是与SEQ ID NO:29或SEQ ID NO:32所示的多聚A信号序列具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列;和The polyA of the expression cassette of the first nucleotide sequence or the second nucleotide sequence comprises AATAAA (SEQ ID NO: 33) and variants of AATAAA; the variants of AATAAA comprise ATTAAA (SEQ ID NO: 34), AGTAAA (SEQ ID NO: 35), CATAAA (SEQ ID NO: 36), TATAAA (SEQ ID NO: 37), GATAAA (SEQ ID NO: 38), ACTAAA (SEQ ID NO: 39), AATATA (SEQ ID NO: 40), AAGAAA (SEQ ID NO: 41), AATAAT (SEQ ID NO: 42), AAAAAA (SEQ ID NO: 43), and AAAAAA (SEQ ID NO: 44). NO: 43), AATGAA (SEQ ID NO: 44), AATCAA (SEQ ID NO: 45), AACAAA (SEQ ID NO: 46), AATCAA (SEQ ID NO: 47), AATAAC (SEQ ID NO: 48), AATAGA (SEQ ID NO: 49), AATTAA (SEQ ID NO: 50) or AATAAG (SEQ ID NO: 51); for example, the polyA is a nucleotide sequence having at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identity to the poly A signal sequence set forth in SEQ ID NO: 29 or SEQ ID NO: 32; and 所述两个第一ITR序列和所述两个第二ITR序列中的每个ITR序列源自AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8或AAV9。Each of the two first ITR sequences and the two second ITR sequences is derived from AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 or AAV9. 根据权利要求1所述的用于表达MYO7A蛋白的双载体系统,其中,所述第一核苷酸序列或所述第二核苷酸序列的表达盒还包含表达调控元件和/或标签元件,例如,所述表达调控元件为土拨鼠肝炎转录后调控元件(WPRE)或其变体,优选地,WPRE截短变体,例如,与SEQ ID NO:28所示的核苷酸序列具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列,例如,SEQ ID NO:31所示的核苷酸序列;例如,所述标签元件为HA。The dual-vector system for expressing MYO7A protein according to claim 1, wherein the expression cassette of the first nucleotide sequence or the second nucleotide sequence further comprises an expression regulatory element and/or a tag element, for example, the expression regulatory element is a woodchuck hepatitis posttranscriptional regulatory element (WPRE) or a variant thereof, preferably a WPRE truncated variant, for example, a nucleotide sequence having at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identity with the nucleotide sequence shown in SEQ ID NO: 28, for example, the nucleotide sequence shown in SEQ ID NO: 31; for example, the tag element is HA. 根据权利要求1所述的用于表达MYO7A蛋白的双载体系统,其中,内含肽源自MxeGyrA、pabPolIII、MjaKlbA、SspDnaB、SceVMA、SspDnaE、NpuDnaE、AvaDnaE、CraDnaE、CspDnaE、CwaDnaE、MchtDnaE、OliDnaE、TerDnaE、gp41-1、gp41-8、IMPDH-1或RmaDnaB,例如,内含肽源自RmaDnaB,例如,内含肽的N端为如SEQ ID NO:23所示的RmaDnaB内含肽N端,内含肽的C端为如SEQ ID NO:24所示的RmaDnaB内含肽C端;或者,内含肽源自NpuDnaE,例如,内含肽的N端为如SEQ ID NO:52所示的NpuDnaE内含肽N端,内含肽的C端为如SEQ ID NO:54所示的NpuDnaE内含肽C端。The dual vector system for expressing MYO7A protein according to claim 1, wherein the intein is derived from MxeGyrA, pabPolIII, MjaKlbA, SspDnaB, SceVMA, SspDnaE, NpuDnaE, AvaDnaE, CraDnaE, CspDnaE, CwaDnaE, MchtDnaE, OliDnaE, TerDnaE, gp41-1, gp41-8, IMPDH-1 or RmaDnaB, for example For example, the intein is derived from RmaDnaB, for example, the N-terminus of the intein is the N-terminus of the RmaDnaB intein as shown in SEQ ID NO:23, and the C-terminus of the intein is the C-terminus of the RmaDnaB intein as shown in SEQ ID NO:24; or, the intein is derived from NpuDnaE, for example, the N-terminus of the intein is the N-terminus of the NpuDnaE intein as shown in SEQ ID NO:52, and the C-terminus of the intein is the C-terminus of the NpuDnaE intein as shown in SEQ ID NO:54. 根据权利要求1所述的用于表达MYO7A蛋白的双载体系统,其中,将所述第一核苷酸序列插入在包含两个第一ITR序列的质粒中,将所述第二核苷酸序列插入在包含两个第二ITR序列的质粒中,例如,所述包含两个第一ITR序列的质粒和所述包含两个第二ITR序列的质粒相同或不同,例如,所述质粒是pAAV、pAAV-CMV、pX601、pX551或pAAV-MCS质粒。The dual-vector system for expressing MYO7A protein according to claim 1, wherein the first nucleotide sequence is inserted into a plasmid comprising two first ITR sequences, and the second nucleotide sequence is inserted into a plasmid comprising two second ITR sequences, for example, the plasmid comprising two first ITR sequences and the plasmid comprising two second ITR sequences are the same or different, for example, the plasmid is pAAV, pAAV-CMV, pX601, pX551 or pAAV-MCS plasmid. 根据权利要求1-6中任一项所述的用于表达MYO7A蛋白的双载体系统,其中,MYO7A切分位点如表1所示;优选地,使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1043位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The dual-vector system for expressing the MYO7A protein according to any one of claims 1 to 6, wherein the MYO7A cleavage site is as shown in Table 1; preferably, amino acid 1043 of the MYO7A amino acid sequence shown in SEQ ID NO: 2 is used as the MYO7A cleavage site, and the RmaDnaB intein is used; the N-terminal coding sequence of MYO7A is connected and fused to the N-terminal coding sequence of the RmaDnaB intein, to construct a first nucleotide sequence, and the pAAV-CMV plasmid is used as a vector; the C-terminal coding sequence of the RmaDnaB intein is connected and fused to the C-terminal coding sequence of MYO7A, to construct a second nucleotide sequence, and the pAAV-CMV plasmid is used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1058位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1058th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1061位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1061st amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1064位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1064th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1071位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1071st amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1076位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1076th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1081位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1081st amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1104位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1104th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1105位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1105th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1114位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1114th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1119位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1119th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1122位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;或者Using amino acid 1122 of the MYO7A amino acid sequence shown in SEQ ID NO: 2 as the MYO7A cleavage site and using the RmaDnaB intein; connecting and fusing the N-terminal coding sequence of MYO7A with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, using the pAAV-CMV plasmid as a vector; connecting and fusing the C-terminal coding sequence of the RmaDnaB intein with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, using the pAAV-CMV plasmid as a vector; or 使用SEQ ID NO:2所示的MYO7A氨基酸序列的第1126位氨基酸为MYO7A切分位点,并使用RmaDnaB内含肽;将MYO7A的N端编码序列与RmaDnaB内含肽的N端编码序列进行连接和融合后,构建第一核苷酸序列,以pAAV-CMV质粒为载体;将RmaDnaB内含肽的C端编码序列与MYO7A的C端编码序列进行连接和融合后,构建第二核苷酸序列,以pAAV-CMV质粒为载体;The 1126th amino acid of the MYO7A amino acid sequence shown in SEQ ID NO: 2 was used as the MYO7A cleavage site, and the RmaDnaB intein was used; the N-terminal coding sequence of MYO7A was connected and fused with the N-terminal coding sequence of the RmaDnaB intein to construct a first nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; the C-terminal coding sequence of the RmaDnaB intein was connected and fused with the C-terminal coding sequence of MYO7A to construct a second nucleotide sequence, and the pAAV-CMV plasmid was used as a vector; 例如,RmaDnaB内含肽的N端编码序列编码SEQ ID NO:23所示的RmaDnaB N端部分,RmaDnaB内含肽的C端编码序列编码SEQ ID NO:24所示的RmaDnaB C端部分。For example, the N-terminal coding sequence of the RmaDnaB intein encodes the N-terminal portion of RmaDnaB shown in SEQ ID NO:23, and the C-terminal coding sequence of the RmaDnaB intein encodes the C-terminal portion of RmaDnaB shown in SEQ ID NO:24. 根据权利要求1-7中任一项所述的用于表达MYO7A蛋白的双载体系统,其中,The dual vector system for expressing MYO7A protein according to any one of claims 1 to 7, wherein 所述第一核苷酸序列的表达盒包含启动子、MYO7A的SEQ ID No:4所示N端的编码序列、内含肽的N端编码序列和polyA;所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的SEQ ID No:6所示C端的编码序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 4, an N-terminal coding sequence of an intein, and polyA; the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A as shown in SEQ ID No: 6, and polyA; 所述第一核苷酸序列的表达盒包含启动子、MYO7A的SEQ ID No:8所示N端的编码序列、内含肽的N端编码序列和polyA;所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的SEQ ID No:10所示C端的编码序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 8, an N-terminal coding sequence of an intein, and polyA; the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A as shown in SEQ ID No: 10, and polyA; 所述第一核苷酸序列的表达盒包含启动子、MYO7A的SEQ ID No:12所示N端的编码序列、内含肽的N端编码序列和polyA;所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的SEQ ID No:14所示C端的编码序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 12, an N-terminal coding sequence of an intein, and polyA; the expression cassette of the second nucleotide sequence comprises a promoter, a C-terminal coding sequence of an intein, a C-terminal coding sequence of MYO7A as shown in SEQ ID No: 14, and polyA; 所述第一核苷酸序列的表达盒包含启动子、MYO7A的SEQ ID No:16所示N端的编码序列、内含肽的N端编码序列和polyA;所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的SEQ ID No:18所示C端的编码序列和polyA;或The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 16, an N-terminal coding sequence of an intein, and polyA; the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, an C-terminal coding sequence of MYO7A as shown in SEQ ID No: 18, and polyA; or 所述第一核苷酸序列的表达盒包含启动子、MYO7A的SEQ ID No:20所示N端的编码序列、内含肽的N端编码序列和polyA;所述第二核苷酸序列的表达盒包含启动子、内含肽的C端编码序列、MYO7A的SEQ ID No:22所示C端的编码序列和polyA。The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of MYO7A as shown in SEQ ID No: 20, an N-terminal coding sequence of an intein, and polyA; the expression cassette of the second nucleotide sequence comprises a promoter, an C-terminal coding sequence of an intein, an C-terminal coding sequence of MYO7A as shown in SEQ ID No: 22, and polyA. 根据权利要求1-8中任一项所述的用于表达MYO7A蛋白的双载体系统,其中所述第一核苷酸序列的表达盒和所述第二核苷酸序列的表达盒各自在3’ITR序列的N端包含WPRE核苷酸序列和SV40多聚腺苷酸化序列的组合,例如,具有SEQ ID NO:27所示的核苷酸序列或与SEQ ID NO:27具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列;或者包含WPRE3核苷酸序列和SV40晚期多聚腺苷酸化序列的组合,例如,具有SEQ ID NO:30所示的核苷酸序列或与SEQ ID NO:30具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列。A dual vector system for expressing MYO7A protein according to any one of claims 1 to 8, wherein the expression cassette of the first nucleotide sequence and the expression cassette of the second nucleotide sequence each comprise a combination of a WPRE nucleotide sequence and an SV40 polyadenylation sequence at the N-terminus of the 3’ITR sequence, for example, a nucleotide sequence shown in SEQ ID NO: 27 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 27; or a combination of a WPRE3 nucleotide sequence and an SV40 late polyadenylation sequence, for example, a nucleotide sequence shown in SEQ ID NO: 30 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 30. 腺相关病毒的包装载体系统,其中,所述包装载体系统包含根据权利要求1-9中任一项所述的用于表达MYO7A蛋白的双载体系统、携带AAV rep和cap基因的载体和辅助病毒载体,经包装成为AAV载体,优选地,其中MYO7A蛋白的氨基酸序列如SEQ ID NO:2所示。An adeno-associated virus packaging vector system, wherein the packaging vector system comprises a dual-vector system for expressing MYO7A protein according to any one of claims 1-9, a vector carrying AAV rep and cap genes, and a helper virus vector, which is packaged into an AAV vector. Preferably, the amino acid sequence of the MYO7A protein is as shown in SEQ ID NO: 2. 根据权利要求10所述的腺相关病毒的包装载体系统,其中,所述携带AAV rep和cap基因的载体选自AAV1、AAV2、AAV5、AAV8、AAV9、Anc80、PHP.eB、AAV-DJ和AAVrh.10载体;所述辅助病毒载体为pHelper质粒。The adeno-associated virus packaging vector system according to claim 10, wherein the vector carrying the AAV rep and cap genes is selected from AAV1, AAV2, AAV5, AAV8, AAV9, Anc80, PHP.eB, AAV-DJ and AAVrh.10 vectors; and the helper virus vector is a pHelper plasmid. 一种腺相关病毒的包装方法,其中,将根据权利要求10或11所述的腺相关病毒的包装载体系统转入宿主细胞中进行包装。A method for packaging an adeno-associated virus, wherein the adeno-associated virus packaging vector system according to claim 10 or 11 is transferred into a host cell for packaging. 根据权利要求12所述的腺相关病毒的包装方法,其中,所述宿主细胞选自Hela-S3细胞、HEK-293细胞、HEK-293T细胞、HEK-293FT细胞、A549细胞和Sf9细胞。The method for packaging adeno-associated virus according to claim 12, wherein the host cell is selected from Hela-S3 cells, HEK-293 cells, HEK-293T cells, HEK-293FT cells, A549 cells and Sf9 cells. 一种腺相关病毒,其是通过根据权利要求12或13所述的包装方法获得的。An adeno-associated virus obtained by the packaging method according to claim 12 or 13. 根据权利要求1-9中任一项所述的用于表达MYO7A蛋白的双载体系统或根据权利要求14所述的腺相关病毒的用途,用于制备治疗耳聋性疾病或听力损伤或听力功能障碍的药物或制剂。Use of the dual vector system for expressing MYO7A protein according to any one of claims 1 to 9 or the adeno-associated virus according to claim 14 for preparing a medicament or preparation for treating deafness, hearing loss or hearing dysfunction. 一种用于治疗耳聋疾病或听力损伤或听力功能障碍的药物或制剂,其由根据权利要求1-9中任一项所述的用于表达MYO7A蛋白的双载体系统或根据权利要求14所述的腺相关病毒制备,其中,所述腺相关病毒是将腺相关病毒的包装载体系统转入宿主细胞进行包装得到的,所述腺相关病毒的包装载体系统包括表达MYO7A蛋白的双载体系统、携带AAVrep和cap基因的载体,以及辅助病毒载体。A medicine or preparation for treating deafness, hearing loss or hearing dysfunction, which is prepared by the dual-vector system for expressing MYO7A protein according to any one of claims 1 to 9 or the adeno-associated virus according to claim 14, wherein the adeno-associated virus is obtained by transferring the adeno-associated virus packaging vector system into a host cell for packaging, and the adeno-associated virus packaging vector system includes a dual-vector system for expressing MYO7A protein, a vector carrying AAV rep and cap genes, and a helper virus vector. 根据权利要求16所述的药物或制剂,其中,所述药物或制剂还包含中性盐缓冲剂、酸性盐缓冲剂、碱性盐缓冲剂、葡萄糖、甘露糖、甘露糖醇、蛋白质、多肽、氨基酸、抗生素、螯合剂、佐剂、防腐剂、纳米颗粒、脂质体和阳性脂质颗粒。The drug or preparation according to claim 16, wherein the drug or preparation further comprises a neutral salt buffer, an acidic salt buffer, an alkaline salt buffer, glucose, mannose, mannitol, proteins, polypeptides, amino acids, antibiotics, chelating agents, adjuvants, preservatives, nanoparticles, liposomes and positive lipid particles. 根据权利要求16或17所述的药物或制剂,其中,通过耳蜗的圆窗、卵圆窗、半规管、总管进行注射给药;以及终生单次给药或多次给药,总剂量为1×109-1×1013病毒基因组。The drug or preparation according to claim 16 or 17, wherein the drug is administered by injection through the round window, oval window, semicircular canal, or common canal of the cochlea; and the drug is administered single or multiple times throughout life, with a total dose of 1×10 9 -1×10 13 viral genomes.
PCT/CN2025/074724 2024-02-01 2025-01-24 Dual-vector system for expressing myo7a protein, and use thereof Pending WO2025162267A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202410142670.0 2024-02-01
CN202410142670.0A CN120400250A (en) 2024-02-01 2024-02-01 Dual vector system for expressing MYO7A protein and its use

Publications (1)

Publication Number Publication Date
WO2025162267A1 true WO2025162267A1 (en) 2025-08-07

Family

ID=96510687

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2025/074724 Pending WO2025162267A1 (en) 2024-02-01 2025-01-24 Dual-vector system for expressing myo7a protein, and use thereof

Country Status (2)

Country Link
CN (1) CN120400250A (en)
WO (1) WO2025162267A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140256802A1 (en) * 2011-11-16 2014-09-11 University Of Florida Research Foundation, Inc. Dual-AAV Vector-Based Systems and Methods for Delivering Oversized Genes to Mammalian Cells
WO2021209574A1 (en) * 2020-04-15 2021-10-21 Fondazione Telethon Constructs comprising inteins
CN115074369A (en) * 2022-06-24 2022-09-20 成都金唯科生物科技有限公司 Adeno-associated virus vector for expressing full-length ABCA4 gene and application thereof
CN115666658A (en) * 2020-04-01 2023-01-31 佛罗里达州大学研究基金会 Dual AAV-MYO7A vectors with improved safety for treatment of USH1B
US20230242942A1 (en) * 2022-05-17 2023-08-03 Eye & Ent Hospital Of Fudan University Dual-vector system for treating hearing impairment and use thereof
CN117106824A (en) * 2022-05-17 2023-11-24 复旦大学附属眼耳鼻喉科医院 A dual-carrier system for treating hearing loss and its application

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140256802A1 (en) * 2011-11-16 2014-09-11 University Of Florida Research Foundation, Inc. Dual-AAV Vector-Based Systems and Methods for Delivering Oversized Genes to Mammalian Cells
CN115666658A (en) * 2020-04-01 2023-01-31 佛罗里达州大学研究基金会 Dual AAV-MYO7A vectors with improved safety for treatment of USH1B
WO2021209574A1 (en) * 2020-04-15 2021-10-21 Fondazione Telethon Constructs comprising inteins
US20230242942A1 (en) * 2022-05-17 2023-08-03 Eye & Ent Hospital Of Fudan University Dual-vector system for treating hearing impairment and use thereof
CN117106824A (en) * 2022-05-17 2023-11-24 复旦大学附属眼耳鼻喉科医院 A dual-carrier system for treating hearing loss and its application
CN115074369A (en) * 2022-06-24 2022-09-20 成都金唯科生物科技有限公司 Adeno-associated virus vector for expressing full-length ABCA4 gene and application thereof

Also Published As

Publication number Publication date
CN120400250A (en) 2025-08-01

Similar Documents

Publication Publication Date Title
JP6453307B2 (en) Effective delivery of large genes by dual AAV vectors
JP7303816B2 (en) AAV vector
JP7595013B2 (en) AAV-mediated gene therapy to restore the otoferin gene
US12467066B2 (en) Compositions and methods for treating retinal disorders
WO2025016099A1 (en) Composition and method for expression of otof gene by dual vector system
US11827680B2 (en) Compounds and methods of treating usher syndrome III
CN118475699A (en) AAV particles comprising hepadnavin and alpha-galactosidase and their use for treating brix disease
JP2022524140A (en) Improved treatment for rare eye diseases with gene substitutions
WO2025162267A1 (en) Dual-vector system for expressing myo7a protein, and use thereof
EP4071247A1 (en) Trans-splicing ribozyme that is specific to rhodopsin transcriptome, and use thereof
CN117377771A (en) carrier system
WO2025201333A1 (en) Dual-vector system for delivering otof gene and use thereof
WO2025157179A1 (en) Dual-vector system for expressing strc protein and use thereof
CN111088264A (en) Adeno-associated virus vector carrying C3 gene expression cassette and its application
CN120322449A (en) ATP7B gene therapy
CN120641570A (en) Two-vector system for delivering Otof gene and its use
WO2024239226A1 (en) Cochlear hair cell specific promoter and use thereof
KR20240052746A (en) KCNV2 gene therapy
CN115997012A (en) Treatment of RPE65-Related Eye Diseases and Disorders
HK40030486A (en) Compositions and methods for treating retinal disorders

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 25747835

Country of ref document: EP

Kind code of ref document: A1