[go: up one dir, main page]

WO2025201333A1 - Dual-vector system for delivering otof gene and use thereof - Google Patents

Dual-vector system for delivering otof gene and use thereof

Info

Publication number
WO2025201333A1
WO2025201333A1 PCT/CN2025/084737 CN2025084737W WO2025201333A1 WO 2025201333 A1 WO2025201333 A1 WO 2025201333A1 CN 2025084737 W CN2025084737 W CN 2025084737W WO 2025201333 A1 WO2025201333 A1 WO 2025201333A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
otof
seq
nucleotide sequence
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2025/084737
Other languages
French (fr)
Chinese (zh)
Inventor
柴人杰
齐洁玉
谈方志
余朝荣
张李燕
周吟毅
张善中
谭畅
陆玲
丁泽阳
崔志平
宋怀恩
孙思睫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Otovia Therapeutics
Original Assignee
Otovia Therapeutics
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Otovia Therapeutics filed Critical Otovia Therapeutics
Priority to CN202580000485.0A priority Critical patent/CN120641570A/en
Publication of WO2025201333A1 publication Critical patent/WO2025201333A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • A61K38/16Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • A61K38/17Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P27/00Drugs for disorders of the senses
    • A61P27/16Otologicals
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/85Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
    • C12N15/86Viral vectors
    • C12N15/864Parvoviral vectors, e.g. parvovirus, densovirus

Definitions

  • Deafness is the most common disabling disease in clinical practice, severely impacting normal human life and placing a significant burden on society.
  • WHO World Health Organization
  • nearly 1.5 billion people worldwide have varying degrees of hearing loss, and 466 million, or 5% of the total population, suffer from disabling hearing loss, including 34 million children. It is estimated that by 2050, 2.5 billion people worldwide will suffer from varying degrees of hearing loss, with 700 million suffering from disabling hearing loss.
  • the Otof gene has five transcripts, which are expressed in both the inner ear and brain. In the brain, there are two OTOF transcripts that terminate in exon 47 or 48; in the cochlea, there is only one transcript that terminates in exon 48.
  • the OTOF protein, encoded by the Otof gene is concentrated in the basolateral region of cochlear inner hair cells and is a crucial component of the presynaptic membrane structure of inner hair cells. OTOF triggers membrane fusion at the ribbon synapses of inner hair cells and plays a crucial role in the exocytosis of synaptic vesicles, which is associated with calcium ions.
  • cochlear implants are currently the only effective treatment option for hearing aid failure.
  • cochlear implants are effective for patients with OTOF gene mutations, with only 300,000 patients worldwide receiving cochlear implants, the audience for cochlear implants is only a small fraction of deaf patients.
  • Cochlear implants have low frequency sensitivity, and difficulties in speech discrimination and perception in noisy environments still cannot meet the normal life needs of patients.
  • special care must be taken during use.
  • the present invention provides a two-vector system for delivering large gene sequences (e.g., genes larger than 4 kB; such as OTOF genes), comprising a first nucleic acid vector and a second nucleic acid vector, wherein:
  • the first nucleic acid vector comprises a first nucleotide sequence
  • the second nucleic acid vector comprises a second nucleotide sequence
  • the first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences
  • the second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences
  • the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of OTOF, a splice donor (SD) sequence, a sequence for homologous recombination with the second nucleotide sequence, and polyA;
  • the expression cassette of the second nucleotide sequence comprises a promoter, a sequence that undergoes homologous recombination with the first nucleotide sequence, a splice acceptor (SA) sequence, a C-terminal coding sequence of OTOF, and polyA; and
  • the N-terminal coding sequence of OTOF is the nucleotide coding sequence from the N-terminus of the OTOF amino acid sequence to the OTOF cleavage site;
  • the C-terminal coding sequence of OTOF is the nucleotide coding sequence from the amino acid after the OTOF cleavage site to the C-terminus of the OTOF amino acid sequence.
  • the OTOF cleavage site is located at the 3' terminal amino acid residue of any exon 17-29 in the OTOF amino acid sequence.
  • the polyA of the expression cassette of the first nucleotide sequence or the second nucleotide sequence comprises AATAAA (SEQ ID NO: 68) and variants of AATAAA; the variants of AATAAA comprise ATTAAA (SEQ ID NO: 69), AGTAAA (SEQ ID NO: 70), CATAAA (SEQ ID NO: 71), TATAAA (SEQ ID NO: 72), GATAAA (SEQ ID NO: 73), ACTAAA (SEQ ID NO: 74), AATATA (SEQ ID NO: 75), AAGAAA (SEQ ID NO: 76), AATAAT (SEQ ID NO: 77), AAAAAA (SEQ ID NO: 78), AATGAA (SEQ ID NO: 79), NO: 79), AATCAA (SEQ ID NO: 80), AACAAA (SEQ ID NO: 81), AATCAA (SEQ ID NO: 82), AATAAC
  • the expression cassette of the first nucleotide sequence or the second nucleotide sequence further comprises an expression regulatory element and/or a tag element
  • the expression regulatory element is a woodchuck hepatitis posttranscriptional regulatory element (WPRE) or a variant thereof, preferably a WPRE truncated variant, for example, a nucleotide sequence having at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identity with the nucleotide sequence shown in SEQ ID NO: 61, for example, the nucleotide sequence shown in SEQ ID NO: 64; for example, the tag element is HA.
  • WPRE woodchuck hepatitis posttranscriptional regulatory element
  • the sequence that undergoes homologous recombination with the first nucleotide sequence and the sequence that undergoes homologous recombination with the second nucleotide sequence are intron sequences, for example, intron sequences that do not naturally exist in the genomic sequence of the gene encoding the OTOF protein.
  • the OTOF N-terminus and OTOF C-terminus divided by the OTOF cleavage site are as shown in Table 1A.
  • the OTOF cleavage site is located between the amino acid residue encoded by exon 20 and the amino acid residue encoded by exon 21;
  • the OTOF cleavage site is located between the amino acid residues encoded by exon 21 and the amino acid residues encoded by exon 22;
  • the OTOF N-terminal coding sequence is SEQ ID NO: 11 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 37, and the OTOF C-terminal coding sequence is SEQ ID NO: 24 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 50;
  • the OTOF N-terminal coding sequence is SEQ ID NO: 13 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 39, and the OTOF C-terminal coding sequence is SEQ ID NO: 26 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 52;
  • the OTOF N-terminal coding sequence is SEQ ID NO: 14 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 40, and the OTOF C-terminal coding sequence is SEQ ID NO: 27 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 53; or
  • the OTOF N-terminal coding sequence is SEQ ID NO: 20 or a sequence having at least 90% sequence identity therewith, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 46, and the OTOF C-terminal coding sequence is SEQ ID NO: 33 or a sequence having at least 90% sequence identity therewith, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 59.
  • the expression cassette of the first nucleotide sequence and the expression cassette of the second nucleotide sequence each contain a combination of a WPRE nucleotide sequence and an SV40 polyadenylation sequence at the N-terminus of the 3’ITR sequence, for example, a nucleotide sequence shown in SEQ ID NO: 60 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 60; or contain a combination of a WPRE3 nucleotide sequence and an SV40 late polyadenylation sequence, for example, a nucleotide sequence shown in SEQ ID NO: 63 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 63.
  • the present invention provides an adeno-associated virus packaging vector system, which comprises the dual-vector system for expressing the OTOF protein described in the first aspect of the present invention, a vector carrying the AAV rep and cap genes, and a helper virus vector, which are packaged into an AAV vector.
  • the amino acid sequence of the OTOF protein is as shown in SEQ ID NO: 1.
  • the present invention provides a method for packaging an adeno-associated virus, wherein the adeno-associated virus packaging vector system described in the second aspect of the present invention is transferred into a host cell for packaging.
  • the host cell is selected from Hela-S3 cells, HEK-293 cells, HEK-293T cells, HEK-293FT cells, A549 cells, and Sf9 cells.
  • the present invention provides a dual adeno-associated virus vector, which is obtained by the packaging method according to the second aspect of the present invention.
  • the present invention provides the use of the dual vector system for expressing OTOF protein described in the first aspect of the present invention or the dual adeno-associated virus vector described in the fourth aspect of the present invention for preparing a drug or preparation for treating deafness diseases or hearing loss or hearing dysfunction.
  • the present invention provides a medicine or preparation for treating deafness, hearing loss or hearing dysfunction, which is prepared by the dual-vector system for expressing OTOF protein described in the first aspect of the present invention or the adeno-associated virus described in the fourth aspect of the present invention, wherein the adeno-associated virus is obtained by transferring the packaging vector system of the adeno-associated virus into a host cell for packaging, and the packaging vector system of the adeno-associated virus includes a dual-vector system for expressing OTOF protein, a vector carrying AAVrep and cap genes, and a helper virus vector.
  • the drug or formulation of the present invention further comprises a neutral salt buffer, an acidic salt buffer, an alkaline salt buffer, glucose, mannose, mannitol, proteins, polypeptides, amino acids, antibiotics, chelating agents, adjuvants, preservatives, nanoparticles, liposomes and positive lipid particles.
  • the drug or preparation of the present invention is administered by injection through the round window, oval window, semicircular canal, posterior semicircular canal, or common canal of the cochlea; and is administered once or multiple times throughout life, with a total dose of 1 ⁇ 10 9 -1 ⁇ 10 13 viral genomes.
  • Figure 1 shows a schematic diagram of dual-vector-mediated OTOF protein expression using an inverted terminal repeat (ITR), a CMV promoter, a splice donor SD signal sequence, a splice acceptor SA signal sequence, and an HD sequence for homologous recombination splicing, which relies on a trans-splicing mechanism for homologous recombination.
  • ITR inverted terminal repeat
  • CMV CMV promoter
  • a splice donor SD signal sequence a splice acceptor SA signal sequence
  • HD sequence for homologous recombination splicing which relies on a trans-splicing mechanism for homologous recombination.
  • HD represents the homeodomain, for example, whose sequence contains a short AK recombination-initiating sequence from f1 phage
  • 5'cDNA-otof and 3'cDNA-erlin represent the nucleotide sequence encoding the OTOF N-terminus (also referred to as "5'Otof NT”) and the nucleotide sequence encoding the OTOF C-terminus (also referred to as "3'Otof CT”), respectively, after the full-length OTOF protein is cleaved into the OTOF N-terminus and OTOF C-terminus.
  • Figure 2 illustrates the vector backbone elements in the dual vector system, for example, its sequence can be shown as SEQ ID NO:7, wherein the 5’ITR sequence is located at 1bp-141bp, the CMV promoter sequence is located at 169bp-752bp, the Kozak sequence is located at 792bp-797bp, the EGFP sequence is located at 801bp-1517bp, the WPRE sequence is located at 1536bp-2124bp, the SV40 PolyA sequence is located at 2131bp-2252bp, the 3’ITR sequence is located at 2290bp-2430bp, the f1 Ori sequence is located at 2505bp-2960bp, the kana resistance sequence is located at 3242bp-4156bp, and the Ori sequence is located at 4327bp-4515bp.
  • SEQ ID NO:7 wherein the 5’ITR sequence is located at 1bp-141bp, the CMV promoter sequence is located at 169bp-752b
  • Figure 3 illustrates the plasmid map of the first nucleic acid vector in the dual-vector system constructed after the full-length OTOF protein is split into the OTOF N-terminus and the OTOF C-terminus between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24, which contains the coding nucleotide sequence of 5’Otof NT, SD sequence and AK sequence.
  • Figure 4 illustrates the plasmid map of the second nucleic acid vector in the dual-vector system constructed after the full-length OTOF protein was split into the OTOF N-terminus and the OTOF C-terminus between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24, which contains the AK sequence, SA sequence and the coding nucleotide sequence of 3’Otof CT, wherein an HA tag is connected to the C-terminus of the OTOF C-terminal protein for verifying the in vitro expression of the OTOF protein.
  • Figure 5 illustrates the plasmid map of the second nucleic acid vector in the dual-vector system constructed after the full-length OTOF protein was split into the OTOF N-terminus and the OTOF C-terminus between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24, containing the AK sequence, SA sequence and the coding nucleotide sequence of 3’Otof CT, wherein the HA tag is not connected to the C-terminus of the OTOF C-terminal protein for in vivo injection therapy experiments in mice.
  • Figure 6 shows the Western blot results of HEK-293T cells co-transfected with a plasmid vector encoding the N-terminal portion of the OTOF protein (Otof-NT) and a plasmid vector encoding the C-terminal portion of the OTOF protein (Otof-CT), resulting in the expression of full-length Otof and full-length OTOF proteins at different cleavage sites.
  • the leftmost lane represents a molecular marker ladder
  • Lane NC control (293T cells not transfected with plasmid vectors)
  • Lane 1 co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein is cleaved between the amino acid residues encoded by exon 17 and the amino acid residues encoded by exon 18 (also referred to as "17N+18C" in the text);
  • Lane 2 co-transfection of the amino acid residues encoded by exon 18.
  • Lane 3 co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residue encoded by exon 19 and the amino acid residue encoded by exon 20 (also referred to as "19N+20C” in the text);
  • Lane 4 co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residue encoded by exon 20 and the amino acid residue encoded by exon 21 (also referred to as "20N+21C” in the text);
  • Lane 5 co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residue encoded by exon 21 and the amino acid residue encoded by exon 22.
  • Lane 6 Otof-NT plasmid vector and Otof-CT plasmid vector (also referred to as "21N+22C” in the text); Lane 7: Otof-NT plasmid vector and Otof-CT plasmid vector (also referred to as "22N+23C” in the text) after co-transfection of the OTOF full-length protein cleaved between the amino acid residues encoded by exon 22 and the amino acid residues encoded by exon 23; Lane 8: Otof-NT plasmid vector and Otof-CT plasmid vector (also referred to as "22N+23C” in the text) after co-transfection of the OTOF full-length protein cleaved between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24.
  • Lane 8 co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residues encoded by exon 24 and the amino acid residues encoded by exon 25 (also referred to as "24N+25C” in the text); Lane 9: co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residues encoded by exon 25 and the amino acid residues encoded by exon 26 (also referred to as "25N+26C” in the text); Lane 10: co-transfection of the Otof-NT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residues encoded by exon 26 and the amino acid residues encoded by exon 27.
  • Lane 11 Co-transfection of the plasmid vectors of Otof-NT and Otof-CT (also referred to as "26N+27C” in the text); Lane 12: Co-transfection of the plasmid vectors of Otof-NT and Otof-CT (also referred to as "27N+28C” in the text) after the full-length OTOF protein was cleaved between the amino acid residues encoded by exon 28 and the amino acid residues encoded by exon 29.
  • Lane 13 Co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residues encoded by exon 29 and the amino acid residues encoded by exon 30 (also referred to as "29N+30C" in the text).
  • Figure 7 is the auditory brainstem response (ABR) results of the Otof knockout mouse (Otof KO mouse) model that received treatment in Example 3, showing that in the Otof knockout mouse model that received treatment, the ABR threshold was lowered and some frequency bands of hearing were significantly restored.
  • ABR auditory brainstem response
  • Figure 8 shows the ABR threshold recovery observed by audiometry 5 days after injection of a dual AAV virus preparation constructed with the cleavage site exon23/24 (shown as “23N+24C” in Figure 8 ) and the cleavage site exon20/21 (shown as "20N+21C” in Figure 8 ) into the posterior semicircular canal of C57BL/6 mice with a point mutation in the Otof gene at 27 days of age.
  • the term "about" when used in conjunction with a numerical value is intended to encompass numerical values within a range having a lower limit that is 5% less than the specified numerical value and an upper limit that is 5% greater than the specified numerical value.
  • the term is also intended to encompass values within ⁇ 1%, ⁇ 0.5%, or ⁇ 0.1% of the specified number.
  • the term “comprising” or “including” means including the recited elements, integers, steps, or groups of elements, integers, or steps, but does not exclude any other elements, integers, or steps, or other groups of elements, integers, or steps.
  • the term “comprising” or “including” also encompasses the situation consisting of the recited elements, integers, or steps. For example, when referring to a polynucleotide "comprising" a particular sequence, it is intended to encompass a polynucleotide consisting of that particular sequence.
  • hearing loss refers to hearing below the normal hearing threshold level determined by audiometry, including mild, moderate, severe and profound hearing loss, and deafness.
  • Hearing loss can be described by a percentage of hearing loss, for example, 30%, 60%, 80% or even 100% hearing loss, or by a grading description of hearing loss.
  • the hearing loss can be hearing loss caused by or associated with a gene defect, such as congenital deafness and pre-lingual deafness caused by genetic factors, or hearing loss associated with genetic factors, induced by environmental factors (for example, aging, noise, drugs or infection-induced).
  • Hearing loss can be asymptomatic (that is, there is no associated visible outer ear or other organ abnormality) or symptomatic.
  • hearing loss is sensorineural hearing loss.
  • a “hearing loss-associated gene” refers to a gene whose variation can cause hearing loss or susceptibility to hearing loss by altering the ability of the inner ear to function normally. In this article, such genes are also referred to as “hearing loss genes.” More than 100 genes have been identified as being associated with hearing loss (see Hereditary Hearing Loss Homepage, https://hereditaryhearingloss.org/, which lists the gene locations and identification data for currently known single-gene asymptomatic hearing loss). In the case of causing susceptibility to hearing loss, individuals carrying the hearing loss gene variation may show a greater susceptibility to hearing loss due to environmental factors, such as aging, noise, drugs, or infections, relative to healthy individuals.
  • the "Otof gene” encodes otoferlin (also known as "OTOF protein”).
  • otoferlin also known as "OTOF protein”
  • Other members of the same family of proteins include dysferlin and myoferlin, which are homologous to the sperm formation factor (FER-1) of nematodes.
  • FER-1 sperm formation factor
  • the OTOF protein may be related to vesicle membrane fusion.
  • the OTOF protein is expressed in trace amounts in the spiral ganglion cells and inner and outer hair cells in the cochlea of late mouse embryos and newly born mice. In adult mice, it is only expressed in the inner hair cells, mainly concentrated in the base of the inner hair cells.
  • the base of the inner capillary is distributed with many ribbon-like synapses that can continuously release large amounts of neurotransmitters.
  • OTOF protein is a calcium ion sensor for exocytosis of inner hair cells and an important component of the afferent nerve synapse.
  • the C2 domain of the OTOF protein binds Ca2 + in a Ca2 + -dependent manner; it can bind to phospholipids and proteins, playing an important role in cell membrane trafficking and signal transduction. It can also bind to cytosolic phospholipase A2, synaptotagmin I, and Ras-related GTP-binding protein, participating in neurotransmitter release. Therefore, it is speculated that OTOF protein is involved in calcium-dependent synaptic vesicle fusion and neurotransmitter release.
  • OTOF -/- mice have no obvious structural abnormalities in their ribbon synapses and can secrete Ca2 + , but they exhibit complete hearing loss and abnormal synaptic vesicle release, suggesting that OTOF protein is a key mediator of synaptic vesicle release. Because neurotransmitters nourish postsynapses, a lack of nutrition can lead to postsynaptic degeneration, potentially explaining the progressive hearing loss seen in humans.
  • the Otof gene is expressed in the cochlea, vestibule, and brain tissues. It has five transcripts, two of which are expressed in the brain.
  • the encoded proteins are divided into long and short forms, terminating at exon 47 and exon 48, respectively.
  • the short form of OTOF protein has three C2 domains and one carboxyl transmembrane domain, and this form only exists in humans; the long form of OTOF protein has six C2 domains and one transmembrane domain, and is expressed in humans and mice, but only the transcript terminating at exon 48 exists in the human cochlea.
  • the transcript variant 5 sequence (NM_001287489.1) of the human Otof gene is modified for gene therapy in the cochlea.
  • the nucleotide sequence of transcript variant 5 of the Otof gene is shown in SEQ ID NO: 2
  • the amino acid sequence of the encoded OTOF isoform 5 is shown in SEQ ID NO: 1.
  • cochlear inner hair cells refer to isolated or in vitro cochlear inner hair cells, cell lines, or cell populations derived from a mammal, or inner hair cells in the cochlea of a mammal.
  • cochlear outer hair cells refer to cochlear outer hair cells, cell lines, or cell populations isolated from or in vitro of a mammal, or outer hair cells in the cochlea of a mammal.
  • an "isolated" nucleic acid refers to a nucleic acid molecule that has been artificially synthesized or separated from at least some components of its natural environment.
  • an isolated nucleic acid can be part of a larger nucleic acid, or part of a vector or composition of matter, or can be contained within a cell and still be “isolated” provided that the larger nucleic acid, vector, composition of matter, or specific cell is not the natural environment of the nucleic acid.
  • operably linked also referred to as “effectively linked” or “functionally linked,” means that two or more polynucleotide (e.g., DNA) segments are in a relationship that allows them to function in the intended manner.
  • a promoter sequence is operably linked to a coding sequence if it stimulates or regulates the transcription of the coding sequence in a suitable host cell or other expression system.
  • promoters that are operably linked to a transcribable sequence are contiguous with the transcribable sequence, i.e., they are cis-acting.
  • some transcriptional regulatory sequences e.g., enhancers
  • AAV adeno-associated virus
  • AAV is a dependent virus that requires other viruses such as adenovirus, herpes simplex virus, human papillomavirus, or auxiliary factors to provide auxiliary functional proteins for replication.
  • the first AAV virus isolated was serotype 2 (AAV2).
  • AAV2 serotype 2
  • the AAV2 genome is approximately 4.7 kb long, flanked by 145-bp inverted terminal repeats (ITRs) at each end, forming a palindromic hairpin structure.
  • the genome contains two large open reading frames (ORFs), encoding the rep and cap genes, respectively.
  • ITRs are cis-acting elements of the AAV vector genome, playing a crucial role in AAV integration, rescue, replication, and genome packaging.
  • the ITR sequence contains the Rep binding site (RBS) and the terminal resolution site (TRs), which are recognized by the Rep protein and produce a cleavage at the TRs.
  • the ITR sequence also forms a unique "T"-shaped secondary structure, which plays a crucial role in the AAV life cycle.
  • the rest of the AAV2 genome can be divided into two functional regions, the rep gene region and the cap gene region.
  • the rep gene region encodes four Rep proteins: Rep78, Rep68, Rep52, and Rep40.
  • Rep proteins play an important role in the replication, integration, rescue, and packaging of AAV viruses.
  • Rep78 and Rep68 specifically bind to the terminal melting sites trs and GAGY repeat motifs in the ITR, initiating the replication of the AAV genome from single-stranded to double-stranded.
  • the trs and GAGC repeat motifs and/or GAGY repeat motifs in the ITR are the center of AAV genome replication. Therefore, although the ITR sequences are different in various serotypes of AAV viruses, they can all form a hairpin structure and contain Rep binding sites.
  • Rep52 and Rep40 have ATP-dependent DNA helicase activity but do not have the function of binding to DNA.
  • the cap gene encodes the AAV capsid proteins VP1, VP2, and VP3.
  • VP3 has the smallest molecular weight but is the most abundant. In mature AAV particles, the ratio of VP1, VP2, and VP3 is approximately 1:1:10.
  • VP1 is essential for the formation of infectious AAV; VP2 facilitates VP3 entry into the cell nucleus; and VP3 is the primary protein in AAV particles.
  • AAV vector refers to an efficient exogenous gene transfer tool, i.e., an AAV vector, that has been transformed from wild-type AAV virus as people gain a better understanding of the AAV virus life cycle and its related molecular biological mechanisms.
  • the modified AAV vector genome only contains the ITR sequence of the AAV virus and the exogenous sequence to be transferred.
  • the Rep and Cap proteins required for AAV virus packaging are provided in trans by other exogenous plasmids, thereby reducing the possible harm caused by packaging the rep and cap genes into the AAV vector.
  • the AAV virus itself is not pathogenic, which makes the AAV vector recognized as one of the safest viral vectors.
  • AAV virus serotypes There are many AAV virus serotypes, and different serotypes have different tissue infection tropisms. Therefore, the use of AAV vectors can transport exogenous genes to specific organs and tissues.
  • the existing technology has a relatively mature packaging system for AAV vectors, which facilitates the large-scale production of AAV vectors.
  • vector genome refers to the nucleic acid sequence that is packaged within the rAAV capsid to form the rAAV vector.
  • mammals include, but are not limited to, humans, non-human primates (e.g., cynomolgus monkeys, rhesus monkeys), rodents, and other mammals, such as cattle, pigs, horses, and dogs.
  • mammals include individuals at all stages of development, including embryonic and fetal stages.
  • treatment refers to clinical intervention intended to alter the natural course of a disease in the individual being treated. Desired therapeutic effects include, but are not limited to, preventing the onset or recurrence of the disease, alleviating symptoms, reducing any direct or indirect pathological consequences of the disease, preventing metastasis, reducing the rate of disease progression, ameliorating or palliating the disease state, and alleviating or improving prognosis.
  • treatment also encompasses modification or improvement of at least one physical parameter, including physical parameters that may not be discernible by the patient.
  • prevention refers to preventing or delaying the onset or development or progression of a disease or condition.
  • prevention generally refers to hospital intervention performed before at least one symptom of a disease occurs.
  • the present invention provides a dual vector system comprising a first nucleic acid vector and a second nucleic acid vector, wherein:
  • the first nucleic acid vector comprises a first nucleotide sequence
  • the second nucleic acid vector comprises a second nucleotide sequence
  • the first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences
  • the second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences
  • the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of OTOF, a splice donor (SD) sequence, a sequence for homologous recombination with the second nucleotide sequence, and polyA;
  • the expression cassette of the second nucleotide sequence comprises a promoter, a sequence that undergoes homologous recombination with the first nucleotide sequence, a splice acceptor (SA) sequence, a C-terminal coding sequence of OTOF, and polyA; and
  • an OTOF cleavage site is provided in the OTOF amino acid sequence, for example, the OTOF amino acid sequence is as shown in SEQ ID NO: 1 or a functional fragment thereof, for example, an amino acid sequence having at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO: 1;
  • the N-terminal coding sequence of OTOF is the nucleotide coding sequence from the N-terminus of the OTOF amino acid sequence to the OTOF cleavage site;
  • the C-terminal coding sequence of OTOF is the nucleotide coding sequence from the amino acid after the OTOF cleavage site to the C-terminus of the OTOF amino acid sequence.
  • the sequence in which the first nucleotide sequence and the second nucleotide sequence undergo homologous recombination can be an intron sequence, for example, an intron sequence that does not naturally occur in the genomic sequence of the gene encoding the OTOF protein.
  • the intron is a synthetic alkaline phosphatase (AP) intron.
  • the intron sequence used in the dual vector system of the present invention may comprise a splice donor sequence and a splice acceptor sequence.
  • the dual vector system may comprise one or more AP intron spliceosomal recognition sites, such as one or more AP splice acceptor (APSA) domains or AP splice donor (APSD) domains.
  • these vectors comprise both an APSA and an APSD.
  • the first vector comprises an APSA and the second vector comprises an APSD.
  • the first vector comprises an APSD and the second vector comprises an APSA.
  • the intron sequence is a recombination-inducing AK sequence of F1 phage, such as the sequence shown in SEQ ID NO: 6, thereby mediated by homologous recombination to reconstruct the expression cassette of the full-length OTOF protein.
  • Splice acceptor sequence and splice donor sequence can be selected by those skilled in the art from known splice acceptor sequence and splice donor sequence.
  • Spliceosomal introns are usually located in the sequence of protein-coding genes of eukaryotic cells.In introns, splice donor site (located at the 5 ' end of the intron) and splice acceptor site (located at the 3 ' end of the intron) are required for splicing.
  • the splice donor site is located at the 5 ' end of the intron in a larger, less highly conserved region and comprises an almost constant sequence GU.
  • the splice acceptor site at the 3 ' end of the intron terminates the intron with an almost constant AG sequence.Upstream of AG (5 ' direction), there is a pyrimidine-rich (C and U) region or a polypyrimidine sequence.
  • the splice donor (SD) sequence in the dual vector system basically consists of the sequence shown in SEQ ID NO: 4; the splice acceptor (SA) sequence basically consists of the sequence shown in SEQ ID NO: 5.
  • the OTOF protein comprises or consists of the amino acid sequence of SEQ ID NO:1.
  • a cleavage site is set in the amino acid sequence of the OTOF protein to divide the OTOF protein into the N-terminal portion of the OTOF protein (also referred to herein as the "N-terminus of OTOF") and the C-terminal portion of the OTOF protein (also referred to herein as the "C-terminus of OTOF”).
  • the N-terminus of OTOF is the sequence from the N-terminus of the OTOF amino acid sequence to the cleavage site
  • the C-terminus of OTOF is the sequence from the amino acid residue immediately adjacent to the cleavage site to the C-terminus of the OTOF amino acid sequence.
  • the cleavage site of OTOF is located at the 3' terminal amino acid residue of any exon 1-47 in the OTOF amino acid sequence.
  • the OTOF N-terminal coding sequence is SEQ ID NO: 11 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 37, and the OTOF C-terminal coding sequence is SEQ ID NO: 24 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 50;
  • the OTOF N-terminal coding sequence is SEQ ID NO: 12 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 38, and the OTOF C-terminal coding sequence is SEQ ID NO: 25 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 51;
  • the OTOF N-terminal coding sequence is SEQ ID NO: 13 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 39, and the OTOF C-terminal coding sequence is SEQ ID NO: 26 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 52;
  • the OTOF N-terminal coding sequence is SEQ ID NO: 14 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 40, and the OTOF C-terminal coding sequence is SEQ ID NO: 27 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 53; or
  • the OTOF N-terminal coding sequence is SEQ ID NO: 20 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 46, and the OTOF C-terminal coding sequence is SEQ ID NO: 33 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 59.
  • the vector plasmid of the present invention can be any plasmid that can replicate in a host cell and express a corresponding polypeptide.
  • the vector plasmid comprises two ITR sequences, namely a 5' inverted terminal repeat (5'ITR) sequence and a 3' inverted terminal repeat (3'ITR) sequence.
  • 5'ITR 5' inverted terminal repeat
  • 3'ITR 3' inverted terminal repeat
  • the first nucleotide sequence is inserted into a plasmid comprising two first ITR sequences
  • the second nucleotide sequence is inserted into a plasmid comprising two second ITR sequences
  • the plasmid comprising two first ITR sequences and the plasmid comprising two second ITR sequences are the same or different, for example, the plasmid is pAAV, pAAV-CMV, pX601, pX551 or pAAV-MCS plasmid.
  • the OTOF protein N-terminal coding sequence and the OTOF protein C-terminal coding sequence are operably linked after transcription, and the OTOF protein is produced through translation.
  • tissue-specific promoter is used in a two-vector system, for example, a promoter that mediates expression in the ear, such as the synapsin promoter or the GFAP promoter.
  • any one of the following promoters is used in the binary vector system: cytomegalovirus (CMV) promoter, SV40 promoter, Rous sarcoma virus (RSV) promoter, CAG promoter, chimeric CMV/chicken beta actin (CBA) promoter, truncated CBA (smCBA) promoter, UbC promoter, SFFV promoter, EF1 ⁇ promoter, PGK promoter, or promoters of Myo7A, Myo15, Atoh1, POU4F3, Lhx3, Myo6, ⁇ 9AchR, ⁇ 10AchR, OTOF and STRC encoding genes.
  • the promoter is a CMV promoter.
  • the regulatory sequence may be any sequence that promotes transgene expression, i.e., serves to increase transcript expression, improve nuclear export of mRNA, or enhance its stability.
  • Such regulatory sequences include, for example, enhancer elements, post-transcriptional regulatory elements, and polyadenylation sequences.
  • the preferred post-transcriptional regulatory element used in the dual vector system of the present invention is the woodchuck hepatitis post-transcriptional regulatory element (WPRE) or a variant thereof.
  • WPRE woodchuck hepatitis post-transcriptional regulatory element
  • an AAV vector containing WPRE or a variant thereof increases the expression of OTOF protein.
  • the dual vector system of the present invention comprises a WPRE nucleotide sequence as set forth in SEQ ID NO:61.
  • the dual vector system of the present invention comprises a post-transcriptional regulatory element having a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to the WPRE nucleotide sequence as set forth in SEQ ID NO:61, wherein the nucleotide sequence substantially retains the functional activity of the post-transcriptional regulatory element as set forth in SEQ ID NO:61, for example, a truncated variant of WPRE.
  • the truncated variant of WPRE has the WPRE3 nucleotide sequence as set forth in SEQ ID NO:64.
  • the dual vector system of the present invention comprises a polyadenylation sequence, for example, a bovine growth hormone polyadenylation sequence, an SV40 polyadenylation sequence, and/or an SV40 late polyadenylation sequence.
  • the dual vector system of the present invention comprises an SV40 polyadenylation sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the nucleotide sequence set forth in SEQ ID NO: 62.
  • the dual vector system of the present invention comprises an SV40 late polyadenylation sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the nucleotide sequence set forth in SEQ ID NO: 65.
  • the dual vector system of the present invention comprises a combination of a WPRE nucleotide sequence and an SV40 polyadenylation sequence.
  • the dual vector system of the present invention has the nucleotide sequence set forth in SEQ ID NO: 60, or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 60.
  • the combination of the WPRE nucleotide sequence and the SV40 polyadenylation sequence allows for high-level expression of the transgene.
  • the dual vector system of the present invention allows for the expression of homologous polypeptides having an amino acid sequence that is at least 70% identical and/or similar to SEQ ID NO: 1. More preferably, the homologous sequence has at least 75%, even more preferably at least 80%, or at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 99%, or at least 99% identity and/or similarity to SEQ ID NO: 1. When the homologous polypeptide is much shorter than SEQ ID NO: 1, local alignment may be considered.
  • the dual vector system of the present invention can allow for the expression of functional fragments of an OTOF protein polypeptide.
  • functional fragment herein refers to any fragment that retains at least one biological function of an OTOF protein polypeptide of interest.
  • the full-length OTOF protein can be obtained by transforming host cells using the dual vector system of the present invention.
  • the host cells are selected from Hela-S3 cells, HEK-293 cells, HEK-293T cells, HEK-293FT cells, A549 cells and Sf9 cells.
  • the dual-vector system of the present invention is used to administer to patients suffering from DFNB9-induced deafness.
  • Patient suffering from DFNB9-induced deafness refers to a patient, particularly a human patient, who is believed to have (or has been diagnosed with) a mutation in a gene encoding an OTOF protein that triggers abnormal expression, abnormal function, or both of the OTOF protein.
  • the dual vector system of the present invention is a dual AAV vector system.
  • the first AAV vector and the second AAV vector in the dual AAV vector system are vectors each having a capsid of the same or different AAV origin, for example, the first AAV vector and the second AAV vector in the dual AAV vector system are vectors each having an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV-Anc80 capsid or an AAV vector with a chimeric capsid, in particular an AAV vector with an AAV-Anc80 capsid.
  • the synthetic adeno-associated virus vector Anc80L65 is used, which has been shown to have the highest transduction efficiency of inner ear hair cells reported to date (Suzuki et al., Sci. Rep. 7: 45524 (2017)).
  • the dual AAV vector system after administration to a subject, treats OTOF mutation-associated autosomal recessive deafness 9 (DFNB9) disease by increasing the expression of wild-type OTOF protein or providing wild-type OTOF protein to the subject.
  • DFNB9 OTOF mutation-associated autosomal recessive deafness 9
  • the dual-vector system of the present invention can trigger the expression of the full-length OTOF protein polypeptide, or a functional fragment thereof, in inner ear cells, inner hair cells, or outer hair cells.
  • the patients to whom the dual vector system of the present invention is administered are preferably newborn human infants, usually less than 6 months old, or even less than 3 months old (if they were diagnosed with DFNB9 deafness in childhood). These human infants are more preferably between 3 months and 1 year old.
  • the two-vector system of the present invention can also be administered to, for example, infants (2-6 years), children (6-12 years), adolescents (12-18 years), or adults (18 years and older).
  • the term “treating” is intended to mean administering a therapeutically effective amount of the dual vector system of the present invention to a patient suffering from DFNB9 deafness to partially or completely restore the patient's hearing. Such restoration can be assessed by testing auditory brainstem responses (ABRs) using electrophysiological equipment. "Treatment of hearing loss induced by OTOF mutations” is specifically intended to refer to complete restoration of hearing function. The term “preventing” refers to reducing or delaying hearing loss within the auditory frequency range.
  • the first nucleic acid vector ( Figure 3) comprises in the 5'-3' direction: a 5' inverted terminal repeat (5'ITR) sequence, a CMV promoter sequence, a cDNA sequence containing the N-terminal portion (5'Otof NT) encoding the OTOF protein, a splicing donor (SD) sequence, a splicing sequence HD (for example, an AK sequence, which is a recombination-initiating sequence from f1 phage) that undergoes homologous recombination with the second nucleotide sequence, and a 3' inverted terminal repeat (3'ITR) sequence;
  • the second nucleic acid vector ( Figure 3) comprises: a 5' inverted terminal repeat (5'ITR) sequence, a CMV promoter sequence, a cDNA sequence containing the N-terminal portion (5'Otof NT) encoding the OTOF protein, a splic
  • Both the first and second nucleic acid vectors also contain an Ori sequence and a resistance sequence.
  • Figure 1 shows a schematic diagram of dual-vector-mediated OTOF protein expression using an inverted terminal repeat (ITR), a CMV promoter, a splice donor SD signal sequence, a splice acceptor SA signal sequence, and an HD sequence for homologous recombination splicing, which relies on a trans-splicing mechanism for homologous recombination.
  • ITR inverted terminal repeat
  • the first nucleic acid vector and the second nucleic acid vector in the dual-vector system were modified based on the pAAV-CMV-EGFP-WPRE-SV40 plasmid backbone shown in FIG2 .
  • the EGFP reporter gene sequence in the pAAV-CMV-EGFP-WPRE-SV40 plasmid was replaced with a target sequence comprising a nucleotide sequence encoding the 5' Otof NT, SD sequence, and AK sequence using EcoRI and EcoRV double digestion, resulting in the first nucleic acid vector pAAV-CMV-Otof-N-SD-AK.
  • the EGFP reporter gene sequence in the pAAV-CMV-EGFP-WPRE-SV40 plasmid was replaced with a target sequence comprising an AK sequence, SA sequence, and a nucleotide sequence encoding the 3' Otof CT using EcoRI and EcoRV double digestion, resulting in the second nucleic acid vector pAAV-CMV-AK-SA-Otof-C-WPRE-SV40.
  • the target sequence synthesis and vector construction were commissioned to Nanjing GenScript Biotechnology Co., Ltd.
  • dual-vector AAV delivery for the restoration of deafness caused by OTOF mutation is an important method for the treatment of deafness caused by OTOF mutation.
  • the efficiency of delivering the Otof gene using dual-vector AAV is a difficult problem, in which sequence recombination in the dual vectors is an important step.
  • the OTOF protein was divided into 13 groups as shown in Table 1A, and 13 groups of 5’Otof NT (also referred to as “N-Otof” in Table 1B) and 3’Otof CT (also referred to as “C-Otof” in Table 1B) were obtained.
  • the coding sequence of the OTOF protein was also codon-optimized to obtain the sequence shown in SEQ ID NO: 3.
  • Table 1A OTOF N-terminus and OTOF C-terminus of the full-length OTOF protein (SEQ ID NO: 1) cleaved by the cleavage site
  • the codon-optimized N-Otof sequences of groups 1-13 are shown in SEQ ID NO: 34-46, respectively, which are inserted into the vector backbone elements shown in Figure 2 to obtain the first nucleic acid vector;
  • the codon-optimized C-Otof sequences of groups 1-13 are shown in SEQ ID NO: 47-59, respectively, which are inserted into the vector backbone elements shown in Figure 2 to obtain the second nucleic acid vector.
  • the obtained first nucleic acid vector pAAV-CMV-Otof-N-SD-AK and the second nucleic acid vector pAAV-CMV-AK-SA-Otof-C-WPRE-SV40 were co-transfected into HEK-293T cells (obtained from the Chinese Academy of Sciences Cell Bank, SCSP-502) at the same molar ratio.
  • the expression of full-length OTOF was analyzed by Western blotting 48 hours after transfection.
  • the specific experimental method is as follows.
  • HEK-293T cells human embryonic kidney 293T cells, hereinafter referred to as "293T cells" to 70-90% density and prepare for transfection.
  • Tube B 125 ⁇ L serum-free DMEM medium + 2 ⁇ g of the first nucleic acid vector pAAV-CMV-Otof-N-SD-AK + 2 ⁇ g of the second nucleic acid vector pAAV-CMV-AK-SA-Otof-C-WPRE-SV40 + 8 ⁇ L P3000 reagent (Invitrogen, catalog number: L3000015) and mix thoroughly. Add the mixture in tube B to tube A, mix gently and thoroughly, and let it stand at room temperature for 10-15 minutes. Tube A contains a mixture of culture medium and Lipofectamine 3000 transfection reagent, while Tube B contains a mixture of culture medium, nucleic acid carrier DNA, and transfection enhancer P3000. The resulting DNA-liposome complex was added to 293T cells for transfection, and the cells were incubated at 37°C in 95% air and 5% CO2 . Cells were harvested by centrifugation 48 hours after transfection.
  • the cell pellet harvested by centrifugation was fully resuspended in an appropriate amount of RIPA lysis buffer (Thermo Fisher Scientific, catalog number: 89900) (supplemented with 1% protease inhibitor cocktail (Thermo Fisher Scientific, catalog number: 87786), 1% PMSF), and lysed on ice for 30 minutes. Vortex every 10 minutes during this period to fully resuspend the cell pellet in the lysis buffer.
  • Western blotting for OTOF protein expression Wash the glass plates and secure them flat on a rack, clamping them with the concave surface facing inward. Position the plates symmetrically, front and back. Prepare separating gel and seal with isopropanol. After 0.5 hours, discard the isopropanol and place on its side with a pump to dry. Prepare stacking gel, adding until overflowing, and insert a comb. After 45 minutes, remove the gel plate and attach it to the clamps in the electrophoresis tank. Add running buffer from the center of the tank until it overflows to 1/2 of the tank volume. Carefully remove the comb. Load the sample, perform electrophoresis at a constant voltage of 100V for 1 hour, and transfer the membrane at a constant current of 300mA on ice for 90 minutes.
  • the incubated PVDF membrane was removed and rinsed three times with 1X TBST for 5 minutes each time.
  • the membrane was incubated with the secondary antibody (HRP-conjugated Affinipure Goat Anti-Mouse IgG (H+L), Proteintech, catalog number: SA00001-1) at room temperature for 1 hour and rinsed three times with 1X TBST for 5 minutes each time.
  • the chemiluminescent reagent (ECL) was then added for development in a dark room.
  • FIG. 6 The results of Western blot analysis are shown in Figure 6.
  • lane NC control (293T cells not transfected with the dual-vector plasmid); lanes 1-13: correspond to cells transfected with the dual-vector plasmids from groups 1-13 in Table 1B, respectively.
  • the Western blot results in Figure 6 demonstrate that the dual-vector plasmid systems constructed using the various cleavage sites of the full-length OTOF sequence were able to express the full-length OTOF protein upon transfection.
  • the dual-vector plasmid systems from groups 4-7 and 13 significantly increased the expression of the full-length OTOF protein upon transfection and intracellular recombination.
  • paired adeno-associated viruses were prepared for infection of mice.
  • two plasmids (a first nucleic acid vector plasmid and a second nucleic acid vector plasmid) corresponding to the 4th group of cleavage sites of the full-length OTOF protein constructed in Example 1 and two plasmids (a first nucleic acid vector plasmid and a second nucleic acid vector plasmid) corresponding to the 7th group of cleavage sites were used to prepare four adeno-associated viruses (AAVs) in HEK-293T cells, respectively, and paired into two pairs of double AAV viruses.
  • AAVs adeno-associated viruses
  • Tube A 4880 ⁇ L serum-free culture medium DMEM + 120 ⁇ L PEI transfection reagent, mix thoroughly.
  • iodixanol is irritating to the eyes, skin, and respiratory tract; personal protective equipment is required and the procedure must be performed in a fume hood.
  • the supernatant obtained above was carefully dropped onto the surface of 15% iodixanol and ultracentrifuged at 301580 g for 1 h 40 min at 12°C using a fixed-angle titanium rotor. The maximum acceleration and deceleration were used during the centrifugal acceleration and deceleration process.
  • the purified AAV virus was located between the 40% and 60% iodixanol interfaces. Carefully insert and aspirate with a stainless steel blunt needle to obtain the N-terminal AAV virus and C-terminal AAV virus corresponding to each cleavage site.
  • AAV titer determination AAV titer was determined using qPCR. The titer was expressed as GC/mL, where GC represents the genomic particle concentration. The purified virus titer was verified to be above 10e+12 GC/mL.
  • qPCR primers were designed to target the ITR sequence. The primer sequences were: upstream primer fwd ITR primer, 5'-GGAACCCCTAGTGATGGAGTT-3' (SEQ ID NO: 66); downstream primer rev ITR primer, 5'-CGGCCTCAGTGAGCGA-3' (SEQ ID NO: 67).
  • Titer 1000000*power(10 x )
  • Example 3 Restoration of deafness in Otof gene knockout mice by dual-vector AAV delivery
  • the Otof(Q939*) point mutation represents a deletion of the amino acid residue Q at position 939 of the OTOF protein.
  • the auditory response threshold, latency, and inter-wave duration are detected through ABR.
  • Short sounds especially short sounds of different frequencies (4KHz, 8KHz, 12KHz, 16KHz, 24KHz, 32KHz), are used as stimuli to detect the hearing thresholds of mice to different frequencies, analyze the hearing sensitivity of mice, and judge whether the mice have normal auditory function from hair cells to cerebral cortex as a whole.
  • Figure 8 is the original waveform of the 8kHz ABR audiometry data.
  • the "SPL” in the vertical axis “dB SPL” is the abbreviation of "Sound Pressure Levels", which represents the intensity of the stimulating sound.
  • the minimum sound intensity produced by the induced waveform is the ABR threshold.
  • the ABR threshold of WT mice is 30dB
  • the ABR threshold of KO mice is greater than 90dB.
  • the threshold of KO mice injected with the dual AAV vector "20N+21C” is 55dB
  • the threshold of KO mice injected with the dual AAV vector "23N+24C” is 30dB.
  • the experimental results also show that the OTOF gene cut site exon23/24 and the OTOF gene cut site exon20/21 both significantly reduce the ABR threshold, and the OTOF gene cut site exon23/24 makes the ABR threshold lower.
  • Codon-optimized sequence encoding OTOF isoform 5
  • nucleotide sequence of the wild-type N-Otof after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 19 and the amino acid residues encoded by exon 20 is shown in SEQ ID NO: 10
  • nucleotide sequence of the wild-type N-Otof after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 22 and the amino acid residues encoded by exon 23 is shown in SEQ ID NO: 13
  • nucleotide sequence of the wild-type N-Otof after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24 is shown in SEQ ID NO: 14
  • nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 24 and the amino acid residues encoded by exon 25 is shown in SEQ ID NO: 15
  • nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 25 and the amino acid residues encoded by exon 26 is shown in SEQ ID NO: 16
  • nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 26 and the amino acid residues encoded by exon 27 is shown in SEQ ID NO: 17
  • nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 27 and the amino acid residues encoded by exon 28 is shown in SEQ ID NO: 18
  • nucleotide sequence of the wild-type N-Otof after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 29 and the amino acid residues encoded by exon 30 is shown in SEQ ID NO: 20
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 17 and the amino acid residues encoded by exon 18 is shown in SEQ ID NO: 21
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 18 and the amino acid residues encoded by exon 19 is shown in SEQ ID NO: 22
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 20 and the amino acid residues encoded by exon 21 is shown in SEQ ID NO: 24
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 22 and the amino acid residues encoded by exon 23 is shown in SEQ ID NO: 26
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24 is shown in SEQ ID NO: 27
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 24 and the amino acid residues encoded by exon 25 is shown in SEQ ID NO: 28
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 25 and the amino acid residues encoded by exon 26 is shown in SEQ ID NO: 29
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 26 and the amino acid residues encoded by exon 27 is shown in SEQ ID NO: 30
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 27 and the amino acid residues encoded by exon 28 is shown in SEQ ID NO: 31
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 28 and the amino acid residues encoded by exon 29 is shown in SEQ ID NO: 32
  • nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 29 and the amino acid residues encoded by exon 30 is shown in SEQ ID NO: 33
  • nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 17 and the amino acid residues encoded by exon 18 is shown in SEQ ID NO: 34
  • nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 18 and the amino acid residues encoded by exon 19 is shown in SEQ ID NO: 35
  • nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 19 and the amino acid residues encoded by exon 20 is shown in SEQ ID NO: 36
  • nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 20 and the amino acid residues encoded by exon 21 is shown in SEQ ID NO: 37
  • nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24 is shown in SEQ ID NO: 40
  • Nucleotide sequence of SV40 poly(A) signal (122 bp): SEQ ID NO: 62
  • Nucleotide sequence of SV40 late poly(A) signal SEQ ID NO: 65
  • Upstream primer 5’-GGAACCCCTAGTGATGGAGTT-3’ (SEQ ID NO: 66)
  • Downstream primer 5’-CGGCCTCAGTGAGCGA-3’ (SEQ ID NO: 67)
  • AATAAA (SEQ ID NO: 68), ATTAAA (SEQ ID NO: 69), AGTAAA (SEQ ID NO: 70), CATAAA (SEQ ID NO: 71), TATAAA (SEQ ID N O: 72), GATAAA (SEQ ID NO: 73), ACTAAA (SEQ ID NO: 74), AATATA (SEQ ID NO: 75), AAGAAA (SEQ ID NO: 76), AATAAT (SEQ ID NO: 77), AAAAAA (SEQ ID NO: 78), AATGAA (SEQ ID NO: 79), AATCAA (SEQ ID NO: 80), AACAAA (SEQ ID NO: 81), AATC AA(SEQ ID NO:82), AATAAC(SEQ ID NO:83), AATAGA(SEQ ID NO:84), AATTAA(SEQ ID NO:85) or AATAAG(SEQ ID NO:86)

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Zoology (AREA)
  • Public Health (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Animal Behavior & Ethology (AREA)
  • Veterinary Medicine (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biophysics (AREA)
  • Epidemiology (AREA)
  • Biochemistry (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Wood Science & Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Immunology (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Virology (AREA)
  • Physics & Mathematics (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Plant Pathology (AREA)
  • Microbiology (AREA)
  • Toxicology (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

The present invention relates to a dual-vector system for expressing an OTOF protein. The dual-vector system comprises a first nucleic acid vector and a second nucleic acid vector, wherein the first nucleic acid vector comprises a first nucleotide sequence, and the second nucleic acid vector comprises a second nucleotide sequence; the first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences, and the second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences; and the expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of OTOF, a splice donor sequence, a sequence undergoing homologous recombination with the second nucleotide sequence, and a polyA, and the expression cassette of the second nucleotide sequence comprises a promoter, a sequence undergoing homologous recombination with the first nucleotide sequence, a splice receptor sequence, a C-terminal coding sequence of OTOF, and a polyA. The present invention further relates to a packaging vector system for an adeno-associated virus, a method for packaging an adeno-associated virus, and an adeno-associated virus obtained thereby. The dual-vector system or adeno-associated virus for expressing an OTOF protein of the present invention can be used for treating autosomal recessive hereditary hearing loss associated with OTOF mutations.

Description

递送Otof基因的双载体系统和其用途Two-vector system for delivering Otof gene and its use 技术领域Technical Field

本发明涉及递送超过5Kb的大基因的双载体系统、及其在基因治疗中的应用,尤其是在治疗听力损失中的应用。The present invention relates to a two-vector system for delivering large genes exceeding 5 Kb and its use in gene therapy, in particular in the treatment of hearing loss.

背景技术Background Art

耳聋是临床上最常见的致残性疾病,已经严重影响了人类的正常生活,给社会造成了巨大的负担。据WHO报道,全球几乎有15亿人存在不同程度的听力损失,4.66亿人患残疾性听力损失,占总人口的5%,其中儿童占3400万。预计到2050年全球将有25亿人患不同程度的听力损失,将有7亿人患有残疾性听力损失。Deafness is the most common disabling disease in clinical practice, severely impacting normal human life and placing a significant burden on society. According to the WHO, nearly 1.5 billion people worldwide have varying degrees of hearing loss, and 466 million, or 5% of the total population, suffer from disabling hearing loss, including 34 million children. It is estimated that by 2050, 2.5 billion people worldwide will suffer from varying degrees of hearing loss, with 700 million suffering from disabling hearing loss.

遗传和环境是致聋的两大因素,环境原因主要是与使用耳毒性药物、孕期感染、新生儿缺氧及放射线照射等多种环境因素或某些并发症相关。而遗传因素主要是因为个体耳聋基因缺陷,导致不同程度听力下降,致病基因通过不同的遗传方式传递给下一代,并且可以在任何年龄发生,且影响是永久性的。遗传是致聋的一个主要原因,由遗传因素导致的耳聋约占60%,目前已发现了120多个基因与耳聋相关,涉及1500多种致病变异,至今临床上尚无任何可用于治疗遗传性耳聋的药物。Heredity and environment are the two main contributing factors to deafness. Environmental factors are primarily related to various environmental factors or complications, such as the use of ototoxic drugs, infections during pregnancy, neonatal hypoxia, and radiation exposure. Genetic factors, on the other hand, are primarily due to individual genetic defects in hearing loss, leading to varying degrees of hearing loss. The pathogenic genes are passed on to the next generation through various genetic pathways, can occur at any age, and the effects are permanent. Heredity is a major cause of deafness, accounting for approximately 60% of deafness. Currently, over 120 genes associated with deafness have been identified, involving over 1,500 pathogenic variants. To date, there are no clinically available drugs to treat hereditary deafness.

根据除听觉系统外是否有其它器官的临床症状,遗传性耳聋可分为综合征型听力障碍(syndromic hearing loss,SHL)和非综合征型听力障碍(non-syndromic hearing loss,NSHL)。综合征型听力障碍(SHL)常伴有其它系统的临床表现,包括眼睛、心脏、肾脏、神经系统、皮肤、骨骼等,占遗传性耳聋的30%。SHL中以Pendred综合征、Usher综合征(USH)和Waardenburg综合征(WS)最为著名;而USH、Pendred综合征和Jervell and Lange-Nielsen综合征(JLNS)已在临床前动物模型研究中成功获得内耳基因治疗。非综合征型听力障碍(NSHL)有四种类型:常染色体显性(DFNA)、常染色体隐性(DFNB)、X染色体连锁(DFNX)和线粒体非综合征性耳聋。约70%的遗传性耳聋患者为非综合征型。NSHL最常见的遗传方式是常染色体隐性遗传(75%-80%),其次是常染色体显性遗传(20%),X染色体连锁遗传(<2%)和线粒体遗传(<1%)。迄今为止,已有120多个基因被报道与NSHL相关,其中Otof基因突变导致的听觉神经病变谱系障碍(Auditory neuropathyspectrum disorder,ANSD)是常见的非综合征隐性遗传耳聋病症,该基因的致病突变目前报道已达到1000多个。Based on the presence of clinical symptoms affecting organs other than the auditory system, hereditary hearing loss can be divided into syndromic hearing loss (SHL) and non-syndromic hearing loss (NSHL). Syndromic hearing loss (SHL) is often accompanied by clinical manifestations affecting other systems, including the eyes, heart, kidneys, nervous system, skin, and bones, and accounts for 30% of hereditary hearing loss. Among SHL, Pendred syndrome, Usher syndrome (USH), and Waardenburg syndrome (WS) are the most well-known. In preclinical animal models, USH, Pendred syndrome, and Jervell and Lange-Nielsen syndrome (JLNS) have been successfully treated with inner ear gene therapy. There are four types of non-syndromic hearing loss (NSHL): autosomal dominant (DFNA), autosomal recessive (DFNB), X-linked (DFNX), and mitochondrial non-syndromic hearing loss. Approximately 70% of patients with hereditary hearing loss have the non-syndromic type. The most common mode of inheritance for NSHL is autosomal recessive inheritance (75%-80%), followed by autosomal dominant inheritance (20%), X-linked inheritance (<2%), and mitochondrial inheritance (<1%). To date, more than 120 genes have been reported to be associated with NSHL, among which auditory neuropathy spectrum disorder (ANSD), caused by mutations in the Otof gene, is a common non-syndromic recessive hearing loss disorder. Over 1,000 pathogenic mutations in this gene have been reported.

每种耳聋类型按耳聋基因的命名原则描述,例如DFNA1是第一个发现的常染色体显性耳聋类型。DFNB9是第九个被描述的常染色体隐性非综合征性耳聋,由Otof基因突变引起。Otof基因定位于人染色体2p23.3,全长101496bp,含48个外显子,编码耳畸蛋白(otoferlin,也称为“OTOF蛋白”)。OTOF蛋白为钙离子的感应器,包含1997个氨基酸,其中具有6个结合钙离子的C2结构域,主要参与Ca2+相关内毛细胞的突触囊泡融合及神经递质释放。Otof基因共5个转录本,在内耳和大脑均有表达,在大脑有2种OTOF转录本,终止于47号或48号外显子;在耳蜗中只有1种转录本,终止于48号外显子。Otof基因编码的OTOF蛋白集中表达于耳蜗内毛细胞基底外侧部,是内毛细胞突触前膜结构的重要组成部分。OTOF蛋白在内毛细胞带状突触处触发膜融合,在与钙离子相关的突触囊泡的胞吐过程发挥重要作用。Each type of hearing loss is described according to the nomenclature of the deafness gene. For example, DFNA1 was the first autosomal dominant type of hearing loss to be discovered. DFNB9 is the ninth autosomal recessive nonsyndromic hearing loss to be described and is caused by mutations in the Otof gene. The Otof gene is located on human chromosome 2p23.3. It is 101,496 bp long and contains 48 exons. It encodes otoferlin (also known as "OTOF protein"). The OTOF protein is a calcium sensor consisting of 1,997 amino acids, including six calcium-binding C2 domains. It is primarily involved in Ca2 + -dependent synaptic vesicle fusion and neurotransmitter release in inner hair cells. The Otof gene has five transcripts, which are expressed in both the inner ear and brain. In the brain, there are two OTOF transcripts that terminate in exon 47 or 48; in the cochlea, there is only one transcript that terminates in exon 48. The OTOF protein, encoded by the Otof gene, is concentrated in the basolateral region of cochlear inner hair cells and is a crucial component of the presynaptic membrane structure of inner hair cells. OTOF triggers membrane fusion at the ribbon synapses of inner hair cells and plays a crucial role in the exocytosis of synaptic vesicles, which is associated with calcium ions.

OTOF基因突变导致的耳聋表型是听神经病(auditory neuropathy,AN),也称为听觉神经病谱障碍(auditory neuropathy spectrum disorder,ANSD)。该疾病除了内毛细胞的突触囊泡无法释放之外,毛细胞的存活和功能不受影响。临床表现为语言理解障碍,听脑干反应的阈值正常或者严重受损。听力学检测中听脑干反应无明显分化的波形或者严重异常,但是因为毛细胞功能正常耳声发射和耳蜗微音电位可以正常引出。The deafness phenotype caused by mutations in the OTOF gene is auditory neuropathy (AN), also known as auditory neuropathy spectrum disorder (ANSD). This disease, except for the inability to release synaptic vesicles from inner hair cells, does not affect the survival and function of hair cells. Clinical manifestations include speech comprehension impairment and normal or severely impaired thresholds for auditory brainstem responses. Audiological testing shows no clearly differentiated waveforms or severe abnormalities in the auditory brainstem responses, but because hair cells function normally, otoacoustic emissions and cochlear microphonic potentials can be elicited normally.

正是因为该突变没有直接影响螺旋神经节神经元的活性,目前人工耳蜗植入是助听器失效后唯一有效的治疗方案。虽然人工耳蜗对OTOF基因突变的患者治疗效果良好,但是从全世界仅有30万名患者接受人工耳蜗植入而言,人工耳蜗的受众仅是耳聋患者的一小部分。人工耳蜗频率敏感性低,言语分辨以及噪声环境下感知困难依然不能满足患者的正常生活需求。此外,人工耳蜗作为一种假肢装置伴随患者一生,在使用过程中需要特别小心。Because this mutation does not directly affect the activity of spiral ganglion neurons, cochlear implants are currently the only effective treatment option for hearing aid failure. Although cochlear implants are effective for patients with OTOF gene mutations, with only 300,000 patients worldwide receiving cochlear implants, the audience for cochlear implants is only a small fraction of deaf patients. Cochlear implants have low frequency sensitivity, and difficulties in speech discrimination and perception in noisy environments still cannot meet the normal life needs of patients. In addition, as a prosthetic device that accompanies the patient throughout their life, special care must be taken during use.

基因治疗是指通过在DNA或RNA层面对缺陷基因进行纠正、补偿或抑制,从而使得受试者从体内核酸序列异常或表达异常所引起的疾病中恢复健康,达到疾病治疗目的的方法。基因治疗作为一种新兴的治疗策略在遗传性听力损失中有着巨大应用的前景。目前大部分基因治疗都需要载体递送,腺相关病毒(Adeno-Asociated Virus,AAV)是其中一种安全、高效的递送载体,它的包装容量约为4.5Kb。然而,由于Otof基因的编码区长度为6kb,连同相关调控元件使得表达Otof基因受到来自AAV载体的包装方面的限制。Gene therapy refers to a method of correcting, compensating for, or inhibiting defective genes at the DNA or RNA level, thereby allowing the subject to recover from diseases caused by abnormal nucleic acid sequences or expressions in the body, thereby achieving the purpose of treating the disease. As an emerging treatment strategy, gene therapy has great application prospects in hereditary hearing loss. Currently, most gene therapies require vector delivery, and adeno-associated virus (AAV) is one of the safe and efficient delivery vectors with a packaging capacity of approximately 4.5Kb. However, due to the 6kb length of the coding region of the Otof gene, together with related regulatory elements, the expression of the Otof gene is limited by the packaging of the AAV vector.

使用双载体系统(例如,双AAV载体系统)递送大基因序列(例如,大于4kB的基因,如Otof基因)能够突破单个AAV载体对基因大小的限制。The use of a dual-vector system (eg, a dual AAV vector system) to deliver large gene sequences (eg, genes larger than 4 kB, such as the Otof gene) can overcome the gene size limitation of a single AAV vector.

发明概述SUMMARY OF THE INVENTION

本发明的目的是,提供用于患有例如DFNB9听力损失的受试者的基因治疗,由此预防和/或恢复受试者的听力。It is an object of the present invention to provide gene therapy for a subject suffering from, for example, DFNB9 hearing loss, thereby preventing and/or restoring hearing in the subject.

在第一方面,本发明提供了用于递送大基因序列(例如,大于4kB的基因;如OTOF基因)的双载体系统,其包含第一核酸载体和第二核酸载体,其中:In a first aspect, the present invention provides a two-vector system for delivering large gene sequences (e.g., genes larger than 4 kB; such as OTOF genes), comprising a first nucleic acid vector and a second nucleic acid vector, wherein:

第一核酸载体包含第一核苷酸序列;且第二核酸载体包含第二核苷酸序列;The first nucleic acid vector comprises a first nucleotide sequence; and the second nucleic acid vector comprises a second nucleotide sequence;

所述第一核苷酸序列包含插入在两个第一ITR序列之间的表达盒;The first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences;

所述第二核苷酸序列包含插入在两个第二ITR序列之间的表达盒;The second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences;

所述第一核苷酸序列的表达盒包含启动子、OTOF的N端编码序列、剪接供体(SD)序列、与第二核苷酸序列发生同源重组的序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of OTOF, a splice donor (SD) sequence, a sequence for homologous recombination with the second nucleotide sequence, and polyA;

所述第二核苷酸序列的表达盒包含启动子、与第一核苷酸序列发生同源重组的序列、剪接受体(SA)序列、OTOF的C端编码序列和polyA;和The expression cassette of the second nucleotide sequence comprises a promoter, a sequence that undergoes homologous recombination with the first nucleotide sequence, a splice acceptor (SA) sequence, a C-terminal coding sequence of OTOF, and polyA; and

在OTOF氨基酸序列中设置有OTOF切分位点,例如,所述OTOF氨基酸序列如SEQ ID NO:1所示或其功能片段,例如,与SEQ ID NO:1具有至少80%、至少85%、至少90%、至少95%、至少96%、至少97%、至少98%、至少99%序列同一性的氨基酸序列;An OTOF cleavage site is provided in the OTOF amino acid sequence, for example, the OTOF amino acid sequence is as shown in SEQ ID NO: 1 or a functional fragment thereof, for example, an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 1;

OTOF的N端编码序列是从OTOF氨基酸序列的N端至OTOF切分位点的核苷酸编码序列;OTOF的C端编码序列是从OTOF切分位点后一位的氨基酸至OTOF氨基酸序列的C端的核苷酸编码序列。The N-terminal coding sequence of OTOF is the nucleotide coding sequence from the N-terminus of the OTOF amino acid sequence to the OTOF cleavage site; the C-terminal coding sequence of OTOF is the nucleotide coding sequence from the amino acid after the OTOF cleavage site to the C-terminus of the OTOF amino acid sequence.

在一些实施方案中,所述OTOF切分位点位于OTOF氨基酸序列中任一外显子17-29的3’末端氨基酸残基处。In some embodiments, the OTOF cleavage site is located at the 3' terminal amino acid residue of any exon 17-29 in the OTOF amino acid sequence.

在一些实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,所述第一核苷酸序列或所述第二核苷酸序列的表达盒的启动子选自CAG启动子、CMV启动子、CBA启动子、UbC启动子、SFFV启动子、EF1α启动子、PGK启动子、或Myo7A、Myo15、Atoh1、POU4F3、Lhx3、Myo6、α9AchR、α10AchR、OTOF和STRC编码基因的启动子;In some embodiments, in the dual-vector system for expressing OTOF protein of the present invention, the promoter of the expression cassette of the first nucleotide sequence or the second nucleotide sequence is selected from CAG promoter, CMV promoter, CBA promoter, UbC promoter, SFFV promoter, EF1α promoter, PGK promoter, or promoters of Myo7A, Myo15, Atoh1, POU4F3, Lhx3, Myo6, α9AchR, α10AchR, OTOF and STRC encoding genes;

在一些实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,所述第一核苷酸序列或第二核苷酸序列的表达盒的polyA包含AATAAA(SEQ ID NO:68)和AATAAA的变体;所述AATAAA的变体包含ATTAAA(SEQ ID NO:69)、AGTAAA(SEQ ID NO:70)、CATAAA(SEQ ID NO:71)、TATAAA(SEQ ID NO:72)、GATAAA(SEQ ID NO:73)、ACTAAA(SEQ ID NO:74)、AATATA(SEQ ID NO:75)、AAGAAA(SEQ ID NO:76)、AATAAT(SEQ ID NO:77)、AAAAAA(SEQ ID NO:78),AATGAA(SEQ ID NO:79)、AATCAA(SEQ ID NO:80)、AACAAA(SEQ ID NO:81)、AATCAA(SEQ ID NO:82)、AATAAC(SEQ ID NO:83)、AATAGA(SEQ ID NO:84)、AATTAA(SEQ ID NO:85)或AATAAG(SEQ ID NO:86);例如,所述polyA是与SEQ ID NO:62或SEQ ID NO:65所示的多聚A信号序列具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列;和所述两个第一ITR序列和所述两个第二ITR序列中的每个ITR序列源自AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8或AAV9。In some embodiments, in the dual-vector system for expressing OTOF protein of the present invention, the polyA of the expression cassette of the first nucleotide sequence or the second nucleotide sequence comprises AATAAA (SEQ ID NO: 68) and variants of AATAAA; the variants of AATAAA comprise ATTAAA (SEQ ID NO: 69), AGTAAA (SEQ ID NO: 70), CATAAA (SEQ ID NO: 71), TATAAA (SEQ ID NO: 72), GATAAA (SEQ ID NO: 73), ACTAAA (SEQ ID NO: 74), AATATA (SEQ ID NO: 75), AAGAAA (SEQ ID NO: 76), AATAAT (SEQ ID NO: 77), AAAAAA (SEQ ID NO: 78), AATGAA (SEQ ID NO: 79), NO: 79), AATCAA (SEQ ID NO: 80), AACAAA (SEQ ID NO: 81), AATCAA (SEQ ID NO: 82), AATAAC (SEQ ID NO: 83), AATAGA (SEQ ID NO: 84), AATTAA (SEQ ID NO: 85) or AATAAG (SEQ ID NO: 86); for example, the polyA is a nucleotide sequence having at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identity with the poly A signal sequence shown in SEQ ID NO: 62 or SEQ ID NO: 65; and each of the two first ITR sequences and the two second ITR sequences is derived from AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 or AAV9.

在一些实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,所述第一核苷酸序列或所述第二核苷酸序列的表达盒还包含表达调控元件和/或标签元件,例如,所述表达调控元件为土拨鼠肝炎转录后调控元件(WPRE)或其变体,优选地,WPRE截短变体,例如,与SEQ ID NO:61所示的核苷酸序列具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列,例如,SEQ ID NO:64所示的核苷酸序列;例如,所述标签元件为HA。In some embodiments, in the dual-vector system for expressing OTOF protein of the present invention, the expression cassette of the first nucleotide sequence or the second nucleotide sequence further comprises an expression regulatory element and/or a tag element, for example, the expression regulatory element is a woodchuck hepatitis posttranscriptional regulatory element (WPRE) or a variant thereof, preferably a WPRE truncated variant, for example, a nucleotide sequence having at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identity with the nucleotide sequence shown in SEQ ID NO: 61, for example, the nucleotide sequence shown in SEQ ID NO: 64; for example, the tag element is HA.

在一些实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,所述与第一核苷酸序列发生同源重组的序列和所述与第二核苷酸序列发生同源重组的序列是内含子序列,例如,非天然存在于编码OTOF蛋白的基因的基因组序列中的内含子序列,In some embodiments, in the binary vector system for expressing an OTOF protein of the present invention, the sequence that undergoes homologous recombination with the first nucleotide sequence and the sequence that undergoes homologous recombination with the second nucleotide sequence are intron sequences, for example, intron sequences that do not naturally exist in the genomic sequence of the gene encoding the OTOF protein.

在一些实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,所述与第一核苷酸序列发生同源重组的序列和所述与第二核苷酸序列发生同源重组的序列是内含子序列,例如,所述内含子序列是F1噬菌体的引起重组的AK序列,例如SEQ ID NO:6所示的序列。In some embodiments, in the dual-vector system for expressing OTOF protein of the present invention, the sequence that undergoes homologous recombination with the first nucleotide sequence and the sequence that undergoes homologous recombination with the second nucleotide sequence are intron sequences. For example, the intron sequence is the recombination-causing AK sequence of F1 phage, such as the sequence shown in SEQ ID NO: 6.

在一些实施方案中,在本发明的用于表达STRC蛋白的双载体系统中,将所述第一核苷酸序列插入在包含两个第一ITR序列的质粒中,将所述第二核苷酸序列插入在包含两个第二ITR序列的质粒中,例如,所述包含两个第一ITR序列的质粒和所述包含两个第二ITR序列的质粒相同或不同,例如,所述质粒是pAAV、pAAV-CMV、pX601、pX551或pAAV-MCS质粒。In some embodiments, in the dual-vector system for expressing STRC protein of the present invention, the first nucleotide sequence is inserted into a plasmid comprising two first ITR sequences, and the second nucleotide sequence is inserted into a plasmid comprising two second ITR sequences, for example, the plasmid comprising two first ITR sequences and the plasmid comprising two second ITR sequences are the same or different, for example, the plasmid is pAAV, pAAV-CMV, pX601, pX551 or pAAV-MCS plasmid.

在一些实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,经OTOF切分位点切分的OTOF N端和OTOF C端如表1A所示。In some embodiments, in the dual-vector system of the present invention for expressing OTOF protein, the OTOF N-terminus and OTOF C-terminus divided by the OTOF cleavage site are as shown in Table 1A.

在一些具体实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,所述OTOF切分位点位于外显子20编码的氨基酸残基和外显子21编码的氨基酸残基之间;In some specific embodiments, in the binary vector system for expressing the OTOF protein of the present invention, the OTOF cleavage site is located between the amino acid residue encoded by exon 20 and the amino acid residue encoded by exon 21;

OTOF切分位点位于外显子21编码的氨基酸残基和外显子22编码的氨基酸残基之间;The OTOF cleavage site is located between the amino acid residues encoded by exon 21 and the amino acid residues encoded by exon 22;

OTOF切分位点位于外显子22编码的氨基酸残基和外显子23编码的氨基酸残基之间;The OTOF cleavage site is located between the amino acid residues encoded by exon 22 and the amino acid residues encoded by exon 23;

OTOF切分位点位于外显子23编码的氨基酸残基和外显子24编码的氨基酸残基之间;或The OTOF cleavage site is located between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24; or

OTOF切分位点位于外显子29编码的氨基酸残基和外显子30编码的氨基酸残基之间。The OTOF cleavage site is located between the amino acid residues encoded by exon 29 and the amino acid residues encoded by exon 30.

在一些优选的实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,所述OTOF N端编码序列是SEQ ID NO:11或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:37所示的核酸序列,且所述OTOF C端编码序列是SEQ ID NO:24或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:50所示的核酸序列;In some preferred embodiments, in the dual-vector system for expressing the OTOF protein of the present invention, the OTOF N-terminal coding sequence is SEQ ID NO: 11 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 37, and the OTOF C-terminal coding sequence is SEQ ID NO: 24 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 50;

所述OTOF N端编码序列是SEQ ID NO:12或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:38所示的核酸序列,且所述OTOF C端编码序列是SEQ ID NO:25或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:51所示的核酸序列;The OTOF N-terminal coding sequence is SEQ ID NO: 12 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 38, and the OTOF C-terminal coding sequence is SEQ ID NO: 25 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 51;

所述OTOF N端编码序列是SEQ ID NO:13或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:39所示的核酸序列,且所述OTOF C端编码序列是SEQ ID NO:26或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:52所示的核酸序列;The OTOF N-terminal coding sequence is SEQ ID NO: 13 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 39, and the OTOF C-terminal coding sequence is SEQ ID NO: 26 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 52;

所述OTOF N端编码序列是SEQ ID NO:14或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:40所示的核酸序列,且所述OTOF C端编码序列是SEQ ID NO:27或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:53所示的核酸序列;或The OTOF N-terminal coding sequence is SEQ ID NO: 14 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 40, and the OTOF C-terminal coding sequence is SEQ ID NO: 27 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 53; or

所述OTOF N端编码序列是SEQ ID NO:20或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:46所示的核酸序列,且所述OTOF C端编码序列是SEQ ID NO:33或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:59所示的核酸序列。The OTOF N-terminal coding sequence is SEQ ID NO: 20 or a sequence having at least 90% sequence identity therewith, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 46, and the OTOF C-terminal coding sequence is SEQ ID NO: 33 or a sequence having at least 90% sequence identity therewith, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 59.

在一些实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,使用pAAV-CMV质粒为载体。In some embodiments, in the dual-vector system for expressing OTOF protein of the present invention, pAAV-CMV plasmid is used as a vector.

在一些实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,所述剪接供体(SD)序列基本由SEQ ID NO:4所示的序列组成;所述剪接受体(SA)序列基本由SEQ ID NO:5所示的序列组成。In some embodiments, in the dual-vector system for expressing OTOF protein of the present invention, the splice donor (SD) sequence basically consists of the sequence shown in SEQ ID NO: 4; the splice acceptor (SA) sequence basically consists of the sequence shown in SEQ ID NO: 5.

在一些实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,所述第一核苷酸序列的表达盒和所述第二核苷酸序列的表达盒各自在3’ITR序列的N端包含WPRE核苷酸序列和SV40多聚腺苷酸化序列的组合,例如,具有SEQ ID NO:60所示的核苷酸序列或与SEQ ID NO:60具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列;或者包含WPRE3核苷酸序列和SV40晚期多聚腺苷酸化序列的组合,例如,具有SEQ ID NO:63所示的核苷酸序列或与SEQ ID NO:63具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列。In some embodiments, in the dual-vector system for expressing OTOF protein of the present invention, the expression cassette of the first nucleotide sequence and the expression cassette of the second nucleotide sequence each contain a combination of a WPRE nucleotide sequence and an SV40 polyadenylation sequence at the N-terminus of the 3’ITR sequence, for example, a nucleotide sequence shown in SEQ ID NO: 60 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 60; or contain a combination of a WPRE3 nucleotide sequence and an SV40 late polyadenylation sequence, for example, a nucleotide sequence shown in SEQ ID NO: 63 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 63.

在第二方面,本发明提供了一种腺相关病毒的包装载体系统,所述包装载体系统包含本发明第一方面所述的用于表达OTOF蛋白的双载体系统、携带AAV rep和cap基因的载体和辅助病毒载体,经包装成为AAV载体,优选地,其中OTOF蛋白的氨基酸序列如SEQ ID NO:1所示。In a second aspect, the present invention provides an adeno-associated virus packaging vector system, which comprises the dual-vector system for expressing the OTOF protein described in the first aspect of the present invention, a vector carrying the AAV rep and cap genes, and a helper virus vector, which are packaged into an AAV vector. Preferably, the amino acid sequence of the OTOF protein is as shown in SEQ ID NO: 1.

在一些实施方案中,在本发明的腺相关病毒的包装载体系统中,所述携带AAV rep和cap基因的载体选自AAV1、AAV2、AAV5、AAV8、AAV9、Anc80、PHP.eB、AAV-DJ和AAVrh.10载体;所述辅助病毒载体为pHelper质粒。In some embodiments, in the adeno-associated virus packaging vector system of the present invention, the vector carrying the AAV rep and cap genes is selected from AAV1, AAV2, AAV5, AAV8, AAV9, Anc80, PHP.eB, AAV-DJ and AAVrh.10 vectors; and the helper virus vector is a pHelper plasmid.

在第三方面,本发明提供了一种腺相关病毒的包装方法,其中,将本发明第二方面所述的腺相关病毒的包装载体系统转入宿主细胞中进行包装。In a third aspect, the present invention provides a method for packaging an adeno-associated virus, wherein the adeno-associated virus packaging vector system described in the second aspect of the present invention is transferred into a host cell for packaging.

在一些实施方案中,所述宿主细胞选自Hela-S3细胞、HEK-293细胞、HEK-293T细胞、HEK-293FT细胞、A549细胞和Sf9细胞。In some embodiments, the host cell is selected from Hela-S3 cells, HEK-293 cells, HEK-293T cells, HEK-293FT cells, A549 cells, and Sf9 cells.

在第四方面,本发明提供了双腺相关病毒载体,其是通过根据本发明第二方面所述的包装方法获得的。In a fourth aspect, the present invention provides a dual adeno-associated virus vector, which is obtained by the packaging method according to the second aspect of the present invention.

在第五方面,本发明提供了本发明第一方面所述的用于表达OTOF蛋白的双载体系统或本发明第四方面所述的双腺相关病毒载体的用途,用于制备治疗耳聋性疾病或听力损伤或听力功能障碍的药物或制剂。In the fifth aspect, the present invention provides the use of the dual vector system for expressing OTOF protein described in the first aspect of the present invention or the dual adeno-associated virus vector described in the fourth aspect of the present invention for preparing a drug or preparation for treating deafness diseases or hearing loss or hearing dysfunction.

在第六方面,本发明提供了一种用于治疗耳聋疾病或听力损伤或听力功能障碍的药物或制剂,其由本发明第一方面所述的用于表达OTOF蛋白的双载体系统或本发明第四方面所述的腺相关病毒制备,其中,所述腺相关病毒是将腺相关病毒的包装载体系统转入宿主细胞进行包装得到的,所述腺相关病毒的包装载体系统包括表达OTOF蛋白的双载体系统、携带AAVrep和cap基因的载体,以及辅助病毒载体。In the sixth aspect, the present invention provides a medicine or preparation for treating deafness, hearing loss or hearing dysfunction, which is prepared by the dual-vector system for expressing OTOF protein described in the first aspect of the present invention or the adeno-associated virus described in the fourth aspect of the present invention, wherein the adeno-associated virus is obtained by transferring the packaging vector system of the adeno-associated virus into a host cell for packaging, and the packaging vector system of the adeno-associated virus includes a dual-vector system for expressing OTOF protein, a vector carrying AAVrep and cap genes, and a helper virus vector.

在一些实施方案中,本发明的所述药物或制剂还包含中性盐缓冲剂、酸性盐缓冲剂、碱性盐缓冲剂、葡萄糖、甘露糖、甘露糖醇、蛋白质、多肽、氨基酸、抗生素、螯合剂、佐剂、防腐剂、纳米颗粒、脂质体和阳性脂质颗粒。In some embodiments, the drug or formulation of the present invention further comprises a neutral salt buffer, an acidic salt buffer, an alkaline salt buffer, glucose, mannose, mannitol, proteins, polypeptides, amino acids, antibiotics, chelating agents, adjuvants, preservatives, nanoparticles, liposomes and positive lipid particles.

在一些实施方案中,将本发明的所述药物或制剂通过耳蜗的圆窗、卵圆窗、半规管、后半规管、总管进行注射给药;以及终生单次给药或多次给药,总剂量为1×109-1×1013病毒基因组。In some embodiments, the drug or preparation of the present invention is administered by injection through the round window, oval window, semicircular canal, posterior semicircular canal, or common canal of the cochlea; and is administered once or multiple times throughout life, with a total dose of 1×10 9 -1×10 13 viral genomes.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1显示了使用反向末端重复序列(ITR)、CMV启动子、剪接供体SD信号序列、剪接受体SA信号序列、用于同源重组拼接的HD序列依赖于反式剪接机制同源重组的双载体介导的OTOF蛋白表达的示意图。图1中,HD表示同源结构域(Homeodomain),例如,其序列包含来自f1噬菌体的短AK重组引发性序列,5’cDNA-otof和3’cDNA-erlin分别表示将OTOF全长蛋白切分为OTOF N端和OTOF C端后的编码OTOF N端(文中也表示为“5’Otof NT”)的核苷酸序列和编码OTOF C端(文中也表示为“3’Otof CT”)的核苷酸序列。Figure 1 shows a schematic diagram of dual-vector-mediated OTOF protein expression using an inverted terminal repeat (ITR), a CMV promoter, a splice donor SD signal sequence, a splice acceptor SA signal sequence, and an HD sequence for homologous recombination splicing, which relies on a trans-splicing mechanism for homologous recombination. In Figure 1, HD represents the homeodomain, for example, whose sequence contains a short AK recombination-initiating sequence from f1 phage, and 5'cDNA-otof and 3'cDNA-erlin represent the nucleotide sequence encoding the OTOF N-terminus (also referred to as "5'Otof NT") and the nucleotide sequence encoding the OTOF C-terminus (also referred to as "3'Otof CT"), respectively, after the full-length OTOF protein is cleaved into the OTOF N-terminus and OTOF C-terminus.

图2例示了双载体系统中的载体骨架元件,例如,其序列可以如SEQ ID NO:7所示,其中5’ITR序列位于1bp-141bp,CMV启动子序列位于169bp-752bp,Kozak序列位于792bp-797bp,EGFP序列位于801bp-1517bp,WPRE序列位于1536bp-2124bp,SV40 PolyA序列位于2131bp-2252bp,3’ITR序列位于2290bp-2430bp,f1 Ori序列位于2505bp-2960bp,kana抗性序列位于3242bp-4156bp,Ori序列位于4327bp-4515bp。Figure 2 illustrates the vector backbone elements in the dual vector system, for example, its sequence can be shown as SEQ ID NO:7, wherein the 5’ITR sequence is located at 1bp-141bp, the CMV promoter sequence is located at 169bp-752bp, the Kozak sequence is located at 792bp-797bp, the EGFP sequence is located at 801bp-1517bp, the WPRE sequence is located at 1536bp-2124bp, the SV40 PolyA sequence is located at 2131bp-2252bp, the 3’ITR sequence is located at 2290bp-2430bp, the f1 Ori sequence is located at 2505bp-2960bp, the kana resistance sequence is located at 3242bp-4156bp, and the Ori sequence is located at 4327bp-4515bp.

图3例示了在外显子23编码的氨基酸残基和外显子24编码的氨基酸残基之间将OTOF全长蛋白切分为OTOF N端和OTOF C端后,构建的双载体系统中的第一核酸载体的质粒图谱,包含5’Otof NT的编码核苷酸序列、SD序列和AK序列。Figure 3 illustrates the plasmid map of the first nucleic acid vector in the dual-vector system constructed after the full-length OTOF protein is split into the OTOF N-terminus and the OTOF C-terminus between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24, which contains the coding nucleotide sequence of 5’Otof NT, SD sequence and AK sequence.

图4例示了在外显子23编码的氨基酸残基和外显子24编码的氨基酸残基之间将OTOF全长蛋白切分为OTOF N端和OTOF C端后,构建的双载体系统中的第二核酸载体的质粒图谱,包含AK序列、SA序列和3’Otof CT的编码核苷酸序列,其中在OTOF C端蛋白的C末端连接有HA标签,用于验证OTOF蛋白的体外表达。Figure 4 illustrates the plasmid map of the second nucleic acid vector in the dual-vector system constructed after the full-length OTOF protein was split into the OTOF N-terminus and the OTOF C-terminus between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24, which contains the AK sequence, SA sequence and the coding nucleotide sequence of 3’Otof CT, wherein an HA tag is connected to the C-terminus of the OTOF C-terminal protein for verifying the in vitro expression of the OTOF protein.

图5例示了在外显子23编码的氨基酸残基和外显子24编码的氨基酸残基之间将OTOF全长蛋白切分为OTOF N端和OTOF C端后,构建的双载体系统中的第二核酸载体的质粒图谱,包含AK序列、SA序列和3’Otof CT的编码核苷酸序列,其中在OTOF C端蛋白的C末端未连接HA标签,用于小鼠体内注射治疗实验。Figure 5 illustrates the plasmid map of the second nucleic acid vector in the dual-vector system constructed after the full-length OTOF protein was split into the OTOF N-terminus and the OTOF C-terminus between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24, containing the AK sequence, SA sequence and the coding nucleotide sequence of 3’Otof CT, wherein the HA tag is not connected to the C-terminus of the OTOF C-terminal protein for in vivo injection therapy experiments in mice.

图6显示了用编码OTOF蛋白的N端部分(Otof-NT)的质粒载体和用编码OTOF蛋白的C端部分(Otof-CT)的质粒载体两者共转染的HEK-293T细胞导致全长Otof表达以及不同切分位点的全长OTOF蛋白表达情况的Western印迹结果。图6中,最左侧的泳道表示分子标志物梯带;泳道NC:对照(未转染质粒载体的293T细胞);泳道1:共转染在外显子17编码的氨基酸残基和外显子18编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“17N+18C”);泳道2:共转染在外显子18编码的氨基酸残基和外显子19编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“18N+19C”);泳道3:共转染在外显子19编码的氨基酸残基和外显子20编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“19N+20C”);泳道4:共转染在外显子20编码的氨基酸残基和外显子21编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“20N+21C”);泳道5:共转染在外显子21编码的氨基酸残基和外显子22编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“21N+22C”);泳道6:共转染在外显子22编码的氨基酸残基和外显子23编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“22N+23C”);泳道7:共转染在外显子23编码的氨基酸残基和外显子24编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“23N+24C”);泳道8:共转染在外显子24编码的氨基酸残基和外显子25编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“24N+25C”);泳道9:共转染在外显子25编码的氨基酸残基和外显子26编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“25N+26C”);泳道10:共转染在外显子26编码的氨基酸残基和外显子27编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“26N+27C”);泳道11:共转染在外显子27编码的氨基酸残基和外显子28编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“27N+28C”);泳道12:共转染在外显子28编码的氨基酸残基和外显子29编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“28N+29C”);泳道13:共转染在外显子29编码的氨基酸残基和外显子30编码的氨基酸残基之间切分OTOF全长蛋白后的Otof-NT的质粒载体和Otof-CT的质粒载体(文中也简称为“29N+30C”)。Figure 6 shows the Western blot results of HEK-293T cells co-transfected with a plasmid vector encoding the N-terminal portion of the OTOF protein (Otof-NT) and a plasmid vector encoding the C-terminal portion of the OTOF protein (Otof-CT), resulting in the expression of full-length Otof and full-length OTOF proteins at different cleavage sites. In Figure 6, the leftmost lane represents a molecular marker ladder; Lane NC: control (293T cells not transfected with plasmid vectors); Lane 1: co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein is cleaved between the amino acid residues encoded by exon 17 and the amino acid residues encoded by exon 18 (also referred to as "17N+18C" in the text); Lane 2: co-transfection of the amino acid residues encoded by exon 18. Lane 3: co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residue encoded by exon 19 and the amino acid residue encoded by exon 20 (also referred to as "19N+20C" in the text); Lane 4: co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residue encoded by exon 20 and the amino acid residue encoded by exon 21 (also referred to as "20N+21C" in the text); Lane 5: co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residue encoded by exon 21 and the amino acid residue encoded by exon 22. Lane 6: Otof-NT plasmid vector and Otof-CT plasmid vector (also referred to as "21N+22C" in the text); Lane 7: Otof-NT plasmid vector and Otof-CT plasmid vector (also referred to as "22N+23C" in the text) after co-transfection of the OTOF full-length protein cleaved between the amino acid residues encoded by exon 22 and the amino acid residues encoded by exon 23; Lane 8: Otof-NT plasmid vector and Otof-CT plasmid vector (also referred to as "22N+23C" in the text) after co-transfection of the OTOF full-length protein cleaved between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24. Lane 8: co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residues encoded by exon 24 and the amino acid residues encoded by exon 25 (also referred to as "24N+25C" in the text); Lane 9: co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residues encoded by exon 25 and the amino acid residues encoded by exon 26 (also referred to as "25N+26C" in the text); Lane 10: co-transfection of the Otof-NT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residues encoded by exon 26 and the amino acid residues encoded by exon 27. Lane 11: Co-transfection of the plasmid vectors of Otof-NT and Otof-CT (also referred to as "26N+27C" in the text); Lane 12: Co-transfection of the plasmid vectors of Otof-NT and Otof-CT (also referred to as "27N+28C" in the text) after the full-length OTOF protein was cleaved between the amino acid residues encoded by exon 28 and the amino acid residues encoded by exon 29. Lane 13: Co-transfection of the Otof-NT plasmid vector and the Otof-CT plasmid vector after the OTOF full-length protein was cleaved between the amino acid residues encoded by exon 29 and the amino acid residues encoded by exon 30 (also referred to as "29N+30C" in the text).

图7是实施例3的接受治疗的Otof基因敲除小鼠(Otof KO小鼠)模型的听性脑干反应(ABR)结果,显示了在接受治疗的Otof敲除小鼠模型中,ABR阈值降低,听力部分频段显著恢复,图7中,“20N+21C”对应图6的泳道4的质粒包装后的双AAV病毒的结果;“23N+24C”对应图6的泳道7的质粒包装后的双AAV病毒的结果;“WT”表示野生型小鼠;“KO”表示Otof基因敲除小鼠。Figure 7 is the auditory brainstem response (ABR) results of the Otof knockout mouse (Otof KO mouse) model that received treatment in Example 3, showing that in the Otof knockout mouse model that received treatment, the ABR threshold was lowered and some frequency bands of hearing were significantly restored. In Figure 7, "20N+21C" corresponds to the results of the double AAV virus after plasmid packaging in lane 4 of Figure 6; "23N+24C" corresponds to the results of the double AAV virus after plasmid packaging in lane 7 of Figure 6; "WT" represents wild-type mice; "KO" represents Otof knockout mice.

图8显示对Otof基因点突变的C57BL/6小鼠在出生后27天龄,用切分位点exon23/24(图8中显示为“23N+24C”)以及切分位点exon20/21(图8中显示为“20N+21C”)构建的双AAV病毒制剂注射后半规管5天之后,测听观察ABR阈值恢复情况。Figure 8 shows the ABR threshold recovery observed by audiometry 5 days after injection of a dual AAV virus preparation constructed with the cleavage site exon23/24 (shown as "23N+24C" in Figure 8 ) and the cleavage site exon20/21 (shown as "20N+21C" in Figure 8 ) into the posterior semicircular canal of C57BL/6 mice with a point mutation in the Otof gene at 27 days of age.

发明详述Detailed Description of the Invention

除非下文中另外定义,否则本说明书中所用的全部技术与科学术语具有如本发明所属领域的普通技术人员通常理解的相同含义。本文所提及的全部出版物、专利申请、专利和其他参考文献通过引用的方式完整地并入本文作为参考。此外,本文中所述的材料、方法和例子仅是说明性的并且不意在是限制性的。本发明的其他特征、目的和优点将从本说明书及附图并且从后附的权利要求书中显而易见。Unless otherwise defined hereinafter, all technical and scientific terms used in this specification have the same meaning as those of ordinary skill in the art to which the present invention pertains. All publications, patent applications, patents and other references mentioned herein are incorporated herein by reference in their entirety. In addition, the materials, methods and examples described herein are merely illustrative and are not intended to be restrictive. Other features, objects and advantages of the present invention will become apparent from this specification and the accompanying drawings and from the appended claims.

定义definition

在本文中,术语“约”在与数字数值联合使用时意为涵盖具有比指定数字数值小5%的下限和比指定数字数值大5%的上限的范围内的数字数值。该术语也旨在涵盖在指定数字±1%、±0.5%或±0.1%范围内的数值。As used herein, the term "about" when used in conjunction with a numerical value is intended to encompass numerical values within a range having a lower limit that is 5% less than the specified numerical value and an upper limit that is 5% greater than the specified numerical value. The term is also intended to encompass values within ±1%, ±0.5%, or ±0.1% of the specified number.

在本文中,术语“包含”或“包括”意指包括所述及的要素、整数或步骤、或要素组、整数组或步骤组,但是不排除任何其他的要素、整数或步骤、或其他要素组、整数组或步骤组。在本文中,当使用术语“包含”或“包括”时,除非另有指明,否则也涵盖由所述及的要素、整数或步骤组成的情形。例如,当提及“包含”某个具体序列的多核苷酸时,也旨在涵盖由该具体序列组成的多核苷酸。As used herein, the term "comprising" or "including" means including the recited elements, integers, steps, or groups of elements, integers, or steps, but does not exclude any other elements, integers, or steps, or other groups of elements, integers, or steps. As used herein, unless otherwise indicated, the term "comprising" or "including" also encompasses the situation consisting of the recited elements, integers, or steps. For example, when referring to a polynucleotide "comprising" a particular sequence, it is intended to encompass a polynucleotide consisting of that particular sequence.

在本文中,表述“和/或”在与两个或两个以上项目联合使用时,意在表示所列相关项目中的任何一个、或所列相关项目中的任何多个或所有的可能组合。Herein, the expression "and/or," when used in conjunction with two or more items, is intended to mean any one of the associated listed items, or any multiple or all possible combinations of the associated listed items.

在本文中,“听力损失”是指,通过听力测量确定的低于正常听力阈值水平的听力,包括轻度、中度、重度和极重度听力损失、以及耳聋。听力损失可以通过听力损失百分比描述,例如,30%、60%、80%或甚至100%听力损失,或通过听力损失分级描述。所述听力损失可以是由基因缺陷引起的或与之相关的听力损失,例如由遗传因素引起的先天性耳聋和学语前聋,或与遗传因素相关的、由环境因素诱发的(例如,衰老、噪音、药物或感染诱导的)听力损失。听力损失可以是无症状的(即,不存在相关的可见外耳或其他器官异常)或有症状的。在一些实施方案中,听力损失是感音神经性听力损失。In this article, "hearing loss" refers to hearing below the normal hearing threshold level determined by audiometry, including mild, moderate, severe and profound hearing loss, and deafness. Hearing loss can be described by a percentage of hearing loss, for example, 30%, 60%, 80% or even 100% hearing loss, or by a grading description of hearing loss. The hearing loss can be hearing loss caused by or associated with a gene defect, such as congenital deafness and pre-lingual deafness caused by genetic factors, or hearing loss associated with genetic factors, induced by environmental factors (for example, aging, noise, drugs or infection-induced). Hearing loss can be asymptomatic (that is, there is no associated visible outer ear or other organ abnormality) or symptomatic. In some embodiments, hearing loss is sensorineural hearing loss.

在本文中,“听力损失相关基因”是指,所述基因上的变异可通过改变内耳正常发挥功能的能力而引起听力损失或造成听力损失易感性。在本文中,这样的基因也称作“听损基因”。目前已经鉴定了100多个基因与听力损失相关(参见,Hereditary Hearing Loss Homepage,https://hereditaryhearingloss.org/,其中列出了目前已知的单基因无症状听力损失的基因位置和鉴定数据)。在引起听力损失易感性的情况下,相对于健康个体,携带所述听损基因变异的个体可表现出更容易因环境因素,例如衰老、噪音、药物或感染而引发听力损失。As used herein, a "hearing loss-associated gene" refers to a gene whose variation can cause hearing loss or susceptibility to hearing loss by altering the ability of the inner ear to function normally. In this article, such genes are also referred to as "hearing loss genes." More than 100 genes have been identified as being associated with hearing loss (see Hereditary Hearing Loss Homepage, https://hereditaryhearingloss.org/, which lists the gene locations and identification data for currently known single-gene asymptomatic hearing loss). In the case of causing susceptibility to hearing loss, individuals carrying the hearing loss gene variation may show a greater susceptibility to hearing loss due to environmental factors, such as aging, noise, drugs, or infections, relative to healthy individuals.

“Otof基因”编码耳畸蛋白(otoferlin,也称为“OTOF蛋白”),该蛋白同家族的成员还有dysferlin和myoferlin,它们与线虫的精子形成因子(FER-1)具有同源性,由此可推断OTOF蛋白或许与囊泡膜融合相关。OTOF蛋白在小鼠胚胎后期以及刚刚出生幼鼠耳蜗中的螺旋神经节细胞以及内外毛细胞中均有微量表达。小鼠成年后则只在内毛细胞表达,主要集中在内毛细胞的基底部。而内毛细基底部分布了许多可以持续大量释放神经递质的带状突触。先前研究认为,OTOF蛋白是内毛细胞胞吐的钙离子感受器,是传入神经突触的重要组成部分。OTOF蛋白的C2域可以结合Ca2+并伴有Ca2+依赖性;可以与磷脂及蛋白质结合,并在细胞膜运输和信号传导方面起重要作用;可以与胞浆磷脂酶A2、突触结合蛋白Ⅰ、Ras相关GTP结合蛋白等相结合,参与神经递质的释放,由此可以推测OTOF蛋白与钙离子相关的突触囊泡融合及神经递质的释放有关。OTOF-/-小鼠的带状突触结构无明显异常并且Ca2+也能分泌,但是听力完全丧失,并伴随突触囊泡释放异常,这表明OTOF蛋白是介导突触囊泡释放的重要元件。因为神经递质对突触后具有滋养作用,缺少营养会导致突触后的退行,这或许能解释在人类病例中表现为渐进性听力损失的原因。The "Otof gene" encodes otoferlin (also known as "OTOF protein"). Other members of the same family of proteins include dysferlin and myoferlin, which are homologous to the sperm formation factor (FER-1) of nematodes. It can be inferred that the OTOF protein may be related to vesicle membrane fusion. The OTOF protein is expressed in trace amounts in the spiral ganglion cells and inner and outer hair cells in the cochlea of late mouse embryos and newly born mice. In adult mice, it is only expressed in the inner hair cells, mainly concentrated in the base of the inner hair cells. The base of the inner capillary is distributed with many ribbon-like synapses that can continuously release large amounts of neurotransmitters. Previous studies have shown that the OTOF protein is a calcium ion sensor for exocytosis of inner hair cells and an important component of the afferent nerve synapse. The C2 domain of the OTOF protein binds Ca2 + in a Ca2 + -dependent manner; it can bind to phospholipids and proteins, playing an important role in cell membrane trafficking and signal transduction. It can also bind to cytosolic phospholipase A2, synaptotagmin I, and Ras-related GTP-binding protein, participating in neurotransmitter release. Therefore, it is speculated that OTOF protein is involved in calcium-dependent synaptic vesicle fusion and neurotransmitter release. OTOF -/- mice have no obvious structural abnormalities in their ribbon synapses and can secrete Ca2 + , but they exhibit complete hearing loss and abnormal synaptic vesicle release, suggesting that OTOF protein is a key mediator of synaptic vesicle release. Because neurotransmitters nourish postsynapses, a lack of nutrition can lead to postsynaptic degeneration, potentially explaining the progressive hearing loss seen in humans.

Otof基因在耳蜗、前庭和大脑组织中均有表达,其具有5个转录本,其中两个转录本在大脑中表达,编码的蛋白质分长和短两种形式,分别终止于47号外显子和48号外显子。短形式OTOF蛋白具有3个C2域和一个羧基跨膜域,这种形式仅仅存在于人类;长形式OTOF蛋白具有6个C2域和一个跨膜域,人和小鼠中均有表达,但人类耳蜗中只有终止于48号外显子的转录本存在。在一些实施方案中,改造人Otof基因的转录本变体5序列(NM_001287489.1)来用于耳蜗中的基因治疗。在一个实施方案中,Otof基因的转录本变体5的核苷酸序列如SEQ ID NO:2所示,其编码的OTOF同工型(isoform)5的氨基酸序列如SEQ ID NO:1所示。先前研究发现在OTOF蛋白的C2A-F这6个C2结构域中,不同的结构域存在不同的突变发生率,其中C2F域发生突变的频率最高。目前认为不同的C2结构域构型不同,或者在听功能形成过程中不同的C2结构域结合的物质不同导致其突变率产生差异。The Otof gene is expressed in the cochlea, vestibule, and brain tissues. It has five transcripts, two of which are expressed in the brain. The encoded proteins are divided into long and short forms, terminating at exon 47 and exon 48, respectively. The short form of OTOF protein has three C2 domains and one carboxyl transmembrane domain, and this form only exists in humans; the long form of OTOF protein has six C2 domains and one transmembrane domain, and is expressed in humans and mice, but only the transcript terminating at exon 48 exists in the human cochlea. In some embodiments, the transcript variant 5 sequence (NM_001287489.1) of the human Otof gene is modified for gene therapy in the cochlea. In one embodiment, the nucleotide sequence of transcript variant 5 of the Otof gene is shown in SEQ ID NO: 2, and the amino acid sequence of the encoded OTOF isoform 5 is shown in SEQ ID NO: 1. Previous studies have found that within the six C2 domains of the OTOF protein, C2A-F, different domains have different mutation rates, with the C2F domain having the highest mutation frequency. It is currently believed that the differences in mutation rates are due to different configurations of the different C2 domains or to the different substances they bind to during the development of hearing function.

在本文中,“耳蜗内毛细胞”是指,来自哺乳动物的离体或体外的耳蜗内毛细胞或细胞系或细胞群体,或在哺乳动物耳蜗中的内毛细胞。As used herein, "cochlear inner hair cells" refer to isolated or in vitro cochlear inner hair cells, cell lines, or cell populations derived from a mammal, or inner hair cells in the cochlea of a mammal.

在本文中,“耳蜗外毛细胞”是指,来自哺乳动物的离体或体外的耳蜗外毛细胞或细胞系或细胞群体,或在哺乳动物耳蜗中的外毛细胞。As used herein, "cochlear outer hair cells" refer to cochlear outer hair cells, cell lines, or cell populations isolated from or in vitro of a mammal, or outer hair cells in the cochlea of a mammal.

在本文中,“分离的”核酸是指,人工合成的、或自包含其的天然环境的至少一些组分中分离出来的核酸分子。例如,分离的核酸可以是一个更大的核酸的一部分、或是载体或物质组合物的一部分,或者可以包含在细胞内,并且仍然是“分离的”,条件是该更大的核酸、载体、物质组合物或特定细胞不是所述核酸的天然环境。As used herein, an "isolated" nucleic acid refers to a nucleic acid molecule that has been artificially synthesized or separated from at least some components of its natural environment. For example, an isolated nucleic acid can be part of a larger nucleic acid, or part of a vector or composition of matter, or can be contained within a cell and still be "isolated" provided that the larger nucleic acid, vector, composition of matter, or specific cell is not the natural environment of the nucleic acid.

在本文中,术语“可操作连接”也称作“有效连接”或“功能性连接”,意指两个或更多个多核苷酸(例如DNA)区段处于一种允许它们以预期的方式起作用的关系中。举例而言,若启动子序列在合适的宿主细胞或其他表达系统中刺激或调节编码序列的转录,则该启动子序列有效连接到该编码序列。一般而言,有效连接到可转录序列的启动子,与该可转录序列是连续的,即它们是顺式作用(cis-acting)。然而,一些转录调控序列(如增强子)无须物理邻接于或紧密接近于其增强转录的编码序列。As used herein, the term "operably linked," also referred to as "effectively linked" or "functionally linked," means that two or more polynucleotide (e.g., DNA) segments are in a relationship that allows them to function in the intended manner. For example, a promoter sequence is operably linked to a coding sequence if it stimulates or regulates the transcription of the coding sequence in a suitable host cell or other expression system. Generally, promoters that are operably linked to a transcribable sequence are contiguous with the transcribable sequence, i.e., they are cis-acting. However, some transcriptional regulatory sequences (e.g., enhancers) do not need to be physically adjacent to or in close proximity to the coding sequence whose transcription they enhance.

如本文中所用,术语“腺相关病毒(Adeno-associated virus,AAV)”因在腺病毒制品中发现而得名。AAV是微小病毒科(Parvovirus)成员,包含多种血清型,其基因组为单链DNA。As used herein, the term "adeno-associated virus (AAV)" is named after its discovery in adenovirus products. AAV is a member of the Parvovirus family, which includes multiple serotypes and has a single-stranded DNA genome.

AAV是依赖性病毒,需要其它病毒如腺病毒、单纯疱疹病毒、人乳头瘤病毒、或辅助因素提供辅助功能蛋白才能复制。AAV is a dependent virus that requires other viruses such as adenovirus, herpes simplex virus, human papillomavirus, or auxiliary factors to provide auxiliary functional proteins for replication.

最早分离到的AAV病毒是血清型2型AAV(AAV2)。AAV2基因组长约4.7kb,基因组两端为长度145bp的“反向末端重复序列”(inverted terminal repeat,ITR),呈回文-发卡结构。基因组中有两个大开放阅读框(ORF),分别编码rep和cap基因。The first AAV virus isolated was serotype 2 (AAV2). The AAV2 genome is approximately 4.7 kb long, flanked by 145-bp inverted terminal repeats (ITRs) at each end, forming a palindromic hairpin structure. The genome contains two large open reading frames (ORFs), encoding the rep and cap genes, respectively.

ITR是AAV载体基因组的顺式作用元件,在AAV病毒的整合、拯救、复制和基因组包装中发挥重要作用。ITR序列中包含Rep蛋白结合位点(Rep binding site,RBS)和末端解链位点trs(terminal resolution site),能够被Rep蛋白结合识别并在trs处产生切口。ITR序列还可形成独特的“T”字母型二级结构,在AAV病毒的生活周期中发挥重要作用。ITRs are cis-acting elements of the AAV vector genome, playing a crucial role in AAV integration, rescue, replication, and genome packaging. The ITR sequence contains the Rep binding site (RBS) and the terminal resolution site (TRs), which are recognized by the Rep protein and produce a cleavage at the TRs. The ITR sequence also forms a unique "T"-shaped secondary structure, which plays a crucial role in the AAV life cycle.

AAV2基因组其余部分可分为2个功能区,rep基因区和cap基因区。The rest of the AAV2 genome can be divided into two functional regions, the rep gene region and the cap gene region.

rep基因区编码Rep78、Rep68、Rep52和Rep40四种Rep蛋白。Rep蛋白对于AAV病毒的复制、整合、拯救和包装都具有重要作用。其中Rep78和Rep68与ITR中的末端解链位点trs和GAGY重复基序特异性结合,启动AAV基因组由单链向双链的复制过程。ITR中trs和GAGC重复基序和/或GAGY重复基序是AAV基因组复制的中心,因此虽然在各种血清型的AAV病毒中ITR序列都不尽相同,但是都能形成发卡结构和存在Rep结合位点。在AAV2基因组图谱位置19处有p19启动子,启动分别表达Rep52和Rep40。Rep52和Rep40具有ATP依赖的DNA解旋酶活性,但没有结合DNA的功能。The rep gene region encodes four Rep proteins: Rep78, Rep68, Rep52, and Rep40. Rep proteins play an important role in the replication, integration, rescue, and packaging of AAV viruses. Rep78 and Rep68 specifically bind to the terminal melting sites trs and GAGY repeat motifs in the ITR, initiating the replication of the AAV genome from single-stranded to double-stranded. The trs and GAGC repeat motifs and/or GAGY repeat motifs in the ITR are the center of AAV genome replication. Therefore, although the ITR sequences are different in various serotypes of AAV viruses, they can all form a hairpin structure and contain Rep binding sites. There is a p19 promoter at position 19 on the AAV2 genome map, which initiates the expression of Rep52 and Rep40, respectively. Rep52 and Rep40 have ATP-dependent DNA helicase activity but do not have the function of binding to DNA.

cap基因编码AAV病毒的衣壳蛋白VP1、VP2和VP3。其中,VP3分子量最小,但数量最多,在成熟的AAV颗粒中VP1、VP2、VP3的比例大致为1:1:10。VP1是形成有感染性的AAV所必需的;VP2协助VP3进入细胞核;VP3是组成AAV颗粒的主要蛋白。The cap gene encodes the AAV capsid proteins VP1, VP2, and VP3. VP3 has the smallest molecular weight but is the most abundant. In mature AAV particles, the ratio of VP1, VP2, and VP3 is approximately 1:1:10. VP1 is essential for the formation of infectious AAV; VP2 facilitates VP3 entry into the cell nucleus; and VP3 is the primary protein in AAV particles.

如本文中所用,术语“AAV载体”是人们随着对AAV病毒生活周期及其相关分子生物学机制的了解,将野生型AAV病毒改造成的一种高效的外源基因转移工具,即AAV载体。改造后的AAV载体基因组中只包含AAV病毒的ITR序列和携带待转运的外源序列。AAV病毒包装需要的Rep和Cap蛋白通过其他外源质粒反式提供,由此降低了rep和cap基因包装入AAV载体可能带来的危害。进一步地,AAV病毒本身不具有致病性,这使得AAV载体成为公认的最安全的病毒载体之一。As used herein, the term "AAV vector" refers to an efficient exogenous gene transfer tool, i.e., an AAV vector, that has been transformed from wild-type AAV virus as people gain a better understanding of the AAV virus life cycle and its related molecular biological mechanisms. The modified AAV vector genome only contains the ITR sequence of the AAV virus and the exogenous sequence to be transferred. The Rep and Cap proteins required for AAV virus packaging are provided in trans by other exogenous plasmids, thereby reducing the possible harm caused by packaging the rep and cap genes into the AAV vector. Furthermore, the AAV virus itself is not pathogenic, which makes the AAV vector recognized as one of the safest viral vectors.

AAV病毒血清型众多,不同的血清型具有不同的组织感染嗜性,因此应用AAV载体能够将外源基因转运至特定的器官和组织。There are many AAV virus serotypes, and different serotypes have different tissue infection tropisms. Therefore, the use of AAV vectors can transport exogenous genes to specific organs and tissues.

现有技术中对AAV载体具有相对成熟的包装系统,这便于规模化生产AAV载体。The existing technology has a relatively mature packaging system for AAV vectors, which facilitates the large-scale production of AAV vectors.

术语“载体基因组(vg)”是指包装在rAAV衣壳内形成rAAV载体的核酸序列。The term "vector genome (vg)" refers to the nucleic acid sequence that is packaged within the rAAV capsid to form the rAAV vector.

在本文中,“个体”和“受试者”可互换地使用,是指哺乳动物。哺乳动物的实例包括,但不限于,人、非人灵长类动物(例如,食蟹猴、恒河猴)、啮齿类动物,以及其他哺乳动物,例如,牛、猪、马、狗。在本文中,哺乳动物包括处于所有发育期(包括胚胎和胎儿期)的个体。As used herein, "individual" and "subject" are used interchangeably to refer to mammals. Examples of mammals include, but are not limited to, humans, non-human primates (e.g., cynomolgus monkeys, rhesus monkeys), rodents, and other mammals, such as cattle, pigs, horses, and dogs. As used herein, mammals include individuals at all stages of development, including embryonic and fetal stages.

在本文中,术语“治疗”指意欲改变正在接受治疗的个体中疾病之天然过程的临床介入。想要的治疗效果包括但不限于防止疾病出现或复发、减轻症状、减小疾病的任何直接或间接病理学后果、防止转移、降低病情进展速率、改善或缓和疾病状态,以及缓解或改善预后。术语“治疗”也涵盖改良或改善至少一项身体参数,包括患者可能无法辨别的身体参数。As used herein, the term "treatment" refers to clinical intervention intended to alter the natural course of a disease in the individual being treated. Desired therapeutic effects include, but are not limited to, preventing the onset or recurrence of the disease, alleviating symptoms, reducing any direct or indirect pathological consequences of the disease, preventing metastasis, reducing the rate of disease progression, ameliorating or palliating the disease state, and alleviating or improving prognosis. The term "treatment" also encompasses modification or improvement of at least one physical parameter, including physical parameters that may not be discernible by the patient.

在本文中,术语“预防”是指预防或延缓疾病或病症的发作或发展或进程。在本文中,“预防”通常是指在疾病的至少一个症状发生前实施的医院干预。As used herein, the term "prevention" refers to preventing or delaying the onset or development or progression of a disease or condition. As used herein, "prevention" generally refers to hospital intervention performed before at least one symptom of a disease occurs.

以下对本发明的各个方面进行描述。Various aspects of the present invention are described below.

II.双载体系统II. Dual Vector System

本发明提供了一种双载体系统,其包含第一核酸载体和第二核酸载体,其中:The present invention provides a dual vector system comprising a first nucleic acid vector and a second nucleic acid vector, wherein:

第一核酸载体包含第一核苷酸序列;且第二核酸载体包含第二核苷酸序列;The first nucleic acid vector comprises a first nucleotide sequence; and the second nucleic acid vector comprises a second nucleotide sequence;

所述第一核苷酸序列包含插入在两个第一ITR序列之间的表达盒;The first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences;

所述第二核苷酸序列包含插入在两个第二ITR序列之间的表达盒;The second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences;

所述第一核苷酸序列的表达盒包含启动子、OTOF的N端编码序列、剪接供体(SD)序列、与第二核苷酸序列发生同源重组的序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of OTOF, a splice donor (SD) sequence, a sequence for homologous recombination with the second nucleotide sequence, and polyA;

所述第二核苷酸序列的表达盒包含启动子、与第一核苷酸序列发生同源重组的序列、剪接受体(SA)序列、OTOF的C端编码序列和polyA;和The expression cassette of the second nucleotide sequence comprises a promoter, a sequence that undergoes homologous recombination with the first nucleotide sequence, a splice acceptor (SA) sequence, a C-terminal coding sequence of OTOF, and polyA; and

在OTOF氨基酸序列中设置有OTOF切分位点,例如,所述OTOF氨基酸序列如SEQ ID NO:1所示或其功能片段,例如,与SEQ ID NO:1具有至少80%、85%、90%、95%、96%、97%、98%、99%或100%序列同一性的氨基酸序列;An OTOF cleavage site is provided in the OTOF amino acid sequence, for example, the OTOF amino acid sequence is as shown in SEQ ID NO: 1 or a functional fragment thereof, for example, an amino acid sequence having at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO: 1;

OTOF的N端编码序列是从OTOF氨基酸序列的N端至OTOF切分位点的核苷酸编码序列;OTOF的C端编码序列是从OTOF切分位点后一位的氨基酸至OTOF氨基酸序列的C端的核苷酸编码序列。The N-terminal coding sequence of OTOF is the nucleotide coding sequence from the N-terminus of the OTOF amino acid sequence to the OTOF cleavage site; the C-terminal coding sequence of OTOF is the nucleotide coding sequence from the amino acid after the OTOF cleavage site to the C-terminus of the OTOF amino acid sequence.

同源重组序列以及剪接供体序列和剪接接受体序列Homologous recombination sequences as well as splice donor and acceptor sequences

本发明的双载体系统中,第一核苷酸序列与第二核苷酸序列发生同源重组的序列可以是内含子序列,例如,非天然存在于编码OTOF蛋白的基因的基因组序列中的内含子序列。在一些具体实施方案中,内含子是合成的碱性磷酸酶(AP)内含子。In the dual vector system of the present invention, the sequence in which the first nucleotide sequence and the second nucleotide sequence undergo homologous recombination can be an intron sequence, for example, an intron sequence that does not naturally occur in the genomic sequence of the gene encoding the OTOF protein. In some specific embodiments, the intron is a synthetic alkaline phosphatase (AP) intron.

在一些实施方案中,本发明的双载体系统中使用的内含子序列可以包含剪接供体序列和剪接接受体序列。例如,双载体系统可以包含一个或多个AP内含子剪接体识别位点,例如一个或多个AP剪接受体(APSA)结构域或AP剪接供体(APSD)结构域。在一些示例性实施方案中,这些载体包含APSA和APSD。在一些实施方案中,第一载体包含APSA而第二载体包含APSD。在一些实施方案中,第一载体包含APSD而第二载体包含APSA。In some embodiments, the intron sequence used in the dual vector system of the present invention may comprise a splice donor sequence and a splice acceptor sequence. For example, the dual vector system may comprise one or more AP intron spliceosomal recognition sites, such as one or more AP splice acceptor (APSA) domains or AP splice donor (APSD) domains. In some exemplary embodiments, these vectors comprise both an APSA and an APSD. In some embodiments, the first vector comprises an APSA and the second vector comprises an APSD. In some embodiments, the first vector comprises an APSD and the second vector comprises an APSA.

在一些实施方案中,内含子序列是F1噬菌体的引起重组的AK序列,例如SEQ ID NO:6所示的序列,由此,通过AK序列介导同源重组以重构全长OTOF蛋白的表达盒。In some embodiments, the intron sequence is a recombination-inducing AK sequence of F1 phage, such as the sequence shown in SEQ ID NO: 6, thereby mediated by homologous recombination to reconstruct the expression cassette of the full-length OTOF protein.

剪接受体序列和剪接供体序列可以由本领域技术人员在已知的剪接受体序列和剪接供体序列中选择。剪接体内含子通常位于真核细胞的蛋白质编码基因的序列内。在内含子中,需要剪接供体位点(位于内含子的5’端)和剪接受体位点(位于内含子的3’端)来进行剪接。剪接供体位点在较大、较不高度保守的区域内的位于内含子5’端处包含几乎不变的序列GU。内含子3’端的剪接受体位点以几乎不变的AG序列终止该内含子。在AG的上游(5'方向),存在富嘧啶(C和U)区域或多聚嘧啶序列。Splice acceptor sequence and splice donor sequence can be selected by those skilled in the art from known splice acceptor sequence and splice donor sequence.Spliceosomal introns are usually located in the sequence of protein-coding genes of eukaryotic cells.In introns, splice donor site (located at the 5 ' end of the intron) and splice acceptor site (located at the 3 ' end of the intron) are required for splicing.The splice donor site is located at the 5 ' end of the intron in a larger, less highly conserved region and comprises an almost constant sequence GU.The splice acceptor site at the 3 ' end of the intron terminates the intron with an almost constant AG sequence.Upstream of AG (5 ' direction), there is a pyrimidine-rich (C and U) region or a polypyrimidine sequence.

在一些实施方案中,双载体系统中的所述剪接供体(SD)序列基本由SEQ ID NO:4所示的序列组成;所述剪接受体(SA)序列基本由SEQ ID NO:5所示的序列组成。In some embodiments, the splice donor (SD) sequence in the dual vector system basically consists of the sequence shown in SEQ ID NO: 4; the splice acceptor (SA) sequence basically consists of the sequence shown in SEQ ID NO: 5.

OTOF蛋白OTOF protein

在一些实施方案中,OTOF蛋白包含SEQ ID NO:1的氨基酸序列或由其组成。In some embodiments, the OTOF protein comprises or consists of the amino acid sequence of SEQ ID NO:1.

在一些实施方案中,在OTOF蛋白的氨基酸序列上设置切分位点,将OTOF蛋白切分为OTOF蛋白N端部分(本文中也简称为“OTOF的N端”)和OTOF蛋白C端部分(本文中也简称为“OTOF的C端”),OTOF的N端为OTOF氨基酸序列的N端至切分位点的序列,OTOF的C端是从紧邻切分位点的氨基酸残基到OTOF氨基酸序列的C端的序列。OTOF的切分位点有多种选择,例如,OTOF切分位点位于OTOF氨基酸序列中任一外显子1-47的3’末端氨基酸残基处。In some embodiments, a cleavage site is set in the amino acid sequence of the OTOF protein to divide the OTOF protein into the N-terminal portion of the OTOF protein (also referred to herein as the "N-terminus of OTOF") and the C-terminal portion of the OTOF protein (also referred to herein as the "C-terminus of OTOF"). The N-terminus of OTOF is the sequence from the N-terminus of the OTOF amino acid sequence to the cleavage site, and the C-terminus of OTOF is the sequence from the amino acid residue immediately adjacent to the cleavage site to the C-terminus of the OTOF amino acid sequence. There are multiple options for the cleavage site of OTOF, for example, the OTOF cleavage site is located at the 3' terminal amino acid residue of any exon 1-47 in the OTOF amino acid sequence.

在一些优选的实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,OTOF N端编码序列是SEQ ID NO:11或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:37所示的核酸序列,且OTOF C端编码序列是SEQ ID NO:24或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:50所示的核酸序列;In some preferred embodiments, in the dual-vector system for expressing the OTOF protein of the present invention, the OTOF N-terminal coding sequence is SEQ ID NO: 11 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 37, and the OTOF C-terminal coding sequence is SEQ ID NO: 24 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 50;

OTOF N端编码序列是SEQ ID NO:12或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:38所示的核酸序列,且OTOF C端编码序列是SEQ ID NO:25或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:51所示的核酸序列;The OTOF N-terminal coding sequence is SEQ ID NO: 12 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 38, and the OTOF C-terminal coding sequence is SEQ ID NO: 25 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 51;

OTOF N端编码序列是SEQ ID NO:13或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:39所示的核酸序列,且OTOF C端编码序列是SEQ ID NO:26或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:52所示的核酸序列;The OTOF N-terminal coding sequence is SEQ ID NO: 13 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 39, and the OTOF C-terminal coding sequence is SEQ ID NO: 26 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 52;

OTOF N端编码序列是SEQ ID NO:14或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:40所示的核酸序列,且OTOF C端编码序列是SEQ ID NO:27或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:53所示的核酸序列;或The OTOF N-terminal coding sequence is SEQ ID NO: 14 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 40, and the OTOF C-terminal coding sequence is SEQ ID NO: 27 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 53; or

OTOF N端编码序列是SEQ ID NO:20或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:46所示的核酸序列,且OTOF C端编码序列是SEQ ID NO:33或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:59所示的核酸序列。The OTOF N-terminal coding sequence is SEQ ID NO: 20 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 46, and the OTOF C-terminal coding sequence is SEQ ID NO: 33 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 59.

载体质粒vector plasmid

本发明的载体质粒可以是任何能够在宿主细胞中复制并表达相应多肽的质粒。The vector plasmid of the present invention can be any plasmid that can replicate in a host cell and express a corresponding polypeptide.

在一些实施方案中,载体质粒包含两个ITR序列,分别为5’反向末端重复(5’ITR)序列和3’反向末端重复(3’ITR)序列。In some embodiments, the vector plasmid comprises two ITR sequences, namely a 5' inverted terminal repeat (5'ITR) sequence and a 3' inverted terminal repeat (3'ITR) sequence.

在一些实施方案中,在本发明的用于表达OTOF蛋白的双载体系统中,将第一核苷酸序列插入在包含两个第一ITR序列的质粒中,将第二核苷酸序列插入在包含两个第二ITR序列的质粒中,例如,所述包含两个第一ITR序列的质粒和所述包含两个第二ITR序列的质粒相同或不同,例如,所述质粒是pAAV、pAAV-CMV、pX601、pX551或pAAV-MCS质粒。In some embodiments, in the dual-vector system for expressing OTOF protein of the present invention, the first nucleotide sequence is inserted into a plasmid comprising two first ITR sequences, and the second nucleotide sequence is inserted into a plasmid comprising two second ITR sequences, for example, the plasmid comprising two first ITR sequences and the plasmid comprising two second ITR sequences are the same or different, for example, the plasmid is pAAV, pAAV-CMV, pX601, pX551 or pAAV-MCS plasmid.

双载体系统Dual vector system

本发明提供了一种双载体系统,其包含第一核酸载体和第二核酸载体,其中:The present invention provides a dual vector system comprising a first nucleic acid vector and a second nucleic acid vector, wherein:

第一核酸载体以5’-3’方向包含:5’反向末端重复(5’ITR)序列、启动子、OTOF的N端编码序列、剪接供体(SD)序列、与第二核苷酸序列发生同源重组的序列、polyA和3’反向末端重复(3’ITR)序列;The first nucleic acid vector comprises, in 5'-3' direction: a 5' inverted terminal repeat (5'ITR) sequence, a promoter, an N-terminal coding sequence of OTOF, a splice donor (SD) sequence, a sequence for homologous recombination with the second nucleotide sequence, polyA, and a 3' inverted terminal repeat (3'ITR) sequence;

第二核酸载体以5’-3’方向包含:5’ITR序列、启动子、与第一核苷酸序列发生同源重组的序列、剪接受体(SA)序列、OTOF的C端编码序列、polyA和3’ITR序列,且The second nucleic acid vector comprises, in 5'-3' direction: a 5'ITR sequence, a promoter, a sequence for homologous recombination with the first nucleotide sequence, a splicing acceptor (SA) sequence, a C-terminal coding sequence of OTOF, polyA and a 3'ITR sequence, and

任选地在将所述第一核酸载体和所述第二核酸载体导入宿主细胞后,所述OTOF蛋白N端编码序列和OTOF蛋白C端编码序列在转录后有效连接,经翻译产生OTOF蛋白。Optionally, after the first nucleic acid vector and the second nucleic acid vector are introduced into a host cell, the OTOF protein N-terminal coding sequence and the OTOF protein C-terminal coding sequence are operably linked after transcription, and the OTOF protein is produced through translation.

在一些实施方案中,双载体系统中的所述ITR的核苷酸序列分别衍生自同一AAV血清型或不同AAV血清型,例如,AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8或AAV9血清型。在一些实施方案中,所述第一核酸载体的5’-ITR和3’-ITR和所述第二核酸载体的5’-ITR和3’-ITR衍生自同一AAV血清型。在一些实施方案中,所述第一核酸载体的5’-ITR和3’-ITR和所述第二核酸载体的5’-ITR和3’-ITR分别来自不同AAV血清型。In some embodiments, the nucleotide sequences of the ITRs in the dual vector system are derived from the same AAV serotype or different AAV serotypes, for example, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9 serotypes. In some embodiments, the 5'-ITR and 3'-ITR of the first nucleic acid vector and the 5'-ITR and 3'-ITR of the second nucleic acid vector are derived from the same AAV serotype. In some embodiments, the 5'-ITR and 3'-ITR of the first nucleic acid vector and the 5'-ITR and 3'-ITR of the second nucleic acid vector are derived from different AAV serotypes.

在一些实施方案中,双载体系统中使用了组织特异性启动子。例如,在耳中介导表达的启动子,例如,突触蛋白启动子或GFAP启动子。In some embodiments, a tissue-specific promoter is used in a two-vector system, for example, a promoter that mediates expression in the ear, such as the synapsin promoter or the GFAP promoter.

在一些实施方案中,双载体系统中使用了以下启动子中的任一种:巨细胞病毒(CMV)启动子、SV40启动子、劳斯肉瘤病毒(RSV)启动子、CAG启动子、嵌合CMV/鸡β肌动蛋白(CBA)启动子、截短形式的CBA(smCBA)启动子、UbC启动子、SFFV启动子、EF1α启动子、PGK启动子、或Myo7A、Myo15、Atoh1、POU4F3、Lhx3、Myo6、α9AchR、α10AchR、OTOF和STRC编码基因的启动子。在一些实施方案中,启动子是CMV启动子。In some embodiments, any one of the following promoters is used in the binary vector system: cytomegalovirus (CMV) promoter, SV40 promoter, Rous sarcoma virus (RSV) promoter, CAG promoter, chimeric CMV/chicken beta actin (CBA) promoter, truncated CBA (smCBA) promoter, UbC promoter, SFFV promoter, EF1α promoter, PGK promoter, or promoters of Myo7A, Myo15, Atoh1, POU4F3, Lhx3, Myo6, α9AchR, α10AchR, OTOF and STRC encoding genes. In some embodiments, the promoter is a CMV promoter.

本发明的双载体系统还可以包含一个或多个可以在转录之前或转录之后起作用的其他调控序列。调控序列可以是天然转基因基因座的一部分或者可以是异源的调控序列。本发明的双载体系统中可以包含天然转基因转录物的5'UTR或3'UTR的一部分。The dual vector system of the present invention may also include one or more additional regulatory sequences that can function before or after transcription. The regulatory sequences may be part of the native transgenic locus or may be heterologous regulatory sequences. A portion of the 5'UTR or 3'UTR of the native transgenic transcript may be included in the dual vector system of the present invention.

调控序列可以是促进转基因表达的任何序列,即用于增加转录物表达,改善mRNA的核输出或增强其稳定性。这种调控序列包括例如增强子元件、转录后调控元件和多聚腺苷酸化序列。The regulatory sequence may be any sequence that promotes transgene expression, i.e., serves to increase transcript expression, improve nuclear export of mRNA, or enhance its stability. Such regulatory sequences include, for example, enhancer elements, post-transcriptional regulatory elements, and polyadenylation sequences.

增强子是影响同一分子DNA上基因转录的顺式调控元件。增强子可以位于它们调节的基因的上游、下游、内含子内,或甚至相对远离它们调节的基因。Enhancers are cis-regulatory elements that affect the transcription of genes on the same molecule of DNA. Enhancers can be located upstream, downstream, within introns, or even relatively far from the genes they regulate.

在本发明的双载体系统中使用的优选转录后调控元件是土拨鼠肝炎转录后调控元件(WPRE)或其变体。与无WPRE或其变体的AAV载体相比,包含WPRE或其变体的AAV载体增加OTOF蛋白的表达。The preferred post-transcriptional regulatory element used in the dual vector system of the present invention is the woodchuck hepatitis post-transcriptional regulatory element (WPRE) or a variant thereof. Compared with an AAV vector without WPRE or a variant thereof, an AAV vector containing WPRE or a variant thereof increases the expression of OTOF protein.

在一个实施方案中,本发明的双载体系统包含具有SEQ ID NO:61所示的WPRE核苷酸序列。在另一个实施方案中,本发明的双载体系统包含与SEQ ID NO:61所示的WPRE核苷酸序列具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列的转录后调控元件,其中该核苷酸序列基本上保留了如SEQ ID NO:61所示的转录后调控元件的功能性活性,例如,WPRE的截短变体。缩小AAV基因组的尺寸使得能够增加除了转基因以外在载体中引入其他调控元件的灵活性。在一个实施方案中,WPRE的截短变体具有SEQ ID NO:64所示的WPRE3核苷酸序列。In one embodiment, the dual vector system of the present invention comprises a WPRE nucleotide sequence as set forth in SEQ ID NO:61. In another embodiment, the dual vector system of the present invention comprises a post-transcriptional regulatory element having a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to the WPRE nucleotide sequence as set forth in SEQ ID NO:61, wherein the nucleotide sequence substantially retains the functional activity of the post-transcriptional regulatory element as set forth in SEQ ID NO:61, for example, a truncated variant of WPRE. Reducing the size of the AAV genome enables increased flexibility in introducing other regulatory elements into the vector in addition to transgenes. In one embodiment, the truncated variant of WPRE has the WPRE3 nucleotide sequence as set forth in SEQ ID NO:64.

在一个实施方案中,本发明的双载体系统包含多聚腺苷酸化序列,例如,牛生长激素多聚腺苷酸化序列、SV40多聚腺苷酸化序列和/或SV40晚期多聚腺苷酸化序列。在一个实施方案中,本发明的双载体系统包含与SEQ ID NO:62所示的核苷酸序列具有至少90%、95%、96%、97%、98%、99%或100%同一性的SV40多聚腺苷酸化序列。在一个实施方案中,本发明的双载体系统包含与SEQ ID NO:65所示的核苷酸序列具有至少90%、95%、96%、97%、98%、99%或100%同一性的SV40晚期多聚腺苷酸化序列。In one embodiment, the dual vector system of the present invention comprises a polyadenylation sequence, for example, a bovine growth hormone polyadenylation sequence, an SV40 polyadenylation sequence, and/or an SV40 late polyadenylation sequence. In one embodiment, the dual vector system of the present invention comprises an SV40 polyadenylation sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the nucleotide sequence set forth in SEQ ID NO: 62. In one embodiment, the dual vector system of the present invention comprises an SV40 late polyadenylation sequence that is at least 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the nucleotide sequence set forth in SEQ ID NO: 65.

在一些实施方案中,本发明的双载体系统包含WPRE核苷酸序列和SV40多聚腺苷酸化序列的组合,例如,本发明的双载体系统具有SEQ ID NO:60所示的核苷酸序列或与SEQ ID NO:60具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列。WPRE核苷酸序列和SV40多聚腺苷酸化序列的组合使得可以高水平表达转基因。In some embodiments, the dual vector system of the present invention comprises a combination of a WPRE nucleotide sequence and an SV40 polyadenylation sequence. For example, the dual vector system of the present invention has the nucleotide sequence set forth in SEQ ID NO: 60, or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 60. The combination of the WPRE nucleotide sequence and the SV40 polyadenylation sequence allows for high-level expression of the transgene.

在一些实施方案中,本发明的双载体系统包含WPRE3核苷酸序列和SV40晚期多聚腺苷酸化序列的组合,例如,本发明的双载体系统具有SEQ ID NO:63所示的核苷酸序列或与SEQ ID NO:63具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列。WPRE3核苷酸序列和SV40晚期多聚腺苷酸化序列的组合(也简称为“W3SL”)可以高效率表达更大的外源基因,且占用AAV更少的包装容量。In some embodiments, the dual vector system of the present invention comprises a combination of a WPRE3 nucleotide sequence and an SV40 late polyadenylation sequence. For example, the dual vector system of the present invention has the nucleotide sequence set forth in SEQ ID NO: 63, or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 63. The combination of the WPRE3 nucleotide sequence and the SV40 late polyadenylation sequence (also referred to as "W3SL") can efficiently express larger exogenous genes while occupying less AAV packaging capacity.

本发明通过使用所述双载体系统,将OTOF编码基因分成两个部分提供至内耳细胞、内毛细胞或外毛细胞,在那里转录的OTOF N端mRNA和OTOF C端mRNA发生反式剪接,形成全长OTOF mRNA,通过翻译实现全长OTOF蛋白的表达。本发明证实了,本发明的用于表达OTOF蛋白的双载体系统能够有效地转导所靶向的内耳细胞、内毛细胞或外毛细胞,在所述细胞中产生OTOF蛋白,并以持久的方式恢复因OTOF基因敲除而产生的听力损伤。The present invention uses the dual-vector system to deliver the OTOF-encoding gene in two parts to inner ear cells, inner hair cells, or outer hair cells. The transcribed OTOF N-terminal mRNA and OTOF C-terminal mRNA undergo trans-splicing to form full-length OTOF mRNA, which is then translated to express the full-length OTOF protein. The present invention demonstrates that the dual-vector system for expressing the OTOF protein can effectively transduce the targeted inner ear cells, inner hair cells, or outer hair cells, producing the OTOF protein in these cells and durably restoring hearing loss caused by OTOF gene knockout.

在优选的实施方案中,本发明的双载体系统允许表达氨基酸序列与SEQ ID NO:1具有至少70%的同一性和/或相似性的同源多肽。所述同源序列更优选地与SEQ ID NO:1具有至少75%、甚至更优选至少80%、或至少90%、至少91%、至少92%、至少93%、至少94%、至少95%、至少96%、至少97%、至少99%、至少99%的同一性和/或相似性。当同源多肽比SEQ ID NO:1短得多时,可以考虑局部比对。In a preferred embodiment, the dual vector system of the present invention allows for the expression of homologous polypeptides having an amino acid sequence that is at least 70% identical and/or similar to SEQ ID NO: 1. More preferably, the homologous sequence has at least 75%, even more preferably at least 80%, or at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 99%, or at least 99% identity and/or similarity to SEQ ID NO: 1. When the homologous polypeptide is much shorter than SEQ ID NO: 1, local alignment may be considered.

在另一个实施方案中,本发明的双载体系统可以允许表达OTOF蛋白多肽的功能片段。术语“功能片段”本文是指保留目的OTOF蛋白多肽的至少一种生物学功能的任何片段。In another embodiment, the dual vector system of the present invention can allow for the expression of functional fragments of an OTOF protein polypeptide. The term "functional fragment" herein refers to any fragment that retains at least one biological function of an OTOF protein polypeptide of interest.

可以使用本发明所述的双载体系统转化宿主细胞而获得全长OTOF蛋白。在一些实施方案中,所述宿主细胞选自Hela-S3细胞、HEK-293细胞、HEK-293T细胞、HEK-293FT细胞、A549细胞和Sf9细胞。The full-length OTOF protein can be obtained by transforming host cells using the dual vector system of the present invention. In some embodiments, the host cells are selected from Hela-S3 cells, HEK-293 cells, HEK-293T cells, HEK-293FT cells, A549 cells and Sf9 cells.

III.双载体系统的用途III. Uses of the Dual Vector System

本发明的双载体系统用于施与患有DFNB9耳聋的患者。“患有DFNB9耳聋的患者”是指这样的患者、特别是人患者,其被认为具有(或已被诊断为具有)在编码OTOF蛋白的基因中的突变,所述突变触发OTOF蛋白的异常表达、异常功能或两者都有。The dual-vector system of the present invention is used to administer to patients suffering from DFNB9-induced deafness. "Patient suffering from DFNB9-induced deafness" refers to a patient, particularly a human patient, who is believed to have (or has been diagnosed with) a mutation in a gene encoding an OTOF protein that triggers abnormal expression, abnormal function, or both of the OTOF protein.

在一些实施方案中,本发明的双载体系统为双AAV载体系统。在一些实施方案中,双AAV载体系统中的第一AAV载体和第二AAV载体为各自具有相同或不同的AAV来源的衣壳的载体,例如,双AAV载体系统中的第一AAV载体和第二AAV载体为各自具有AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8、AAV9、AAV10、AAV-Anc80衣壳的载体或具有嵌合衣壳的AAV载体,特别是具有AAV-Anc80衣壳的AAV载体。优选地,使用合成的腺相关病毒载体Anc80L65,其已被证明具有迄今为止报道的内耳毛细胞的最高转导效率(Suzuki等人,Sci.Rep.7:45524(2017))。In some embodiments, the dual vector system of the present invention is a dual AAV vector system. In some embodiments, the first AAV vector and the second AAV vector in the dual AAV vector system are vectors each having a capsid of the same or different AAV origin, for example, the first AAV vector and the second AAV vector in the dual AAV vector system are vectors each having an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV-Anc80 capsid or an AAV vector with a chimeric capsid, in particular an AAV vector with an AAV-Anc80 capsid. Preferably, the synthetic adeno-associated virus vector Anc80L65 is used, which has been shown to have the highest transduction efficiency of inner ear hair cells reported to date (Suzuki et al., Sci. Rep. 7: 45524 (2017)).

双AAV载体系统在向受试者施用后,通过增加野生型OTOF蛋白的表达或向受试者提供野生型OTOF蛋白来治疗OTOF突变相关常染色体隐性遗传性耳聋9(DFNB9)疾病。The dual AAV vector system, after administration to a subject, treats OTOF mutation-associated autosomal recessive deafness 9 (DFNB9) disease by increasing the expression of wild-type OTOF protein or providing wild-type OTOF protein to the subject.

本发明的双载体系统在向受试者施用后,可以触发在内耳细胞、内毛细胞或外毛细胞中全长OTOF蛋白多肽、或其功能片段的表达。After administration to a subject, the dual-vector system of the present invention can trigger the expression of the full-length OTOF protein polypeptide, or a functional fragment thereof, in inner ear cells, inner hair cells, or outer hair cells.

本发明的双载体系统施与的患者优选为新生人婴儿,通常小于6月龄,甚至小于3月龄(如果他们在幼时被诊断出来患DFNB9耳聋)。这些人婴儿更优选在3月龄和1岁之间。The patients to whom the dual vector system of the present invention is administered are preferably newborn human infants, usually less than 6 months old, or even less than 3 months old (if they were diagnosed with DFNB9 deafness in childhood). These human infants are more preferably between 3 months and 1 year old.

还可以将本发明的双载体系统施用于例如幼儿(2-6岁)、儿童(6-12岁)、青少年(12-18岁)或成人(18岁及以上)。The two-vector system of the present invention can also be administered to, for example, infants (2-6 years), children (6-12 years), adolescents (12-18 years), or adults (18 years and older).

如本文所用,术语“治疗”旨在表示将治疗有效量的本发明的双载体系统施用至患有DFNB9耳聋的患者,以便部分地或完全恢复所述患者的听力。通过使用电生理设备测试听觉脑干反应(ABR)可以评估所述恢复。“治疗OTOF突变诱发的听力损伤”特别旨在指听力功能的完全恢复。术语“预防”指减少或延迟听觉频率范围内的听力丧失。As used herein, the term "treating" is intended to mean administering a therapeutically effective amount of the dual vector system of the present invention to a patient suffering from DFNB9 deafness to partially or completely restore the patient's hearing. Such restoration can be assessed by testing auditory brainstem responses (ABRs) using electrophysiological equipment. "Treatment of hearing loss induced by OTOF mutations" is specifically intended to refer to complete restoration of hearing function. The term "preventing" refers to reducing or delaying hearing loss within the auditory frequency range.

实施例Example

下面对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明的一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。如无特殊说明,实施例中涉及的各种反应试剂均可以通过商业渠道购买得到。The technical solutions in the embodiments of the present invention are clearly and completely described below. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all of the embodiments. Based on the embodiments in the present invention, all other embodiments obtained by those of ordinary skill in the art without making creative work are within the scope of protection of the present invention. Unless otherwise specified, the various reaction reagents involved in the embodiments can be purchased through commercial channels.

实施例1 OTOF全长蛋白切分位点的筛选和在HEK-293T细胞中的表达Example 1 Screening of OTOF full-length protein cleavage sites and expression in HEK-293T cells

本实施例的双载体系统如图3和图4或图5所示,其中第一核酸载体(图3)以5’-3’方向包含:5’反向末端重复(5’ITR)序列、CMV启动子序列、含有编码OTOF蛋白N端(5’Otof NT)部分的cDNA序列、剪接供体(SD)序列、与第二核苷酸序列发生同源重组的拼接序列HD(例如,AK序列,为来自f1噬菌体的重组引发性序列)和3’反向末端重复(3’ITR)序列;第二核酸载体(图4或图5)以5’-3’方向包含:5’ITR序列、CMV启动子序列、与第一核苷酸序列发生同源重组的拼接序列HD(例如,AK序列)、剪接受体(SA)序列、含有编码OTOF蛋白C端(3’Otof CT)部分的cDNA序列、土拨鼠肝炎病毒转录后调节元件(WPRE)序列(SEQ ID NO:61)、SV40 PolyA序列(SEQ ID NO:62)和3’ITR序列。第一核酸载体和第二核酸载体均还含有Ori序列及抗性序列。图1显示了使用反向末端重复序列(ITR)、CMV启动子、剪接供体SD信号序列、剪接受体SA信号序列、用于同源重组拼接的HD序列依赖于反式剪接机制同源重组的双载体介导的OTOF蛋白表达示意图。The dual vector system of this embodiment is shown in Figures 3 and 4 or 5, wherein the first nucleic acid vector (Figure 3) comprises in the 5'-3' direction: a 5' inverted terminal repeat (5'ITR) sequence, a CMV promoter sequence, a cDNA sequence containing the N-terminal portion (5'Otof NT) encoding the OTOF protein, a splicing donor (SD) sequence, a splicing sequence HD (for example, an AK sequence, which is a recombination-initiating sequence from f1 phage) that undergoes homologous recombination with the second nucleotide sequence, and a 3' inverted terminal repeat (3'ITR) sequence; the second nucleic acid vector (Figure 3) comprises: a 5' inverted terminal repeat (5'ITR) sequence, a CMV promoter sequence, a cDNA sequence containing the N-terminal portion (5'Otof NT) encoding the OTOF protein, a splicing donor (SD) sequence, a splicing sequence HD that undergoes homologous recombination with the second nucleotide sequence (for example, an AK sequence, which is a recombination-initiating sequence from f1 phage), and a 3' inverted terminal repeat (3'ITR) sequence; The nucleic acid vector (Figure 4 or Figure 5) contains, in the 5'-3' direction: a 5'ITR sequence, a CMV promoter sequence, a splicing sequence HD (e.g., an AK sequence) for homologous recombination with the first nucleotide sequence, a splice acceptor (SA) sequence, a cDNA sequence encoding the C-terminus (3'Otof CT) of the OTOF protein, a woodchuck hepatitis virus post-transcriptional regulatory element (WPRE) sequence (SEQ ID NO: 61), an SV40 PolyA sequence (SEQ ID NO: 62), and a 3'ITR sequence. Both the first and second nucleic acid vectors also contain an Ori sequence and a resistance sequence. Figure 1 shows a schematic diagram of dual-vector-mediated OTOF protein expression using an inverted terminal repeat (ITR), a CMV promoter, a splice donor SD signal sequence, a splice acceptor SA signal sequence, and an HD sequence for homologous recombination splicing, which relies on a trans-splicing mechanism for homologous recombination.

双载体系统中的第一核酸载体和第二核酸载体是在图2所示的pAAV-CMV-EGFP-WPRE-SV40质粒骨架上改造的。The first nucleic acid vector and the second nucleic acid vector in the dual-vector system were modified based on the pAAV-CMV-EGFP-WPRE-SV40 plasmid backbone shown in FIG2 .

具体地,使用EcoRI和EcoRV双酶切将pAAV-CMV-EGFP-WPRE-SV40质粒中的EGFP报告基因序列替换为包含编码5’Otof NT的核苷酸序列、SD序列和AK序列的目的序列,获得第一核酸载体pAAV-CMV-Otof-N-SD-AK。使用EcoRI和EcoRV双酶切将pAAV-CMV-EGFP-WPRE-SV40质粒中的EGFP报告基因序列替换为包含AK序列、SA序列和编码3’Otof CT的核苷酸序列的目的序列,获得第二核酸载体pAAV-CMV-AK-SA-Otof-C-WPRE-SV40。所述目的序列合成及载体构建均委托南京金斯瑞生物科技有限公司完成。Specifically, the EGFP reporter gene sequence in the pAAV-CMV-EGFP-WPRE-SV40 plasmid was replaced with a target sequence comprising a nucleotide sequence encoding the 5' Otof NT, SD sequence, and AK sequence using EcoRI and EcoRV double digestion, resulting in the first nucleic acid vector pAAV-CMV-Otof-N-SD-AK. The EGFP reporter gene sequence in the pAAV-CMV-EGFP-WPRE-SV40 plasmid was replaced with a target sequence comprising an AK sequence, SA sequence, and a nucleotide sequence encoding the 3' Otof CT using EcoRI and EcoRV double digestion, resulting in the second nucleic acid vector pAAV-CMV-AK-SA-Otof-C-WPRE-SV40. The target sequence synthesis and vector construction were commissioned to Nanjing GenScript Biotechnology Co., Ltd.

使用双载体AAV递送进行OTOF突变的耳聋恢复是OTOF突变耳聋治疗的重要方法。但是,使用双载体AAV递送Otof基因的效率是一个难题,其中双载体中的序列重组是一个重要步骤。为了获得反式剪接和重组效率提高的双载体AAV,对OTOF蛋白进行了表1A所示的13组切分,分别获得了13组5’Otof NT(表1B中也简称为“N-Otof”)和3’Otof CT(表1B中也简称为“C-Otof”)。另外,也对OTOF蛋白的编码序列进行了密码子优化,获得了SEQ ID NO:3所示的序列。The use of dual-vector AAV delivery for the restoration of deafness caused by OTOF mutation is an important method for the treatment of deafness caused by OTOF mutation. However, the efficiency of delivering the Otof gene using dual-vector AAV is a difficult problem, in which sequence recombination in the dual vectors is an important step. In order to obtain dual-vector AAV with improved trans-splicing and recombination efficiency, the OTOF protein was divided into 13 groups as shown in Table 1A, and 13 groups of 5’Otof NT (also referred to as “N-Otof” in Table 1B) and 3’Otof CT (also referred to as “C-Otof” in Table 1B) were obtained. In addition, the coding sequence of the OTOF protein was also codon-optimized to obtain the sequence shown in SEQ ID NO: 3.

表1A.全长OTOF蛋白(SEQ ID NO:1)经切分位点切分的OTOF N端和OTOF C端
Table 1A. OTOF N-terminus and OTOF C-terminus of the full-length OTOF protein (SEQ ID NO: 1) cleaved by the cleavage site

表1B.编码切分的OTOF N端和OTOF C端的核苷酸序列

Table 1B. Nucleotide sequences encoding the cleaved OTOF N-terminus and OTOF C-terminus

其中,第1-13组的密码子优化后的N-Otof序列分别如SEQ ID NO:34-46所示,分别插入图2所示的载体骨架元件,获得第一核酸载体;第1-13组的密码子优化后的C-Otof序列分别如SEQ ID NO:47-59所示,分别插入图2所示的载体骨架元件,获得第二核酸载体。Among them, the codon-optimized N-Otof sequences of groups 1-13 are shown in SEQ ID NO: 34-46, respectively, which are inserted into the vector backbone elements shown in Figure 2 to obtain the first nucleic acid vector; the codon-optimized C-Otof sequences of groups 1-13 are shown in SEQ ID NO: 47-59, respectively, which are inserted into the vector backbone elements shown in Figure 2 to obtain the second nucleic acid vector.

将获得的第一核酸载体pAAV-CMV-Otof-N-SD-AK和第二核酸载体pAAV-CMV-AK-SA-Otof-C-WPRE-SV40按照相同摩尔比共转染HEK-293T细胞(获自中国科学院细胞库,SCSP-502),转染48小时后通过蛋白质印迹分析全长OTOF的表达。具体实验方法如下。The obtained first nucleic acid vector pAAV-CMV-Otof-N-SD-AK and the second nucleic acid vector pAAV-CMV-AK-SA-Otof-C-WPRE-SV40 were co-transfected into HEK-293T cells (obtained from the Chinese Academy of Sciences Cell Bank, SCSP-502) at the same molar ratio. The expression of full-length OTOF was analyzed by Western blotting 48 hours after transfection. The specific experimental method is as follows.

细胞转染:接种HEK-293T细胞(为人胚肾(Human Embryonic Kidney)293T细胞,下文中也缩写为“293T细胞”)至70-90%密度时准备转染。配制A管:125μL无血清DMEM培养基+8μL Lipofectamine 3000试剂(Invitrogen,目录号:L3000015),充分混匀。配制B管:125μL无血清DMEM培养基+2μg第一核酸载体pAAV-CMV-Otof-N-SD-AK+2μg第二核酸载体pAAV-CMV-AK-SA-Otof-C-WPRE-SV40+8μL P3000试剂(Invitrogen,目录号:L3000015),充分混匀。将B管混合物加到A管中,轻轻充分混匀,室温静置10-15min。所述A管是培养基和Lipofectamine 3000转染试剂的混合物,所述B管是培养基和核酸载体DNA及转染加强剂P3000的混合物。向293T细胞添加所获得的DNA-脂质体复合物进行转染,37℃、95%空气和5%的CO2孵育,转染48小时后离心收获细胞。Cell transfection: Seed HEK-293T cells (human embryonic kidney 293T cells, hereinafter referred to as "293T cells") to 70-90% density and prepare for transfection. Prepare tube A: 125 μL serum-free DMEM medium + 8 μL Lipofectamine 3000 reagent (Invitrogen, catalog number: L3000015) and mix thoroughly. Prepare tube B: 125 μL serum-free DMEM medium + 2 μg of the first nucleic acid vector pAAV-CMV-Otof-N-SD-AK + 2 μg of the second nucleic acid vector pAAV-CMV-AK-SA-Otof-C-WPRE-SV40 + 8 μL P3000 reagent (Invitrogen, catalog number: L3000015) and mix thoroughly. Add the mixture in tube B to tube A, mix gently and thoroughly, and let it stand at room temperature for 10-15 minutes. Tube A contains a mixture of culture medium and Lipofectamine 3000 transfection reagent, while Tube B contains a mixture of culture medium, nucleic acid carrier DNA, and transfection enhancer P3000. The resulting DNA-liposome complex was added to 293T cells for transfection, and the cells were incubated at 37°C in 95% air and 5% CO₂ . Cells were harvested by centrifugation 48 hours after transfection.

将细胞转染后离心收获的细胞沉淀在适量RIPA裂解液(Thermo Fisher Scientific,目录号:89900)(添加有1%蛋白酶抑制剂cocktail(Thermo Fisher Scientific,目录号:87786),1%PMSF)里充分重悬,冰上裂解30min,期间每10min涡旋一下,使细胞沉淀充分重悬于裂解液中。12000rpm离心15min收集上清(不要吸到沉淀),使用5X上样缓冲液(Loading Buffer)按比例加入样品,75℃煮15min,冰上冷却,离心沉降后,取上清进行Western印迹分析。After cell transfection, the cell pellet harvested by centrifugation was fully resuspended in an appropriate amount of RIPA lysis buffer (Thermo Fisher Scientific, catalog number: 89900) (supplemented with 1% protease inhibitor cocktail (Thermo Fisher Scientific, catalog number: 87786), 1% PMSF), and lysed on ice for 30 minutes. Vortex every 10 minutes during this period to fully resuspend the cell pellet in the lysis buffer. Centrifuge at 12000rpm for 15 minutes to collect the supernatant (do not aspirate the precipitate), use 5X loading buffer (Loading Buffer) to add the sample in proportion, boil at 75℃ for 15 minutes, cool on ice, and after centrifugation, take the supernatant for Western blotting analysis.

Western印迹检测OTOF蛋白表达:洗净玻璃板,将玻璃板平整固定在架子上,凹面朝里夹紧,前后两块对称放置。配分离胶,用异丙醇液封,0.5h后将异丙醇倒掉,侧置用泵吸干。配浓缩胶,加到溢出为止,插梳子。45min后,将胶板取下,装到电泳槽的夹子上,从槽中心加入运行缓冲液,直到运行缓冲液溢出到槽体积的1/2处,用力小心拔出梳子。上样,100v恒压1h进行电泳和冰上300mA恒流转膜90min。Western blotting for OTOF protein expression: Wash the glass plates and secure them flat on a rack, clamping them with the concave surface facing inward. Position the plates symmetrically, front and back. Prepare separating gel and seal with isopropanol. After 0.5 hours, discard the isopropanol and place on its side with a pump to dry. Prepare stacking gel, adding until overflowing, and insert a comb. After 45 minutes, remove the gel plate and attach it to the clamps in the electrophoresis tank. Add running buffer from the center of the tank until it overflows to 1/2 of the tank volume. Carefully remove the comb. Load the sample, perform electrophoresis at a constant voltage of 100V for 1 hour, and transfer the membrane at a constant current of 300mA on ice for 90 minutes.

将转膜完成后的PVDF膜取出,使用封闭液(5%脱脂奶粉,使用TBST缓冲液配制)室温孵育1小时。将封闭完成后的PVDF膜加入第一抗体(HA-Tag Mouse mAb,Cell Signaling Technology(CST),目录号:6E2;β-Actin Mouse mAb,Cell Signaling,目录号:3700S)置于4℃孵育过夜。使用HA抗体作为第一抗体来检测OTOF全长蛋白的表达,使用β肌动蛋白抗体作为第一抗体来检测Western印迹的内参蛋白β肌动蛋白水平。β肌动蛋白的蛋白水平通常不会发生改变,因此能够用于检测Western印迹时上样量是否一致。After transfer, remove the PVDF membrane and incubate it with blocking solution (5% skim milk powder, prepared with TBST buffer) at room temperature for 1 hour. Add the primary antibody (HA-Tag Mouse mAb, Cell Signaling Technology (CST), catalog number: 6E2; β-Actin Mouse mAb, Cell Signaling, catalog number: 3700S) to the blocked PVDF membrane and incubate it at 4°C overnight. Use HA antibody as the primary antibody to detect the expression of OTOF full-length protein, and use β-actin antibody as the primary antibody to detect the level of β-actin, an internal reference protein in Western blotting. The protein level of β-actin usually does not change, so it can be used to check whether the sample amount is consistent during Western blotting.

次日取出孵育好的PVDF膜,1X TBST漂洗3次,每次5min,使用第二抗体(HRP-conjugated Affinipure Goat Anti-Mouse IgG(H+L),Proteintech,目录号:SA00001-1)室温孵育1小时,1X TBST漂洗3次,每次5min;然后加入化学发光试剂(ECL)在暗室中进行显影。The next day, the incubated PVDF membrane was removed and rinsed three times with 1X TBST for 5 minutes each time. The membrane was incubated with the secondary antibody (HRP-conjugated Affinipure Goat Anti-Mouse IgG (H+L), Proteintech, catalog number: SA00001-1) at room temperature for 1 hour and rinsed three times with 1X TBST for 5 minutes each time. The chemiluminescent reagent (ECL) was then added for development in a dark room.

Western印迹分析结果如图6所示。在图6中,泳道NC:对照(未转染双载体质粒的293T细胞);泳道1-13:分别对应转染表1B中第1-13组的双载体质粒。由图6的Western印迹结果可见,使用OTOF全长序列的各切分位点构建的双载体质粒系统在转染细胞后,均能够获得全长OTOF蛋白的表达,且第4-7组、13组的双载体质粒系统在转染细胞并在细胞内重组后,OTOF全长蛋白的表达量有明显提高。The results of Western blot analysis are shown in Figure 6. In Figure 6, lane NC: control (293T cells not transfected with the dual-vector plasmid); lanes 1-13: correspond to cells transfected with the dual-vector plasmids from groups 1-13 in Table 1B, respectively. The Western blot results in Figure 6 demonstrate that the dual-vector plasmid systems constructed using the various cleavage sites of the full-length OTOF sequence were able to express the full-length OTOF protein upon transfection. Furthermore, the dual-vector plasmid systems from groups 4-7 and 13 significantly increased the expression of the full-length OTOF protein upon transfection and intracellular recombination.

实施例2.腺相关病毒的制备Example 2. Preparation of adeno-associated virus

在本实施例中,制备了用于感染小鼠的成对腺相关病毒。In this example, paired adeno-associated viruses were prepared for infection of mice.

具体地,使用实施例1构建的全长OTOF蛋白的第4组切分位点对应的2个质粒(第一核酸载体质粒和第二核酸载体质粒)和第7组切分位点对应的2个质粒(第一核酸载体质粒和第二核酸载体质粒)分别在HEK-293T细胞中制备4种腺相关病毒(AAV),并配对为2对双AAV病毒。具体方法如下。病毒包装:准备10盘150mm平皿的HEK-293T细胞:细胞密度为80%-90%汇片。配制A管:4880μL无血清培养基DMEM+120μL PEI转染试剂,充分混匀。配制B管:4958μL无血清培养基DMEM+pAnc80L65质粒(GenBank:KT235804.1,提供Anc80L65的衣壳蛋白)(Addgene plasmid#68837)15μL+pHelper质粒(GenBank:AF369965.1)15μL+实施例1构建的第一核酸载体pAAV-CMV-Otof-N-SD-AK或无HA标签的第二核酸载体pAAV-CMV-AK-SA-Otof-C-WPRE-SV40多聚(A)12μL,充分混匀。将A管中液体加至B管中,逐滴加入轻轻混匀,室温静置20-25分钟将混合液加入至准备好的HEK-293T细胞中,每盘加入1ml DNA-脂质体复合物(可批量配制)。12h后换液,48h后收集上清至无菌瓶中,4℃保存。加上新鲜的培养基继续培养48h后将细胞及上清收集至前述无菌瓶中。Specifically, two plasmids (a first nucleic acid vector plasmid and a second nucleic acid vector plasmid) corresponding to the 4th group of cleavage sites of the full-length OTOF protein constructed in Example 1 and two plasmids (a first nucleic acid vector plasmid and a second nucleic acid vector plasmid) corresponding to the 7th group of cleavage sites were used to prepare four adeno-associated viruses (AAVs) in HEK-293T cells, respectively, and paired into two pairs of double AAV viruses. The specific method is as follows. Virus packaging: prepare 10 plates of 150mm flat dishes of HEK-293T cells: the cell density is 80%-90% confluence. Prepare tube A: 4880μL serum-free culture medium DMEM + 120μL PEI transfection reagent, mix thoroughly. Prepare tube B: 4958 μL serum-free DMEM medium + pAnc80L65 plasmid (GenBank: KT235804.1, providing Anc80L65 capsid protein) (Addgene plasmid #68837) 15 μL + pHelper plasmid (GenBank: AF369965.1) 15 μL + the first nucleic acid vector pAAV-CMV-Otof-N-SD-AK constructed in Example 1 or the second nucleic acid vector pAAV-CMV-AK-SA-Otof-C-WPRE-SV40 poly (A) without HA tag 12 μL, and mix thoroughly. Add the liquid in tube A to tube B, add dropwise and mix gently, let stand at room temperature for 20-25 minutes, add the mixture to the prepared HEK-293T cells, and add 1 ml DNA-liposome complex to each plate (can be prepared in batches). After 12 hours, the medium was changed and after 48 hours, the supernatant was collected into a sterile bottle and stored at 4°C. Fresh medium was added and cultured for another 48 hours before the cells and supernatant were collected into the sterile bottle mentioned above.

AAV纯化:收集上清和细胞后,利用循环冷热交替方法裂解细胞,将细胞交替放置于干冰预冷的乙醇和37℃水浴三次,然后进行4℃离心,1167g,15min;然后去除基因组和质粒DNA,并将上清转移至50mL离心管,加入DNase和Rnase至终浓度分别为10U/ml和10mg/mL,37℃孵育30min;在4℃,13490rpm的速度下离心20min;使用无菌50mL注射器和0.22um滤膜过滤上一步离心所得的上清液;然后准备梯度碘克醇:15%(v/v)碘克沙醇8mL,25%(v/v)碘克沙醇5.5mL,40%(v/v)碘克沙醇5mL和60%(v/v)碘克沙醇4.5mL。本步骤中,碘克沙醇对眼睛、皮肤和呼吸道有刺激,需要做好个人防护并在通风橱中进行。将上述获得的上清小心滴在15%碘克沙醇表面进行超速离心:使用固定角度钛转子,以301580g的速度在12℃下离心1h 40min,离心加速和减速过程使用最大加、减速度;在40%和60%碘克沙醇界面之间即为纯化的AAV病毒,小心地使用不锈钢钝针插入吸取,分别获得各切分位点对应的N端AAV病毒:C端AAV病毒;AAV purification: After collecting the supernatant and cells, cells were lysed using a hot-cold-hot cycle. Cells were alternately placed in dry ice-cold ethanol and a 37°C water bath three times, followed by centrifugation at 1167g for 15 minutes at 4°C. Genomic and plasmid DNA was then removed, and the supernatant was transferred to a 50mL centrifuge tube. DNase and RNase were added to final concentrations of 10 U/mL and 10 mg/mL, respectively, and incubated at 37°C for 30 minutes. The supernatant was centrifuged at 13490 rpm for 20 minutes at 4°C. The supernatant from the previous centrifugation was filtered using a sterile 50mL syringe and a 0.22μm filter. A gradient of iodixanol was then prepared: 8mL of 15% (v/v) iodixanol, 5.5mL of 25% (v/v) iodixanol, 5mL of 40% (v/v) iodixanol, and 4.5mL of 60% (v/v) iodixanol. During this step, iodixanol is irritating to the eyes, skin, and respiratory tract; personal protective equipment is required and the procedure must be performed in a fume hood. The supernatant obtained above was carefully dropped onto the surface of 15% iodixanol and ultracentrifuged at 301580 g for 1 h 40 min at 12°C using a fixed-angle titanium rotor. The maximum acceleration and deceleration were used during the centrifugal acceleration and deceleration process. The purified AAV virus was located between the 40% and 60% iodixanol interfaces. Carefully insert and aspirate with a stainless steel blunt needle to obtain the N-terminal AAV virus and C-terminal AAV virus corresponding to each cleavage site.

AAV滴度测定:使用qPCR法测定AAV的滴度,含量以GC/ml表示,GC为基因组病毒粒子浓度(gonomic particle concentration),验证提纯病毒滴度在10e+12GC/mL以上即可。qPCR的引物设计在ITR序列上。引物序列为:上游引物fwd ITR primer,5'-GGAACCCCTAGTGATGGAGTT-3’(SEQ ID NO:66);下游引物rev ITR primer,5'-CGGCCTCAGTGAGCGA-3’(SEQ ID NO:67)。AAV titer determination: AAV titer was determined using qPCR. The titer was expressed as GC/mL, where GC represents the genomic particle concentration. The purified virus titer was verified to be above 10e+12 GC/mL. qPCR primers were designed to target the ITR sequence. The primer sequences were: upstream primer fwd ITR primer, 5'-GGAACCCCTAGTGATGGAGTT-3' (SEQ ID NO: 66); downstream primer rev ITR primer, 5'-CGGCCTCAGTGAGCGA-3' (SEQ ID NO: 67).

滴度计算公式:滴度=1000000*power(10x)Titer calculation formula: Titer = 1000000*power(10 x )

x的计算公式为:x=(38.71-y)/3.54The formula for calculating x is: x = (38.71-y)/3.54

其中,y为CT值,即,qPCR中的阈值循环数Threshold cycle(Ct),qPCR结束后可导出CT值,将CT值带入x=(38.71-y)/3.54,计算得到x值。Where y is the CT value, that is, the threshold cycle (Ct) in qPCR. The CT value can be derived after qPCR is completed. Substitute the CT value into x = (38.71 - y) / 3.54 to calculate the x value.

例如,qPCR结果CT值为13.925=y,带入上述公式,则x=(38.71-13.925)/3.54=7,由此,病毒滴度值为=1000000*power(107),即,1.00E+13。For example, the CT value of the qPCR result is 13.925=y. Substituting into the above formula, x=(38.71-13.925)/3.54=7. Therefore, the virus titer value is =1000000*power(10 7 ), that is, 1.00E+13.

实施例3双载体AAV递送对Otof基因敲除小鼠耳聋的恢复Example 3: Restoration of deafness in Otof gene knockout mice by dual-vector AAV delivery

3.1 Otof基因敲除小鼠模型的构建3.1 Construction of Otof gene knockout mouse model

委托苏州赛业生物科技有限公司构建Otof(Q939*)点突变的C57BL/6小鼠模型,该点突变纯合C57BL/6小鼠完全丧失听力。Otof(Q939*)点突变表示将OTOF蛋白的第939位的氨基酸残基Q缺失。Suzhou Saiye Biotechnology Co., Ltd. was commissioned to develop a C57BL/6 mouse model with an Otof(Q939*) point mutation. Homozygous C57BL/6 mice with this point mutation experience complete hearing loss. The Otof(Q939*) point mutation represents a deletion of the amino acid residue Q at position 939 of the OTOF protein.

3.2 AAV病毒在Otof基因敲除小鼠中的施用3.2 AAV administration in Otof knockout mice

将实施例3.1制备的Otof基因点突变C57BL/6小鼠在P27(即,出生后27天龄)进行后半规管(posterior semicircular canal,PSCC)注射,每个耳朵注射2ul实施例2制备的各切分位点对应的N端AAV病毒:C端AAV病毒的混合液(每组切分位点对应的N端病毒:C端病毒=1:1,N端病毒滴度为8.18e12 GC/ml,C端病毒滴度为8.18e12 GC/ml),5天后测听。以野生型C57BL/6小鼠作为对照小鼠进行听觉功能分析。C57BL/6 mice with the Otof gene point mutation prepared in Example 3.1 were injected into the posterior semicircular canal (PSCC) at P27 (i.e., 27 days after birth). Each ear was injected with 2 μl of a mixture of N-terminal AAV virus and C-terminal AAV virus corresponding to each cleavage site prepared in Example 2 (N-terminal virus:C-terminal virus for each cleavage site = 1:1, N-terminal virus titer of 8.18e12 GC/ml, C-terminal virus titer of 8.18e12 GC/ml). Audiometry was performed 5 days later. Wild-type C57BL/6 mice were used as controls for auditory function analysis.

通过ABR检测听觉反应阈值、潜伏期和波间期等参数,分别用短声(Click),尤其是不同音频的短音(4KHz、8KHz、12KHz、16KHz、24KHz、32KHz)作为刺激声,检测小鼠对不同音频的听力阈值,分析小鼠的听力敏感性,从整体上判断小鼠从毛细胞到大脑皮层有无正常的听觉功能。The auditory response threshold, latency, and inter-wave duration are detected through ABR. Short sounds (Click), especially short sounds of different frequencies (4KHz, 8KHz, 12KHz, 16KHz, 24KHz, 32KHz), are used as stimuli to detect the hearing thresholds of mice to different frequencies, analyze the hearing sensitivity of mice, and judge whether the mice have normal auditory function from hair cells to cerebral cortex as a whole.

ABR阈值越高,表明OTOF基因突变小鼠的听觉功能损失越严重。反之,ABR阈值越低,基因治疗的效果越好。听力学检测结果显示OTOF基因切分位点exon23/24(即,切分位点在外显子23和外显子24之间,图7中表示为“23N+24C”)和OTOF基因切分位点exon20/21(即,切分位点在外显子20和外显子21之间,图7中表示为“20N+21C”)均显著地降低ABR阈值,且OTOF基因切分位点exon23/24使得ABR阈值更低(图7)。The higher the ABR threshold, the more severe the hearing loss in the OTOF gene mutant mice. Conversely, the lower the ABR threshold, the better the effect of gene therapy. Audiological test results showed that the OTOF gene cut site exon23/24 (i.e., the cut site is between exon 23 and exon 24, represented as "23N+24C" in Figure 7) and the OTOF gene cut site exon20/21 (i.e., the cut site is between exon 20 and exon 21, represented as "20N+21C" in Figure 7) both significantly reduced the ABR threshold, and the OTOF gene cut site exon23/24 made the ABR threshold lower (Figure 7).

图8为ABR测听数据的8kHz的原始波形图,纵坐标“dB SPL”中的“SPL”是“Sound Pressure Levels(声压级)”的缩写,代表刺激声音强度。诱发波形产生的最小音强即为ABR的阈值。如图8所示,WT小鼠ABR的阈值为30dB,KO小鼠ABR的阈值大于90dB,注射双AAV载体“20N+21C”的KO小鼠阈值为55dB,注射双AAV载体“23N+24C”的KO小鼠阈值为30dB。该实验结果也说明OTOF基因切分位点exon23/24和OTOF基因切分位点exon20/21均显著地降低ABR阈值,且OTOF基因切分位点exon23/24使得ABR阈值更低。Figure 8 is the original waveform of the 8kHz ABR audiometry data. The "SPL" in the vertical axis "dB SPL" is the abbreviation of "Sound Pressure Levels", which represents the intensity of the stimulating sound. The minimum sound intensity produced by the induced waveform is the ABR threshold. As shown in Figure 8, the ABR threshold of WT mice is 30dB, and the ABR threshold of KO mice is greater than 90dB. The threshold of KO mice injected with the dual AAV vector "20N+21C" is 55dB, and the threshold of KO mice injected with the dual AAV vector "23N+24C" is 30dB. The experimental results also show that the OTOF gene cut site exon23/24 and the OTOF gene cut site exon20/21 both significantly reduce the ABR threshold, and the OTOF gene cut site exon23/24 makes the ABR threshold lower.

示例性序列:Example sequence:

OTOF同工型5的氨基酸序列:

Amino acid sequence of OTOF isoform 5:

编码OTOF同工型5的Otof基因转录本变体5的核苷酸序列:

Nucleotide sequence of Otof gene transcript variant 5 encoding OTOF isoform 5:

编码OTOF同工型5的密码子优化后的序列:


Codon-optimized sequence encoding OTOF isoform 5:


编码剪接供体SD的核苷酸序列:
Nucleotide sequence encoding splice donor SD:

编码剪接受体SA的核苷酸序列:
Nucleotide sequence encoding the splice acceptor SA:

用于同源重组的拼接AK核苷酸序列:
Splicing AK nucleotide sequence for homologous recombination:

说明书附图2所示的载体pAAV-CMV-EGFP的核苷酸序列:


The nucleotide sequence of the vector pAAV-CMV-EGFP shown in Figure 2 of the specification is:


在外显子17编码的氨基酸残基和外显子18编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:8所示
The nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 17 and the amino acid residues encoded by exon 18 is shown in SEQ ID NO: 8

在外显子18编码的氨基酸残基和外显子19编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:9所示

The nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 18 and the amino acid residues encoded by exon 19 is shown in SEQ ID NO: 9

在外显子19编码的氨基酸残基和外显子20编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:10所示
The nucleotide sequence of the wild-type N-Otof after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 19 and the amino acid residues encoded by exon 20 is shown in SEQ ID NO: 10

在外显子20编码的氨基酸残基和外显子21编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:11所示

The nucleotide sequence of the wild-type N-Otof after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 20 and the amino acid residues encoded by exon 21 is shown in SEQ ID NO: 11

在外显子21编码的氨基酸残基和外显子22编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:12所示
The nucleotide sequence of the wild-type N-Otof after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 21 and the amino acid residues encoded by exon 22 is shown in SEQ ID NO: 12

在外显子22编码的氨基酸残基和外显子23编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:13所示

The nucleotide sequence of the wild-type N-Otof after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 22 and the amino acid residues encoded by exon 23 is shown in SEQ ID NO: 13

在外显子23编码的氨基酸残基和外显子24编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:14所示
The nucleotide sequence of the wild-type N-Otof after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24 is shown in SEQ ID NO: 14

在外显子24编码的氨基酸残基和外显子25编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:15所示

The nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 24 and the amino acid residues encoded by exon 25 is shown in SEQ ID NO: 15

在外显子25编码的氨基酸残基和外显子26编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:16所示

The nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 25 and the amino acid residues encoded by exon 26 is shown in SEQ ID NO: 16

在外显子26编码的氨基酸残基和外显子27编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:17所示
The nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 26 and the amino acid residues encoded by exon 27 is shown in SEQ ID NO: 17

在外显子27编码的氨基酸残基和外显子28编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:18所示

The nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 27 and the amino acid residues encoded by exon 28 is shown in SEQ ID NO: 18

在外显子28编码的氨基酸残基和外显子29编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:19所示

The nucleotide sequence of the wild-type N-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 28 and the amino acid residues encoded by exon 29 is shown in SEQ ID NO: 19

在外显子29编码的氨基酸残基和外显子30编码的氨基酸残基之间切分OTOF全长蛋白后的野生型N-Otof的核苷酸序列如SEQ ID NO:20所示
The nucleotide sequence of the wild-type N-Otof after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 29 and the amino acid residues encoded by exon 30 is shown in SEQ ID NO: 20

在外显子17编码的氨基酸残基和外显子18编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:21所示

The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 17 and the amino acid residues encoded by exon 18 is shown in SEQ ID NO: 21

在外显子18编码的氨基酸残基和外显子19编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:22所示

The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 18 and the amino acid residues encoded by exon 19 is shown in SEQ ID NO: 22

在外显子19编码的氨基酸残基和外显子20编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:23所示

The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 19 and the amino acid residues encoded by exon 20 is shown in SEQ ID NO: 23

在外显子20编码的氨基酸残基和外显子21编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:24所示
The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 20 and the amino acid residues encoded by exon 21 is shown in SEQ ID NO: 24

在外显子21编码的氨基酸残基和外显子22编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:25所示

The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 21 and the amino acid residues encoded by exon 22 is shown in SEQ ID NO: 25

在外显子22编码的氨基酸残基和外显子23编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:26所示

The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 22 and the amino acid residues encoded by exon 23 is shown in SEQ ID NO: 26

在外显子23编码的氨基酸残基和外显子24编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:27所示
The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24 is shown in SEQ ID NO: 27

在外显子24编码的氨基酸残基和外显子25编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:28所示

The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 24 and the amino acid residues encoded by exon 25 is shown in SEQ ID NO: 28

在外显子25编码的氨基酸残基和外显子26编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:29所示

The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 25 and the amino acid residues encoded by exon 26 is shown in SEQ ID NO: 29

在外显子26编码的氨基酸残基和外显子27编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:30所示
The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 26 and the amino acid residues encoded by exon 27 is shown in SEQ ID NO: 30

在外显子27编码的氨基酸残基和外显子28编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:31所示

The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 27 and the amino acid residues encoded by exon 28 is shown in SEQ ID NO: 31

在外显子28编码的氨基酸残基和外显子29编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:32所示
The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 28 and the amino acid residues encoded by exon 29 is shown in SEQ ID NO: 32

在外显子29编码的氨基酸残基和外显子30编码的氨基酸残基之间切分OTOF全长蛋白后的野生型C-Otof的核苷酸序列如SEQ ID NO:33所示

The nucleotide sequence of the wild-type C-Otof after cleavage of the full-length OTOF protein between the amino acid residues encoded by exon 29 and the amino acid residues encoded by exon 30 is shown in SEQ ID NO: 33

在外显子17编码的氨基酸残基和外显子18编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:34所示
The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 17 and the amino acid residues encoded by exon 18 is shown in SEQ ID NO: 34

在外显子18编码的氨基酸残基和外显子19编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:35所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 18 and the amino acid residues encoded by exon 19 is shown in SEQ ID NO: 35

在外显子19编码的氨基酸残基和外显子20编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:36所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 19 and the amino acid residues encoded by exon 20 is shown in SEQ ID NO: 36

在外显子20编码的氨基酸残基和外显子21编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:37所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 20 and the amino acid residues encoded by exon 21 is shown in SEQ ID NO: 37

在外显子21编码的氨基酸残基和外显子22编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:38所示
The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 21 and the amino acid residues encoded by exon 22 is shown in SEQ ID NO: 38

在外显子22编码的氨基酸残基和外显子23编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:39所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 22 and the amino acid residues encoded by exon 23 is shown in SEQ ID NO: 39

在外显子23编码的氨基酸残基和外显子24编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:40所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24 is shown in SEQ ID NO: 40

在外显子24编码的氨基酸残基和外显子25编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:41所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 24 and the amino acid residues encoded by exon 25 is shown in SEQ ID NO: 41.

在外显子25编码的氨基酸残基和外显子26编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:42所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 25 and the amino acid residues encoded by exon 26 is shown in SEQ ID NO: 42

在外显子26编码的氨基酸残基和外显子27编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:43所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 26 and the amino acid residues encoded by exon 27 is shown in SEQ ID NO: 43.

在外显子27编码的氨基酸残基和外显子28编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:44所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 27 and the amino acid residues encoded by exon 28 is shown in SEQ ID NO: 44

在外显子28编码的氨基酸残基和外显子29编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:45所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 28 and the amino acid residues encoded by exon 29 is shown in SEQ ID NO: 45

在外显子29编码的氨基酸残基和外显子30编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的N-Otof的核苷酸序列如SEQ ID NO:46所示

The nucleotide sequence of N-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 29 and the amino acid residues encoded by exon 30 is shown in SEQ ID NO: 46

在外显子17编码的氨基酸残基和外显子18编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:47所示


The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 17 and the amino acid residues encoded by exon 18 is shown in SEQ ID NO: 47


在外显子18编码的氨基酸残基和外显子19编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:48所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 18 and the amino acid residues encoded by exon 19 is shown in SEQ ID NO: 48

在外显子19编码的氨基酸残基和外显子20编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:49所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 19 and the amino acid residues encoded by exon 20 is shown in SEQ ID NO: 49

在外显子20编码的氨基酸残基和外显子21编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:50所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 20 and the amino acid residues encoded by exon 21 is shown in SEQ ID NO: 50

在外显子21编码的氨基酸残基和外显子22编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:51所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 21 and the amino acid residues encoded by exon 22 is shown in SEQ ID NO: 51

在外显子22编码的氨基酸残基和外显子23编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:52所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 22 and the amino acid residues encoded by exon 23 is shown in SEQ ID NO: 52

在外显子23编码的氨基酸残基和外显子24编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:53所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 23 and the amino acid residues encoded by exon 24 is shown in SEQ ID NO: 53

在外显子24编码的氨基酸残基和外显子25编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:54所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 24 and the amino acid residues encoded by exon 25 is shown in SEQ ID NO: 54

在外显子25编码的氨基酸残基和外显子26编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:55所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 25 and the amino acid residues encoded by exon 26 is shown in SEQ ID NO: 55

在外显子26编码的氨基酸残基和外显子27编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:56所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 26 and the amino acid residues encoded by exon 27 is shown in SEQ ID NO: 56

在外显子27编码的氨基酸残基和外显子28编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:57所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 27 and the amino acid residues encoded by exon 28 is shown in SEQ ID NO: 57

在外显子28编码的氨基酸残基和外显子29编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:58所示
The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is cleaved between the amino acid residues encoded by exon 28 and the amino acid residues encoded by exon 29 is shown in SEQ ID NO: 58

在外显子29编码的氨基酸残基和外显子30编码的氨基酸残基之间切分OTOF全长蛋白后的密码子优化后的C-Otof的核苷酸序列如SEQ ID NO:59所示

The nucleotide sequence of C-Otof after codon optimization after the full-length OTOF protein is split between the amino acid residues encoded by exon 29 and the amino acid residues encoded by exon 30 is shown in SEQ ID NO: 59

WPRE+SV40 poly(A)signal(717bp)的核苷酸序列:SEQ ID NO:60
Nucleotide sequence of WPRE+SV40 poly(A) signal (717 bp): SEQ ID NO: 60

WPRE(589bp)的核苷酸序列:SEQ ID NO:61
Nucleotide sequence of WPRE (589 bp): SEQ ID NO: 61

SV40 poly(A)信号(122bp)的核苷酸序列:SEQ ID NO:62
Nucleotide sequence of SV40 poly(A) signal (122 bp): SEQ ID NO: 62

WPRE3-SV40 late poly(A)(432bp)的核苷酸序列:SEQ ID NO:63
Nucleotide sequence of WPRE3-SV40 late poly(A) (432 bp): SEQ ID NO: 63

WPRE3的核苷酸序列:SEQ ID NO:64
Nucleotide sequence of WPRE3: SEQ ID NO: 64

SV40 late poly(A)信号的核苷酸序列:SEQ ID NO:65
Nucleotide sequence of SV40 late poly(A) signal: SEQ ID NO: 65

设计在ITR序列上的qPCR的引物Design of qPCR primers on ITR sequences

上游引物:5’-GGAACCCCTAGTGATGGAGTT-3’(SEQ ID NO:66)Upstream primer: 5’-GGAACCCCTAGTGATGGAGTT-3’ (SEQ ID NO: 66)

下游引物:5’-CGGCCTCAGTGAGCGA-3’(SEQ ID NO:67)Downstream primer: 5’-CGGCCTCAGTGAGCGA-3’ (SEQ ID NO: 67)

polyA中的序列:Sequences in polyA:

AATAAA(SEQ ID NO:68)、ATTAAA(SEQ ID NO:69)、AGTAAA(SEQ ID NO:70)、CATAAA(SEQ ID NO:71)、TATAAA(SEQ ID NO:72)、GATAAA(SEQ ID NO:73)、ACTAAA(SEQ ID NO:74)、AATATA(SEQ ID NO:75)、AAGAAA(SEQ ID NO:76)、AATAAT(SEQ ID NO:77)、AAAAAA(SEQ ID NO:78),AATGAA(SEQ ID NO:79)、AATCAA(SEQ ID NO:80)、AACAAA(SEQ ID NO:81)、AATCAA(SEQ ID NO:82)、AATAAC(SEQ ID NO:83)、AATAGA(SEQ ID NO:84)、AATTAA(SEQ ID NO:85)或AATAAG(SEQ ID NO:86)AATAAA (SEQ ID NO: 68), ATTAAA (SEQ ID NO: 69), AGTAAA (SEQ ID NO: 70), CATAAA (SEQ ID NO: 71), TATAAA (SEQ ID N O: 72), GATAAA (SEQ ID NO: 73), ACTAAA (SEQ ID NO: 74), AATATA (SEQ ID NO: 75), AAGAAA (SEQ ID NO: 76), AATAAT (SEQ ID NO: 77), AAAAAA (SEQ ID NO: 78), AATGAA (SEQ ID NO: 79), AATCAA (SEQ ID NO: 80), AACAAA (SEQ ID NO: 81), AATC AA(SEQ ID NO:82), AATAAC(SEQ ID NO:83), AATAGA(SEQ ID NO:84), AATTAA(SEQ ID NO:85) or AATAAG(SEQ ID NO:86)

以上描述了本发明的示例性实施方案,本领域技术人员应当理解的是,这些公开内容仅是示例性的,在本发明的范围内可以进行各种其它替换、适应和修改。因此,本发明不限于文中列举的具体实施方案。While the exemplary embodiments of the present invention have been described above, it should be understood by those skilled in the art that these disclosures are merely exemplary and that various other substitutions, adaptations, and modifications may be made within the scope of the present invention. Therefore, the present invention is not limited to the specific embodiments listed herein.

Claims (18)

用于表达OTOF蛋白的双载体系统,其包含第一核酸载体和第二核酸载体,其中A dual vector system for expressing an OTOF protein, comprising a first nucleic acid vector and a second nucleic acid vector, wherein 第一核酸载体包含第一核苷酸序列;且第二核酸载体包含第二核苷酸序列;The first nucleic acid vector comprises a first nucleotide sequence; and the second nucleic acid vector comprises a second nucleotide sequence; 所述第一核苷酸序列包含插入在两个第一ITR序列之间的表达盒;The first nucleotide sequence comprises an expression cassette inserted between two first ITR sequences; 所述第二核苷酸序列包含插入在两个第二ITR序列之间的表达盒;The second nucleotide sequence comprises an expression cassette inserted between two second ITR sequences; 所述第一核苷酸序列的表达盒包含启动子、OTOF的N端编码序列、剪接供体(SD)序列、与第二核苷酸序列发生同源重组的序列和polyA;The expression cassette of the first nucleotide sequence comprises a promoter, an N-terminal coding sequence of OTOF, a splice donor (SD) sequence, a sequence for homologous recombination with the second nucleotide sequence, and polyA; 所述第二核苷酸序列的表达盒包含启动子、与第一核苷酸序列发生同源重组的序列、剪接受体(SA)序列、OTOF的C端编码序列和polyA;和The expression cassette of the second nucleotide sequence comprises a promoter, a sequence that undergoes homologous recombination with the first nucleotide sequence, a splice acceptor (SA) sequence, a C-terminal coding sequence of OTOF, and polyA; and 在OTOF氨基酸序列中设置有OTOF切分位点,例如,所述OTOF氨基酸序列如SEQ ID NO:1所示或其功能片段,例如,与SEQ ID NO:1具有至少80%序列同一性的氨基酸序列;An OTOF cleavage site is provided in the OTOF amino acid sequence, for example, the OTOF amino acid sequence is as shown in SEQ ID NO: 1 or a functional fragment thereof, for example, an amino acid sequence having at least 80% sequence identity with SEQ ID NO: 1; OTOF的N端编码序列是从OTOF氨基酸序列的N端至OTOF切分位点的核苷酸编码序列;OTOF的C端编码序列是从OTOF切分位点后一位的氨基酸至OTOF氨基酸序列的C端的核苷酸编码序列。The N-terminal coding sequence of OTOF is the nucleotide coding sequence from the N-terminus of the OTOF amino acid sequence to the OTOF cleavage site; the C-terminal coding sequence of OTOF is the nucleotide coding sequence from the amino acid after the OTOF cleavage site to the C-terminus of the OTOF amino acid sequence. 根据权利要求1所述的用于表达OTOF蛋白的双载体系统,其中,所述OTOF切分位点位于OTOF氨基酸序列中任一编码的外显子17-29的3’末端氨基酸残基处。The dual-vector system for expressing OTOF protein according to claim 1, wherein the OTOF cleavage site is located at the 3'-terminal amino acid residue of any one of exons 17-29 encoded in the OTOF amino acid sequence. 根据权利要求1所述的用于表达OTOF蛋白的双载体系统,其中,所述第一核苷酸序列或所述第二核苷酸序列的表达盒的启动子选自CAG启动子、CMV启动子、CBA启动子、UbC启动子、SFFV启动子、EF1α启动子、PGK启动子、或Myo7A、Myo15、Atoh1、POU4F3、Lhx3、Myo6、α9AchR、α10AchR、OTOF和STRC编码基因的启动子;The dual-vector system for expressing OTOF protein according to claim 1, wherein the promoter of the expression cassette of the first nucleotide sequence or the second nucleotide sequence is selected from CAG promoter, CMV promoter, CBA promoter, UbC promoter, SFFV promoter, EF1α promoter, PGK promoter, or promoters of Myo7A, Myo15, Atoh1, POU4F3, Lhx3, Myo6, α9AchR, α10AchR, OTOF and STRC encoding genes; 所述第一核苷酸序列或第二核苷酸序列的表达盒的polyA包含AATAAA(SEQ ID NO:68)和AATAAA的变体;所述AATAAA的变体包含ATTAAA(SEQ ID NO:69)、AGTAAA(SEQ ID NO:70)、CATAAA(SEQ ID NO:71)、TATAAA(SEQ ID NO:72)、GATAAA(SEQ ID NO:73)、ACTAAA(SEQ ID NO:74)、AATATA(SEQ ID NO:75)、AAGAAA(SEQ ID NO:76)、AATAAT(SEQ ID NO:77)、AAAAAA(SEQ ID NO:78),AATGAA(SEQ ID NO:79)、AATCAA(SEQ ID NO:80)、AACAAA(SEQ ID NO:81)、AATCAA(SEQ ID NO:82)、AATAAC(SEQ ID NO:83)、AATAGA(SEQ ID NO:84)、AATTAA(SEQ ID NO:85)或AATAAG(SEQ ID NO:86);例如,所述polyA是与SEQ ID NO:62或SEQ ID NO:65所示的多聚A信号序列具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列;和The polyA of the expression cassette of the first nucleotide sequence or the second nucleotide sequence comprises AATAAA (SEQ ID NO: 68) and variants of AATAAA; the variants of AATAAA comprise ATTAAA (SEQ ID NO: 69), AGTAAA (SEQ ID NO: 70), CATAAA (SEQ ID NO: 71), TATAAA (SEQ ID NO: 72), GATAAA (SEQ ID NO: 73), ACTAAA (SEQ ID NO: 74), AATATA (SEQ ID NO: 75), AAGAAA (SEQ ID NO: 76), AATAAT (SEQ ID NO: 77), AAAAAA (SEQ ID NO: 78), and AAAAAA (SEQ ID NO: 79). NO: 78), AATGAA (SEQ ID NO: 79), AATCAA (SEQ ID NO: 80), AACAAA (SEQ ID NO: 81), AATCAA (SEQ ID NO: 82), AATAAC (SEQ ID NO: 83), AATAGA (SEQ ID NO: 84), AATTAA (SEQ ID NO: 85), or AATAAG (SEQ ID NO: 86); for example, the polyA is a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to the poly A signal sequence set forth in SEQ ID NO: 62 or SEQ ID NO: 65; and 所述两个第一ITR序列和所述两个第二ITR序列中的每个ITR序列源自AAV1、AAV2、AAV3、AAV4、AAV5、AAV6、AAV7、AAV8或AAV9。Each of the two first ITR sequences and the two second ITR sequences is derived from AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8 or AAV9. 根据权利要求1所述的用于表达OTOF蛋白的双载体系统,其中,所述第一核苷酸序列或所述第二核苷酸序列的表达盒还包含表达调控元件和/或标签元件,例如,所述表达调控元件为土拨鼠肝炎转录后调控元件(WPRE)或其变体,优选地,WPRE截短变体,例如,与SEQ ID NO:61所示的核苷酸序列具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列,例如,SEQ ID NO:64所示的核苷酸序列;例如,所述标签元件为HA。The dual-vector system for expressing OTOF protein according to claim 1, wherein the expression cassette of the first nucleotide sequence or the second nucleotide sequence further comprises an expression regulatory element and/or a tag element, for example, the expression regulatory element is a woodchuck hepatitis posttranscriptional regulatory element (WPRE) or a variant thereof, preferably a WPRE truncated variant, for example, a nucleotide sequence having at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identity with the nucleotide sequence shown in SEQ ID NO: 61, for example, the nucleotide sequence shown in SEQ ID NO: 64; for example, the tag element is HA. 根据权利要求1所述的用于表达OTOF蛋白的双载体系统,其中,所述与第一核苷酸序列发生同源重组的序列和所述与第二核苷酸序列发生同源重组的序列是内含子序列,The dual-vector system for expressing an OTOF protein according to claim 1, wherein the sequence that undergoes homologous recombination with the first nucleotide sequence and the sequence that undergoes homologous recombination with the second nucleotide sequence are intron sequences, 例如,非天然存在于编码OTOF蛋白的基因的基因组序列中的内含子序列,For example, an intron sequence that does not naturally occur in the genomic sequence of a gene encoding an OTOF protein, 例如,所述内含子序列是F1噬菌体的引起重组的AK序列,例如SEQ ID NO:6所示的序列。For example, the intron sequence is the recombination-causing AK sequence of the F1 phage, such as the sequence shown in SEQ ID NO: 6. 根据权利要求1所述的用于表达OTOF蛋白的双载体系统,其中,将所述第一核苷酸序列插入在包含两个第一ITR序列的质粒中,将所述第二核苷酸序列插入在包含两个第二ITR序列的质粒中,例如,所述包含两个第一ITR序列的质粒和所述包含两个第二ITR序列的质粒相同或不同,例如,所述质粒是pAAV、pAAV-CMV、pX601、pX551或pAAV-MCS质粒。The dual-vector system for expressing OTOF protein according to claim 1, wherein the first nucleotide sequence is inserted into a plasmid comprising two first ITR sequences, and the second nucleotide sequence is inserted into a plasmid comprising two second ITR sequences, for example, the plasmid comprising two first ITR sequences and the plasmid comprising two second ITR sequences are the same or different, for example, the plasmid is pAAV, pAAV-CMV, pX601, pX551 or pAAV-MCS plasmid. 根据权利要求1-6中任一项所述的用于表达OTOF蛋白的双载体系统,其中,经OTOF切分位点切分的OTOF N端和OTOF C端如表1A所示;The dual-vector system for expressing an OTOF protein according to any one of claims 1 to 6, wherein the OTOF N-terminus and the OTOF C-terminus cleaved by the OTOF cleavage site are as shown in Table 1A; 优选地,所述OTOF N端编码序列是SEQ ID NO:11或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:37所示的核酸序列,且所述OTOF C端编码序列是SEQ ID NO:24或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:50所示的核酸序列;Preferably, the OTOF N-terminal coding sequence is SEQ ID NO: 11 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 37, and the OTOF C-terminal coding sequence is SEQ ID NO: 24 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 50; 所述OTOF N端编码序列是SEQ ID NO:12或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:38所示的核酸序列,且所述OTOF C端编码序列是SEQ ID NO:25或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:51所示的核酸序列;The OTOF N-terminal coding sequence is SEQ ID NO: 12 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 38, and the OTOF C-terminal coding sequence is SEQ ID NO: 25 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 51; 所述OTOF N端编码序列是SEQ ID NO:13或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:39所示的核酸序列,且所述OTOF C端编码序列是SEQ ID NO:26或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:52所示的核酸序列;The OTOF N-terminal coding sequence is SEQ ID NO: 13 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 39, and the OTOF C-terminal coding sequence is SEQ ID NO: 26 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 52; 所述OTOF N端编码序列是SEQ ID NO:14或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:40所示的核酸序列,且所述OTOF C端编码序列是SEQ ID NO:27或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:53所示的核酸序列;或The OTOF N-terminal coding sequence is SEQ ID NO: 14 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 40, and the OTOF C-terminal coding sequence is SEQ ID NO: 27 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 53; or 所述OTOF N端编码序列是SEQ ID NO:20或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:46所示的核酸序列,且所述OTOF C端编码序列是SEQ ID NO:33或与其具有至少90%序列同一性的序列,例如,优化的序列,如SEQ ID NO:59所示的核酸序列;The OTOF N-terminal coding sequence is SEQ ID NO: 20 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 46, and the OTOF C-terminal coding sequence is SEQ ID NO: 33 or a sequence having at least 90% sequence identity thereto, for example, an optimized sequence, such as the nucleic acid sequence shown in SEQ ID NO: 59; 例如,以pAAV-CMV质粒为载体。For example, pAAV-CMV plasmid is used as a vector. 根据权利要求1-7中任一项所述的用于表达OTOF蛋白的双载体系统,其中,所述剪接供体(SD)序列基本由SEQ ID NO:4所示的序列组成;所述剪接受体(SA)序列基本由SEQ ID NO:5所示的序列组成。The dual-vector system for expressing OTOF protein according to any one of claims 1 to 7, wherein the splice donor (SD) sequence basically consists of the sequence shown in SEQ ID NO: 4; and the splice acceptor (SA) sequence basically consists of the sequence shown in SEQ ID NO: 5. 根据权利要求1-8中任一项所述的用于表达OTOF蛋白的双载体系统,其中所述第一核苷酸序列的表达盒和所述第二核苷酸序列的表达盒各自在3’ITR序列的N端包含WPRE核苷酸序列和SV40多聚腺苷酸化序列的组合,例如,具有SEQ ID NO:60所示的核苷酸序列或与SEQ ID NO:60具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列;或者包含WPRE3核苷酸序列和SV40晚期多聚腺苷酸化序列的组合,例如,具有SEQ ID NO:63所示的核苷酸序列或与SEQ ID NO:63具有至少70%、80%、90%、95%、96%、97%、98%、99%或100%同一性的核苷酸序列。A dual vector system for expressing OTOF protein according to any one of claims 1 to 8, wherein the expression cassette of the first nucleotide sequence and the expression cassette of the second nucleotide sequence each comprise a combination of a WPRE nucleotide sequence and an SV40 polyadenylation sequence at the N-terminus of the 3’ITR sequence, for example, a nucleotide sequence shown in SEQ ID NO: 60 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 60; or a combination of a WPRE3 nucleotide sequence and an SV40 late polyadenylation sequence, for example, a nucleotide sequence shown in SEQ ID NO: 63 or a nucleotide sequence that is at least 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 63. 腺相关病毒的包装载体系统,其中,所述包装载体系统包含根据权利要求1-9中任一项所述的用于表达OTOF蛋白的双载体系统、携带AAV rep和cap基因的载体和辅助病毒载体,经包装成为AAV载体,优选地,其中OTOF蛋白的氨基酸序列如SEQ ID NO:1所示。An adeno-associated virus packaging vector system, wherein the packaging vector system comprises a dual-vector system for expressing an OTOF protein according to any one of claims 1-9, a vector carrying AAV rep and cap genes, and a helper virus vector, which is packaged into an AAV vector. Preferably, the amino acid sequence of the OTOF protein is as shown in SEQ ID NO: 1. 根据权利要求10所述的腺相关病毒的包装载体系统,其中,所述携带AAV rep和cap基因的载体选自AAV1、AAV2、AAV5、AAV8、AAV9、Anc80、PHP.eB、AAV-DJ和AAVrh.10载体;所述辅助病毒载体为pHelper质粒。The adeno-associated virus packaging vector system according to claim 10, wherein the vector carrying the AAV rep and cap genes is selected from AAV1, AAV2, AAV5, AAV8, AAV9, Anc80, PHP.eB, AAV-DJ and AAVrh.10 vectors; and the helper virus vector is a pHelper plasmid. 一种腺相关病毒的包装方法,其中,将根据权利要求10或11所述的腺相关病毒的包装载体系统转入宿主细胞中进行包装。A method for packaging an adeno-associated virus, wherein the adeno-associated virus packaging vector system according to claim 10 or 11 is transferred into a host cell for packaging. 根据权利要求12所述的腺相关病毒的包装方法,其中,所述宿主细胞选自Hela-S3细胞、HEK-293细胞、HEK-293T细胞、HEK-293FT细胞、A549细胞和Sf9细胞。The method for packaging adeno-associated virus according to claim 12, wherein the host cell is selected from Hela-S3 cells, HEK-293 cells, HEK-293T cells, HEK-293FT cells, A549 cells and Sf9 cells. 一种腺相关病毒,其是通过根据权利要求12或13所述的包装方法获得的。An adeno-associated virus obtained by the packaging method according to claim 12 or 13. 根据权利要求1-9中任一项所述的用于表达OTOF蛋白的双载体系统或根据权利要求14所述的腺相关病毒的用途,用于制备治疗耳聋性疾病或听力损伤或听力功能障碍的药物或制剂。Use of the dual vector system for expressing OTOF protein according to any one of claims 1 to 9 or the adeno-associated virus according to claim 14 for preparing a medicament or preparation for treating deafness, hearing loss or hearing dysfunction. 一种用于治疗耳聋疾病或听力损伤或听力功能障碍的药物或制剂,其由根据权利要求1-9中任一项所述的用于表达OTOF蛋白的双载体系统或根据权利要求14所述的腺相关病毒制备,其中,所述腺相关病毒是将腺相关病毒的包装载体系统转入宿主细胞进行包装得到的,所述腺相关病毒的包装载体系统包括表达OTOF蛋白的双载体系统、携带AAVrep和cap基因的载体,以及辅助病毒载体。A medicine or preparation for treating deafness, hearing loss or hearing dysfunction, which is prepared by the dual-vector system for expressing OTOF protein according to any one of claims 1 to 9 or the adeno-associated virus according to claim 14, wherein the adeno-associated virus is obtained by transferring the adeno-associated virus packaging vector system into a host cell for packaging, and the adeno-associated virus packaging vector system includes a dual-vector system for expressing OTOF protein, a vector carrying AAV rep and cap genes, and a helper virus vector. 根据权利要求16所述的药物或制剂,其中,所述药物或制剂还包含中性盐缓冲剂、酸性盐缓冲剂、碱性盐缓冲剂、葡萄糖、甘露糖、甘露糖醇、蛋白质、多肽、氨基酸、抗生素、螯合剂、佐剂、防腐剂、纳米颗粒、脂质体和阳性脂质颗粒。The drug or preparation according to claim 16, wherein the drug or preparation further comprises a neutral salt buffer, an acidic salt buffer, an alkaline salt buffer, glucose, mannose, mannitol, proteins, polypeptides, amino acids, antibiotics, chelating agents, adjuvants, preservatives, nanoparticles, liposomes and positive lipid particles. 根据权利要求16或17所述的药物或制剂,其中,通过耳蜗的圆窗、卵圆窗、半规管、后半规管、总管进行注射给药;以及终生单次给药或多次给药,总剂量为1×109-1×1013病毒基因组。The drug or preparation according to claim 16 or 17, wherein the drug is administered by injection through the round window, oval window, semicircular canal, posterior semicircular canal, or common canal of the cochlea; and the drug is administered single or multiple times throughout life, with a total dose of 1×10 9 -1×10 13 viral genomes.
PCT/CN2025/084737 2024-03-28 2025-03-25 Dual-vector system for delivering otof gene and use thereof Pending WO2025201333A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202580000485.0A CN120641570A (en) 2024-03-28 2025-03-25 Two-vector system for delivering Otof gene and its use

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202410365323 2024-03-28
CN202410365323.4 2024-03-28

Publications (1)

Publication Number Publication Date
WO2025201333A1 true WO2025201333A1 (en) 2025-10-02

Family

ID=97216100

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2025/084737 Pending WO2025201333A1 (en) 2024-03-28 2025-03-25 Dual-vector system for delivering otof gene and use thereof

Country Status (1)

Country Link
WO (1) WO2025201333A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110892062A (en) * 2017-05-05 2020-03-17 佛罗里达大学研究基金会 Compositions and methods for expressing otic teratoproteins
CN116096905A (en) * 2020-02-21 2023-05-09 阿库斯股份有限公司 Compositions and methods for treating non-age-related hearing impairment in human subjects
CN117106824A (en) * 2022-05-17 2023-11-24 复旦大学附属眼耳鼻喉科医院 A dual-carrier system for treating hearing loss and its application
CN117295529A (en) * 2021-02-19 2023-12-26 分贝治疗公司 Method for treating sensorineural hearing loss using an otoxin dual carrier system
CN117305367A (en) * 2023-08-21 2023-12-29 复旦大学附属眼耳鼻喉科医院 Double AAV vector system for expressing full-length otodontin and application thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110892062A (en) * 2017-05-05 2020-03-17 佛罗里达大学研究基金会 Compositions and methods for expressing otic teratoproteins
CN116096905A (en) * 2020-02-21 2023-05-09 阿库斯股份有限公司 Compositions and methods for treating non-age-related hearing impairment in human subjects
CN117295529A (en) * 2021-02-19 2023-12-26 分贝治疗公司 Method for treating sensorineural hearing loss using an otoxin dual carrier system
CN117106824A (en) * 2022-05-17 2023-11-24 复旦大学附属眼耳鼻喉科医院 A dual-carrier system for treating hearing loss and its application
CN117305367A (en) * 2023-08-21 2023-12-29 复旦大学附属眼耳鼻喉科医院 Double AAV vector system for expressing full-length otodontin and application thereof

Similar Documents

Publication Publication Date Title
US9567376B2 (en) Enhanced AAV-mediated gene transfer for retinal therapies
US10266845B2 (en) Enhanced AAV-mediated gene transfer for retinal therapies
JP6453307B2 (en) Effective delivery of large genes by dual AAV vectors
US20170007720A1 (en) Methods and compositions for gene delivery to on bipolar cells
US12467066B2 (en) Compositions and methods for treating retinal disorders
US20240131189A1 (en) Variant adeno-associated virus (aav) capsid polypeptides and gene therapeutics thereof for treatment of hearing loss
JP7616668B2 (en) UBE3A GENE AND EXPRESSION CASSETTES AND USES THEREOF
US20240252681A1 (en) Gene therapy constructs and methods for treatment of hearing loss
WO2025016099A1 (en) Composition and method for expression of otof gene by dual vector system
US20230285596A1 (en) Compositions and methods for the treatment of niemann-pick type c1 disease
WO2025201333A1 (en) Dual-vector system for delivering otof gene and use thereof
JP7637060B2 (en) Neuroprotective gene therapy targeting the AKT pathway
CN120641570A (en) Two-vector system for delivering Otof gene and its use
KR20230112672A (en) Gene therapy for neurodegenerative diseases
WO2025157179A1 (en) Dual-vector system for expressing strc protein and use thereof
CN120574893A (en) Two-vector system for delivering Strc gene and its use
WO2025162267A1 (en) Dual-vector system for expressing myo7a protein, and use thereof
EP4667573A1 (en) Use of multiple promoters efficiently targeting on-type bipolar cells in gene therapy technology
CN120322449A (en) ATP7B gene therapy
HK40030486B (en) Compositions and methods for treating retinal disorders
HK40030486A (en) Compositions and methods for treating retinal disorders
HK40013460A (en) Methods and compositions for treatment of disorders and diseases involving rdh12
HK40012997A (en) Neuropeptide-expressing vectors and methods for the treatment of epilepsy

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 202580000485.0

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 202580000485.0

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 25777133

Country of ref document: EP

Kind code of ref document: A1