WO2023138617A1 - Engineered casx nuclease, effector protein and use thereof - Google Patents
Engineered casx nuclease, effector protein and use thereof Download PDFInfo
- Publication number
- WO2023138617A1 WO2023138617A1 PCT/CN2023/072900 CN2023072900W WO2023138617A1 WO 2023138617 A1 WO2023138617 A1 WO 2023138617A1 CN 2023072900 W CN2023072900 W CN 2023072900W WO 2023138617 A1 WO2023138617 A1 WO 2023138617A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- casx
- engineered
- nucleic acid
- nuclease
- effector protein
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases [RNase]; Deoxyribonucleases [DNase]
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
- A61K48/005—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy characterised by an aspect of the 'active' part of the composition delivered, i.e. the nucleic acid delivered
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P27/00—Drugs for disorders of the senses
- A61P27/02—Ophthalmic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P9/00—Drugs for disorders of the cardiovascular system
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6816—Hybridisation assays characterised by the detection means
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPR]
Definitions
- This application belongs to the field of biotechnology. More specifically, the present application relates to CasX nucleases, effector proteins, and uses thereof having enhanced catalytic activity (eg, gene editing activity).
- Genome editing is an important and useful technique in genome research.
- Several systems are available for genome editing, including clustered regularly interspaced short palindromic repeat (CRISPR)-Cas systems, transcription activator-like effector nuclease (TALEN) systems, and zinc finger nuclease (ZFN) systems.
- CRISPR clustered regularly interspaced short palindromic repeat
- TALEN transcription activator-like effector nuclease
- ZFN zinc finger nuclease
- the CRISPR-Cas system is an efficient and cost-effective genome editing technology that can be widely applied in a range of eukaryotes from yeast and plants to zebrafish and humans (see review: Van der Oost 2013, Science 339:768-770, and Charpentier and Doudna, 2013, Nature 495:50-51).
- the CRISPR-Cas system provides adaptive immunity in archaea and bacteria by combining CasX effector proteins and CRISPR RNA (crRNA).
- crRNA CRISPR RNA
- two classes (classes 1 and 2) of CRISPR-Cas systems including six (types I–VI) have been characterized based on the system's outstanding functional and evolutionary modularity.
- type II Cas9 system and type V-A/B/E/J Cas12a/Cas12b/Cas12e/Cas12j system have been exploited for genome editing and offer broad prospects for biomedical research.
- An engineered CasX nuclease It comprises one or more mutations based on the reference CasX nuclease, the mutation is: the amino acid that interacts with the nucleic acid in the reference CasX nuclease is replaced with a positively charged amino acid; the reference CasX nuclease is a natural wild-type CasX nuclease.
- the nucleic acid is a PAM.
- the engineered CasX nuclease as described in item 4, wherein, the replacement of one or more amino acids interacting with nucleic acid in the reference CasX nuclease with a positively charged amino acid refers to one or more of the following replacements: T26R, K610R, K640, S759R, K808R; 640R; (4) S759R; (5) K808R; (6) T26R and K610R; (7) T26R and K610R and K808R.
- An engineered CasX nuclease comprising an amino acid sequence as shown in any one of SEQ ID NO.2 ⁇ 8 or having at least 95% or more identity with the amino acid sequence shown in any one of SEQ ID NO.2 ⁇ 8.
- An engineered CasX effector protein comprising the engineered CasX nuclease or a functional derivative thereof according to any one of items 1-6.
- the engineered CasX effector protein according to any one of items 7 to 9, which further comprises a functional domain fused with the engineered CasX nuclease.
- the engineered CasX effector protein according to any one of items 7 to 11, comprising a first polypeptide and a second polypeptide, the first polypeptide comprising amino acid residues 1 to X of the N-terminal part of the engineered CasX nuclease described in any one of items 1-6, and the second polypeptide comprising the engineered CasX nuclease described in any one of items 1-6 Amino acid residue X+1 of the CasX nuclease of UL to the C-terminus of the CasX nuclease, wherein the first polypeptide and the second polypeptide are capable of associating with each other in the presence of a guide RNA comprising a guide sequence to form a Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) complex specifically binding to a target nucleic acid comprising a target sequence complementary to the guide sequence.
- CRISPR Clustered Regularly Interspaced Short Palindromic Repeat
- An engineered CRISPR-CasX system comprising:
- a guide RNA comprising a guide sequence complementary to a target sequence, or one or more nucleic acids encoding said guide RNA
- the engineered CasX effector protein and the guide RNA are capable of forming a CRISPR complex that specifically binds to a target nucleic acid comprising the target sequence and induces modification of the target nucleic acid.
- a method for detecting target nucleic acid in a sample comprising:
- the engineered CRISPR-CasX system as described in item 13 in the preparation of a drug for treating a disease or a disease associated with a target nucleic acid in an individual's cells; preferably, the disease or disease is selected from the group consisting of cancer, cardiovascular disease, hereditary disease, autoimmune disease, metabolic disease, neurodegenerative disease, eye disease, bacterial infection and viral infection.
- a method of modifying a target nucleic acid comprising a target sequence comprising contacting said target nucleic acid with the engineered CRISPR-CasX system of item 13.
- a kit comprising: one or more AAV vectors encoding any one of the engineered CasX nuclease according to any one of items 1-6, the engineered CasX effector protein according to items 7-12 and the engineered CRISPR-CasX system according to item 13.
- the engineered CasX nuclease and its effector proteins in this application have higher activity, such as catalytic efficiency of cutting nucleic acid substrates and gene editing efficiency in cells.
- the engineered Plm CasX nuclease in this application has superior gene editing efficiency in mammalian cells (such as human cells) than existing conventional Cas gene editing tools; the gene editing efficiency in human cells has reached about 80%.
- the engineered CasX nuclease and its effector protein of the present application also have the following advantages: small protein, crRNA The components are simple, the PAM sequence is simple, and the protein itself can process the precursor crRNA.
- Figure 1 Comparison of gene editing efficiencies of 14 engineered CRISPR-PlmCasXs with single point mutations at two loci (AAVS1-2 and CCR5-2) in human cells (with each other and with wild-type PlmCasX enzymes).
- Figure 2 In human cells, at four gene loci (AAVS1-2, AAVS1-7, CCR5-2 and CD34-1), the gene editing efficiency comparison of three kinds of engineered CRISPR-PlmCasX with single point mutation and two point mutations (compared with each other and compared with wild-type PlmCasX enzyme).
- Figure 3 Comparison of gene editing efficiencies of 16 engineered CRISPR-PlmCasX with single point mutations at two loci (AAVS1-2 and CCR5-2) in human cells (compared to each other and to wild-type PlmCasX enzymes).
- Figure 4 In human cells, at four gene loci (AAVS1-2, AAVS1-7, CCR5-2 and CD34-1), the gene editing efficiency comparison of three kinds of engineered CRISPR-PlmCasX with single point mutation and multiple point mutation (compared with each other and compared with wild-type PlmCasX enzyme).
- effector protein refers to a protein having an activity such as site-specific binding activity, single-strand DNA cleavage activity, double-strand DNA cleavage activity, single-strand RNA cleavage activity or transcription regulation activity.
- guide RNA and “gRNA” are used interchangeably herein to refer to an RNA capable of forming a complex with a CasX effector protein and a target nucleic acid (eg, double-stranded DNA).
- target nucleic acid eg, double-stranded DNA
- crRNA or “CRISPR RNA” comprises a guide sequence with sufficient complementarity to the target sequence of a target nucleic acid (e.g., double-stranded DNA) that directs sequence-specific binding of the CRISPR complex to the target nucleic acid.
- a target nucleic acid e.g., double-stranded DNA
- nucleic acid As used herein, the terms “nucleic acid,” “polynucleotide,” and “nucleotide sequence” are used interchangeably to refer to a polymeric form of nucleotides of any length, including deoxyribonucleotides, ribonucleotides, combinations thereof, and analogs thereof. "Oligonucleotide” and “oligonucleotide” are used interchangeably to refer to short polynucleotides having no more than about 50 nucleotides.
- complementarity refers to the ability of a nucleic acid to form hydrogen bonds with another nucleic acid through conventional Watson-Crick base pairing. Percent complementarity represents the percentage of residues in a nucleic acid molecule that can form hydrogen bonds (i.e., Watson-Crick base pairing) with a second nucleic acid (e.g., 5, 6, 7, 8, 9, 10 out of 10, about 50%, 60%, 70%, 80%, 90% and 100% complementary, respectively). "Perfectly complementary” means that all contiguous residues of a nucleic acid sequence form hydrogen bonds with the same number of contiguous residues in a second nucleic acid sequence.
- substantially complementary refers to a degree of complementarity of at least about any of 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100 over a region of about 40, 50, 60, 70, 80, 100, 150, 200, 250 or more nucleotides, or to two nucleic acids that hybridize under stringent conditions.
- stringent conditions for hybridization refers to conditions under which a nucleic acid having complementarity to a target sequence primarily hybridizes to the target sequence and substantially does not hybridize to non-target sequences. Stringent conditions are generally sequence-dependent and vary depending on many factors. In general, the longer the sequence, the higher the temperature at which the sequence will specifically hybridize to its target sequence. Non-limiting examples of stringent conditions are described in detail in Tijssen (1993), Laboratory Techniques In Biochemistry And Molecular Biology-Hybridization With Nucleic Acid Probes, Part I, Chapter 2 “Overview of principles of hybridization and the strategy of nuclear acid probe assay," Else demo, N, Y.
- Hybridization refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized by hydrogen bonding between the bases of the nucleotide residues. Hydrogen bonding can occur through Watson-Crick base pairing, Hoogstein bonding, or in any other sequence-specific manner. A sequence that is capable of hybridizing to a given sequence is called the "complement" of that given sequence.
- Percent (%) sequence identity for a nucleic acid sequence is defined as the percentage of nucleotides in a candidate sequence that are identical to nucleotides in a specific nucleic acid sequence after aligning the sequences by allowing gaps, if necessary, to achieve the maximum percent sequence identity.
- Percent sequence identity (%) for a peptide, polypeptide or protein sequence is the percentage of amino acid residues in a candidate sequence that are identical to those in a particular peptide or amino acid sequence after aligning the sequences by allowing gaps, if necessary, to achieve the maximum percent sequence identity.
- Alignment for purposes of determining percent amino acid sequence identity can be achieved in various ways that are within the skill in the art, for example, using publicly available computer software such as BLAST, BLAST-2, ALIGN or MEGALIGN TM (DNASTAR) software. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared.
- polypeptide and “peptide” are used interchangeably herein to refer to a polymer of amino acids of any length.
- the polymer may be linear or branched, it may contain modified amino acids, and it may be interrupted by non-amino acids.
- a protein can have one or more polypeptides.
- the term also encompasses amino acid polymers that have been modified; eg, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation (such as conjugation with a labeling component).
- variant is interpreted as a polynucleotide or polypeptide that differs from a reference polynucleotide or polypeptide, respectively, but retains essential properties.
- a typical variant of a polynucleotide differs from the nucleic acid sequence of another reference polynucleotide. Changes in the nucleic acid sequence of a variant may or may not alter the amino acid sequence of the polypeptide encoded by the reference polynucleotide. Nucleotide changes may result in amino acid substitutions, additions, deletions, fusions and truncations in the polypeptide encoded by the reference sequence, as described below.
- a typical variant of a polypeptide differs in amino acid sequence from another reference polypeptide. Usually, the differences are limited such that the sequences of the reference polypeptide and the variant are very similar overall and identical in many regions.
- the amino acid sequence of a variant and reference polypeptide may differ by any combination of one or more substitutions, additions, deletions.
- the substituted or inserted amino acid residues may or may not be those encoded by the genetic code.
- Variants of a polynucleotide or polypeptide may be naturally occurring (such as allelic variants), or may be variants that are not known to occur naturally.
- Non-naturally occurring variants of polynucleotides and polypeptides can be prepared by mutagenesis techniques, by direct synthesis, and by other recombinant methods known to those skilled in the art.
- wild-type has the meaning commonly understood by those skilled in the art, meaning a typical form of an organism, strain, gene or characteristic that distinguishes it from a mutant or variant as it exists in nature. It can be isolated from resources in nature and has not been deliberately modified.
- nucleic acid molecule or polypeptide when used to describe a nucleic acid molecule or polypeptide, mean that the nucleic acid molecule or polypeptide is at least substantially free of at least one other component with which it is naturally associated or occurs in nature.
- an "ortholog" of a protein as referred to herein refers to a protein belonging to a different species that performs the same or a similar function as the protein that is its ortholog.
- identity is used to denote a sequence match between two polypeptides or between two nucleic acids.
- a position in two compared sequences is occupied by the same base or subunit of an amino acid monomer (for example, a position in each of two DNA molecules is occupied by an adenine, or a position in each of two polypeptides is occupied by a lysine)
- each molecule is identical at that position.
- the "percent identity" between these two sequences is a function of the number of matching positions shared by the two sequences divided by the number of positions being compared x 100. For example, two sequences are 60% identical if 6 out of 10 positions of the two sequences match.
- the DNA sequences CTGACT and CAGGTT share 50% identity (3 out of a total of 6 positions match).
- comparisons are made when two sequences are aligned to yield maximum identity.
- Such alignment can be achieved, for example, by the method of Needleman et al. (1970) J. Mol. Biol. 48:443-453, which can be conveniently performed by a computer program such as the Align program (DNAstar, Inc.).
- the PAM 120 weighted residue table can also be used, integrated into the ALIGN program (version 2.0) using the algorithm of E. Meyers and W. Miller (Comput. Appl Biosci., 4:11-17 (1988)).
- a gap length penalty of 12 and a gap penalty of 4 are used to determine the percent identity between two amino acid sequences.
- the Needleman and Wunsch (J MoI Biol. 48:444-453 (1970)) algorithm integrated into the GAP program of the GCG software package can be used to determine the distance between two amino acid sequences using the Blossum 62 matrix or the PAM250 matrix with gap weights of 16, 14, 12, 10, 8, 6, or 4 and length weights of 1, 2, 3, 4, 5, or 6. percent identity.
- Cell as used herein is understood not only to refer to a specific single cell, but also to the progeny or potential progeny of that cell. Because certain modifications may have occurred in the progeny, due to mutations or environmental influences, such progeny may in fact differ from the parental cells and still be included within the scope of the term herein.
- transduction and “transfection” include methods known in the art to introduce DNA into cells to express a protein or molecule of interest using an infectious agent such as a virus or otherwise.
- infectious agent such as a virus or otherwise.
- chemical-based transfection methods such as the use of calcium phosphate, dendrimers non-chemical methods such as electroporation, cell squeezing, sonoporation, optical transfection, impalefection, protoplast fusion, plasmid delivery or transposons; particle-based methods such as the use of gene guns, magnetofection or magnet-assisted transfection, particle bombardment; and hybridization methods such as nucleofection ).
- transfected refers to the process of transferring or introducing exogenous nucleic acid into a host cell.
- a “transfected”, “transformed” or “transduced” cell is a cell that has been transfected, transformed or transduced with an exogenous nucleic acid.
- in vivo refers to the organism from which cells are obtained. "Ex vivo” or “in vitro” means outside the organism from which cells are obtained.
- treatment/treating is a method used to obtain beneficial or desired results, including clinical results.
- a beneficial or desired clinical outcome includes, but is not limited to, one or more of the following: alleviation of one or more symptoms caused by the disease, reduction of the extent of the disease, stabilization of the disease (e.g., prevention or delay of progression of the disease), prevention or delay of spread of the disease (e.g., metastasis), prevention or delay of recurrence of the disease, reduction of the rate of recurrence of the disease, delay or slowing of the progression of the disease, amelioration of the disease state, provision of remission (partial or total) of the disease, reduction in the dose of one or more other drugs required to treat the disease, delay of the progression of the disease, improvement of life quality, and/or prolong survival.
- Treatment also includes reducing the pathological consequences of a disorder, condition or disease. The methods of the invention contemplate any one or more of these aspects of treatment.
- the term "effective amount” refers to an amount of a compound or composition sufficient to treat a particular disorder, condition or disease, eg, ameliorate, alleviate, lessen and/or delay one or more symptoms thereof.
- an "effective amount” may be administered in one or more doses, ie, a single dose or multiple doses may be required to achieve the desired therapeutic endpoint.
- Subject “Subject,” “individual,” or “patient” are used interchangeably herein for purposes of treatment and refer to any animal classified as a mammal, including humans, livestock, and farm animals, as well as zoo, farm, or pet animals such as dogs, horses, cats, cows, etc.
- the individual is a human individual.
- a reference to a "not” value or parameter generally means and describes an "except” value or parameter.
- the method is not used to treat type X cancer, meaning that the method is used to treat cancer other than type X.
- the term “and/or” in words such as “A and/or B” is intended to include both A and B; A or B; A (alone); and B (alone).
- the term “and/or” in words such as "A, B and/or C” is intended to include each of the following: A, B and C; A, B or C; A or C; A or B; B or C; A and C; A and B; B and C; A (alone); B (alone); and C (alone).
- the application provides engineered CasX nuclease in a first aspect
- an engineered CasX nuclease comprises one or more mutations based on the reference CasX nuclease, the mutation is: one or more amino acids in the reference CasX nuclease that interact with nucleic acid are replaced with positively charged amino acids; the reference CasX nuclease is a natural wild-type CasX nuclease.
- the natural wild-type CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
- the present application provides methods for engineering enzymes by introducing amino acid mutations based on the engineering principles described above, which lead to increased enzyme activity in vitro and in vivo.
- the engineered CasX nuclease contains one or more specific mutations as described below.
- any one or more of the mutations described herein can be combined with existing CasX mutations to provide an engineered CasX nuclease with higher activity.
- an engineered CasX nuclease comprises a mutation that replaces one or more amino acids interacting with nucleic acid in the reference CasX nuclease with a positively charged amino acid.
- the one or more amino acids interacting with the nucleic acid are amino acids within 9 angstroms in the three-dimensional structure, especially one or more amino acids at the following positions: 26, 27, 29, 105, 195, 198, 204, 222, 230, 512, 564, 565, 610, 640; wherein, the amino acid position numbers are as defined in SEQ ID NO.1.
- the positively charged amino acid is R or K; preferably positively charged Amino acid is R.
- the replacement of one or more amino acids interacting with nucleic acid in the reference CasX nuclease with a positively charged amino acid refers to one or more of the following replacements: T26R, K610R, K640, S759R, K808R; preferably, the engineered CasX nuclease comprises any mutation or combination of mutations in the following group: (1) T26R; (2) K610R; (3) K640R; (4) S75 9R; (5) K808R; (6) T26R and K610R; (7) T26R and K610R and K808R; wherein, the amino acid position numbering is as defined in SEQ ID NO.1.
- the engineered CasX nuclease of the amino acid sequence shown in any one of SEQ ID NO.2 ⁇ 8 is provided.
- T26R is; in the cited amino acid sequence, the No. 26 amino acid T (threonine) is replaced by R (arginine); here, common amino acids and their three-letter and single-letter abbreviations are listed as follows: Alanine AlaA; Arginine Arg R; Aspartate Asp D; Cysteine Cys C; Glutamine Gln Q; Glutamate Glu E; Histidine His H; Isoleucine Ile I; Glycine Gly G; Asparagine AsnN; Leucine Leu L; Lysine Lys K; Methionine MetM; Phenylalanine Phe F; Proline Pro P; Serine Ser S; Threonine ThrT; Tryptophan Trp W; Tyrosine TyrY; Valine ValV.
- the amino acid is at position X, wherein said amino acid position numbering is as defined in SEQ ID NO.1
- the amino acid residue is located at a certain position of the reference enzyme CasX, which is equivalent to the position X of SEQ ID NO:1, and the amino acid sequence of the reference enzyme CasX is aligned with the amino acid sequence of SEQ ID NO:1 based on sequence homology.
- Figure 6 shows the homology alignment of the amino acid sequences of CasX2 (SEQ ID NO.1) and CasX1 (SEQ ID NO.13).
- the reference PlmCasX (Planctomycetes CasX) nuclease is native PlmCasX.
- the reference PlmCasX nuclease is an engineered PlmCasX nuclease.
- CasX is a new type of CRISPR-Cas gene editing system excavated from metagenomics, which was first reported in 2017 (article: New CRISPR–Cas systems from uncultivated microbes). According to the latest CRISPR system classification criteria, CasX belongs to the V-E subclass.
- CasX enzymes comprise a distinct family of RNA-guided genome editors
- DpbCasX Deltaproteobacteria
- CasX CasX
- PlmCasX CasX
- this study showed that CasX was inefficient in mammalian cells, and the article did not explore the editing efficiency of PlmCasX at endogenous genomic loci.
- the engineered CasX of the present application is an endonuclease, which binds to a specific site of a target sequence and cuts under the guidance of a guide RNA, and has DNA and RNA endonuclease activity.
- the CasX is capable of autonomous crRNA biogenesis by processing a precursor crRNA array. Autonomous pre-crRNA processing facilitates CasX delivery, enabling double-nicking applications, as two separate genomic sites can be targeted by a single crRNA transcript.
- the CasX protein then processes the CRISPR array into two homologous crRNAs, forming a paired nicking complex.
- the guide RNA comprises a precursor crRNA expressed by a CRISPR array consisting of target sequences interleaved with unprocessed DR sequences, repeated by intrinsic precursor crRNA processing of the effector protein to enable simultaneous targeting of one, two or more sites.
- CasX nucleases from a variety of organisms can be used as the reference CasX nucleases to provide the engineered CasX nucleases and effector proteins of the present application.
- the CasX reference CasX nuclease has enzymatic activity.
- the reference CasX is a nuclease, ie, cleaves both strands of a target duplex nucleic acid (eg, duplex DNA).
- the reference CasX is a nickase, ie, cleaves a single strand of a target duplex nucleic acid (eg, duplex DNA).
- the CasX reference CasX nuclease is enzymatically inactive. Orthologs having a certain sequence identity (such as at least about 60%, 70%, 80%, 85%, 90%, 95%, 98% or more) with CasX or its functional derivatives can be used as the basis for designing the engineered CasX nuclease or effector protein of the present application.
- the engineered CasX nuclease is based on a functional variant of a naturally occurring CasX nuclease.
- the functional variant has one or more mutations, such as amino acid substitutions, insertions, and deletions.
- the functional variant may comprise any of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more amino acid substitutions compared to a wild-type naturally occurring CasX nuclease.
- the one or more substitutions are conservative substitutions.
- the functional variant has all domains of a naturally occurring CasX nuclease. In some embodiments, the functional variant does not possess one or more domains of a naturally occurring CasX nuclease.
- the present application provides engineered CasX effector proteins in a second aspect.
- the aforementioned engineered CasX nuclease or a functional derivative thereof is provided.
- the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
- an engineered CasX effector protein wherein the effector protein is capable of inducing double-strand breaks or single-strand breaks in DNA molecules.
- an engineered CasX effector protein wherein the functional derivative of the engineered CasX nuclease is an enzyme inactive mutant.
- an engineered CasX effector protein which further comprises a functional domain fused with the engineered CasX nuclease.
- an engineered CasX effector protein is provided in a specific embodiment, wherein the functional domain is selected from the group consisting of a translation initiation domain, a transcriptional repression domain, a transactivation domain, an epigenetic modification domain, a nucleobase editing domain, a reverse transcriptase domain, a reporter domain and a nuclease domain.
- an engineered CasX effector protein comprising a first polypeptide and a second polypeptide, the first polypeptide comprising amino acid residues 1 to X of the N-terminal part of the aforementioned engineered CasX nuclease, and the second polypeptide comprising amino acid residues X+1 of the aforementioned engineered CasX nuclease to the C-terminal of the CasX nuclease, wherein the first polypeptide and the second polypeptide can associate with each other in the presence of a guide RNA comprising a guide sequence to form a clustered regularly interspaced short palindromic repeat sequence (CRISPR) that specifically binds to a target nucleic acid ) complex, the target nucleic acid comprising a target sequence complementary to the guide sequence.
- CRISPR clustered regularly interspaced short palindromic repeat sequence
- an engineered CasX effector protein based on any of the engineered CasX nucleases described herein is provided.
- the engineered CasX Effector proteins have enzymatic activity.
- the engineered CasX effector protein is a nuclease that cleaves both strands of a target duplex nucleic acid (eg, duplex DNA).
- the engineered CasX effector protein is a nickase, ie, cleaves a single strand of a target duplex nucleic acid (eg, duplex DNA).
- the engineered CasX effector protein comprises an enzyme-inactive mutant of the engineered CasX nuclease.
- the engineered CasX enzymes provided herein can be modified to have reduced nuclease activity, e.g., the nuclease is at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 97%, or 100% inactive compared to a wild-type CasX enzyme.
- the nuclease activity can be reduced by several methods, for example, introducing mutations into the nucleic acid interaction domain of the nuclease or CasX enzyme.
- catalytic residues for nuclease activity are identified, and these amino acid residues can be replaced with different amino acid residues (eg, glycine or alanine) to reduce the nuclease activity.
- the engineered CasX nuclease has increased activity compared to the reference CasX nuclease.
- the activity is target DNA binding activity.
- the activity is a site-specific nuclease activity.
- the activity is double-stranded DNA cleavage activity.
- the activity is single-stranded DNA cleavage activity, including, for example, site-specific DNA cleavage activity or non-specific DNA cleavage activity.
- the activity is single-stranded RNA cleavage activity, eg, site-specific RNA cleavage activity or non-specific RNA cleavage activity.
- the activity is measured in vitro.
- the activity is measured in cells, such as bacterial cells, plant cells, or eukaryotic cells. In some embodiments, the activity is measured in mammalian cells, such as rodent cells or human cells. In some embodiments, the activity is measured in human cells, such as 293T cells. In some embodiments, the activity is measured in mouse cells, eg, Hepal-6 cells.
- the engineered CasX nuclease has at least about any of 20%, 30%, 40%, 60%, 70%, 80%, 90%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 10-fold or more increased site-specific nuclease activity compared to a reference CasX nuclease.
- the site-specific nuclease activity of the engineered CasX nuclease can be measured using methods known in the art, including, for example, gel shift assays, agarose gel electrophoresis-based in vitro cleavage assays as described in the Examples provided herein.
- the activity is gene editing activity in the cell.
- the cells are bacterial cells, plant cells, or eukaryotic cells.
- the cells are mammalian cells such as rodent cells or human cells.
- the cells are 293T cells.
- the activity is measured in mouse cells, eg, Hepal-6 cells.
- the activity is an indel forming activity at a target genomic site in the cell, such as site-specific cleavage of a target nucleic acid by the engineered CasX nuclease and DNA repair by a non-homologous end joining (NHEJ) mechanism.
- NHEJ non-homologous end joining
- the activity is insertion of an exogenous nucleic acid sequence at a target genomic site in the cell, such as site-specific cleavage of the target nucleic acid by the engineered CasX nuclease and DNA repair by a homologous recombination (HR) mechanism.
- HR homologous recombination
- the engineered CasX nuclease increases the gene editing (e.g., indel formation) activity of any of at least about 20%, 30%, 40%, 60%, 70%, 80%, 90%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 10-fold or more at a genomic locus of a cell (e.g., a human cell such as a 293T cell or a mouse Hepa1-6 cell) compared to a reference CasX nuclease.
- a cell e.g., a human cell such as a 293T cell or a mouse Hepa1-6 cell
- the engineered CasX nuclease is at least about 20%, 30%, 40%, 60%, 70%, 80%, 90%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold increased at multiple (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10 or more) genomic sites in a cell (e.g., a human cell such as a 293T cell or a mouse Hepa1-6 cell) compared to a reference CasX2 nuclease
- the engineered CasX nuclease is capable of editing a greater number of genomic sites than the reference CasX nuclease.
- the consensus nucleic acid sequence of the engineered CasX nuclease is identical to the reference CasX nuclease.
- the gene editing efficiency of an engineered CasX nuclease in a cell can be determined using methods known in the art, including, for example, a T7 endonuclease 1 (T7E1 ) assay, sequencing of target DNA (including, for example, Sanger sequences, and next-generation sequencing), tracking indels by decomposition (TIDE) assays, or indel detection by amplicon analysis (IDAA) assays.
- T7E1 T7 endonuclease 1
- sequencing of target DNA including, for example, Sanger sequences, and next-generation sequencing
- TIDE tracking indels by decomposition
- IDAA indel detection by amplicon analysis
- the gene editing efficiency of the engineered CasX nuclease in cells is measured using targeted next generation sequencing (NGS), eg, as described in the Examples herein.
- NGS next generation sequencing
- Exemplary genomic sites for determining the gene editing efficiency of the engineered CasX nuclease include, but are not limited to, AAVS1, CCR5, CD34, and the like.
- the gene editing efficiency (eg indel rate) of the engineered CasX nuclease At least 20%, 30%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85% or more.
- the present application provides engineered CasX effector proteins with improved activities, such as target binding, double-strand cleavage activity, nickase activity, and/or gene editing activity.
- an engineered CasX effector protein e.g., a CasX nuclease, a CasX nickase, a CasX fusion effector protein, or a split CasX effector protein
- an engineered CasX effector protein comprising any of the engineered CasX nucleases described herein or a functional derivative thereof is provided.
- the present application provides engineered CasX effector proteins comprising functional variants of the engineered CasX nucleases described herein.
- the amino acid sequence of the functional variant differs by at least one amino acid residue (eg, with a deletion, insertion, substitution, and/or fusion) when compared to the amino acid sequence of a corresponding engineered CasX nuclease.
- the functional variant has one or more mutations, such as amino acid substitutions, insertions and/or deletions.
- the functional variant may comprise any of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more amino acid substitutions compared to the engineered CasX nuclease.
- the one or more substitutions are conservative substitutions.
- the functional variant has all domains of the engineered CasX nuclease. In some embodiments, the functional variant does not have one or more domains of the engineered CasX nuclease.
- the CasX variant can include a CasX protein sequence having the same parameters described above (e.g., domains present, percent identity, etc.).
- the functional variant has a different catalytic activity than the non-mutated form of its engineered CasX nuclease.
- the mutation eg, amino acid substitution, insertion, and/or deletion
- the variant is in the catalytic domain (eg, RuvC domain) of the CasX effector protein.
- the variant comprises mutations in multiple catalytic domains.
- a CasX effector protein that cleaves one strand of a double-stranded target nucleic acid but not the other is referred to herein as a "nickase" (eg, a "CasX nickase").
- CasX proteins that are substantially devoid of nuclease activity are referred to herein as inactive CasX proteins ("dCasX") (disclaimer: in the case of fusion CasX effector proteins, a heterologous polypeptide (fusion partner) can provide nuclease activity, which will be described in detail below).
- a CasX effector protein is considered to lack substantially all DNA cleavage activity when the mutant enzyme has less than about 25%, 10%, 5%, 1%, 0.1%, 0.01% or less DNA cleavage activity relative to its non-mutated form.
- the present application also relates to an engineered CasX effector protein according to the foregoing, which comprises a first polypeptide and a second polypeptide, the first polypeptide comprising amino acid residues 1 to X of the N-terminal part of the aforementioned engineered CasX nuclease, and the second polypeptide comprising amino acid residues X+1 of the aforementioned engineered CasX nuclease to the C-terminus of the CasX nuclease, wherein the first polypeptide and the second polypeptide can associate with each other in the presence of a guide RNA comprising a guide sequence to form a clustered regularly interspaced short palindromic repeat (CRISPR) complex that specifically binds to a target nucleic acid A target nucleic acid comprising a target sequence complementary to said guide sequence.
- CRISPR clustered regularly interspaced short palindromic repeat
- the application also provides split CasX effector proteins based on any of the engineered CasX effector proteins described herein. Split CasX effectors may be advantageous for delivery.
- the engineered CasX effector protein is split into two parts of the enzyme, which can be reconstituted together to provide a substantially functioning CasX effector protein.
- CasX effector proteins can be provided using known methods, for example, fragmented forms of the Cas12 and Cas9 proteins have been described in, for example, WO2016/112242, WO2016/205749 and PCT/CN 2020/111057, which are incorporated herein by reference in their entirety.
- a split-type CasX effector protein comprising a first polypeptide and a second polypeptide, the first polypeptide comprising an N-terminal portion of any one of the engineered CasX nucleases described herein or a functional derivative thereof, the second polypeptide comprising a C-terminal portion of the engineered CasX nuclease or a functional derivative thereof, wherein the first polypeptide and the second polypeptide can associate with each other in the presence of a guide RNA comprising a guide sequence to form a CRISPR complex specifically binding to a target nucleic acid, the target nucleic acid comprising a target sequence complementary to the guide sequence.
- the first polypeptide and the second polypeptide each comprise a dimerization domain. In some embodiments, the first dimerization domain and the second dimerization domain associate with each other in the presence of an inducing agent (eg, rapamycin). In some embodiments, the first polypeptide and the second polypeptide do not comprise a dimerization domain. In some embodiments, the segmented CasX effector protein is autoinducible.
- CasX effector proteins can act as nucleases (including nicking enzymes) or can be inactivated enzymes, which are essentially RNA-guided DNA-binding proteins with little or no catalytic activity (eg, due to mutations in their catalytic domain).
- the nuclease lobe and the ⁇ -helical lobe of the CasX protein are expressed as separate polypeptides.
- RNA guide sequences recruit them into a complex that recapitulates the activity of the full-length CasX enzyme and catalyzes site-specific DNA cleavage.
- modified RNA guide sequences can be used to prevent dimerization by Instead, the activity of the split-type enzyme is eliminated, allowing the development of an inducible dimerization system.
- split-type enzymes are described, for example, in Wright, Addison V., et al. "Rational design of a split-Cas9 enzyme complex," Proc. Nat'l. Acad. Sci., 112.10(2015):2984-2989, which is incorporated herein by reference in its entirety.
- the split CasX effector protein portion described herein can be designed to be split in half by splitting (i.e., splitting) a reference engineered CasX effector protein (e.g., full-length engineered CasX) at a split position, which is the point at which the N-terminal portion of the reference CasX effector protein is separated from the C-terminal portion.
- a reference engineered CasX effector protein e.g., full-length engineered CasX
- the N-terminal portion comprises amino acid residues 1 to X
- the C-terminal portion comprises amino acid residues X+1 to the C-terminus of the reference CasX effector protein.
- the numbering is sequential, but this is not required, as it is also contemplated that amino acids (or nucleotides encoding them) may be trimmed from either split ends and/or mutations (e.g., insertions, deletions, and substitutions) in the interior region of the polypeptide chain, provided that the reconstituted CasX effector protein retains sufficient DNA binding activity (if desired), DNA nickase or cleavage activity, e.g., at least 40%, 50%, 60%, 70%, 80%, 90% compared to the reference CasX effector protein % or 95% active.
- Cutpoints can be designed in silico and cloned into constructs. During this process, mutations can be introduced into segmented CasX effector proteins and non-functional domains can be removed. In some embodiments, the two parts or fragments (i.e., N-terminal and C-terminal fragments) of the split CasX effector protein can form a complete CasX effector protein comprising, for example, at least 70%, at least 80%, at least 90%, at least 95%, or at least 99% of the complete CasX effector protein sequence.
- the segmented CasX effector proteins may each comprise one or more dimerization domains.
- the first polypeptide comprises a first dimerization domain fused to a first segmented CasX effector protein portion
- the second polypeptide comprises a second dimerization domain fused to a second segmented CasX effector protein portion.
- the dimerization domain can be fused to the segmented CasX effector protein portion by a peptide linker (eg, a flexible peptide linker such as a GS linker) or a chemical bond.
- the dimerization domain is fused to the N-terminus of the segmented CasX effector protein portion.
- the dimerization domain is fused to the C-terminus of the segmented CasX effector portion.
- the segmented CasX effector protein does not comprise any dimerization domains.
- the dimerization domain facilitates the association of two segmented CasX effector protein moieties.
- the segmented CasX effector portion is induced by an inducer to associate or dimerize into a functional CasX effector.
- the Cut-type CasX effector proteins contain an inducible dimerization domain.
- the dimerization domain is not an inducible dimerization domain, ie, the dimerization domain dimerizes in the absence of an inducing agent.
- An inducing agent can be an inducing energy source or an inducing molecule other than a guide RNA (eg, sgRNA).
- the inducer partially remodels the two segmented CasX effector proteins into a functional CasX effector protein through the induced dimerization of the dimerization domain.
- the inducing agent brings together the two segmented CasX effector protein moieties by inducing association of the inducible dimerization domain.
- the two segmented CasX effector protein moieties do not associate with each other to remodel into a functional CasX effector protein in the absence of an inducing agent.
- two separate CasX effector protein moieties can associate with each other in the presence of a guide RNA (eg, crRNA) to reconstitute into a functional CasX effector protein.
- the inducer of the present application may be heat, ultrasound, electromagnetic energy or chemical compounds.
- the inducing agent is an antibiotic, small molecule, hormone, hormone derivative, steroid, or steroid derivative.
- the inducer is abscisic acid (ABA), doxycycline (DOX), cumate, rapamycin, 4-hydroxytamoxifen (4OHT), estrogen, or ecdysone.
- the segmented CasX effector system is an inducer-controlled system selected from the group consisting of an antibiotic-based induction system, an electromagnetic energy-based induction system, a small molecule-based induction system, a nuclear receptor-based induction system, and a hormone-based induction system.
- the segmented CasX effector system is an inducer-controlled system selected from the group consisting of a tetracycline (Tet)/DOX inducible system, a light inducible system, an ABA inducible system, a cumate repressor/operator system, a 4OHT/estrogen inducible system, an ecdysone-based inducible system, and a FKBP12/FRAP (FKBP12-rapamycin complex) inducible system.
- inducers are also discussed herein and in PCT/US2013/051418, which is hereby incorporated by reference in its entirety.
- pairs of split CasX effector proteins are separated and inactive until dimerization of the dimerization domains (eg, FRB and FKBP) is induced, which dimerization results in reassembly of functional CasX effector protein nucleases.
- the first split CasX effector protein comprising the first half of an inducible dimer e.g., FRB
- the second split CasX effector protein comprising the second half of the inducible dimer (eg, FKBP).
- FKBP-based induction systems that can be used in the inducer-controlled split CasX effector systems described herein include, but are not limited to: FKBP that dimerizes with calcineurin (CNA) in the presence of FK506; FKBP that dimerizes with CyP-Fas in the presence of FKCsA; FKBP that dimerizes with FRB in the presence of rapamycin; GyrB that dimerizes with GryB in the presence of coumarycin; GAI that dimerizes with GID1 in the presence of GA; or Snap-tag that dimerizes with HaloTag in the presence of HaXS.
- CNA calcineurin
- FKBPs homodimerize (ie, one FKBP dimerizes with another FKBP) in the presence of FK1012.
- the dimerization domain is FKBP and the inducer is FK1012. In some embodiments, the dimerization domain is GryB and the inducer is Coumarin. In some embodiments, the dimerization domain is ABA and the inducing agent is gibberellin.
- the segmented CasX effector portion can be autoinduced (ie, autoactivated or autoinduced) in the absence of an inducing agent to associate/dimerize into a functional CasX effector.
- the auto-induction of the segmented CasX effector portion may be mediated by binding to a guide RNA such as crRNA.
- the first polypeptide and the second polypeptide do not comprise a dimerization domain.
- the first polypeptide and the second polypeptide comprise a dimerization domain.
- the reconstituted CasX effector protein of the split CasX effector system described herein has an editing efficiency of at least 70% (such as at least about any of 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% or more efficiency, or 100% efficiency) relative to a reference CasX effector protein editing efficiency.
- the reconstituted CasX effector protein of the inducer-controlled split CasX effector system described herein has an editing efficiency of no more than 50% (such as no more than any of about 50%, 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10%, 5% or less efficiency, or 0% efficiency) relative to the editing efficiency of a reference CasX effector protein in the absence of the inducer (i.e., due to auto-induction).
- the present application also provides engineered CasX effector proteins comprising additional protein domains and/or components, such as linkers, nuclear localization/export sequences, functional domains and/or reporter proteins.
- the engineered CasX effector protein is a protein complex comprising one or more heterologous protein domains (e.g., about or greater than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more domains) and the nucleic acid targeting domain of the engineered CasX nuclease or a functional derivative thereof.
- the engineered CasX effector protein is a fusion protein comprising one or more heterologous protein domains (e.g., about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more domains) fused to the engineered CasX nuclease.
- the engineered CasX effector protein of the present application may comprise (for example, through a fusion protein, such as through one or more peptide linkers, such as a GS peptide linker, etc.) one or more functional domains or associate (for example, through co-expression of multiple proteins) with it.
- the one or more functional domains are enzymatic domains. These functional domains can have various activities, such as DNA and/or RNA methylase activity, demethylase activity, transcriptional activation activity, transcriptional repression activity, transcriptional release factor activity, histone modification activity, RNA cleavage activity, DNA cleavage activity, nucleic acid binding activity, and switching activity (e.g., light-induced).
- the one or more functional domains are transcriptional activation domains (ie, transactivation domains) or repressor domains. In some embodiments, the one or more functional domains are histone modification domains. In some embodiments, the one or more functional domains are transposase domains, HR (homologous recombination) machinery domains, recombinase domains, and/or integrase domains. In some embodiments, the functional domain is Krüppel-associated box (KRAB), VP64, VP16, Fok1, P65, HSF1, MyoD1, Biotin-APEX, APOBEC1, AID, PmCDA1, Tad1, and M-MLV reverse transcriptase.
- KRAB Krüppel-associated box
- the functional domain is selected from the group consisting of a translation initiation domain, a transcriptional repression domain, a transactivation domain, an epigenetic modification domain, a nucleobase editing domain (eg, a CBE or ABE domain), a reverse transcriptase domain, a reporter domain (eg, a fluorescent domain), and a nuclease domain.
- the positioning of one or more functional domains in the engineered CasX effector protein allows correct spatial orientation of the functional domains to affect the target with the conferred functional effect.
- the functional domain is a transcriptional activator (eg, VP16, VP64, or p65)
- the transcriptional activator is placed in a spatial orientation that enables it to affect the transcription of the target.
- a transcriptional repressor is positioned to affect the transcription of a target
- a nuclease eg, Fok1
- the functional domain is located at the N-terminus of the engineered CasX effector protein.
- the functional domain is located at the C-terminus of the engineered CasX effector protein.
- the engineered CasX effector The response protein contains a first functional domain at the N-terminus and a second functional domain at the C-terminus.
- the engineered CasX effector protein comprises a catalytically inactive mutant of any of the engineered CasX nucleases described herein fused to one or more functional domains.
- the engineered CasX effector protein is a transcriptional activator.
- the engineered CasX effector protein comprises an enzyme-inactive variant of any of the engineered CasX nucleases described herein fused to a transactivation domain.
- the transactivation domain is selected from the group consisting of VP64, p65, HSF1, VP16, MyoD1, HSF1, RTA, SET7/9, and combinations thereof.
- the transactivation domain comprises VP64, p65, and HSF1.
- the engineered CasX effector protein comprises two split CasX effector polypeptides, each fused to a transactivation domain.
- the engineered CasX effector protein is a transcriptional repressor.
- the engineered CasX effector protein comprises an enzyme-inactive variant of any of the engineered CasX nucleases described herein fused to a transcriptional repression domain.
- the transcriptional repressor domain is selected from the group consisting of Krüppel-associated box (KRAB), EnR, NuE, NcoR, SID, SID4X, and combinations thereof.
- the engineered CasX effector protein comprises two split CasX effector polypeptides, each fused to a transcriptional repression domain.
- the engineered CasX effector protein is a base editor, such as a cytosine editor or an adenosine editor.
- the engineered CasX effector protein comprises an enzyme-inactive variant of any of the engineered CasX nucleases described herein fused to a nucleobase editing domain, such as a cytosine base editing (CBE) domain or an adenosine base editing (ABE) domain.
- the nucleobase editing domain is a DNA editing domain.
- the nucleobase editing domain has deaminase activity.
- the nucleobase editing domain is a cytosine deaminase domain.
- the nucleobase editing domain is an adenosine deaminase domain.
- Exemplary base editors based on Cas nucleases are described, eg, in WO2018/165629A1 and WO2019/226953A1, which are incorporated herein by reference in their entirety.
- Exemplary CBE domains include, but are not limited to, activation-induced cytidine deaminase or AID (e.g., hAID), apolipoprotein B mRNA editing complex or APOBEC (e.g., rat APOBEC1, hAPOBEC3 A/B/C/D/E/F/G), and PmCDA1.
- Exemplary ABE domains include, but are not limited to: TadA, ABE8, and variants thereof (see, eg, Gaudelli et al., 2017, Nature 551:464-471; and Richter et al., 2020, Nature Biotechnology 38:883-891).
- the functional domain is APOBEC1 domain, such as a rat APOBEC1 domain.
- the functional domain is a TadA domain, such as an E. coli TadA domain.
- the engineered CasX effector protein further comprises one or more nuclear localization sequences.
- the engineered CasX effector protein is a master editor. Cas9-based master editors are described, e.g., in A. Anzalone et al., Nature, 2019, 576(7785):149-157, which is hereby incorporated by reference in its entirety.
- the engineered CasX effector protein comprises a nickase variant of any one of the engineered CasX nucleases described herein fused to a reverse transcriptase domain.
- the functional domain is a reverse transcriptase domain.
- the reverse transcriptase domain is M-MLV reverse transcriptase or a variant thereof, such as an M-MLV reverse transcriptase having one or more mutations of D200N, T306K, W313F, T330P, and L603W.
- an engineered CRISPR/CasX system comprising said master editor is provided.
- the engineered CRISPR/CasX system further comprises a second CasX nickase, eg, based on the same engineered CasX nuclease as the primary editor.
- the engineered CRISPR/CasX system comprises a master editor guide RNA (pegRNA) comprising a primer binding site and a reverse transcriptase (RT) template sequence.
- pegRNA master editor guide RNA
- RT reverse transcriptase
- the application provides segmented CasX effector systems having one or more (e.g., 1, 2, 3, 4, 5, 6 or more) functional domains associated with (i.e., bound or fused to) one or both of the segmented CasX effector protein moieties.
- Said functional domain may be provided as part of said first and/or second segmented CasX effector protein, as a fusion within the construct.
- the functional domain is usually fused to other parts of the segmented CasX effector protein (eg, segmented CasX effector protein portion) via a peptide linker (such as a GS linker).
- These functional domains can be used to reprogram the function of this segmented CasX effector system based on catalytically inactive CasX effector proteins.
- the engineered CasX effector protein comprises one or more nuclear localization sequences (NLS) and/or one or more nuclear export sequences (NES).
- NLS sequences include, eg, PKKKRKVPG and ASPKKKRKV.
- NLS and/or NES can be operably linked to the N-terminus and/or C-terminus of the engineered CasX effector protein or a polypeptide chain in the engineered CasX effector protein.
- the engineered CasX effector protein can encode additional components, such as a reporter protein.
- the engineered CasX effector protein comprises a fluorescent protein, such as GFP.
- GFP fluorescent protein
- Such systems may allow imaging of genomic loci (see, for example, "Dynamic Imaging of Genomic Loci in Living Human Cells by an Optimized CRISPR/Cas System" Chen B et al. Cell 2013).
- the engineered CasX effector protein is an inducible segmentable CasX effector system that can be used to image genomic loci.
- an engineered CasX effector protein is provided, wherein the effector protein is capable of inducing double-strand breaks or single-strand breaks in DNA molecules.
- an engineered CasX effector protein wherein the functional derivative of the engineered CasX nuclease is an enzyme inactive mutant.
- an engineered CasX effector protein which further comprises a functional domain fused with the engineered CasX nuclease.
- an engineered CasX effector protein wherein the functional domain is selected from the group consisting of a translation initiation domain, a transcriptional repression domain, a transactivation domain, an epigenetic modification domain, a nucleobase editing domain, a reverse transcriptase domain, a reporter domain and a nuclease domain.
- the present application provides an engineered CRISPR-CasX system in a third aspect
- an engineered CRISPR-CasX system comprising: (a) the aforementioned engineered CasX effector protein; and (b) a guide RNA comprising a guide sequence complementary to a target sequence, or one or more nucleic acids encoding the guide RNA, wherein the engineered CasX effector protein and the guide RNA can form a CRISPR complex, and the CRISPR complex specifically binds to the target nucleic acid comprising the target sequence and induces modification of the target nucleic acid.
- the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
- the engineered CRISPR-CasX system comprises: (a) any of the engineered CasX effector proteins described herein (e.g., an engineered CasX nuclease, nickase, segmented CasX, transcription repressor, transcription activator, base editor, or master editor); and (b) a guide RNA comprising a guide sequence complementary to a target sequence, or one or more nucleic acids encoding the guide RNA, wherein the engineered CasX effector protein and the guide RNA are capable of forming a CRISPR complex, the The CRISPR complex specifically binds to a target nucleic acid comprising the target sequence and induces modification of the target nucleic acid.
- any of the engineered CasX effector proteins described herein e.g., an engineered CasX nuclease, nickase, segmented CasX, transcription repressor, transcription activator, base editor, or master editor
- a guide RNA comprising a
- the engineered CRISPR-CasX system comprises one or more nucleic acids encoding the engineered CasX effector protein and/or the guide RNA.
- the engineered CRISPR-CasX The system comprises an array of precursor guide RNAs that can be processed into multiple crRNAs, eg, by the engineered CasX effector proteins.
- the engineered CRISPR-CasX system comprises one or more vectors encoding the engineered CasX effector protein and/or the guide RNA.
- the engineered CRISPR-CasX system comprises a ribonucleoprotein (RNP) complex comprising the engineered CasX effector protein bound to the guide RNA.
- RNP ribonucleoprotein
- the engineered CRISPR-CasX system of the present application can comprise any suitable guide RNA.
- a guide RNA may comprise a guide sequence capable of hybridizing to a target sequence in a target nucleic acid of interest, such as a genomic site of interest in a cell.
- the gRNA comprises a CRISPR RNA (crRNA) sequence comprising the guide sequence.
- the crRNAs described herein include direct repeat sequences and spacer sequences.
- the crRNA comprises, consists essentially of, or consists of a direct repeat sequence linked to a guide sequence or a spacer sequence.
- the crRNA includes a direct repeat sequence, a spacer sequence, and a direct repeat sequence (DR-spacer sequence-DR), which are typical features of a precursor crRNA (pre-crRNA) configuration.
- the crRNA includes truncated direct repeat and spacer sequences, which are typical features of processed or mature crRNA.
- the CRISPR-CasX effector protein forms a complex with an RNA guide sequence, and the spacer sequence directs the complex to sequence-specific binding to a target nucleic acid that is complementary to the spacer sequence.
- the guide RNA is a crRNA comprising a guide sequence.
- the engineered CRISPR-CasX system comprises an array of precursor guide RNAs encoding a plurality of crRNAs.
- the CasX effector protein cleaves the array of precursor guide RNAs to generate a plurality of crRNAs.
- the engineered CRISPR-CasX system comprises an array of precursor guide RNAs encoding multiple crRNAs, wherein each crRNA comprises a different guide sequence.
- the guide sequence can be of suitable length. In some embodiments, the guide sequence is between about 18 and about 35 nucleotides, including, for example, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, or 35 nucleotides.
- the guide sequence may be at least about 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99%, or 100% complementary to the target sequence of the target nucleic acid.
- constructs, vectors, and expression systems encoding any of the engineered CasX effector proteins (eg, engineered CasX nucleases) described herein.
- the construct, vector or expression system further comprises one or more gRNA or crRNA arrays.
- a “vector” is a composition of matter comprising an isolated nucleic acid and which can be used to deliver the isolated nucleic acid to the interior of a cell.
- Many vectors are known in the art including, but not limited to, linear polynucleotides, polynucleotides associated with ionic or amphiphilic compounds, plasmids, and viruses.
- suitable vectors contain an origin of replication functional in at least one organism, a promoter sequence, convenient restriction endonuclease sites and one or more selectable markers.
- the term “vector” should also be construed to include non-plasmid and non-viral compounds that facilitate the transfer of nucleic acids into cells, such as, for example, polylysine compounds, liposomes, and the like.
- the vector is a viral vector.
- viral vectors include, but are not limited to, adenoviral vectors, adeno-associated viral vectors, lentiviral vectors, retroviral vectors, vaccinia vectors, herpes simplex virus vectors, and derivatives thereof.
- the vector is a phage vector. Viral vector technology is well known in the art and is described, for example, in Sambrook et al. (2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York), among other handbooks of virology and molecular biology.
- retroviruses provide a convenient platform for gene delivery systems.
- the heterologous nucleic acid can be inserted into a vector and packaged in retroviral particles using techniques known in the art.
- Recombinant virus can then be isolated and delivered to the engineered mammalian cells in vitro or ex vivo.
- Many retroviral systems are known in the art.
- an adenoviral vector is used.
- Many adenoviral vectors are known in the art.
- lentiviral vectors are used.
- a self-inactivating lentiviral vector is used.
- the vector is an adeno-associated virus (AAV) vector, such as AAV2, AAV8, or AAV9, which can administer the adenovirus or adeno-associated virus in a single dose comprising at least 1 x 105 particles (also known as particle units, pu).
- AAV adeno-associated virus
- the administered amount is at least about 1 ⁇ 10 6 particles, at least about 1 ⁇ 10 7 particles, at least about 1 ⁇ 10 8 particles, or at least about 1 ⁇ 10 9 particles of adeno-associated virus. Delivery methods and dosages are described, for example, in WO 2016205764 and US Patent No. 8,454,972, which are incorporated herein by reference in their entirety.
- the vector is a recombinant adeno-associated virus (rAAV) vector.
- modified AAV vectors can be used for delivery.
- Modified AAV vectors can to be based on one or more of several capsid types, including AAV1, AAV2, AAV5, AAV6, AAV8, AAV8.2, AAV9, AAV rh10, modified AAV vectors (e.g. modified AAV2, modified AAV3, modified AAV6) and pseudotyped AAV (e.g. AAV2/8, AAV2/5 and AAV2/6).
- Exemplary AAV vectors and techniques that can be used to generate rAAV particles are known in the art (see, e.g., Aponte-Ubillus et al. (2016) Appl. Microbiol. Biotechnol. 102(3):1045-54; Zhong et al. (2012) J. Genet. Syndr. Gene Ther. 160:38-47 (1987); Tratschin et al. (1985) Mol. Cell. Biol. 5:3251-60; US Patent Nos. 4,797,368 and 5,173,414; International Publication Nos. WO2015/054653 and WO93/24641, each of which is incorporated herein by reference).
- AAV vector used to deliver Cas9 and other Cas proteins can be used to deliver the engineered CasX system of the present application.
- Vectors can be transferred into host cells by physical, chemical or biological means.
- vectors into host cells include: calcium phosphate precipitation, lipofection, particle bombardment, microinjection, electroporation, and the like. Methods for producing cells comprising vectors and/or exogenous nucleic acids are well known in the art. See, e.g., Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York. In some embodiments, the vector is introduced into the cells by electroporation.
- Biological methods for introducing heterologous nucleic acids into host cells include the use of DNA and RNA vectors.
- Viral vectors have become the most widely used method for inserting genes into mammalian, eg human, cells.
- Chemical methods for introducing vectors into host cells include colloidal dispersion systems such as macromolecular complexes, nanocapsules, microspheres, beads, and lipid-based systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes.
- An exemplary colloidal system for use as an in vitro delivery vehicle is a liposome (eg, an artificial membrane vesicle).
- the engineered CRISPR-CasX system is delivered in nanoparticles as RNPs.
- the vector or expression system encoding the CRISPR-CasX system or components thereof comprises one or more selectable or detectable markers that provide a means to isolate or efficiently select cells that contain and/or have been modified (e.g., at an early stage and at a large scale) by the CRISPR-CasX system.
- Reporter genes can be used to identify potentially transfected cells and assess the function of regulatory sequences.
- a reporter gene is a gene that is absent or not expressed in the recipient organism or tissue, and the expression of the polypeptide it encodes Reaching is evidenced by some readily detectable property, such as enzymatic activity. Expression of the reporter gene is measured at an appropriate time after introduction of the DNA into the recipient cells.
- Suitable reporter genes may include genes encoding luciferase, ⁇ -galactosidase, chloramphenicol acetyltransferase, secreted alkaline phosphatase, or the green fluorescent protein gene (eg, Ui-Tei et al. FEBS Letters 479:79-82 (2000)).
- heterologous nucleic acid in host cells include, for example, molecular biological assays, such as Southern and Northern blots, RT-PCR and PCR, well known to those skilled in the art; biochemical assays, such as detection of the presence or absence of specific peptides by immunological methods such as ELISA and Western blotting.
- molecular biological assays such as Southern and Northern blots, RT-PCR and PCR, well known to those skilled in the art
- biochemical assays such as detection of the presence or absence of specific peptides by immunological methods such as ELISA and Western blotting.
- the nucleic acid sequence encoding the engineered CasX effector protein and/or the guide RNA is operably linked to a promoter.
- the promoter is an endogenous promoter relative to the cell engineered using the engineered CRISPR-CasX system.
- the nucleic acid encoding the engineered CasX effector protein can be knocked into the genome of an engineered mammalian cell downstream of an endogenous promoter using any method known in the art.
- the endogenous promoter is the promoter of an abundant protein such as ⁇ -actin.
- the endogenous promoter is an inducible promoter, eg, inducible by an endogenous activation signal of the engineered mammalian cell.
- the promoter is a T cell activation dependent promoter (such as the IL-2 promoter, NFAT promoter or NF ⁇ B promoter).
- the promoter is a heterologous promoter relative to the cell engineered using the engineered CRISPR-CasX system.
- a variety of promoters have been explored for expression of genes in mammalian cells, and any promoter known in the art may be used in this application. Promoters can be broadly classified as constitutive promoters or regulated promoters, such as inducible promoters.
- the nucleic acid sequence encoding the engineered CasX effector protein and/or the guide RNA is operably linked to a constitutive promoter.
- Constitutive promoters allow the constitutive expression of heterologous genes (also known as transgenes) in host cells.
- Exemplary constitutive promoters contemplated herein include, but are not limited to: cytomegalovirus (CMV) promoter, human elongation factor-1 ⁇ (hEF1 ⁇ ), ubiquitin C promoter (UbiC), phosphoglycerol kinase promoter (PGK), Simian virus 40 early promoter (SV40), and chicken ⁇ -actin promoter coupled to CMV early enhancer (CAG).
- CMV cytomegalovirus
- hEF1 ⁇ human elongation factor-1 ⁇
- UbiC ubiquitin C promoter
- PGK phosphoglycerol kinase promoter
- SV40 Simian virus 40 early promoter
- CAG CMV early
- the promoter is a CAG promoter comprising a cytomegalovirus (CMV) early enhancer element, Promoter, first exon and first intron of chicken ⁇ -actin gene, and splice acceptor of rabbit ⁇ -globin gene.
- CMV cytomegalovirus
- the nucleic acid sequence encoding the engineered CRISPR-CasX effector protein and/or the guide RNA is operably linked to an inducible promoter.
- Inducible promoters are a type of regulated promoter.
- the inducible promoter can be induced by one or more conditions, such as physical conditions, the microenvironment or the physiological state of the host cell, an inducer (ie, an inducer), or a combination thereof.
- the inducing conditions are selected from the group consisting of inducing agent, irradiation (e.g., ionizing radiation, light), temperature (e.g., heat), redox state, tumor environment, and activation state of the cells to be engineered by the engineered CRISPR-CasX system.
- the promoter is inducible by a small molecule inducing agent such as a chemical compound.
- the small molecule is selected from the group consisting of doxycycline, tetracycline, alcohol, metal, or steroid. Chemically inducible promoters have been most extensively studied.
- Such promoters include promoters whose transcriptional activity is regulated by the presence or absence of small molecule chemicals such as doxycycline, tetracycline, alcohols, steroids, metals and other compounds.
- the doxycycline-inducible system with retrotetracycline-controlled transactivator (rtTA) and tetracycline-responsive element promoter (TRE) is currently the most mature system.
- rtTA retrotetracycline-controlled transactivator
- TRE tetracycline-responsive element promoter
- WO9429442 describes the tight control of gene expression in eukaryotic cells by tetracycline-responsive promoters.
- WO9601313 discloses tetracycline regulated transcriptional regulators.
- Tet technologies such as the Tet-on system have been described at, for example, the TetSystems.com website.
- any known chemically regulated promoter can be used to drive the expression encoding the engineered CRISPR-CasX
- the nucleic acid sequence encoding the engineered CasX effector protein is codon optimized.
- an expression construct comprising a codon-optimized sequence encoding the engineered CasX effector protein ligated to a BPK2104-ccdB vector.
- the expression construct encodes a tag (eg, a 10xHis tag) operably linked to the C-terminus of the engineered CasX effector protein.
- expression constructs for use in mammalian cells comprising a nucleic acid sequence encoding said engineered CasX effector protein.
- the expression construct comprises a codon optimized sequence encoding the engineered CasX effector protein inserted into the pCAG-2A-eGFP vector such that the CasX protein is operably linked to eGFP.
- a second vector is provided for expression of a guide RNA (eg, crRNA or pre-crRNA array) in a mammalian cell (eg, a human cell).
- the sequence encoding the guide RNA is expressed in the pUC19-U6-i2-cr RNA vector backbone.
- the present application provides a method for detecting target nucleic acid in a sample in a fourth aspect
- a method for detecting a target nucleic acid in a sample comprising: (a) contacting the sample with the aforementioned engineered CRISPR-CasX system and a tagged detection nucleic acid, the detection nucleic acid is single-stranded and does not hybridize with the guide sequence of the guide RNA; and (b) measuring a detectable signal generated by cutting the tagged detection nucleic acid with the engineered CasX effector protein, thereby detecting the target nucleic acid.
- the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
- the present application also provides methods for detecting target nucleic acids using any of the engineered CasX effector proteins or CRISPR-CasX systems with improved activity.
- CasX effector proteins as detection reagents can take advantage of the discovery that, once activated by detection of target DNA, type V CRISPR/Cas proteins (e.g., CasX) can promiscuously cleave non-targeted single-stranded DNA (ssDNA or RNA, i.e., single-stranded nucleic acid to which the guide sequence of a guide RNA does not hybridize).
- target DNA double-stranded or single-stranded
- the result is cleavage of the single-stranded nucleic acid in the sample, which can be detected using any convenient detection method (e.g., using tagged single-stranded detection nucleic acids such as DNA or RNA).
- CasX can cleave ssDNA and ssRNA.
- methods using Cas proteins as detection reagents are described in US10253365 and WO2020/056924, which are hereby incorporated by reference in their entirety.
- a method of detecting target DNA comprising: (a) contacting the sample with: (i) any of the engineered CasX effector proteins described herein; (ii) a guide RNA comprising a guide sequence that hybridizes to the target DNA; and (b) measuring a detectable signal produced by cleavage of the single-stranded detection nucleic acid by the engineered CasX effector protein.
- the single-stranded detection nucleic acid includes a fluorescence emitting dye pair (eg, the fluorescence emitting dye pair is a fluorescence resonance energy transfer (FRET) pair, a quencher/fluorescent pair).
- the target DNA is viral DNA (eg, papillomavirus, hepadnavirus, herpesvirus, adenovirus, poxvirus, parvovirus, etc.).
- the single-stranded detection nucleic acid is DNA.
- the single-stranded detection nucleic acid is RNA.
- the method for detecting target DNA (single-stranded or double-stranded) in a sample of the present disclosure can detect target DNA with high sensitivity.
- the engineered CasX effector protein described herein can detect target DNA with higher sensitivity than the reference CasX nuclease.
- the engineered CasX effector protein can detect target DNA with a sensitivity of 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or higher compared to the reference CasX nuclease.
- the present application relates to a method of modifying a target nucleic acid comprising a target sequence
- a method of modifying a target nucleic acid comprising a target sequence comprising contacting the target nucleic acid with the aforementioned engineered CRISPR-CasX system.
- the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
- the present application provides methods of modifying a target nucleic acid comprising a target sequence comprising contacting the target nucleic acid with any of the engineered CRISPR-CasX systems described herein.
- the method is performed in vitro.
- the target nucleic acid is present in a cell.
- the cells are bacterial cells, yeast cells, Mammalian cells, plant cells or animal cells.
- the method is performed ex vivo. In some embodiments, the method is performed in vivo.
- the target nucleic acid is cleaved or a target sequence in the target nucleic acid is altered by the engineered CRISPR-CasX system. In some embodiments, expression of the target nucleic acid is altered by the engineered CRISPR-CasX system. In some embodiments, the target nucleic acid is genomic DNA. In some embodiments, the target sequence is associated with a disease or condition. In some embodiments, the engineered CRISPR-CasX system comprises an array of precursor guide RNAs encoding multiple crRNAs, wherein each crRNA comprises a different guide sequence.
- the present application provides methods of treating a disease or condition associated with a target nucleic acid in a cell of an individual comprising using any of the methods described herein to modify the target nucleic acid in the cell of the individual, thereby treating the disease or condition.
- the disease or condition is selected from the group consisting of cancer, cardiovascular disease, genetic disease, autoimmune disease, metabolic disease, neurodegenerative disease, eye disease, bacterial infection, and viral infection.
- the engineered CRISPR-CasX systems described herein can modify target nucleic acids in cells in a variety of ways, depending on the type of CasX effector protein engineered in the CRISPR-CasX system.
- the methods induce site-specific cleavage in the target nucleic acid.
- the methods cleave genomic DNA in cells such as bacterial cells, plant cells, or animal cells (eg, mammalian cells).
- the method kills the cell by cleavage of genomic DNA in the cell.
- the methods cleave viral nucleic acid in the cell.
- the methods alter (eg, increase or decrease) the expression level of the target nucleic acid in a cell.
- the method uses an engineered CasX effector protein to increase the expression level of the target nucleic acid in a cell, eg, based on an enzymatically inactive CasX protein fused to a transactivation domain.
- the method uses an engineered CasX effector protein to reduce the expression level of the target nucleic acid in a cell, eg, based on an enzymatically inactive CasX protein fused to a transcriptional repression domain.
- the method uses an engineered CasX effector protein to introduce an epigenetic modification into the target nucleic acid in a cell, eg, based on an enzymatically inactive CasX protein fused to an epigenetic modification domain.
- the engineered CasX system described herein can be used to introduce other modifications into the target nucleic acid, depending on the functional domains comprised by the engineered CasX effector protein.
- the method alters a target sequence in the target nucleic acid in a cell.
- the method introduces a mutation into the target nucleic acid in a cell.
- the methods use one or more endogenous DNA repair pathways, such as non-homologous end joining (NHEJ) or homology-directed recombination (HDR), to repair double-strand breaks induced in the target DNA in the cell as a result of sequence-specific cleavage by the CRISPR complex.
- Exemplary mutations include, but are not limited to, insertions, deletions, substitutions, and frameshifts.
- the method inserts donor DNA at the target site.
- insertion of donor DNA results in the introduction of a selectable marker or reporter protein into the cell. In some embodiments, insertion of donor DNA results in knock-in of the gene. In some embodiments, insertion of donor DNA results in a knockout mutation. In some embodiments, insertion of donor DNA results in substitutional mutations such as single nucleotide substitutions. In some embodiments, the method induces a phenotypic change in the cell.
- the engineered CRISPR-CasX system is used as part of a genetic circuit, or to insert a genetic circuit into the genomic DNA of a cell.
- the inducer-controlled engineered segmented CasX effector proteins described herein are particularly useful as components of genetic circuits.
- Genetic circuits can be used in gene therapy. Methods and techniques for designing and using genetic circuits are known in the art. Further reference can be made to eg Brophy, Jennifer AN, and Christopher A. Voigt.”Principles of genetic circuit design.” Nature methods 11.5(2014):508.
- the target nucleic acid is in a cell.
- the target nucleic acid is genomic DNA.
- the target nucleic acid is extrachromosomal DNA.
- the target nucleic acid is foreign to the cell.
- the target nucleic acid is viral nucleic acid such as viral DNA.
- the target nucleic acid is a plasmid in a cell.
- the target nucleic acid is a horizontally transferred plasmid.
- the target nucleic acid is RNA.
- the target nucleic acid is an isolated nucleic acid such as isolated DNA. In some embodiments, the target nucleic acid is present in a cell-free environment. In some embodiments, the target nucleic acid is an isolated vector such as a plasmid. In some embodiments, the target nucleic acid is an isolated linear DNA fragment.
- the cells are bacteria, yeast cells, fungal cells, algal cells, plant cells, or animal cells (eg, mammalian cells, such as human cells).
- the cells are naturally derived from Such as cells isolated from a tissue biopsy.
- the cells are cells isolated from cell lines cultured in vitro.
- the cells are from a primary cell line.
- the cells are from an immortalized cell line.
- the cells are genetically engineered cells.
- the cell is an animal cell of an organism selected from the group consisting of cattle, sheep, goats, horses, pigs, deer, chickens, ducks, geese, rabbits, and fish.
- the cell is a plant cell of an organism selected from the group consisting of corn, wheat, barley, oat, rice, soybean, oil palm, safflower, sesame, tobacco, flax, cotton, sunflower, pearl millet, millet, sorghum, canola, hemp, vegetable crops, forage crops, industrial crops, tree crops, and biomass crops.
- the cells are mammalian cells. In some embodiments, the cells are human cells. In some embodiments, the human cells are human embryonic kidney 293T (HEK293T or 293T) cells or HeLa cells. In some embodiments, the cells are human embryonic kidney (HEK293T) cells. In some embodiments, the cells are mouse Hepal-6 cells. In some embodiments, the mammalian cells are selected from the group consisting of immune cells, liver cells, tumor cells, stem cells, zygotes, muscle cells, and skin cells.
- the cell is an immune cell selected from the group consisting of cytotoxic T cells, helper T cells, natural killer (NK) T cells, iNK-T cells, NK-T-like cells, ⁇ T cells, tumor infiltrating T cells, and dendritic cell (DC) activated T cells.
- the methods generate modified immune cells, such as CAR-T cells or TCR-T cells.
- the cells are embryonic stem (ES) cells, induced pluripotent stem (iPS) cells, progenitor cells of gametes, cells in gametes, zygotes, or embryos.
- ES embryonic stem
- iPS induced pluripotent stem
- the methods described herein can be used to modify target cells in vivo, ex vivo, or in vitro, and can be done in such a way that the cells are altered such that, once modified, progeny or cell lines of the modified cells retain the altered phenotype.
- the modified cells and progeny may be part of a multicellular organism such as a plant or animal with ex vivo or in vivo applications such as genome editing and gene therapy.
- the method is performed ex vivo.
- the modified cells eg, mammalian cells
- the modified cells are propagated ex vivo.
- the modified cells are cultured to propagate for at least about any of 1 day, 2 days, 3 days, 4 days, 5 days, 6 days, 7 days, 10 days, 12 days, or 14 days.
- the modified cells are cultured for no more than about 1 day, 2 days, 3 days Any of days, 4 days, 5 days, 6 days, 7 days, 10 days, 12 days or 14 days.
- the modified cells are further evaluated or screened to select cells with one or more desired phenotypes or properties.
- the target sequence is a sequence associated with a disease or condition.
- diseases or conditions include, but are not limited to, cancer, cardiovascular disease, genetic disease, autoimmune disease, metabolic disease, neurodegenerative disease, eye disease, bacterial infection, and viral infection.
- the disease or condition is a genetic disease.
- the disease or condition is a monogenic disease or condition.
- the disease or condition is a polygenic disease or condition.
- the target sequence has a mutation compared to the wild-type sequence. In some embodiments, the target sequence has a single nucleotide polymorphism (SNP) associated with a disease or condition.
- SNP single nucleotide polymorphism
- the donor DNA inserted into the target nucleic acid encodes a biological product selected from the group consisting of reporter proteins, antigen-specific receptors, therapeutic proteins, antibiotic resistance proteins, RNAi molecules, cytokines, kinases, antigens, antigen-specific receptors, cytokine receptors, and suicide polypeptides.
- the donor DNA encodes a therapeutic protein.
- the donor DNA encodes a therapeutic protein that can be used in gene therapy.
- the donor DNA encodes a therapeutic antibody.
- the donor DNA encodes an engineered receptor, such as a chimeric antigen receptor (CAR) or an engineered TCR.
- the donor DNA encodes a therapeutic RNA, such as a small RNA (eg, siRNA, shRNA, or miRNA) or a long non-coding RNA (lincRNA).
- the methods described herein can be used for multiplex gene editing or modulation at two or more (eg, 2, 3, 4, 5, 6, 8, 10 or more) different target sites.
- the methods detect or modify multiple target nucleic acids or target nucleic acid sequences.
- the method comprises contacting a target nucleic acid with a guide RNA comprising a plurality (eg, 2, 3, 4, 5, 6, 8, 10 or more) of crRNA sequences, wherein each crRNA comprises a different target sequence.
- engineered cells comprising a modified target nucleic acid produced using any of the methods described herein.
- the engineered cells can be used in cell therapy.
- Autologous or allogeneic cells can be used to prepare engineered cells using the methods described herein for cell therapy.
- the methods described herein can also be used to generate isogenic lines of cells (eg, mammalian cells) to study genetic variants.
- engineered non-human animals comprising engineered cells described herein.
- the engineered non-human animal is a genome-edited non-human animal.
- the engineered non-human animals can be used as disease models.
- Non-human genome edited or transgenic animals include, but are not limited to: pronuclear microinjection, viral infection, transformation of embryonic stem cells and induced pluripotent stem (iPS) cells.
- iPS induced pluripotent stem
- Detailed methods that can be used include, but are not limited to, those described by Sundberg and Ichiki (2006, Genetically Engineered Mice Handbook, CRC Press) and those described by Gibson (2004, A Primer Of Genome Science 2nd ed. Sunderland, Mass.: Sinauer).
- the engineered animal can be of any suitable species including, but not limited to: cattle, horses, sheep, dogs, deer, felines, goats, pigs, primates, and lesser known mammals such as elephants, deer, zebras, or camels.
- the present application relates to the use of CRISPR-CasX system to target nuclear cells in cells for the preparation of treatments and individuals.
- Use in medicine for an acid-related disease or condition relates to the use of CRISPR-CasX system to target nuclear cells in cells for the preparation of treatments and individuals.
- a use of the aforementioned engineered CRISPR-CasX system in the preparation of a medicament for treating a disease or disorder associated with a target nucleic acid in a cell of an individual is provided; preferably, the disease or disorder is selected from the group consisting of cancer, cardiovascular disease, genetic disease, autoimmune disease, metabolic disease, neurodegenerative disease, eye disease, bacterial infection, and viral infection.
- the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
- the present application provides a method of treating a disease or condition associated with a target nucleic acid in a cell of an individual comprising contacting the target nucleic acid with any of the engineered CRISPR-CasX systems described herein, wherein the guide sequence of the guide RNA is complementary to the target sequence of the target nucleic acid, wherein the engineered CasX effector protein and the guide RNA associate with each other to bind to the target nucleic acid to modify the target nucleic acid such that the disease or condition is treated.
- mutations are introduced into the target nucleic acid.
- expression of the target nucleic acid is enhanced.
- expression of the target nucleic acid is inhibited.
- the present application provides methods of treating a disease or condition in an individual comprising administering to the individual an effective amount of any of the engineered CRISPR-CasX systems described herein and donor DNA encoding a therapeutic agent, wherein the guide sequence of the guide RNA is Complementary to a target sequence of a target nucleic acid of the individual, wherein the engineered CasX effector protein and the guide RNA bind to each other to bind to the target nucleic acid and insert donor DNA into the target sequence, thereby allowing the disease or condition to be treated.
- the present application provides a method of treating a disease or condition in an individual comprising administering to the individual an effective amount of an engineered cell comprising a modified target nucleic acid, wherein the engineered cell is prepared by contacting the cell with any of the engineered CRISPR-CasX systems described herein, wherein the guide sequence of the guide RNA is complementary to the target sequence of the target nucleic acid, wherein the engineered CasX effector protein and the guide RNA associate with each other to bind to the target nucleic acid to modify the target nucleic acid.
- the engineered cells are immune cells.
- the individual is a human.
- the individual is an animal, eg, a model animal such as a rodent, pet, or farm animal.
- the individual is a mammal.
- the disease or condition is selected from the group consisting of cancer, cardiovascular disease, genetic disease, autoimmune disease, metabolic disease, neurodegenerative disease, eye disease, bacterial infection, and viral infection.
- the target nucleic acid is PCSK9.
- the disease or condition is cardiovascular disease.
- the disease or condition is coronary artery disease.
- the method reduces cholesterol levels in the individual.
- the method treats diabetes in the individual.
- the present application relates to a delivery method in the seventh aspect
- the engineered CRISPR-CasX system described herein, or components thereof, nucleic acid molecules thereof, or nucleic acid molecules encoding or providing components thereof can be delivered to a host cell by a variety of delivery systems such as plasmids or viruses (e.g., any of the vectors described in the "Constructs and Vectors" section above).
- the engineered CRISPR-CasX system can be delivered by other methods, such as nucleofection or electroporation of a ribonucleoprotein complex consisting of the engineered CasX effector protein and its one or more cognate RNA guide sequences.
- delivery is via nanoparticles or exosomes.
- paired CasX nickase complexes can be delivered directly using nanoparticles or other direct protein delivery methods such that complexes comprising two paired crRNA elements are co-delivered.
- proteins can be delivered to cells via viral vectors or directly followed by CRISPR arrays containing two paired spacers for double nicks.
- the RNA can be conjugated to at least one sugar moiety such as N-acetylgalactosamine (GalNAc) (particularly triantennary GalNAc).
- the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
- the application relates to kits and articles of manufacture in an eighth aspect
- a kit is provided in a specific embodiment, comprising: one or more AAV vectors encoding any of the aforementioned engineered CasX nuclease, the aforementioned engineered CasX effector protein, and the aforementioned engineered CRISPR-CasX system.
- the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
- a “vector” is a composition of matter comprising an isolated nucleic acid and which can be used to deliver the isolated nucleic acid to the interior of a cell.
- Many vectors are known in the art including, but not limited to, linear polynucleotides, polynucleotides associated with ionic or amphiphilic compounds, plasmids, and viruses.
- suitable vectors contain an origin of replication functional in at least one organism, a promoter sequence, convenient restriction endonuclease sites and one or more selectable markers.
- the term “vector” should also be construed to include non-plasmid and non-viral compounds that facilitate the transfer of nucleic acids into cells, such as, for example, polylysine compounds, liposomes, and the like.
- the vector is a viral vector.
- viral vectors include, but are not limited to, adenoviral vectors, adeno-associated viral vectors, lentiviral vectors, retroviral vectors, vaccinia vectors, herpes simplex virus vectors, and derivatives thereof.
- the vector is a phage vector. Viral vector technology is well known in the art and is described, for example, in Sambrook et al. (2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York), among other handbooks of virology and molecular biology.
- retroviruses provide a convenient platform for gene delivery systems.
- the heterologous nucleic acid can be inserted into a vector and packaged in retroviral particles using techniques known in the art.
- Recombinant virus can then be isolated and delivered to the engineered mammalian cells in vitro or ex vivo.
- Many retroviral systems are known in the art.
- an adenoviral vector is used.
- Many adenoviral vectors are known in the art.
- lentiviral vectors are used.
- a self-inactivating lentiviral vector is used.
- the vector is an adeno-associated virus (AAV) vector, such as AAV2, AAV8, or AAV9, which can administer the adenovirus or adeno-associated virus in a single dose comprising at least 1 x 105 particles (also known as particle units, pu).
- AAV adeno-associated virus
- the administered amount is at least about 1 ⁇ 10 6 particles, at least about 1 ⁇ 10 7 particles, at least about 1 ⁇ 10 8 particles, or at least about 1 ⁇ 10 9 particles of adeno-associated virus. Delivery methods and dosages are described, for example, in WO 2016205764 and US Patent No. 8,454,972, which are incorporated herein by reference in their entirety.
- the vector is a recombinant adeno-associated virus (rAAV) vector.
- modified AAV vectors can be used for delivery.
- Modified AAV vectors can be based on one or more of several capsid types, including AAV1, AAV2, AAV5, AAV6, AAV8, AAV8.2, AAV9, AAV rh10, modified AAV vectors (e.g., modified AAV2, modified AAV3, modified AAV6) and pseudotyped AAV (e.g., AAV2/8, AAV2/5, and AAV2/6).
- Exemplary AAV vectors and techniques that can be used to generate rAAV particles are known in the art (see, e.g., Aponte-Ubillus et al. (2016) Appl. Microbiol. Biotechnol. 102(3): 1045-54; Zhong et al. (2012) J. Genet. Syndr. Gene Ther. 160:38-47 (1987); Tratschin et al. (1985) Mol. Cell. Biol. 5:3251-60; U.S. Patent Nos. 4,797,368 and 5,173,414; International Publication Nos. WO2015/054653 and WO93/24641, each of which is incorporated herein by reference).
- AAV vector used to deliver Cas9 and other Cas proteins can be used to deliver the engineered CasX system of the present application.
- compositions, kits, unit doses, and articles of manufacture comprising one or more components of any of the engineered CasX nucleases, engineered CasX effector proteins, or engineered CRISPR-CasX systems described herein.
- kits comprising: one or more AAV vectors encoding any of the engineered CasX nucleases, engineered CasX effector proteins, or engineered CRISPR-CasX systems described herein.
- the kit further comprises one or more guide RNAs.
- the kit further comprises donor DNA.
- the kit further comprises cells such as human cells.
- the kit may comprise one or more additional components, such as containers, reagents, media, cytokines, buffers, antibodies, etc., to allow propagation of the engineered cells.
- the kit may also comprise a device for administering the composition.
- the kit may also comprise instructions for using the engineered CRISPR-CasX system described herein, such as methods for detecting or modifying target nucleic acids.
- the kit comprises instructions for treating or diagnosing the disease or condition.
- the instructions pertaining to the use of the kit components will generally include information on the amount to be administered, the schedule of administration and the route of administration for such deliberate treatment.
- the container can be a unit dose, bulk package (eg, a multi-dose package), or a subunit dose.
- kits comprising sufficient doses of a composition disclosed herein can be provided to provide effective treatment of an individual over an extended period of time.
- the kit can also include a plurality of unit doses of the composition and instructions for use packaged in quantities sufficient for storage and use in a pharmacy (eg, hospital pharmacy and compounding pharmacy).
- kits of the invention are in suitable packaging.
- suitable packaging includes, but is not limited to, vials, bottles, jars, flexible packaging (eg, sealed mylar or plastic bags), and the like. Kits may optionally provide additional components such as buffers and explanatory information. Accordingly, the present application also provides an article of manufacture including vials (eg, sealed vials), bottles, jars, flexible packaging, and the like.
- the article of manufacture may comprise a container and a label or package insert on or adhered to the container.
- Suitable containers include, for example, bottles, vials, syringes, and the like.
- the container can be formed from a variety of materials such as glass or plastic.
- the container contains a composition effective to treat a disease or condition described herein, and may have a sterile access port (eg, the container may be a bag of intravenous solution or a vial with a stopper pierceable by a hypodermic needle).
- the label or package insert indicates that the composition is used to treat a particular condition in an individual.
- the label or package insert will further include instructions for administering the composition to the individual.
- Package Insert means the instructions commonly included in commercial packages of therapeutic products that contain information on the indications, usage, dosage, administration, contraindications and/or warnings regarding the use of such therapeutic products.
- the article of manufacture may also include a second container comprising a pharmaceutically acceptable buffer, such as bacteriostatic water for injection (BWFI), phosphate buffered saline, Ringer's solution, and dextrose solution. From a commercial and user perspective, it can also include other materials, including other buffers, diluents, filters, needles, and syringes.
- BWFI bacteriostatic water for injection
- Example 1 Replace the amino acid that interacts with PAM in the reference PlmCasX enzyme with a positively charged amino acid, and verify its gene editing efficiency.
- a vector expressing Cas protein crRNA or sgRNA in 293T was constructed by ligating the annealed oligonucleotide containing the target sequence into the BasI-digested pUC19-U6-crRNA/sgRNA backbone.
- HEK293T cells were cultured in DMEM (Gibco) containing 1% penicillin-streptomycin (Gibco) and 10% fetal bovine serum (Gibco). Cells were seeded in 24-cell culture dishes (Corning) for 16 hours until the cell density reached 70%.
- DMEM fetal bovine serum
- Lipofectamine 3000 Invitrogen
- 600ng of a plasmid encoding a Cas protein and 300ng of a plasmid encoding a crRNA were transfected into cells cultured in a 24-well cell culture dish. 120 h after transfection, cells were digested with trypsin-EDTA (0.05%) (Gibco), and then subjected to fluorescence-activated cell sorting (FACS).
- FACS fluorescence-activated cell sorting
- FACS-sorted GFP-positive 293FT cells were lysed with buffer L and incubated at 55 °C for 3 h, then at 95 °C for 10 min.
- dsDNA fragments containing target sites in different genomic loci were PCR amplified using the corresponding primers.
- target loci are directly amplified by barcoded PCR using cell lysates as templates.
- PCR products were purified and pooled into several libraries for high-throughput sequencing. The frequency (%) of indels was analyzed using CRISPResso2 software by calculating the ratio of reads containing indels or indels. Reads with an amount less than 0.05% of complete reads were discarded.
- Example 1-A selects three engineered PlmCasXs with single amino acid substitutions
- mutants with the following amino acid substitutions: T26R, K610R and E640R can effectively improve the gene editing efficiency, especially T26R, which can increase the efficiency by 36 times (this is the gene editing efficiency of the reference enzyme for AAVS1-2) and 37 times (this is the gene editing efficiency of the reference enzyme for CCR5-2).
- the mutants S230R, Q512R, and S564R have serious negative effects on the improvement of PlmCasX efficiency.
- Example 1-B selects engineered PlmCasX with two amino acid substitutions
- Example 2 Replace the amino acids in the reference PlmCasX enzyme that interact with the ssDNA substrate with positively charged amino acids, and verify its gene editing efficiency.
- a vector expressing Cas protein crRNA or sgRNA in 293T was constructed by ligating the annealed oligonucleotide containing the target sequence into the BasI-digested pUC19-U6-crRNA/sgRNA backbone.
- HEK293T cells were cultured in DMEM (Gibco) containing 1% penicillin-streptomycin (Gibco) and 10% fetal bovine serum (Gibco). Cells were seeded in 24-cell culture dishes (Corning) for 16 hours until the cell density reached 70%.
- DMEM fetal bovine serum
- Lipofectamine 3000 Invitrogen
- 600ng of a plasmid encoding a Cas protein and 300ng of a plasmid encoding a crRNA were transfected into cells cultured in a 24-well cell culture dish. 120 h after transfection, cells were digested with trypsin-EDTA (0.05%) (Gibco), and then subjected to fluorescence-activated cell sorting (FACS).
- FACS fluorescence-activated cell sorting
- FACS-sorted GFP-positive 293T cells were lysed with buffer L and incubated at 55 °C for 3 h, then at 95 °C for 10 min.
- dsDNA fragments containing target sites in different genomic loci were PCR amplified using the corresponding primers.
- target loci are directly amplified by barcoded PCR using cell lysates as templates.
- PCR products were purified and pooled into several libraries for high-throughput sequencing. The frequency (%) of indels was analyzed using CRISPResso2 software by calculating the ratio of reads containing indels or indels. Reads with an amount less than 0.05% of complete reads were discarded.
- Example 3 Combine some of the amino acid mutants obtained from the screening in Examples 1-2 that can improve the gene editing efficiency of the wild-type PlmCasX, and verify their gene editing efficiency.
- a vector expressing Cas protein crRNA or sgRNA in 293T was constructed by ligating the annealed oligonucleotide containing the target sequence into the BasI-digested pUC19-U6-crRNA/sgRNA backbone.
- HEK293T cells were cultured in DMEM (Gibco) containing 1% penicillin-streptomycin (Gibco) and 10% fetal bovine serum (Gibco). Cells were seeded in 24-cell culture dishes (Corning) for 16 hours until the cell density reached 70%.
- DMEM fetal bovine serum
- Lipofectamine 3000 Invitrogen
- 600ng of a plasmid encoding a Cas protein and 300ng of a plasmid encoding a crRNA were transfected into cells cultured in a 24-well cell culture dish. 120 h after transfection, cells were digested with trypsin-EDTA (0.05%) (Gibco), and then subjected to fluorescence-activated cell sorting (FACS).
- FACS fluorescence-activated cell sorting
- FACS-sorted GFP-positive 293FT cells were lysed with buffer L and incubated at 55 °C for 3 h, then at 95 °C for 10 min.
- dsDNA fragments containing target sites in different genomic loci were PCR amplified using the corresponding primers.
- target loci are directly amplified by barcoded PCR using cell lysates as templates.
- PCR products were purified and pooled into several libraries for high-throughput sequencing. The frequency (%) of indels was analyzed using CRISPResso2 software by calculating the ratio of reads containing indels or indels. Reads with an amount less than 0.05% of complete reads were discarded.
- the engineered PlmCasX enzymes with single and multiple mutations in the amino acid sequence were respectively expressed, and the preferred amino acid replacement method and its corresponding gene editing efficiency are shown in Figure 4 and Table 4.
- Figure 4 we further combined the optimal point mutation combination T26R+K610R obtained in Example 1-B and the optimal point mutation K808R in Example 2 into a protein, By comparing the gene editing efficiency of this new mutant (T26R+K610R+K808R) and wild-type PlmCasX at four genomic loci: AAVS1-2, AAVS1-7, CCR5-2 and CD34-1 in 293T cells, we found that the mutant T26R+K610R+K808R had the highest gene editing efficiency at all four tested genomic loci.
- the gene editing efficiency of T26R+K610R+K808R was increased by 474 times (this is the gene editing efficiency of the reference enzyme for AAVS1-2), 16 times (this is the gene editing efficiency of the reference enzyme for AAVS1-7), 152 times (this is the gene editing efficiency of the reference enzyme for CCR5-2) and 287 times (this is the gene editing efficiency of the reference enzyme for CD34-1 efficiency).
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Medicinal Chemistry (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Pharmacology & Pharmacy (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Immunology (AREA)
- Oncology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Communicable Diseases (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Diabetes (AREA)
- Cardiology (AREA)
- Heart & Thoracic Surgery (AREA)
- Plant Pathology (AREA)
- Hematology (AREA)
- Obesity (AREA)
- Neurology (AREA)
- Neurosurgery (AREA)
- Ophthalmology & Optometry (AREA)
- Analytical Chemistry (AREA)
Abstract
Description
本申请属于生物技术领域。更具体地说,本申请涉及具有提高的催化活性(例如基因编辑活性)的CasX核酸酶、效应蛋白及其用途。This application belongs to the field of biotechnology. More specifically, the present application relates to CasX nucleases, effector proteins, and uses thereof having enhanced catalytic activity (eg, gene editing activity).
基因组编辑是在基因组研究中的重要且有用的技术。有多个系统可用于基因组编辑,包括成簇规律间隔短回文重复序列(CRISPR)-Cas系统、转录激活子样效应因子核酸酶(TALEN)系统以及锌指核酸酶(ZFN)系统。Genome editing is an important and useful technique in genome research. Several systems are available for genome editing, including clustered regularly interspaced short palindromic repeat (CRISPR)-Cas systems, transcription activator-like effector nuclease (TALEN) systems, and zinc finger nuclease (ZFN) systems.
CRISPR-Cas系统是一种高效且具有成本效益的基因组编辑技术,可广泛应用于从酵母、植物到斑马鱼和人类的一系列真核生物中(参见综述:Van der Oost 2013,Science 339:768-770,以及Charpentier and Doudna,2013,Nature 495:50-51)。CRISPR-Cas系统通过结合CasX效应蛋白和CRISPR RNA(crRNA)在古细菌和细菌中提供适应性免疫。迄今为止,基于该系统的突出的功能上和进化上的模块性,已经对包括六型(I–VI型)两类(第1类和第2类)的CRISPR-Cas系统进行了表征。在第2类CRISPR-Cas系统中,II型Cas9系统和V型-A/B/E/J Cas12a/Cas12b/Cas12e/Cas12j系统已被利用来进行基因组编辑,并为生物医学研究提供了广阔的前景。The CRISPR-Cas system is an efficient and cost-effective genome editing technology that can be widely applied in a range of eukaryotes from yeast and plants to zebrafish and humans (see review: Van der Oost 2013, Science 339:768-770, and Charpentier and Doudna, 2013, Nature 495:50-51). The CRISPR-Cas system provides adaptive immunity in archaea and bacteria by combining CasX effector proteins and CRISPR RNA (crRNA). To date, two classes (classes 1 and 2) of CRISPR-Cas systems including six (types I–VI) have been characterized based on the system's outstanding functional and evolutionary modularity. In the second category of CRISPR-Cas systems, type II Cas9 system and type V-A/B/E/J Cas12a/Cas12b/Cas12e/Cas12j system have been exploited for genome editing and offer broad prospects for biomedical research.
但是,当前的CRISPR-Cas系统具有多种局限性,包括有限的基因编辑效率。因此,需要改进方法和系统以进行有效的跨多基因座的基因组编辑。However, current CRISPR-Cas systems have several limitations, including limited gene editing efficiency. Therefore, improved methods and systems are needed for efficient genome editing across multiple loci.
发明内容Contents of the invention
本申请提供了下述技术方案:The application provides the following technical solutions:
1.一种工程化的CasX核酸酶;其包含一个或多个基于参比CasX核酸酶的突变,所述突变为:将参比CasX核酸酶中与核酸有相互作用的氨基酸替换为带正电的氨基酸;所述参比CasX核酸酶为天然野生型CasX核酸酶。优选地,所述核酸是PAM。1. An engineered CasX nuclease; It comprises one or more mutations based on the reference CasX nuclease, the mutation is: the amino acid that interacts with the nucleic acid in the reference CasX nuclease is replaced with a positively charged amino acid; the reference CasX nuclease is a natural wild-type CasX nuclease. Preferably, the nucleic acid is a PAM.
2.如项1所述的工程化的CasX核酸酶,其中,所述CasX核酸酶选自 下组的任一项:PlmCasX、DpbCasX。2. The engineered CasX nuclease as described in item 1, wherein, the CasX nuclease is selected from Any one of the following group: PlmCasX, DpbCasX.
3.如项2所述的工程化的CasX核酸酶,其中,所述与核酸相互作用的一个或多个氨基酸是与核酸在三维结构上距离在9埃以内的氨基酸,尤其是下述位置的一个或多个氨基酸:26、27、29、105、195、198、204、222、230、512、564、565、610、640;其中,所氨基酸位置编号如SEQ ID NO.1所定义。3. The engineered CasX nuclease as described in item 2, wherein, the one or more amino acids interacting with the nucleic acid are amino acids within 9 angstroms in the three-dimensional structure, especially one or more amino acids at the following positions: 26, 27, 29, 105, 195, 198, 204, 222, 230, 512, 564, 565, 610, 640; wherein, the amino acid position numbering is as shown in SEQ ID NO.1 define.
4.如项3所述的工程化的CasX核酸酶,其中,所述带正电的氨基酸是R或K;优选带正电的氨基酸为R。4. The engineered CasX nuclease as described in item 3, wherein the positively charged amino acid is R or K; preferably the positively charged amino acid is R.
5.如项4所述的工程化的CasX核酸酶,其中,所述将参比CasX核酸酶中与核酸相互作用的一个或多个氨基酸替换为带正电的氨基酸是指如下替换中的一种或多种:T26R、K610R、K640、S759R、K808R;优选地,所述工程化CasX核酸酶包含下组的任一项的突变或突变组合:(1)T26R;(2)K610R;(3)K640R;(4)S759R;(5)K808R;(6)T26R和K610R;(7)T26R和K610R以及K808R。5. The engineered CasX nuclease as described in item 4, wherein, the replacement of one or more amino acids interacting with nucleic acid in the reference CasX nuclease with a positively charged amino acid refers to one or more of the following replacements: T26R, K610R, K640, S759R, K808R; 640R; (4) S759R; (5) K808R; (6) T26R and K610R; (7) T26R and K610R and K808R.
6.包含如SEQ ID NO.2~8中任一项所示氨基酸序列或与SEQ ID NO.2~8中任一项所示氨基酸序列有至少95%以上同一性的工程化CasX核酸酶。6. An engineered CasX nuclease comprising an amino acid sequence as shown in any one of SEQ ID NO.2~8 or having at least 95% or more identity with the amino acid sequence shown in any one of SEQ ID NO.2~8.
7.一种工程化的CasX效应蛋白,其包含项1~6中任一项所述的工程化的CasX核酸酶或其功能衍生物。7. An engineered CasX effector protein, comprising the engineered CasX nuclease or a functional derivative thereof according to any one of items 1-6.
8.如项7所述的工程化的CasX效应蛋白,其中所述效应蛋白能够诱导DNA分子中的双链断裂或单链断裂。8. The engineered CasX effector protein as described in item 7, wherein said effector protein can induce double-strand breaks or single-strand breaks in DNA molecules.
9.如项7所述的工程化的CasX效应蛋白,其中所述工程化的CasX核酸酶的功能衍生物为酶失活突变体。9. The engineered CasX effector protein as described in item 7, wherein the functional derivative of the engineered CasX nuclease is an enzyme inactive mutant.
10.如项7~9中任一项所述的工程化的CasX效应蛋白,其还包含与所述工程化的CasX核酸酶融合的功能结构域。10. The engineered CasX effector protein according to any one of items 7 to 9, which further comprises a functional domain fused with the engineered CasX nuclease.
11.如项10所述的工程化的CasX效应蛋白,其中所述功能结构域选自下组:翻译起始结构域、转录阻遏结构域、反式激活结构域、表观遗传修饰结构域、核碱基编辑结构域、逆转录酶结构域、报告分子结构域和核酸酶结构域。11. The engineered CasX effector protein as described in item 10, wherein said functional domain is selected from the group consisting of translation initiation domain, transcriptional repression domain, transactivation domain, epigenetic modification domain, nucleobase editing domain, reverse transcriptase domain, reporter molecular domain and nuclease domain.
12.如项7~11的任一项所述的工程化的CasX效应蛋白,其包含第一多肽和第二多肽,第一多肽包含项1-6中任一项所述的工程化的CasX核酸酶的N末端部分氨基酸残基1至X,第二多肽包含项1-6中任一项所述的工程 化的CasX核酸酶的氨基酸残基X+1至所述CasX核酸酶的C末端,其中所述第一多肽和所述第二多肽能够在包含指导序列的指导RNA的存在下彼此缔合,以形成与靶核酸特异性结合的成簇规律间隔短回文重复序列(CRISPR)复合物,所述靶核酸包含与所述指导序列互补的靶序列。12. The engineered CasX effector protein according to any one of items 7 to 11, comprising a first polypeptide and a second polypeptide, the first polypeptide comprising amino acid residues 1 to X of the N-terminal part of the engineered CasX nuclease described in any one of items 1-6, and the second polypeptide comprising the engineered CasX nuclease described in any one of items 1-6 Amino acid residue X+1 of the CasX nuclease of UL to the C-terminus of the CasX nuclease, wherein the first polypeptide and the second polypeptide are capable of associating with each other in the presence of a guide RNA comprising a guide sequence to form a Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) complex specifically binding to a target nucleic acid comprising a target sequence complementary to the guide sequence.
13.一种工程化的CRISPR-CasX系统,包括:13. An engineered CRISPR-CasX system comprising:
(a)项7-12中任一项所述的工程化的CasX效应蛋白;以及(a) the engineered CasX effector protein of any one of items 7-12; and
(b)包含与靶序列互补的指导序列的指导RNA,或编码所述指导RNA的一种或多种核酸,(b) a guide RNA comprising a guide sequence complementary to a target sequence, or one or more nucleic acids encoding said guide RNA,
其中所述工程化的CasX效应蛋白和所述指导RNA能够形成CRISPR复合物,所述CRISPR复合物特异性结合包含所述靶序列的靶核酸并诱导所述靶核酸的修饰。Wherein the engineered CasX effector protein and the guide RNA are capable of forming a CRISPR complex that specifically binds to a target nucleic acid comprising the target sequence and induces modification of the target nucleic acid.
14.一种检测样品中靶核酸的方法,包括:14. A method for detecting target nucleic acid in a sample, comprising:
(a)使样品与项13中的工程化的CRISPR-CasX系统以及加标签的检测核酸接触,该检测核酸为单链且不与所述指导RNA的指导序列杂交;以及(a) contacting the sample with the engineered CRISPR-CasX system of item 13 and a tagged detection nucleic acid that is single-stranded and does not hybridize to the guide sequence of the guide RNA; and
(b)测量通过所述工程化的CasX效应蛋白切割所述加标签的检测核酸而产生的可检测信号,从而检测所述靶核酸。(b) measuring a detectable signal generated by cleavage of the tagged detection nucleic acid by the engineered CasX effector protein, thereby detecting the target nucleic acid.
15.如项13所述的工程化的CRISPR-CasX系统在制备治疗与个体的细胞中靶核酸相关的疾病或病症的药物中的用途;优选地,所述疾病或病症选自下组:癌症、心血管疾病、遗传性疾病、自身免疫疾病、代谢性疾病、神经退行性疾病、眼病、细菌感染和病毒感染。15. Use of the engineered CRISPR-CasX system as described in item 13 in the preparation of a drug for treating a disease or a disease associated with a target nucleic acid in an individual's cells; preferably, the disease or disease is selected from the group consisting of cancer, cardiovascular disease, hereditary disease, autoimmune disease, metabolic disease, neurodegenerative disease, eye disease, bacterial infection and viral infection.
16.一种修饰包含靶序列的靶核酸的方法,包括使所述靶核酸与项13所述的工程化的CRISPR-CasX系统接触。16. A method of modifying a target nucleic acid comprising a target sequence, comprising contacting said target nucleic acid with the engineered CRISPR-CasX system of item 13.
17.一种试剂盒,其包含:一个或多个AAV载体,其编码根据项1-6的任一项所述工程化的CasX核酸酶、根据项7-12所述的工程化CasX效应蛋白和根据项13所述的工程化CRISPR-CasX系统中任一种。17. A kit comprising: one or more AAV vectors encoding any one of the engineered CasX nuclease according to any one of items 1-6, the engineered CasX effector protein according to items 7-12 and the engineered CRISPR-CasX system according to item 13.
本申请的技术方案取得的有益效果Beneficial effects obtained by the technical solution of the present application
本申请工程化的CasX核酸酶及其效应蛋白具有更高的活性,如切割核酸底物的催化效率及细胞内的基因编辑效率。本申请中的工程化Plm CasX核酸酶具有较现有常规Cas基因编辑工具更卓越的哺乳动物细胞(如人类细胞)内的基因编辑效率;在人类细胞中的基因编辑效率达到了约80%。同时,本申请工程化的CasX核酸酶及其效应蛋白还具有以下优点:蛋白小,crRNA 组分简单,PAM序列简单,并且蛋白自身能加工前体crRNA。这些优点使得本申请高效的工程化的CasX核酸酶及其效应蛋白非常适用于在体内进行基因编辑或者基因调控。The engineered CasX nuclease and its effector proteins in this application have higher activity, such as catalytic efficiency of cutting nucleic acid substrates and gene editing efficiency in cells. The engineered Plm CasX nuclease in this application has superior gene editing efficiency in mammalian cells (such as human cells) than existing conventional Cas gene editing tools; the gene editing efficiency in human cells has reached about 80%. At the same time, the engineered CasX nuclease and its effector protein of the present application also have the following advantages: small protein, crRNA The components are simple, the PAM sequence is simple, and the protein itself can process the precursor crRNA. These advantages make the highly efficient engineered CasX nuclease and its effector protein of the present application very suitable for gene editing or gene regulation in vivo.
图1:在人类细胞中,在两个基因位点(AAVS1-2和CCR5-2)处,14种发生单点突变的工程化CRISPR-PlmCasX的基因编辑效率比较(相互比较以及和野生型PlmCasX酶比较)。Figure 1: Comparison of gene editing efficiencies of 14 engineered CRISPR-PlmCasXs with single point mutations at two loci (AAVS1-2 and CCR5-2) in human cells (with each other and with wild-type PlmCasX enzymes).
图2:在人类细胞中,在四个基因位点(AAVS1-2、AAVS1-7、CCR5-2和CD34-1)处,3种发生单点突变以及两点突变的工程化CRISPR-PlmCasX的基因编辑效率比较(相互比较以及和野生型PlmCasX酶比较)。Figure 2: In human cells, at four gene loci (AAVS1-2, AAVS1-7, CCR5-2 and CD34-1), the gene editing efficiency comparison of three kinds of engineered CRISPR-PlmCasX with single point mutation and two point mutations (compared with each other and compared with wild-type PlmCasX enzyme).
图3:在人类细胞中,在两个基因位点(AAVS1-2和CCR5-2)处,16种发生单点突变的工程化CRISPR-PlmCasX的基因编辑效率比较(相互比较以及和野生型PlmCasX酶比较)。Figure 3: Comparison of gene editing efficiencies of 16 engineered CRISPR-PlmCasX with single point mutations at two loci (AAVS1-2 and CCR5-2) in human cells (compared to each other and to wild-type PlmCasX enzymes).
图4:在人类细胞中,在四个基因位点(AAVS1-2、AAVS1-7、CCR5-2和CD34-1)处,3种发生单点突变以及多点突变的工程化CRISPR-PlmCasX的基因编辑效率比较(相互比较以及和野生型PlmCasX酶比较)。Figure 4: In human cells, at four gene loci (AAVS1-2, AAVS1-7, CCR5-2 and CD34-1), the gene editing efficiency comparison of three kinds of engineered CRISPR-PlmCasX with single point mutation and multiple point mutation (compared with each other and compared with wild-type PlmCasX enzyme).
需要说明的是,在说明书及权利要求当中使用了某些词汇来指称特定组件。本领域技术人员应可以理解,技术人员可能会用不同名词来称呼同一个组件。本说明书及权利要求并不以名词的差异来作为区分组件的方式,而是以组件在功能上的差异来作为区分的准则。如在通篇说明书及权利要求当中所提及的“包含”或“包括”为一开放式用语,故应解释成“包含但不限定于”。说明书后续描述为实施本发明的较佳实施方式,然所述描述乃以说明书的一般原则为目的,并非用以限定本发明的范围。本发明的保护范围当视所附权利要求所界定者为准。除非另有定义,本文中使用的所有技术和科学术语具有与本公开所属领域的普通技术人员通常所理解的相同含义。It should be noted that certain terms are used in the specification and claims to refer to specific components. Those skilled in the art should understand that they may use different terms to refer to the same component. The specification and claims do not use differences in nouns as a way of distinguishing components, but use differences in functions of components as a criterion for distinguishing. "Includes" or "comprises" mentioned throughout the specification and claims is an open term, so it should be interpreted as "including but not limited to". The subsequent description in the specification is a preferred implementation mode for implementing the present invention, but the description is for the purpose of the general principles of the specification, and is not intended to limit the scope of the present invention. The scope of protection of the present invention should be defined by the appended claims. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs.
术语the term
如本文所用,“效应蛋白”是指具有活性如位点特异性结合活性、单链DNA切割活性、双链DNA切割活性、单链RNA切割活性或转录调节活性的蛋白。如本文所用,“指导RNA”和“gRNA”在本文中可互换使用,是指能够与CasX效应蛋白和靶核酸(例如,双链DNA)形成复合物的RNA。CasX本文还考虑了可以被加工成多个crRNA的前体指导RNA阵列CasX。“crRNA”或“CRISPR RNA”包含与靶核酸(例如,双链DNA)的靶序列具有足够互补性的指导序列,其指导CRISPR复合物与靶核酸的序列特异性结合。As used herein, "effector protein" refers to a protein having an activity such as site-specific binding activity, single-strand DNA cleavage activity, double-strand DNA cleavage activity, single-strand RNA cleavage activity or transcription regulation activity. As used herein, "guide RNA" and "gRNA" are used interchangeably herein to refer to an RNA capable of forming a complex with a CasX effector protein and a target nucleic acid (eg, double-stranded DNA). CasX This paper also considers CasX, an array of precursor guide RNAs that can be processed into multiple crRNAs. "crRNA" or "CRISPR RNA" comprises a guide sequence with sufficient complementarity to the target sequence of a target nucleic acid (e.g., double-stranded DNA) that directs sequence-specific binding of the CRISPR complex to the target nucleic acid.
如本文所用,术语“核酸”、“多核苷酸”和“核苷酸序列”可互换使用,是指任何长度的核苷酸的聚合形式,包括脱氧核糖核苷酸、核糖核苷酸、其组合及其类似物。“寡核苷酸”和“低聚核苷酸”可互换使用,是指具有不超过约50个核苷酸的短多核苷酸。As used herein, the terms "nucleic acid," "polynucleotide," and "nucleotide sequence" are used interchangeably to refer to a polymeric form of nucleotides of any length, including deoxyribonucleotides, ribonucleotides, combinations thereof, and analogs thereof. "Oligonucleotide" and "oligonucleotide" are used interchangeably to refer to short polynucleotides having no more than about 50 nucleotides.
如本文所用,“互补性”是指核酸通过传统的沃森-克里克(Watson-Crick)碱基配对与另一核酸形成氢键的能力。互补性百分比表示可与第二种核酸形成氢键(即,沃森-克里克碱基配对)的核酸分子中的残基百分比(例如,10分之5、6、7、8、9、10,分别互补约50%、60%、70%、80%、90%和100%)。“完全互补”是指核酸序列的所有连续残基与第二核酸序列中相同数目的连续残基形成氢键。如本文所用,“基本上互补”是指在约40、50、60、70、80、100、150、200、250个或更多个核苷酸的区域上,互补程度为至少约70%、75%、80%、85%、90%、95%、97%、98%、99%或100中任一个,或指在严格条件下杂交的两种核酸。As used herein, "complementarity" refers to the ability of a nucleic acid to form hydrogen bonds with another nucleic acid through conventional Watson-Crick base pairing. Percent complementarity represents the percentage of residues in a nucleic acid molecule that can form hydrogen bonds (i.e., Watson-Crick base pairing) with a second nucleic acid (e.g., 5, 6, 7, 8, 9, 10 out of 10, about 50%, 60%, 70%, 80%, 90% and 100% complementary, respectively). "Perfectly complementary" means that all contiguous residues of a nucleic acid sequence form hydrogen bonds with the same number of contiguous residues in a second nucleic acid sequence. As used herein, "substantially complementary" refers to a degree of complementarity of at least about any of 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100 over a region of about 40, 50, 60, 70, 80, 100, 150, 200, 250 or more nucleotides, or to two nucleic acids that hybridize under stringent conditions.
如本文所用,用于杂交的“严格条件”是指与靶序列具有互补性的核酸主要与靶序列杂交,而基本上不与非靶序列杂交的条件。严格条件通常是序列依赖性的,并且取决于许多因素而变化。通常,序列越长,序列与其靶序列特异性杂交的温度越高。严格条件的非限制性例子详细描述于Tijssen(1993),Laboratory Techniques In Biochemistry And Molecular Biology-Hybridization With Nucleic Acid Probes,第一部分第二章"Overview of principles of hybridization and the strategy of nucleic acid probe assay,”Elsevier,N,Y。As used herein, "stringent conditions" for hybridization refers to conditions under which a nucleic acid having complementarity to a target sequence primarily hybridizes to the target sequence and substantially does not hybridize to non-target sequences. Stringent conditions are generally sequence-dependent and vary depending on many factors. In general, the longer the sequence, the higher the temperature at which the sequence will specifically hybridize to its target sequence. Non-limiting examples of stringent conditions are described in detail in Tijssen (1993), Laboratory Techniques In Biochemistry And Molecular Biology-Hybridization With Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and the strategy of nuclear acid probe assay," Else vier, N, Y.
“杂交”是指一种或多种多核苷酸反应形成复合物的反应,所述复合物通过核苷酸残基的碱基之间的氢键而稳定化。氢键可通过沃森克里克碱基配对、霍格斯坦(Hoogstein)结合或以任何其他序列特异性方式发生。能够与给定序列杂交的序列称为所述给定序列的“互补体”。 "Hybridization" refers to a reaction in which one or more polynucleotides react to form a complex that is stabilized by hydrogen bonding between the bases of the nucleotide residues. Hydrogen bonding can occur through Watson-Crick base pairing, Hoogstein bonding, or in any other sequence-specific manner. A sequence that is capable of hybridizing to a given sequence is called the "complement" of that given sequence.
针对核酸序列的“序列同一性百分比(%)”定义为,在通过允许空缺(gaps)来比对序列(如有必要)以实现最大的序列同一性百分比后,候选序列中与特定核酸序列中的核苷酸相同的核苷酸百分比。针对肽、多肽或蛋白质序列的“序列同一性百分比(%)”,是在通过允许空缺来比对序列(如有必要)以实现最大的序列同源性百分比后,候选序列中与特定肽或氨基酸序列中的氨基酸残基相同替换的氨基酸残基的百分比。为了确定氨基酸序列同一性百分比的目的,比对可以以本领域技术范围内的各种方式来实现,例如,使用诸如BLAST、BLAST-2、ALIGN或MEGALIGNTM(DNASTAR)软件之类的公众可获得的计算机软件。本领域技术人员可以确定用于测量比对的合适参数,包括在所比较序列的全长上实现最大比对所需的任何算法。"Percent (%) sequence identity" for a nucleic acid sequence is defined as the percentage of nucleotides in a candidate sequence that are identical to nucleotides in a specific nucleic acid sequence after aligning the sequences by allowing gaps, if necessary, to achieve the maximum percent sequence identity. "Percent sequence identity (%)" for a peptide, polypeptide or protein sequence is the percentage of amino acid residues in a candidate sequence that are identical to those in a particular peptide or amino acid sequence after aligning the sequences by allowing gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent amino acid sequence identity can be achieved in various ways that are within the skill in the art, for example, using publicly available computer software such as BLAST, BLAST-2, ALIGN or MEGALIGN ™ (DNASTAR) software. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared.
术语“多肽”和“肽”在本文可互换使用,是指任何长度的氨基酸的聚合物。所述聚合物可以是直链或支链的,它可以包含经修饰的氨基酸,并且可以被非氨基酸中断。蛋白质可以具有一个或多个多肽。该术语还涵盖已经过修饰的氨基酸聚合物;例如,二硫键的形成、糖基化、脂质化、乙酰化、磷酸化或任何其他操作(诸如与标记组分的缀合)。The terms "polypeptide" and "peptide" are used interchangeably herein to refer to a polymer of amino acids of any length. The polymer may be linear or branched, it may contain modified amino acids, and it may be interrupted by non-amino acids. A protein can have one or more polypeptides. The term also encompasses amino acid polymers that have been modified; eg, disulfide bond formation, glycosylation, lipidation, acetylation, phosphorylation, or any other manipulation (such as conjugation with a labeling component).
如本文所用,“变体”解释为分别不同于参比多核苷酸或多肽但保留必要特性的多核苷酸或多肽。多核苷酸的典型变体与另一参比多核苷酸的核酸序列不同。变体核酸序列的变化可以改变或可以不改变参比多核苷酸编码的多肽的氨基酸序列。核苷酸变化可导致参比序列编码的多肽中的氨基酸替换、添加、缺失、融合和截短,如下所述。多肽的典型变体与另一参比多肽在氨基酸序列上不同。通常,差异是有限的,使得参比多肽和变体的序列总体上非常相似,并且在许多区域是相同的。变体和参比多肽的氨基酸序列可以通过一个或多个替换、添加、缺失的任何组合而不同。替换或插入的氨基酸残基可以是或可以不是遗传密码编码的氨基酸残基。多核苷酸或多肽的变体可以是天然存在的(诸如等位基因变体),或者可以是未知天然存在的变体。多核苷酸和多肽的非天然存在的变体可以通过诱变技术,通过直接合成,以及通过本领域技术人员已知的其他重组方法来制备。As used herein, "variant" is interpreted as a polynucleotide or polypeptide that differs from a reference polynucleotide or polypeptide, respectively, but retains essential properties. A typical variant of a polynucleotide differs from the nucleic acid sequence of another reference polynucleotide. Changes in the nucleic acid sequence of a variant may or may not alter the amino acid sequence of the polypeptide encoded by the reference polynucleotide. Nucleotide changes may result in amino acid substitutions, additions, deletions, fusions and truncations in the polypeptide encoded by the reference sequence, as described below. A typical variant of a polypeptide differs in amino acid sequence from another reference polypeptide. Usually, the differences are limited such that the sequences of the reference polypeptide and the variant are very similar overall and identical in many regions. The amino acid sequence of a variant and reference polypeptide may differ by any combination of one or more substitutions, additions, deletions. The substituted or inserted amino acid residues may or may not be those encoded by the genetic code. Variants of a polynucleotide or polypeptide may be naturally occurring (such as allelic variants), or may be variants that are not known to occur naturally. Non-naturally occurring variants of polynucleotides and polypeptides can be prepared by mutagenesis techniques, by direct synthesis, and by other recombinant methods known to those skilled in the art.
如本文所用,术语“野生型”具有本领域技术人员通常理解的含义,意指当它存在于大自然中时,将其与突变体或变体区分开的、典型形式的生物体、菌株、基因或特征。它可以与自然界中的资源隔离开来,并没有被刻意修饰。 As used herein, the term "wild-type" has the meaning commonly understood by those skilled in the art, meaning a typical form of an organism, strain, gene or characteristic that distinguishes it from a mutant or variant as it exists in nature. It can be isolated from resources in nature and has not been deliberately modified.
如本文所用,术语“非天然存在”或“工程化的”可互换使用,是指人工参与。当这些术语用于描述核酸分子或多肽时,是指所述核酸分子或多肽至少基本上不含其天然缔合的或天然存在的至少一种其他组分。As used herein, the terms "non-naturally occurring" or "engineered" are used interchangeably to refer to the intervention of man. These terms, when used to describe a nucleic acid molecule or polypeptide, mean that the nucleic acid molecule or polypeptide is at least substantially free of at least one other component with which it is naturally associated or occurs in nature.
如本文所用,术语“直系同源物(orthologue/ortholog)”具有本领域普通技术人员通常理解的含义。作为进一步的指导,本文所指的蛋白质的“直系同源物”是指属于不同物种的蛋白质,其执行与作为其直系同源物的蛋白质相同或相似的功能。As used herein, the term "orthologue/ortholog" has the meaning commonly understood by those of ordinary skill in the art. As a further guidance, an "ortholog" of a protein as referred to herein refers to a protein belonging to a different species that performs the same or a similar function as the protein that is its ortholog.
如本文所用,术语“同一性”用于表示两个多肽之间或两个核酸之间的序列匹配。当两个比较序列中的一个位置被相同的碱基或氨基酸单体亚基占据时(例如,两个DNA分子的每个中的一个位置都被腺嘌呤占据,或者两个多肽的每个中的一个位置被赖氨酸占据),那么在那个位置每个分子均相同。这两个序列之间的“同一性百分比”是两个序列共有的匹配位置数除以要比较的位置数x 100的函数。例如,如果两个序列的10个位置中有6个匹配,则这两个序列具有60%的同一性。例如,DNA序列CTGACT和CAGGTT具有50%的同一性(总共6个位置中有3个匹配)。通常,当两个序列进行比对以产生最大的同一性时,进行这种比较。这种比对可以通过例如Needleman et al.(1970)J.Mol.Biol.48:443-453中的方法来实现,所述方法可方便地通过计算机程序如比对(Align)程序(DNAstar,Inc.)来进行。也可以采用PAM 120权重残基表,使用E.Meyers和W.Miller的算法(Comput.Appl Biosci.,4:11-17(1988))集成到ALIGN程序(2.0版)中。空缺长度罚分12和空缺罚分4,用于确定两个氨基酸序列之间的同一性百分比。此外,可以使用集成到GCG软件包(可从www.gcg.com获得)的GAP程序中的Needleman和Wunsch(J MoI Biol.48:444-453(1970))算法,采用Blossum 62矩阵或PAM250矩阵,空缺权重为16、14、12、10、8、6或4,长度权重为1、2、3、4、5或6,以确定两个氨基酸序列之间的同一性百分比。As used herein, the term "identity" is used to denote a sequence match between two polypeptides or between two nucleic acids. When a position in two compared sequences is occupied by the same base or subunit of an amino acid monomer (for example, a position in each of two DNA molecules is occupied by an adenine, or a position in each of two polypeptides is occupied by a lysine), then each molecule is identical at that position. The "percent identity" between these two sequences is a function of the number of matching positions shared by the two sequences divided by the number of positions being compared x 100. For example, two sequences are 60% identical if 6 out of 10 positions of the two sequences match. For example, the DNA sequences CTGACT and CAGGTT share 50% identity (3 out of a total of 6 positions match). Typically, such comparisons are made when two sequences are aligned to yield maximum identity. Such alignment can be achieved, for example, by the method of Needleman et al. (1970) J. Mol. Biol. 48:443-453, which can be conveniently performed by a computer program such as the Align program (DNAstar, Inc.). The PAM 120 weighted residue table can also be used, integrated into the ALIGN program (version 2.0) using the algorithm of E. Meyers and W. Miller (Comput. Appl Biosci., 4:11-17 (1988)). A gap length penalty of 12 and a gap penalty of 4 are used to determine the percent identity between two amino acid sequences. In addition, the Needleman and Wunsch (J MoI Biol. 48:444-453 (1970)) algorithm integrated into the GAP program of the GCG software package (available from www.gcg.com) can be used to determine the distance between two amino acid sequences using the Blossum 62 matrix or the PAM250 matrix with gap weights of 16, 14, 12, 10, 8, 6, or 4 and length weights of 1, 2, 3, 4, 5, or 6. percent identity.
本文所用的“细胞”应理解为不仅指特定的单个细胞,而且指该细胞的后代或潜在后代。因为由于突变或环境影响,可能在后代中发生某些修饰,所以此类后代可能事实上与亲本细胞不同,但仍包括在本文术语的范围内。"Cell" as used herein is understood not only to refer to a specific single cell, but also to the progeny or potential progeny of that cell. Because certain modifications may have occurred in the progeny, due to mutations or environmental influences, such progeny may in fact differ from the parental cells and still be included within the scope of the term herein.
如本文所用,术语“转导”和“转染”包括本领域已知的使用感染剂(如病毒)或其他方式将DNA引入细胞中以表达目的蛋白质或分子的方法。除了病毒或类似病毒的试剂外,还有基于化学的转染方法,如使用磷酸钙、树状聚合 物,脂质体或阳离子聚合物(例如DEAE-葡聚糖或聚乙烯亚胺)的转染方法;非化学方法,如电穿孔、细胞挤压(cell squeezing)、声致穿孔(sonoporation)、光学转染、穿刺转染(impalefection)、原生质体融合、质粒递送或转座子;基于颗粒的方法,如使用基因枪、磁转染或磁体辅助转染、颗粒轰击;以及杂交方法(诸如核转染)。As used herein, the terms "transduction" and "transfection" include methods known in the art to introduce DNA into cells to express a protein or molecule of interest using an infectious agent such as a virus or otherwise. In addition to viruses or virus-like reagents, there are chemical-based transfection methods such as the use of calcium phosphate, dendrimers non-chemical methods such as electroporation, cell squeezing, sonoporation, optical transfection, impalefection, protoplast fusion, plasmid delivery or transposons; particle-based methods such as the use of gene guns, magnetofection or magnet-assisted transfection, particle bombardment; and hybridization methods such as nucleofection ).
如本文所用,术语“转染的”、“转化的”或“转导的”是指将外源核酸转移或引入宿主细胞的过程。“转染的”、“转化的”或“转导的”细胞是已经用外源核酸转染、转化或转导的细胞。As used herein, the term "transfected", "transformed" or "transduced" refers to the process of transferring or introducing exogenous nucleic acid into a host cell. A "transfected", "transformed" or "transduced" cell is a cell that has been transfected, transformed or transduced with an exogenous nucleic acid.
术语“体内”是指从其中获得细胞的该生物体内。“离体”或“体外”是指从其中获得细胞的该生物体外。The term "in vivo" refers to the organism from which cells are obtained. "Ex vivo" or "in vitro" means outside the organism from which cells are obtained.
如本文所用,“治疗(treatment/treating)”是用于获得有益的或期望的结果(包括临床结果)的方法。为了本发明的目的,有益的或期望的临床结果包括但不限于以下的一种或多种:减轻由疾病引起的一种或多种症状,减轻疾病的程度,稳定疾病(例如预防或延缓疾病的恶化),预防或延缓疾病的扩散(例如转移),预防或延缓疾病的复发,降低疾病的复发率,延缓或减慢疾病的进展,改善疾病状态,提供疾病的(部分或全部)缓解,减少治疗该疾病所需的一种或多种其他药物的剂量,延缓疾病的进展,提高生活质量,和/或延长生存期。“治疗”还包括减少病症、病况或疾病的病理后果。本发明的方法考虑了这些治疗的方面中的任何一个或多个。As used herein, "treatment/treating" is a method used to obtain beneficial or desired results, including clinical results. For the purposes of this invention, a beneficial or desired clinical outcome includes, but is not limited to, one or more of the following: alleviation of one or more symptoms caused by the disease, reduction of the extent of the disease, stabilization of the disease (e.g., prevention or delay of progression of the disease), prevention or delay of spread of the disease (e.g., metastasis), prevention or delay of recurrence of the disease, reduction of the rate of recurrence of the disease, delay or slowing of the progression of the disease, amelioration of the disease state, provision of remission (partial or total) of the disease, reduction in the dose of one or more other drugs required to treat the disease, delay of the progression of the disease, improvement of life quality, and/or prolong survival. "Treatment" also includes reducing the pathological consequences of a disorder, condition or disease. The methods of the invention contemplate any one or more of these aspects of treatment.
如本文所用,术语“有效量”是指足以治疗特定病症、病况或疾病(如改善、缓解、减轻和/或延迟其一种或多种症状)的化合物或组合物的量。如本领域中所理解的,“有效量”可以以一次或多次给药,即,可能需要单次给药或多此给药来达到期望的治疗终点。As used herein, the term "effective amount" refers to an amount of a compound or composition sufficient to treat a particular disorder, condition or disease, eg, ameliorate, alleviate, lessen and/or delay one or more symptoms thereof. As understood in the art, an "effective amount" may be administered in one or more doses, ie, a single dose or multiple doses may be required to achieve the desired therapeutic endpoint.
“受试者”、“个体”或“患者”在本文中可互换使用,以达到治疗目的,是指任何归类为哺乳动物的动物,包括人类、家畜和农场动物,以及动物园、农场或宠物动物如狗、马、猫、牛等。在一些实施方式中,所述个体是人类个体。"Subject," "individual," or "patient" are used interchangeably herein for purposes of treatment and refer to any animal classified as a mammal, including humans, livestock, and farm animals, as well as zoo, farm, or pet animals such as dogs, horses, cats, cows, etc. In some embodiments, the individual is a human individual.
应理解,本文所述的本发明的实施方式包括“由...组成”和/或“基本上由...组成”的实施方式。在本文中对“约”值或参数的提及包括(并描述了)针对该值或参数本身的变化。例如,提及“大约X”的描述,包括对“X”的描述。 It is to be understood that embodiments of the invention described herein include "consisting of" and/or "consisting essentially of" embodiments. Reference herein to "about" a value or parameter includes (and describes) variations for that value or parameter itself. For example, a description referring to "about X" includes description of "X".
如本文所用,对“不”值或参数的提及通常意指并描述了“除…外”值或参数。例如,所述方法不用于治疗X型癌症,意味着所述方法用于治疗除X型以外的癌症。As used herein, a reference to a "not" value or parameter generally means and describes an "except" value or parameter. For example, the method is not used to treat type X cancer, meaning that the method is used to treat cancer other than type X.
如本文所用,术语“大约X-Y”具有与“大约X至大约Y”相同的含义。As used herein, the term "about X-Y" has the same meaning as "about X to about Y".
如本文和所附权利要求书中所使用的,单数形式“一个/一种(a/an)”和“所述”包括复数对象,除非上下文另外明确指出。还应注意,权利要求可以被撰写为排除任何可选的要素。因此此陈述旨在作为与权利要求要素的叙述结合使用诸如“只”、“仅”等排他性术语的先行基础,或使用“否”的限制。As used herein and in the appended claims, the singular forms "a/an" and "the" include plural referents unless the context clearly dictates otherwise. It should also be noted that the claims may be drafted to exclude any optional elements. This statement is therefore intended to serve as an antecedent basis for the use of exclusive terms such as "only," "only," etc., or limitations on the use of "no" in conjunction with the recitation of claim elements.
如本文所用,术语“和/或”在词语诸如“A和/或B”中,旨在既包括A和B;A或B;A(单独);以及B(单独)。同样地,如本文所用,术语“和/或”在词语诸如“A、B和/或C”中,旨在包括以下每个实施方式:A、B和C;A、B或C;A或C;A或B;B或C;A和C;A和B;B和C;A(单独);B(单独);以及C(单独)。As used herein, the term "and/or" in words such as "A and/or B" is intended to include both A and B; A or B; A (alone); and B (alone). Likewise, as used herein, the term "and/or" in words such as "A, B and/or C" is intended to include each of the following: A, B and C; A, B or C; A or C; A or B; B or C; A and C; A and B; B and C; A (alone); B (alone); and C (alone).
本申请在第一方面提供了工程化的CasX核酸酶The application provides engineered CasX nuclease in a first aspect
在一些实施方式中,提供了一种工程化的CasX核酸酶;其包含一个或多个基于参比CasX核酸酶的突变,所述突变为:将参比CasX核酸酶中与核酸有相互作用的一个或多个氨基酸替换为带正电的氨基酸;所述参比CasX核酸酶为天然野生型CasX核酸酶。优选地,所述天然野生型CasX核酸酶选自下组的任一项:PlmCasX、DpbCasX。In some embodiments, an engineered CasX nuclease is provided; it comprises one or more mutations based on the reference CasX nuclease, the mutation is: one or more amino acids in the reference CasX nuclease that interact with nucleic acid are replaced with positively charged amino acids; the reference CasX nuclease is a natural wild-type CasX nuclease. Preferably, the natural wild-type CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
本申请提供了通过引入氨基酸突变来工程化改造酶的方法,所述氨基酸突变基于上述改造原理,这导致体外和体内酶活性的增加。所述工程化CasX核酸酶含有一个或多个如下文所描述的具体突变。在一些实施方式中,本申请中所述的任何一个或多个突变可以与现有的CasX突变组合以提供具有更高活性的工程化CasX核酸酶。The present application provides methods for engineering enzymes by introducing amino acid mutations based on the engineering principles described above, which lead to increased enzyme activity in vitro and in vivo. The engineered CasX nuclease contains one or more specific mutations as described below. In some embodiments, any one or more of the mutations described herein can be combined with existing CasX mutations to provide an engineered CasX nuclease with higher activity.
在一个具体实施方式中,提供了一种工程化的CasX核酸酶;所述工程化的CasX核酸酶包含将参比CasX核酸酶中与核酸相互作用的一个或多个氨基酸替换为带正电的氨基酸的突变。其中,所述与核酸相互作用的一个或多个氨基酸是与核酸在三维结构上距离在9埃以内的氨基酸,尤其是下述位置的一个或多个氨基酸:26、27、29、105、195、198、204、222、230、512、564、565、610、640;其中,氨基酸位置编号如SEQ ID NO.1所定义。In a specific embodiment, an engineered CasX nuclease is provided; the engineered CasX nuclease comprises a mutation that replaces one or more amino acids interacting with nucleic acid in the reference CasX nuclease with a positively charged amino acid. Wherein, the one or more amino acids interacting with the nucleic acid are amino acids within 9 angstroms in the three-dimensional structure, especially one or more amino acids at the following positions: 26, 27, 29, 105, 195, 198, 204, 222, 230, 512, 564, 565, 610, 640; wherein, the amino acid position numbers are as defined in SEQ ID NO.1.
在一个具体实施方式中,所述带正电的氨基酸是R或K;优选带正电的 氨基酸为R。In a specific embodiment, the positively charged amino acid is R or K; preferably positively charged Amino acid is R.
在一个具体实施方式中,所述将参比CasX核酸酶中与核酸相互作用的一个或多个氨基酸替换为带正电的氨基酸是指如下替换中的一种或多种:T26R、K610R、K640、S759R、K808R;优选地,所述工程化CasX核酸酶包含下组的任一项的突变或突变组合:(1)T26R;(2)K610R;(3)K640R;(4)S759R;(5)K808R;(6)T26R和K610R;(7)T26R和K610R以及K808R;其中,所述氨基酸位置编号如SEQ ID NO.1所定义。In a specific embodiment, the replacement of one or more amino acids interacting with nucleic acid in the reference CasX nuclease with a positively charged amino acid refers to one or more of the following replacements: T26R, K610R, K640, S759R, K808R; preferably, the engineered CasX nuclease comprises any mutation or combination of mutations in the following group: (1) T26R; (2) K610R; (3) K640R; (4) S75 9R; (5) K808R; (6) T26R and K610R; (7) T26R and K610R and K808R; wherein, the amino acid position numbering is as defined in SEQ ID NO.1.
在又一具体实施方式中提供了如SEQ ID NO.2~8中任一项所示氨基酸序列的工程化CasX核酸酶。In yet another specific embodiment, the engineered CasX nuclease of the amino acid sequence shown in any one of SEQ ID NO.2~8 is provided.
在本说明书的上下文中,T26R的含义是;在所援引的氨基酸序列中,第26号氨基酸T(苏氨酸)被替换为R(精氨酸);在此,常见的氨基酸及其三字母和单字母缩写列举性地说明如下:
丙氨酸AlaA; 精氨酸Arg R;
天冬氨酸Asp D; 半胱氨酸Cys C;
谷氨酰胺Gln Q; 谷氨酸Glu E;
组氨酸His H; 异亮氨酸Ile I;
甘氨酸Gly G; 天冬酰胺AsnN;
亮氨酸Leu L; 赖氨酸Lys K;
甲硫氨酸MetM; 苯丙氨酸Phe F;
脯氨酸Pro P; 丝氨酸Ser S;
苏氨酸ThrT; 色氨酸Trp W;
酪氨酸TyrY; 缬氨酸ValV。In the context of this specification, the meaning of T26R is; in the cited amino acid sequence, the No. 26 amino acid T (threonine) is replaced by R (arginine); here, common amino acids and their three-letter and single-letter abbreviations are listed as follows:
Alanine AlaA; Arginine Arg R;
Aspartate Asp D; Cysteine Cys C;
Glutamine Gln Q; Glutamate Glu E;
Histidine His H; Isoleucine Ile I;
Glycine Gly G; Asparagine AsnN;
Leucine Leu L; Lysine Lys K;
Methionine MetM; Phenylalanine Phe F;
Proline Pro P; Serine Ser S;
Threonine ThrT; Tryptophan Trp W;
Tyrosine TyrY; Valine ValV.
如本文所用,“该氨基酸在X位置处,其中所述氨基酸位置编号如SEQ ID NO.1所定义”的含义是:该氨基酸残基位于参比酶CasX的某位置处,其相当于在SEQ ID NO:1的X位置处,而且参比酶CasX的氨基酸序列与SEQ ID NO:1的氨基酸序列基于序列同源性相互对齐。例如,图6示出了CasX2(SEQ ID NO.1)与CasX1(SEQ ID NO.13)的氨基酸序列的同源性比对。本领域人员可以用本领域常用的软件,如Clustal Omega,将任一参比CasX核酸酶的氨基酸序列与SEQ ID NO.1进行序列同一性比较和对齐(alignment),进而得到与本申请中所述基于SEQ ID NO.1所定义的氨基酸位点相对应的所述参比CasX核酸酶中的氨基酸位点。 As used herein, "the amino acid is at position X, wherein said amino acid position numbering is as defined in SEQ ID NO.1" means: the amino acid residue is located at a certain position of the reference enzyme CasX, which is equivalent to the position X of SEQ ID NO:1, and the amino acid sequence of the reference enzyme CasX is aligned with the amino acid sequence of SEQ ID NO:1 based on sequence homology. For example, Figure 6 shows the homology alignment of the amino acid sequences of CasX2 (SEQ ID NO.1) and CasX1 (SEQ ID NO.13). Those skilled in the art can use software commonly used in the art, such as Clustal Omega, to compare and align the amino acid sequence of any reference CasX nuclease with SEQ ID NO.1 for sequence identity, and then obtain the amino acid position in the reference CasX nuclease corresponding to the amino acid position defined based on SEQ ID NO.1 described in this application.
在一些实施方式中,所述参比PlmCasX(Planctomycetes CasX)核酸酶为天然PlmCasX。在一些实施方式中,所述参比PlmCasX核酸酶为经过工程改造后的PlmCasX核酸酶。CasX是从宏基因组中挖掘出的一类新型CRISPR-Cas基因编辑系统,于2017年首次被报道(文章:New CRISPR–Cas systems from uncultivated microbes)。根据最新的CRISPR系统分类标准,CasX属于V-E亚类。2019年,有研究(文章:CasX enzymes comprise a distinct family of RNA-guided genome editors)报道来自Deltaproteobacteria的CasX(DpbCasX)以及来自Planctomycetes的CasX(PlmCasX)能够在哺乳动物细胞中进行基因编辑。然而,该研究表明CasX在哺乳动物细胞中的效率很低,并且该文章并未探究PlmCasX在内源基因组位点的编辑效率。In some embodiments, the reference PlmCasX (Planctomycetes CasX) nuclease is native PlmCasX. In some embodiments, the reference PlmCasX nuclease is an engineered PlmCasX nuclease. CasX is a new type of CRISPR-Cas gene editing system excavated from metagenomics, which was first reported in 2017 (article: New CRISPR–Cas systems from uncultivated microbes). According to the latest CRISPR system classification criteria, CasX belongs to the V-E subclass. In 2019, a study (article: CasX enzymes comprise a distinct family of RNA-guided genome editors) reported that CasX (DpbCasX) from Deltaproteobacteria and CasX (PlmCasX) from Planctomycetes can perform gene editing in mammalian cells. However, this study showed that CasX was inefficient in mammalian cells, and the article did not explore the editing efficiency of PlmCasX at endogenous genomic loci.
在一些实施方式中,本申请的工程化的CasX是核酸内切酶,其结合靶序列的特定位点并在指导RNA的指导下切割,并且具有DNA和RNA内切核酸酶活性。在一些实施方式中,所述CasX能够通过加工前体crRNA阵列来进行自主的crRNA生物发生。自主的前体crRNA处理可促进CasX的递送,从而实现双切口应用,因为可以由单个crRNA转录物靶向两个单独的基因组位点。然后,CasX蛋白将CRISPR阵列加工成两个同源的crRNA,从而形成配对的切口复合物。在一些实施方式中,所述指导RNA包含由CRISPR阵列表达的前体crRNA,所述CRISPR阵列由与未加工的DR序列交错的靶序列组成,通过所述效应蛋白的内在前体crRNA加工而重复以使得能够同时靶向一个、两个或多个位点。来自多种生物体的CasX核酸酶可以用作所述参比CasX核酸酶,以提供本申请的工程化的CasX核酸酶及效应蛋白。在一些实施方式中,所述CasX参比CasX核酸酶具有酶活性。在一些实施方式中,所述参比CasX是核酸酶,即切割靶双螺旋核酸(例如,双螺旋DNA)的两条链。在一些实施方式中,所述参比CasX是切口酶,即切割靶双螺旋核酸(例如,双螺旋DNA)的单链。在一些实施方式中,所述CasX参比CasX核酸酶是酶失活的。与CasX或其功能衍生物具有一定序列同一性(例如至少约60%、70%、80%、85%、90%、95%、98%或更多中任一个)的直系同源物可以用作设计本申请的工程化的CasX核酸酶或效应蛋白的基础。In some embodiments, the engineered CasX of the present application is an endonuclease, which binds to a specific site of a target sequence and cuts under the guidance of a guide RNA, and has DNA and RNA endonuclease activity. In some embodiments, the CasX is capable of autonomous crRNA biogenesis by processing a precursor crRNA array. Autonomous pre-crRNA processing facilitates CasX delivery, enabling double-nicking applications, as two separate genomic sites can be targeted by a single crRNA transcript. The CasX protein then processes the CRISPR array into two homologous crRNAs, forming a paired nicking complex. In some embodiments, the guide RNA comprises a precursor crRNA expressed by a CRISPR array consisting of target sequences interleaved with unprocessed DR sequences, repeated by intrinsic precursor crRNA processing of the effector protein to enable simultaneous targeting of one, two or more sites. CasX nucleases from a variety of organisms can be used as the reference CasX nucleases to provide the engineered CasX nucleases and effector proteins of the present application. In some embodiments, the CasX reference CasX nuclease has enzymatic activity. In some embodiments, the reference CasX is a nuclease, ie, cleaves both strands of a target duplex nucleic acid (eg, duplex DNA). In some embodiments, the reference CasX is a nickase, ie, cleaves a single strand of a target duplex nucleic acid (eg, duplex DNA). In some embodiments, the CasX reference CasX nuclease is enzymatically inactive. Orthologs having a certain sequence identity (such as at least about 60%, 70%, 80%, 85%, 90%, 95%, 98% or more) with CasX or its functional derivatives can be used as the basis for designing the engineered CasX nuclease or effector protein of the present application.
在一些实施方式中,所述工程化的CasX核酸酶基于天然存在的CasX核酸酶的功能变体。在一些实施方式中,所述功能变体具有一个或多个突变, 如氨基酸替换、插入和缺失。举例来说,与野生型天然存在的CasX核酸酶相比,所述功能变体可包含1、2、3、4、5、6、7、8、9、10个或更多个氨基酸替换中的任一个。在一些实施方式中,所述一个或多个替换是保守替换。在一些实施方式中,所述功能变体具有天然存在的CasX核酸酶的所有结构域。在一些实施方式中,所述功能变体不具有天然存在的CasX核酸酶的一个或多个结构域。In some embodiments, the engineered CasX nuclease is based on a functional variant of a naturally occurring CasX nuclease. In some embodiments, the functional variant has one or more mutations, such as amino acid substitutions, insertions, and deletions. For example, the functional variant may comprise any of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more amino acid substitutions compared to a wild-type naturally occurring CasX nuclease. In some embodiments, the one or more substitutions are conservative substitutions. In some embodiments, the functional variant has all domains of a naturally occurring CasX nuclease. In some embodiments, the functional variant does not possess one or more domains of a naturally occurring CasX nuclease.
本申请在第二方面提供了工程化的CasX效应蛋白。The present application provides engineered CasX effector proteins in a second aspect.
在一个具体实施方式中,提供了前面所述的工程化的CasX核酸酶或其功能衍生物。In a specific embodiment, the aforementioned engineered CasX nuclease or a functional derivative thereof is provided.
在一个具体的实施方式中,所述CasX核酸酶选自下组的任一项:PlmCasX、DpbCasX。In a specific embodiment, the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
在又一具体实施方式中提供了一种工程化的CasX效应蛋白,其中所述效应蛋白能够诱导DNA分子中的双链断裂或单链断裂。In yet another embodiment there is provided an engineered CasX effector protein, wherein the effector protein is capable of inducing double-strand breaks or single-strand breaks in DNA molecules.
在再一具体实施方式中提供了一种工程化的CasX效应蛋白,其中所述工程化的CasX核酸酶的功能衍生物为酶失活突变体。In yet another specific embodiment, an engineered CasX effector protein is provided, wherein the functional derivative of the engineered CasX nuclease is an enzyme inactive mutant.
在再一具体实施方式中提供了一种工程化的CasX效应蛋白,其还包含与所述工程化的CasX核酸酶融合的功能结构域。In yet another specific embodiment, an engineered CasX effector protein is provided, which further comprises a functional domain fused with the engineered CasX nuclease.
在一个具体实施方式中提供了一种工程化的CasX效应蛋白,其中所述功能结构域选自下组:翻译起始结构域、转录阻遏结构域、反式激活结构域、表观遗传修饰结构域、核碱基编辑结构域、逆转录酶结构域、报告分子结构域和核酸酶结构域。An engineered CasX effector protein is provided in a specific embodiment, wherein the functional domain is selected from the group consisting of a translation initiation domain, a transcriptional repression domain, a transactivation domain, an epigenetic modification domain, a nucleobase editing domain, a reverse transcriptase domain, a reporter domain and a nuclease domain.
在又一具体实施方式中提供了一种工程化的CasX效应蛋白,其包含第一多肽和第二多肽,第一多肽包含前述的工程化的CasX核酸酶的N末端部分氨基酸残基1至X,第二多肽包含前述的工程化的CasX核酸酶的氨基酸残基X+1至所述CasX核酸酶的C末端,其中所述第一多肽和所述第二多肽能够在包含指导序列的指导RNA的存在下彼此缔合,以形成与靶核酸特异性结合的成簇规律间隔短回文重复序列(CRISPR)复合物,该靶核酸包含与所述指导序列互补的靶序列。In yet another specific embodiment, an engineered CasX effector protein is provided, comprising a first polypeptide and a second polypeptide, the first polypeptide comprising amino acid residues 1 to X of the N-terminal part of the aforementioned engineered CasX nuclease, and the second polypeptide comprising amino acid residues X+1 of the aforementioned engineered CasX nuclease to the C-terminal of the CasX nuclease, wherein the first polypeptide and the second polypeptide can associate with each other in the presence of a guide RNA comprising a guide sequence to form a clustered regularly interspaced short palindromic repeat sequence (CRISPR) that specifically binds to a target nucleic acid ) complex, the target nucleic acid comprising a target sequence complementary to the guide sequence.
在一个具体实施方式中,提供了基于本文所述工程化的CasX核酸酶中任一种的工程化的CasX效应蛋白。在一些实施方式中,所述工程化的CasX 效应蛋白具有酶活性。在一些实施方式中,所述工程化的CasX效应蛋白是切割靶双螺旋核酸(例如,双螺旋DNA)的两条链的核酸酶。在一些实施方式中,所述工程化的CasX效应蛋白是切口酶,即切割靶双螺旋核酸(例如,双螺旋DNA)的单链。在一些实施方式中,所述工程化的CasX效应蛋白包含所述工程化的CasX核酸酶的酶失活突变体。CasX核酸酶活性位点中一个或多个氨基酸残基的突变会导致失去酶活的CasX。在一些实施方式中,本文提供的工程化的CasX酶可以被修饰以具有减少的核酸酶活性,例如,所述核酸酶与野生型CasX酶相比失活至少50%、至少60%、至少70%、至少80%、至少90%、至少95%,至少97%或100%。所述核酸酶活性可以通过几种方法来降低,例如,将突变引入所述核酸酶或CasX酶的核酸相互作用结构域中。在一些实施方式中,鉴定了核酸酶活性的催化残基,并且这些氨基酸残基可以被不同的氨基酸残基(例如,甘氨酸或丙氨酸)替换以降低所述核酸酶活性。In a specific embodiment, an engineered CasX effector protein based on any of the engineered CasX nucleases described herein is provided. In some embodiments, the engineered CasX Effector proteins have enzymatic activity. In some embodiments, the engineered CasX effector protein is a nuclease that cleaves both strands of a target duplex nucleic acid (eg, duplex DNA). In some embodiments, the engineered CasX effector protein is a nickase, ie, cleaves a single strand of a target duplex nucleic acid (eg, duplex DNA). In some embodiments, the engineered CasX effector protein comprises an enzyme-inactive mutant of the engineered CasX nuclease. Mutations of one or more amino acid residues in the active site of the CasX nuclease result in an inactive CasX. In some embodiments, the engineered CasX enzymes provided herein can be modified to have reduced nuclease activity, e.g., the nuclease is at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, at least 97%, or 100% inactive compared to a wild-type CasX enzyme. The nuclease activity can be reduced by several methods, for example, introducing mutations into the nucleic acid interaction domain of the nuclease or CasX enzyme. In some embodiments, catalytic residues for nuclease activity are identified, and these amino acid residues can be replaced with different amino acid residues (eg, glycine or alanine) to reduce the nuclease activity.
所述工程化的CasX核酸酶与所述参比CasX核酸酶相比具有增加的活性。在一些实施方式中,所述活性是靶DNA结合活性。在一些实施方式中,所述活性是位点特异性核酸酶活性。在一些实施方式中,所述活性是双链DNA切割活性。在一些实施方式中,所述活性是单链DNA切割活性,包括例如位点特异性DNA切割活性或非特异性DNA切割活性。在一些实施方式中,所述活性是单链RNA切割活性,例如位点特异性RNA切割活性或非特异性RNA切割活性。在一些实施方式中,所述活性是在体外测量的。在一些实施方式中,所述活性是在细胞如细菌细胞、植物细胞或真核细胞中测量的。在一些实施方式中,所述活性是在哺乳动物细胞如啮齿动物细胞或人细胞中测量的。在一些实施方式中,所述活性是在人细胞诸如293T细胞中测量的。在一些实施方式中,所述活性是在小鼠细胞,例如Hepa1-6细胞中测量的。在一些实施方式中,所述工程化的CasX核酸酶具有与参比CasX核酸酶相比增加至少约20%、30%、40%、60%、70%、80%、90%、1.5倍、2倍、3倍、4倍、5倍、10倍或更多中任一个的位点特异性核酸酶活性。所述工程化的CasX核酸酶的位点特异性核酸酶活性可以使用本领域中已知的方法来测量,包括例如,凝胶迁移测定,如本文提供的实施例中所述的基于琼脂糖凝胶电泳的体外切割测定。 The engineered CasX nuclease has increased activity compared to the reference CasX nuclease. In some embodiments, the activity is target DNA binding activity. In some embodiments, the activity is a site-specific nuclease activity. In some embodiments, the activity is double-stranded DNA cleavage activity. In some embodiments, the activity is single-stranded DNA cleavage activity, including, for example, site-specific DNA cleavage activity or non-specific DNA cleavage activity. In some embodiments, the activity is single-stranded RNA cleavage activity, eg, site-specific RNA cleavage activity or non-specific RNA cleavage activity. In some embodiments, the activity is measured in vitro. In some embodiments, the activity is measured in cells, such as bacterial cells, plant cells, or eukaryotic cells. In some embodiments, the activity is measured in mammalian cells, such as rodent cells or human cells. In some embodiments, the activity is measured in human cells, such as 293T cells. In some embodiments, the activity is measured in mouse cells, eg, Hepal-6 cells. In some embodiments, the engineered CasX nuclease has at least about any of 20%, 30%, 40%, 60%, 70%, 80%, 90%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 10-fold or more increased site-specific nuclease activity compared to a reference CasX nuclease. The site-specific nuclease activity of the engineered CasX nuclease can be measured using methods known in the art, including, for example, gel shift assays, agarose gel electrophoresis-based in vitro cleavage assays as described in the Examples provided herein.
在一些实施方式中,所述活性是细胞中的基因编辑活性。在一些实施方式中,所述细胞是细菌细胞、植物细胞或真核细胞。在一些实施方式中,所述细胞是哺乳动物细胞如啮齿动物细胞或人细胞。在一些实施方式中,所述细胞是293T细胞。在一些实施方式中,所述活性是在小鼠细胞,例如Hepa1-6细胞中测量的。在一些实施方式中,所述活性是在细胞中靶基因组位点的插入缺失形成活性,例如通过所述工程化的CasX核酸酶对靶核酸进行位点特异性切割和通过非同源末端连接(NHEJ)机制进行DNA修复。在一些实施方式中,所述活性是在细胞中靶基因组位点插入外源核酸序列,例如通过所述工程化的CasX核酸酶对靶核酸进行位点特异性切割和通过同源重组(HR)机制进行DNA修复。在一些实施方式中该工程化的CasX核酸酶与参比CasX核酸酶相比在细胞(例如人细胞如293T细胞或小鼠Hepa1-6细胞)的基因组位点处增加至少约20%、30%、40%、60%、70%、80%、90%、1.5倍、2倍、3倍、4倍、5倍、10倍或更多中任一个的基因编辑(例如,插入缺失形成)活性。在一些实施方式中,所述工程化的CasX核酸酶与参比CasX2核酸酶相比在细胞(例如人细胞如293T细胞或小鼠Hepa1-6细胞)的多个(例如2、3、4、5、6、7、8、9、10个或更多个)基因组位点处增加至少约20%、30%、40%、60%、70%、80%、90%、1.5倍、2倍、3倍、4倍、5倍、10倍或更多中任一个的基因编辑(例如,插入缺失形成)活性。在一些实施方式中,该工程化的CasX核酸酶与所述参比CasX核酸酶相比,能够编辑更多数目的基因组位点。在一些实施方式中,该工程化的CasX核酸酶的共有核酸序列与所述参比CasX核酸酶相同。In some embodiments, the activity is gene editing activity in the cell. In some embodiments, the cells are bacterial cells, plant cells, or eukaryotic cells. In some embodiments, the cells are mammalian cells such as rodent cells or human cells. In some embodiments, the cells are 293T cells. In some embodiments, the activity is measured in mouse cells, eg, Hepal-6 cells. In some embodiments, the activity is an indel forming activity at a target genomic site in the cell, such as site-specific cleavage of a target nucleic acid by the engineered CasX nuclease and DNA repair by a non-homologous end joining (NHEJ) mechanism. In some embodiments, the activity is insertion of an exogenous nucleic acid sequence at a target genomic site in the cell, such as site-specific cleavage of the target nucleic acid by the engineered CasX nuclease and DNA repair by a homologous recombination (HR) mechanism. In some embodiments the engineered CasX nuclease increases the gene editing (e.g., indel formation) activity of any of at least about 20%, 30%, 40%, 60%, 70%, 80%, 90%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 10-fold or more at a genomic locus of a cell (e.g., a human cell such as a 293T cell or a mouse Hepa1-6 cell) compared to a reference CasX nuclease. In some embodiments, the engineered CasX nuclease is at least about 20%, 30%, 40%, 60%, 70%, 80%, 90%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold increased at multiple (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10 or more) genomic sites in a cell (e.g., a human cell such as a 293T cell or a mouse Hepa1-6 cell) compared to a reference CasX2 nuclease The gene editing (eg, indel formation) activity of any of 10-fold, 10-fold, or more. In some embodiments, the engineered CasX nuclease is capable of editing a greater number of genomic sites than the reference CasX nuclease. In some embodiments, the consensus nucleic acid sequence of the engineered CasX nuclease is identical to the reference CasX nuclease.
可使用本领域已知的方法确定细胞中工程化的CasX核酸酶的基因编辑效率,包括例如T7核酸内切酶1(T7E1)测定、靶DNA的测序(包括例如,Sanger序列,以及二代测序)、由分解追踪插入缺失(TIDE)测定或通过扩增子分析进行插入缺失检测(IDAA)测定。参见例如Sentmanat MF et al.,“A survey of validation strategies for CRISPR-Cas9 editing,”Scientific Reports,2018,8,文章编号888,其通过整体引用而并入本文。在一些实施方式中,例如,如本文实施例中所述,使用靶向的二代测序(NGS)来测量细胞中所述工程化的CasX核酸酶的基因编辑效率。用于确定所述工程化的CasX核酸酶的基因编辑效率的示例性基因组位点包括但不限于AAVS1、CCR5、CD34等。在一些实施方式中,所述工程化的CasX核酸酶的基因编辑效率(例如插入缺失率) 达到至少20%、30%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%或更高。The gene editing efficiency of an engineered CasX nuclease in a cell can be determined using methods known in the art, including, for example, a T7 endonuclease 1 (T7E1 ) assay, sequencing of target DNA (including, for example, Sanger sequences, and next-generation sequencing), tracking indels by decomposition (TIDE) assays, or indel detection by amplicon analysis (IDAA) assays. See, eg, Sentmanat MF et al., "A survey of validation strategies for CRISPR-Cas9 editing," Scientific Reports, 2018, 8, article number 888, which is hereby incorporated by reference in its entirety. In some embodiments, the gene editing efficiency of the engineered CasX nuclease in cells is measured using targeted next generation sequencing (NGS), eg, as described in the Examples herein. Exemplary genomic sites for determining the gene editing efficiency of the engineered CasX nuclease include, but are not limited to, AAVS1, CCR5, CD34, and the like. In some embodiments, the gene editing efficiency (eg indel rate) of the engineered CasX nuclease At least 20%, 30%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85% or more.
本申请提供了工程化的CasX效应蛋白,其具有改善的活性,例如靶标结合、双链切割活性、切口酶活性和/或基因编辑活性。在一些实施方式中,提供了工程化的CasX效应蛋白(例如,CasX核酸酶、CasX切口酶、CasX融合效应蛋白或分割型(split)CasX效应蛋白),其包含本文所述工程化的CasX核酸酶或其功能衍生物中的任一种。The present application provides engineered CasX effector proteins with improved activities, such as target binding, double-strand cleavage activity, nickase activity, and/or gene editing activity. In some embodiments, an engineered CasX effector protein (e.g., a CasX nuclease, a CasX nickase, a CasX fusion effector protein, or a split CasX effector protein) comprising any of the engineered CasX nucleases described herein or a functional derivative thereof is provided.
本申请提供了工程化的CasX效应蛋白,其包含本文所述工程化的CasX核酸酶的功能变体。在一些实施方式中,当与相应的工程化的CasX核酸酶的氨基酸序列相比时,功能变体的氨基酸序列具有至少一个氨基酸残基的不同(例如,具有缺失、插入、替换和/或融合)。在一些实施方式中,所述功能变体具有一个或多个突变,如氨基酸替换、插入和/或缺失。举例来说,与工程化的CasX核酸酶相比,功能变体可包含1、2、3、4、5、6、7、8、9、10个或更多个氨基酸替换中的任一个。在一些实施方式中,所述一个或多个替换是保守替换。在一些实施方式中,所述功能变体具有工程化的CasX核酸酶的所有结构域。在一些实施方式中,所述功能变体不具有工程化的CasX核酸酶的一个或多个结构域。The present application provides engineered CasX effector proteins comprising functional variants of the engineered CasX nucleases described herein. In some embodiments, the amino acid sequence of the functional variant differs by at least one amino acid residue (eg, with a deletion, insertion, substitution, and/or fusion) when compared to the amino acid sequence of a corresponding engineered CasX nuclease. In some embodiments, the functional variant has one or more mutations, such as amino acid substitutions, insertions and/or deletions. For example, the functional variant may comprise any of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more amino acid substitutions compared to the engineered CasX nuclease. In some embodiments, the one or more substitutions are conservative substitutions. In some embodiments, the functional variant has all domains of the engineered CasX nuclease. In some embodiments, the functional variant does not have one or more domains of the engineered CasX nuclease.
对于本文所述CasX变体蛋白(例如,切口酶CasX蛋白、失活或催化失活的CasX(dCasX)、融合CasX)中任一种,所述CasX变体可包括具有上述相同参数(例如,存在的结构域、同一性百分比等)的CasX蛋白序列。For any of the CasX variant proteins described herein (e.g., a nickase CasX protein, an inactivated or catalytically inactive CasX (dCasX), a fusion CasX), the CasX variant can include a CasX protein sequence having the same parameters described above (e.g., domains present, percent identity, etc.).
在一些实施方式中,所述功能变体与其工程化的CasX核酸酶的非突变形式相比,具有不同的催化活性。在一些实施方式中,所述突变(例如,氨基酸替换、插入和/或缺失)在CasX效应蛋白的催化结构域(例如,RuvC结构域)中。在一些实施方式中,所述变体包含多个催化结构域中的突变。切割双链靶核酸的一条链而不切割另一条链的CasX效应蛋白,在本文中被称为“切口酶”(例如“CasX切口酶”)。在本文中,基本上不具有核酸酶活性的CasX蛋白称为失活CasX蛋白(“dCasX”)(声明:在融合CasX效应蛋白的情况下,异源多肽(融合伴侣)可以提供核酸酶活性,将在下面进行详细说明)。在一些实施方式中,相对于其非突变形式而言,当突变酶的DNA切割活性小于约25%、10%、5%、1%、0.1%、0.01%或更低时,认为CasX效应蛋白基本上缺乏所有的DNA切割活性。 In some embodiments, the functional variant has a different catalytic activity than the non-mutated form of its engineered CasX nuclease. In some embodiments, the mutation (eg, amino acid substitution, insertion, and/or deletion) is in the catalytic domain (eg, RuvC domain) of the CasX effector protein. In some embodiments, the variant comprises mutations in multiple catalytic domains. A CasX effector protein that cleaves one strand of a double-stranded target nucleic acid but not the other is referred to herein as a "nickase" (eg, a "CasX nickase"). CasX proteins that are substantially devoid of nuclease activity are referred to herein as inactive CasX proteins ("dCasX") (disclaimer: in the case of fusion CasX effector proteins, a heterologous polypeptide (fusion partner) can provide nuclease activity, which will be described in detail below). In some embodiments, a CasX effector protein is considered to lack substantially all DNA cleavage activity when the mutant enzyme has less than about 25%, 10%, 5%, 1%, 0.1%, 0.01% or less DNA cleavage activity relative to its non-mutated form.
本申请还涉及根据前面所述的工程化的CasX效应蛋白,其包含第一多肽和第二多肽,第一多肽包含前述的工程化的CasX核酸酶的N末端部分氨基酸残基1至X,第二多肽包含前述的工程化的CasX核酸酶的氨基酸残基X+1至所述CasX核酸酶的C末端,其中所述第一多肽和所述第二多肽能够在包含指导序列的指导RNA的存在下彼此缔合,以形成与靶核酸特异性结合的成簇规律间隔短回文重复序列(CRISPR)复合物,该靶核酸包含与所述指导序列互补的靶序列。The present application also relates to an engineered CasX effector protein according to the foregoing, which comprises a first polypeptide and a second polypeptide, the first polypeptide comprising amino acid residues 1 to X of the N-terminal part of the aforementioned engineered CasX nuclease, and the second polypeptide comprising amino acid residues X+1 of the aforementioned engineered CasX nuclease to the C-terminus of the CasX nuclease, wherein the first polypeptide and the second polypeptide can associate with each other in the presence of a guide RNA comprising a guide sequence to form a clustered regularly interspaced short palindromic repeat (CRISPR) complex that specifically binds to a target nucleic acid A target nucleic acid comprising a target sequence complementary to said guide sequence.
本申请还提供了基于本文所述工程化的CasX效应蛋白中任一种的分割型CasX效应蛋白。分割型CasX效应蛋白对于递送可能是有利的。在一些实施方式中,所述工程化的CasX效应蛋白被分割成酶的两个部分,可以将它们重构在一起以提供基本起作用的CasX效应蛋白。可利用已知方法提供CasX效应蛋白,例如,Cas12和Cas9蛋白的分割形式已描述于例如WO2016/112242、WO2016/205749和PCT/CN 2020/111057,其通过整体引用而并入本文。The application also provides split CasX effector proteins based on any of the engineered CasX effector proteins described herein. Split CasX effectors may be advantageous for delivery. In some embodiments, the engineered CasX effector protein is split into two parts of the enzyme, which can be reconstituted together to provide a substantially functioning CasX effector protein. CasX effector proteins can be provided using known methods, for example, fragmented forms of the Cas12 and Cas9 proteins have been described in, for example, WO2016/112242, WO2016/205749 and PCT/CN 2020/111057, which are incorporated herein by reference in their entirety.
在一些实施方式中,提供了分割型CasX效应蛋白,其包含第一多肽和第二多肽,所述第一多肽包含本文所述工程化的CasX核酸酶中任一种或其功能衍生物的N末端部分,所述第二多肽包含所述工程化的CasX核酸酶或其功能衍生物的C末端部分,其中所述第一多肽和第二多肽能够在包含指导序列的指导RNA存在下彼此缔合,以形成与靶核酸特异性结合的CRISPR复合物,所述靶核酸包含与所述指导序列互补的靶序列。在一些实施方式中,所述第一多肽和第二多肽各自包含二聚化结构域。在一些实施方式中,所述第一二聚化结构域和第二二聚化结构域在诱导剂(例如雷帕霉素)存在下彼此缔合。在一些实施方式中,所述第一多肽和第二多肽不包含二聚化结构域。在一些实施方式中,所述分割型CasX效应蛋白是自诱导的。In some embodiments, a split-type CasX effector protein is provided, comprising a first polypeptide and a second polypeptide, the first polypeptide comprising an N-terminal portion of any one of the engineered CasX nucleases described herein or a functional derivative thereof, the second polypeptide comprising a C-terminal portion of the engineered CasX nuclease or a functional derivative thereof, wherein the first polypeptide and the second polypeptide can associate with each other in the presence of a guide RNA comprising a guide sequence to form a CRISPR complex specifically binding to a target nucleic acid, the target nucleic acid comprising a target sequence complementary to the guide sequence. In some embodiments, the first polypeptide and the second polypeptide each comprise a dimerization domain. In some embodiments, the first dimerization domain and the second dimerization domain associate with each other in the presence of an inducing agent (eg, rapamycin). In some embodiments, the first polypeptide and the second polypeptide do not comprise a dimerization domain. In some embodiments, the segmented CasX effector protein is autoinducible.
可以以不影响催化结构域的方式进行分割。CasX效应蛋白可以用作核酸酶(包括切口酶)或可以是灭活的酶,其本质上是具有很少或没有催化活性(例如由于其催化结构域中的突变)的RNA指导的DNA结合蛋白。Splitting can be done in a way that does not affect the catalytic domain. CasX effector proteins can act as nucleases (including nicking enzymes) or can be inactivated enzymes, which are essentially RNA-guided DNA-binding proteins with little or no catalytic activity (eg, due to mutations in their catalytic domain).
在一些实施方式中,CasX蛋白的核酸酶叶和α-螺旋叶表达为分开的多肽。尽管所述叶不自行相互作用,但RNA指导序列将它们募集到一个复合物中,所述复合物概括了全长CasX酶的活性并催化位点特异性的DNA切割。在一些实施方式中,经修饰的RNA指导序列可以用于通过防止二聚化 而消除分割型酶的活性,从而允许诱导型二聚化系统的发展。所述分割型酶描述于例如,Wright,Addison V.,et al.“Rational design of a split-Cas9 enzyme complex,”Proc.Nat'l.Acad.Sci.,112.10(2015):2984-2989,其通过整体引用并入本文。In some embodiments, the nuclease lobe and the α-helical lobe of the CasX protein are expressed as separate polypeptides. Although the leaves do not interact on their own, RNA guide sequences recruit them into a complex that recapitulates the activity of the full-length CasX enzyme and catalyzes site-specific DNA cleavage. In some embodiments, modified RNA guide sequences can be used to prevent dimerization by Instead, the activity of the split-type enzyme is eliminated, allowing the development of an inducible dimerization system. Such split-type enzymes are described, for example, in Wright, Addison V., et al. "Rational design of a split-Cas9 enzyme complex," Proc. Nat'l. Acad. Sci., 112.10(2015):2984-2989, which is incorporated herein by reference in its entirety.
本文所述分割型CasX效应蛋白部分可设计为通过将参比工程化的CasX效应蛋白(例如全长工程化的CasX)在一个分割位置处分开(即,分割)成两半,所述位置是所述参比CasX效应蛋白的N端部分与C端部分分开的点。在一些实施方式中,所述N末端部分包含氨基酸残基1至X,而所述C末端部分包含氨基酸残基X+1至所述参比CasX效应蛋白的C末端。在该实例中,编号是连续的,但这并非必需的,因为还考虑了氨基酸(或编码它们的核苷酸)可以从分割的末端和/或突变(例如,插入、删除和替换)中任一个在所述多肽链的内部区域修剪而来,条件是重构的CasX效应蛋白保留了足够的DNA结合活性(如果需要)、DNA切口酶或切割活性,例如与所述参比CasX效应蛋白相比,具有至少40%、50%、60%、70%、80%、90%或95%活性。The split CasX effector protein portion described herein can be designed to be split in half by splitting (i.e., splitting) a reference engineered CasX effector protein (e.g., full-length engineered CasX) at a split position, which is the point at which the N-terminal portion of the reference CasX effector protein is separated from the C-terminal portion. In some embodiments, the N-terminal portion comprises amino acid residues 1 to X, and the C-terminal portion comprises amino acid residues X+1 to the C-terminus of the reference CasX effector protein. In this example, the numbering is sequential, but this is not required, as it is also contemplated that amino acids (or nucleotides encoding them) may be trimmed from either split ends and/or mutations (e.g., insertions, deletions, and substitutions) in the interior region of the polypeptide chain, provided that the reconstituted CasX effector protein retains sufficient DNA binding activity (if desired), DNA nickase or cleavage activity, e.g., at least 40%, 50%, 60%, 70%, 80%, 90% compared to the reference CasX effector protein % or 95% active.
可以通过计算机(in silico)设计分割点并将其克隆到构建体中。在此过程中,可以将突变引入分割型CasX效应蛋白,并且可以去除非功能结构域。在一些实施方式中,所述分割型CasX效应蛋白的两个部分或片段(即,N末端和C末端片段)可以形成完整的CasX效应蛋白,其包含例如完整CasX效应蛋白序列的至少70%、至少80%、至少90%、至少95%或至少99%。Cutpoints can be designed in silico and cloned into constructs. During this process, mutations can be introduced into segmented CasX effector proteins and non-functional domains can be removed. In some embodiments, the two parts or fragments (i.e., N-terminal and C-terminal fragments) of the split CasX effector protein can form a complete CasX effector protein comprising, for example, at least 70%, at least 80%, at least 90%, at least 95%, or at least 99% of the complete CasX effector protein sequence.
所述分割型CasX效应蛋白可各自包含一个或多个二聚化结构域。在一些实施方式中,所述第一多肽包含与第一分割型CasX效应蛋白部分融合的第一二聚结构域,并且所述第二多肽包含与第二分割型CasX效应蛋白部分融合的第二二聚结构域。可通过肽接头(例如,柔性肽接头如GS接头)或化学键将所述二聚化结构域融合至所述分割型CasX效应蛋白部分。在一些实施方式中,所述二聚化结构域与所述分割型CasX效应蛋白部分的N末端融合。在一些实施方式中,所述二聚化结构域与所述分割型CasX效应蛋白部分的C末端融合。在一些实施方式中该分割型CasX效应蛋白不包含任何二聚化结构域。The segmented CasX effector proteins may each comprise one or more dimerization domains. In some embodiments, the first polypeptide comprises a first dimerization domain fused to a first segmented CasX effector protein portion, and the second polypeptide comprises a second dimerization domain fused to a second segmented CasX effector protein portion. The dimerization domain can be fused to the segmented CasX effector protein portion by a peptide linker (eg, a flexible peptide linker such as a GS linker) or a chemical bond. In some embodiments, the dimerization domain is fused to the N-terminus of the segmented CasX effector protein portion. In some embodiments, the dimerization domain is fused to the C-terminus of the segmented CasX effector portion. In some embodiments the segmented CasX effector protein does not comprise any dimerization domains.
在一些实施方式中,所述二聚化结构域促进两个分割型CasX效应蛋白部分的缔合。在一些实施方式中,所述分割型CasX效应蛋白部分被诱导剂诱导而缔合或二聚化为功能性CasX效应蛋白。在一些实施方式中,所述分 割型CasX效应蛋白包含可诱导的二聚化结构域。在一些实施方式中,所述二聚化结构域不是可诱导的二聚化结构域,即,所述二聚化结构域在诱导剂不存在下进行二聚化。In some embodiments, the dimerization domain facilitates the association of two segmented CasX effector protein moieties. In some embodiments, the segmented CasX effector portion is induced by an inducer to associate or dimerize into a functional CasX effector. In some embodiments, the Cut-type CasX effector proteins contain an inducible dimerization domain. In some embodiments, the dimerization domain is not an inducible dimerization domain, ie, the dimerization domain dimerizes in the absence of an inducing agent.
诱导剂可以是除指导RNA(例如sgRNA)以外的诱导能量源或诱导分子。所述诱导剂通过诱导的二聚化结构域的二聚化将两个分割型CasX效应蛋白部分重构为功能性CasX效应蛋白。在一些实施方式中,所述诱导剂通过可诱导的二聚化结构域的诱导缔合的作用将两个分割型CasX效应蛋白部分聚集在一起。在一些实施方式中,在没有诱导剂的情况下,两个分割型CasX效应蛋白部分不彼此缔合以重构为功能性CasX效应蛋白。在一些实施方式中,在没有诱导剂的情况下,两个分开的CasX效应蛋白部分可以在指导RNA(例如crRNA)存在下彼此缔合以重构为功能性CasX效应蛋白。An inducing agent can be an inducing energy source or an inducing molecule other than a guide RNA (eg, sgRNA). The inducer partially remodels the two segmented CasX effector proteins into a functional CasX effector protein through the induced dimerization of the dimerization domain. In some embodiments, the inducing agent brings together the two segmented CasX effector protein moieties by inducing association of the inducible dimerization domain. In some embodiments, the two segmented CasX effector protein moieties do not associate with each other to remodel into a functional CasX effector protein in the absence of an inducing agent. In some embodiments, in the absence of an inducing agent, two separate CasX effector protein moieties can associate with each other in the presence of a guide RNA (eg, crRNA) to reconstitute into a functional CasX effector protein.
本申请的诱导剂可以是热、超声、电磁能或化合物。在一些实施方式中,所述诱导剂是抗生素、小分子、激素、激素衍生物、类固醇或类固醇衍生物。在一些实施方式中,所述诱导剂是脱落酸(ABA)、多西霉素(DOX)、异丙基苯甲酸(cumate)、雷帕霉素、4-羟基他莫昔芬(4OHT)、雌激素或蜕皮激素。在一些实施方式中,所述分割型CasX效应系统是选自下组的诱导剂控制的系统:基于抗生素的诱导系统、基于电磁能的诱导系统、基于小分子的诱导系统、基于核受体的诱导系统和基于激素的诱导系统。在一些实施方式中,所述分割型CasX效应系统是选自下组的诱导剂控制的系统:四环素(Tet)/DOX诱导系统、光诱导系统、ABA诱导系统、异丙基苯甲酸(cumate)阻遏物/操纵子系统、4OHT/雌激素诱导系统、基于蜕皮激素的诱导系统和FKBP12/FRAP(FKBP12-雷帕霉素复合物)诱导系统。这样的诱导剂也讨论于本文和PCT/US2013/051418,其通过整体引用而并入本文。FRB/FKBP/雷帕霉素系统已描述于Paulmurugan and Gambhir,Cancer Res,August 15,2005 65;7413,以及Crabtree et al.,Chemistry & Biology 13,99-107,Jan 2006,其通过整体引用而并入本文。The inducer of the present application may be heat, ultrasound, electromagnetic energy or chemical compounds. In some embodiments, the inducing agent is an antibiotic, small molecule, hormone, hormone derivative, steroid, or steroid derivative. In some embodiments, the inducer is abscisic acid (ABA), doxycycline (DOX), cumate, rapamycin, 4-hydroxytamoxifen (4OHT), estrogen, or ecdysone. In some embodiments, the segmented CasX effector system is an inducer-controlled system selected from the group consisting of an antibiotic-based induction system, an electromagnetic energy-based induction system, a small molecule-based induction system, a nuclear receptor-based induction system, and a hormone-based induction system. In some embodiments, the segmented CasX effector system is an inducer-controlled system selected from the group consisting of a tetracycline (Tet)/DOX inducible system, a light inducible system, an ABA inducible system, a cumate repressor/operator system, a 4OHT/estrogen inducible system, an ecdysone-based inducible system, and a FKBP12/FRAP (FKBP12-rapamycin complex) inducible system. Such inducers are also discussed herein and in PCT/US2013/051418, which is hereby incorporated by reference in its entirety. The FRB/FKBP/rapamycin system has been described in Paulmurugan and Gambhir, Cancer Res, August 15, 2005 65; 7413, and Crabtree et al., Chemistry & Biology 13, 99-107, Jan 2006, which are hereby incorporated by reference in their entirety.
在一些实施方式中,成对的分割型CasX效应蛋白是分开的并且是无活性的,直到诱导了所述二聚化结构域的二聚化(例如,FRB和FKBP),该二聚化导致功能性CasX效应蛋白核酸酶的再组装。在一些实施方式中,包含诱导型二聚体(例如FRB)的第一半部分的第一分割型CasX效应蛋白被分开 递送,和/或处在与包含诱导型二聚体(例如FKBP)的第二半部分的第二分割型CasX效应蛋白分开的位置。In some embodiments, pairs of split CasX effector proteins are separated and inactive until dimerization of the dimerization domains (eg, FRB and FKBP) is induced, which dimerization results in reassembly of functional CasX effector protein nucleases. In some embodiments, the first split CasX effector protein comprising the first half of an inducible dimer (e.g., FRB) is separated delivered, and/or in a position separate from the second split CasX effector protein comprising the second half of the inducible dimer (eg, FKBP).
可用于本文所述的诱导剂控制的分割型CasX效应系统中的其他示例性的基于FKBP的诱导系统包括但不限于:在FK506存在下与钙调神经磷酸酶(CNA)进行二聚化的FKBP;在FKCsA存在下与CyP-Fas进行二聚化的FKBP;在雷帕霉素存在下与FRB进行二聚化的FKBP;在香豆霉素存在下与GryB进行二聚化的GyrB;在赤霉素存在下与GID1进行二聚化的GAI;或在HaXS存在下与HaloTag进行二聚化的Snap-tag。Other exemplary FKBP-based induction systems that can be used in the inducer-controlled split CasX effector systems described herein include, but are not limited to: FKBP that dimerizes with calcineurin (CNA) in the presence of FK506; FKBP that dimerizes with CyP-Fas in the presence of FKCsA; FKBP that dimerizes with FRB in the presence of rapamycin; GyrB that dimerizes with GryB in the presence of coumarycin; GAI that dimerizes with GID1 in the presence of GA; or Snap-tag that dimerizes with HaloTag in the presence of HaXS.
也考虑了FKBP家族本身内的替代方案。例如,在FK1012存在下FKBP进行均二聚(即,一个FKBP与另一种FKBP进行二聚化)。Alternatives within the FKBP family itself were also considered. For example, FKBPs homodimerize (ie, one FKBP dimerizes with another FKBP) in the presence of FK1012.
在一些实施方式中,所述二聚化结构域是FKBP,而诱导剂是FK1012。在一些实施方式中,所述二聚化结构域是GryB,而诱导剂是香豆霉素。在一些实施方式中,所述二聚化结构域是ABA,而诱导剂是赤霉素。In some embodiments, the dimerization domain is FKBP and the inducer is FK1012. In some embodiments, the dimerization domain is GryB and the inducer is Coumarin. In some embodiments, the dimerization domain is ABA and the inducing agent is gibberellin.
在一些实施方式中,可以在不存在诱导剂的情况下自动诱导(即,自动激活或自诱导)所述分割型CasX效应蛋白部分以缔合/二聚化为功能性CasX效应蛋白。不受任何理论或假设的束缚,可以通过与指导RNA如crRNA结合来介导所述分割型CasX效应蛋白部分的自动诱导。在一些实施方式中,所述第一多肽和第二多肽不包含二聚化结构域。在一些实施方式中,所述第一多肽和第二多肽包含二聚化结构域。In some embodiments, the segmented CasX effector portion can be autoinduced (ie, autoactivated or autoinduced) in the absence of an inducing agent to associate/dimerize into a functional CasX effector. Without being bound by any theory or hypothesis, the auto-induction of the segmented CasX effector portion may be mediated by binding to a guide RNA such as crRNA. In some embodiments, the first polypeptide and the second polypeptide do not comprise a dimerization domain. In some embodiments, the first polypeptide and the second polypeptide comprise a dimerization domain.
在一些实施方式中,本文所述分割型CasX效应系统(包括诱导剂控制的和自动诱导的系统)的重构的CasX效应蛋白,相对于参比CasX效应蛋白编辑效率,具有至少70%(如至少约70%、75%、80%、85%、90%、95%、98%、99%或更多效率或100%效率中任一种)的编辑效率。In some embodiments, the reconstituted CasX effector protein of the split CasX effector system described herein (including inducer-controlled and auto-inducible systems) has an editing efficiency of at least 70% (such as at least about any of 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99% or more efficiency, or 100% efficiency) relative to a reference CasX effector protein editing efficiency.
在一些实施方式中,本文所述诱导剂控制的分割型CasX效应系统的重构的CasX效应蛋白,在没有诱导剂存在下(即由于自动诱导),相对于参比CasX效应蛋白编辑效率,其具有不超过50%(如不超过约50%、45%、40%、35%、30%、25%、20%、15%、10%、5%或更低效率,或0%效率中任一个)的编辑效率。In some embodiments, the reconstituted CasX effector protein of the inducer-controlled split CasX effector system described herein has an editing efficiency of no more than 50% (such as no more than any of about 50%, 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10%, 5% or less efficiency, or 0% efficiency) relative to the editing efficiency of a reference CasX effector protein in the absence of the inducer (i.e., due to auto-induction).
本申请还提供了工程化的CasX效应蛋白,其包含另外的蛋白结构域和/或组分,如接头、核定位/输出序列、功能结构域和/或报告蛋白。 The present application also provides engineered CasX effector proteins comprising additional protein domains and/or components, such as linkers, nuclear localization/export sequences, functional domains and/or reporter proteins.
在一些实施方式中,所述工程化的CasX效应蛋白是包含一个或多个异源蛋白结构域(例如,约或大于约1、2、3、4、5、6、7、8、9、10个或更多个结构域)以及所述工程化的CasX核酸酶的核酸靶向结构域或其功能衍生物的蛋白复合物。在一些实施方式中,所述工程化的CasX效应蛋白是包含与所述工程化的CasX核酸酶融合的一个或多个异源蛋白结构域(例如约或多于约1、2、3、4、5、6、7、8、9、10个或更多个结构域)的融合蛋白。In some embodiments, the engineered CasX effector protein is a protein complex comprising one or more heterologous protein domains (e.g., about or greater than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more domains) and the nucleic acid targeting domain of the engineered CasX nuclease or a functional derivative thereof. In some embodiments, the engineered CasX effector protein is a fusion protein comprising one or more heterologous protein domains (e.g., about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more domains) fused to the engineered CasX nuclease.
在一些实施方式中,本申请的工程化的CasX效应蛋白可包含(例如,通过融合蛋白,如通过一个或多个肽接头,例如GS肽接头等)一个或多个功能结构域或缔合(例如,通过多种蛋白的共表达)于它。在一些实施方式中,所述一个或多个功能结构域是酶结构域。这些功能结构域可以具有多种活性,例如DNA和/或RNA甲基化酶活性、脱甲基酶活性、转录激活活性、转录抑制活性、转录释放因子活性、组蛋白修饰活性、RNA切割活性、DNA切割活性、核酸结合活性和开关活性(例如,光诱导的)。在一些实施方式中,所述一个或多个功能结构域是转录激活结构域(即,反式激活结构域)或阻遏物结构域。在一些实施方式中,所述一个或多个功能结构域是组蛋白修饰结构域。在一些实施方式中,所述一个或多个功能结构域是转座酶结构域、HR(同源重组)机构结构域、重组酶结构域和/或整合酶结构域。在一些实施方式中,所述功能结构域是Krüppel相关盒(KRAB)、VP64、VP16、Fok1、P65、HSF1、MyoD1、生物素-APEX、APOBEC1、AID、PmCDA1、Tad1和M-MLV逆转录酶。在一些实施方式中,所述功能结构域选自下组:翻译起始结构域、转录阻遏结构域、反式激活结构域、表观遗传修饰结构域、核碱基编辑结构域(例如,CBE或ABE结构域)、逆转录酶结构域、报告分子结构域(例如,荧光结构域)和核酸酶结构域。In some embodiments, the engineered CasX effector protein of the present application may comprise (for example, through a fusion protein, such as through one or more peptide linkers, such as a GS peptide linker, etc.) one or more functional domains or associate (for example, through co-expression of multiple proteins) with it. In some embodiments, the one or more functional domains are enzymatic domains. These functional domains can have various activities, such as DNA and/or RNA methylase activity, demethylase activity, transcriptional activation activity, transcriptional repression activity, transcriptional release factor activity, histone modification activity, RNA cleavage activity, DNA cleavage activity, nucleic acid binding activity, and switching activity (e.g., light-induced). In some embodiments, the one or more functional domains are transcriptional activation domains (ie, transactivation domains) or repressor domains. In some embodiments, the one or more functional domains are histone modification domains. In some embodiments, the one or more functional domains are transposase domains, HR (homologous recombination) machinery domains, recombinase domains, and/or integrase domains. In some embodiments, the functional domain is Krüppel-associated box (KRAB), VP64, VP16, Fok1, P65, HSF1, MyoD1, Biotin-APEX, APOBEC1, AID, PmCDA1, Tad1, and M-MLV reverse transcriptase. In some embodiments, the functional domain is selected from the group consisting of a translation initiation domain, a transcriptional repression domain, a transactivation domain, an epigenetic modification domain, a nucleobase editing domain (eg, a CBE or ABE domain), a reverse transcriptase domain, a reporter domain (eg, a fluorescent domain), and a nuclease domain.
在一些实施方式中,所述工程化的CasX效应蛋白中一个或多个功能结构域的定位允许功能结构域的正确空间定向以影响具有所赋予功能作用的靶标。例如,如果所述功能结构域是转录激活子(例如,VP16、VP64或p65),则将所述转录激活子放置在使其能够影响靶标转录的空间取向上。同样地,定位转录阻遏物以影响靶标的转录,并且将核酸酶(例如,Fok1)定位以切割或部分切割所述靶标。在一些实施方式中,所述功能结构域位于所述工程化的CasX效应蛋白的N末端。在一些实施方式中,所述功能结构域位于工程化的CasX效应蛋白的C末端。在一些实施方式中,所述工程化的CasX效 应蛋白在N末端包含第一功能结构域,并在C末端包含第二功能结构域。在一些实施方式中,所述工程化CasX效应蛋白包含与一个或多个功能结构域融合的本文所述工程化的CasX核酸酶中任一种的催化失活突变体。In some embodiments, the positioning of one or more functional domains in the engineered CasX effector protein allows correct spatial orientation of the functional domains to affect the target with the conferred functional effect. For example, if the functional domain is a transcriptional activator (eg, VP16, VP64, or p65), the transcriptional activator is placed in a spatial orientation that enables it to affect the transcription of the target. Likewise, a transcriptional repressor is positioned to affect the transcription of a target, and a nuclease (eg, Fok1 ) is positioned to cleave or partially cleave the target. In some embodiments, the functional domain is located at the N-terminus of the engineered CasX effector protein. In some embodiments, the functional domain is located at the C-terminus of the engineered CasX effector protein. In some embodiments, the engineered CasX effector The response protein contains a first functional domain at the N-terminus and a second functional domain at the C-terminus. In some embodiments, the engineered CasX effector protein comprises a catalytically inactive mutant of any of the engineered CasX nucleases described herein fused to one or more functional domains.
在一些实施方式中,所述工程化的CasX效应蛋白是转录激活子。在一些实施方式中,所述工程化的CasX效应蛋白包含与反式激活结构域融合的本文所述工程化的CasX核酸酶中任一种的酶失活变体。在一些实施方式中,所述反式激活域选自下组:VP64、p65、HSF1、VP16、MyoD1、HSF1、RTA、SET7/9及其组合。在一些实施方式中,所述反式激活结构域包含VP64、p65和HSF1。在一些实施方式中,所述工程化的CasX效应蛋白包含两个分割型CasX效应多肽,每个都与反式激活结构域融合。In some embodiments, the engineered CasX effector protein is a transcriptional activator. In some embodiments, the engineered CasX effector protein comprises an enzyme-inactive variant of any of the engineered CasX nucleases described herein fused to a transactivation domain. In some embodiments, the transactivation domain is selected from the group consisting of VP64, p65, HSF1, VP16, MyoD1, HSF1, RTA, SET7/9, and combinations thereof. In some embodiments, the transactivation domain comprises VP64, p65, and HSF1. In some embodiments, the engineered CasX effector protein comprises two split CasX effector polypeptides, each fused to a transactivation domain.
在一些实施方式中,所述工程化的CasX效应蛋白是转录阻遏物。在一些实施方式中,所述工程化的CasX效应蛋白包含与转录阻遏结构域融合的本文所述工程化的CasX核酸酶中任一种的酶失活变体。在一些实施方式中,所述转录阻遏物结构域选自下组:Krüppel相关盒(KRAB)、EnR、NuE、NcoR、SID、SID4X及其组合。在一些实施方式中,所述工程化的CasX效应蛋白包含两个分割型CasX效应多肽,每个都与转录阻遏结构域融合。In some embodiments, the engineered CasX effector protein is a transcriptional repressor. In some embodiments, the engineered CasX effector protein comprises an enzyme-inactive variant of any of the engineered CasX nucleases described herein fused to a transcriptional repression domain. In some embodiments, the transcriptional repressor domain is selected from the group consisting of Krüppel-associated box (KRAB), EnR, NuE, NcoR, SID, SID4X, and combinations thereof. In some embodiments, the engineered CasX effector protein comprises two split CasX effector polypeptides, each fused to a transcriptional repression domain.
在一些实施方式中,所述工程化的CasX效应蛋白是碱基编辑器,如胞嘧啶编辑器或腺苷编辑器。在一些实施方式中,所述工程化的CasX效应蛋白包含与核碱基编辑结构域融合的本文所述任何工程化的CasX核酸酶中任一种的酶失活变体,所述核碱基编辑结构域诸如为胞嘧啶碱基编辑(CBE)结构域或腺苷碱基编辑(ABE)域。在一些实施方式中,所述核碱基编辑结构域是DNA编辑结构域。在一些实施方式中,所述核碱基编辑结构域具有脱氨酶活性。在一些实施方式中,所述核碱基编辑结构域是胞嘧啶脱氨酶结构域。在一些实施方式中,所述核碱基编辑结构域是腺苷脱氨酶结构域。基于Cas核酸酶的示例性碱基编辑器描述于例如,WO2018/165629A1和WO2019/226953A1,其通过整体引用而并入本文。示例性CBE结构域包括但不限于:活化诱导的胞苷脱氨酶或AID(例如hAID)、载脂蛋白B mRNA编辑复合物或APOBEC(例如大鼠APOBEC1、hAPOBEC3 A/B/C/D/E/F/G)和PmCDA1。示例性ABE结构域包括但不限于:TadA、ABE8及其变体(参见例如Gaudelli et al.,2017,Nature 551:464-471;and Richter et al.,2020,Nature Biotechnology 38:883-891)。在一些实施方式中,所述功能性结构域是 APOBEC1结构域,例如大鼠APOBEC1结构域。在一些实施方式中,所述功能性结构域是TadA结构域,例如大肠杆菌(E.coli)TadA结构域。在一些实施方式中所述工程化的CasX效应蛋白还包含一个或多个核定位序列。In some embodiments, the engineered CasX effector protein is a base editor, such as a cytosine editor or an adenosine editor. In some embodiments, the engineered CasX effector protein comprises an enzyme-inactive variant of any of the engineered CasX nucleases described herein fused to a nucleobase editing domain, such as a cytosine base editing (CBE) domain or an adenosine base editing (ABE) domain. In some embodiments, the nucleobase editing domain is a DNA editing domain. In some embodiments, the nucleobase editing domain has deaminase activity. In some embodiments, the nucleobase editing domain is a cytosine deaminase domain. In some embodiments, the nucleobase editing domain is an adenosine deaminase domain. Exemplary base editors based on Cas nucleases are described, eg, in WO2018/165629A1 and WO2019/226953A1, which are incorporated herein by reference in their entirety. Exemplary CBE domains include, but are not limited to, activation-induced cytidine deaminase or AID (e.g., hAID), apolipoprotein B mRNA editing complex or APOBEC (e.g., rat APOBEC1, hAPOBEC3 A/B/C/D/E/F/G), and PmCDA1. Exemplary ABE domains include, but are not limited to: TadA, ABE8, and variants thereof (see, eg, Gaudelli et al., 2017, Nature 551:464-471; and Richter et al., 2020, Nature Biotechnology 38:883-891). In some embodiments, the functional domain is APOBEC1 domain, such as a rat APOBEC1 domain. In some embodiments, the functional domain is a TadA domain, such as an E. coli TadA domain. In some embodiments the engineered CasX effector protein further comprises one or more nuclear localization sequences.
在一些实施方式中,所述工程化的CasX效应蛋白是主编辑器。基于Cas9的主编辑器描述于例如,A.Anzalone et al.,Nature,2019,576(7785):149-157,其通过整体引用而并入本文。在一些实施方式中,所述工程化的CasX效应蛋白包含与逆转录酶结构域融合的本文所述工程化的CasX核酸酶中任一种的切口酶变体。在一些实施方式中,所述功能结构域是逆转录酶结构域。在一些实施方式中,所述逆转录酶结构域是M-MLV逆转录酶或其变体,例如具有D200N、T306K、W313F、T330P和L603W的一个或多个突变的M-MLV逆转录酶。在一些实施方式中,提供了包含所述主编辑器的工程化CRISPR/CasX系统。在一些实施方式中,所述工程化的CRISPR/CasX系统还包含第二CasX切口酶,例如基于与主编辑器的相同的工程化的CasX核酸酶。在一些实施方式中,所述工程化的CRISPR/CasX系统包含主编辑器指导RNA(pegRNA),其包含引物结合位点和逆转录酶(RT)模板序列。In some embodiments, the engineered CasX effector protein is a master editor. Cas9-based master editors are described, e.g., in A. Anzalone et al., Nature, 2019, 576(7785):149-157, which is hereby incorporated by reference in its entirety. In some embodiments, the engineered CasX effector protein comprises a nickase variant of any one of the engineered CasX nucleases described herein fused to a reverse transcriptase domain. In some embodiments, the functional domain is a reverse transcriptase domain. In some embodiments, the reverse transcriptase domain is M-MLV reverse transcriptase or a variant thereof, such as an M-MLV reverse transcriptase having one or more mutations of D200N, T306K, W313F, T330P, and L603W. In some embodiments, an engineered CRISPR/CasX system comprising said master editor is provided. In some embodiments, the engineered CRISPR/CasX system further comprises a second CasX nickase, eg, based on the same engineered CasX nuclease as the primary editor. In some embodiments, the engineered CRISPR/CasX system comprises a master editor guide RNA (pegRNA) comprising a primer binding site and a reverse transcriptase (RT) template sequence.
在一些实施方式中,本申请提供了具有与分割型CasX效应蛋白部分之一或两者缔合(即,结合或融合)的一个或多个(例如,1、2、3、4、5、6个或更多个)功能结构域的分割型CasX效应系统。所述功能结构域可以作为所第一和/或第二分割型CasX效应蛋白的一部分提供,作为该构建体内的融合物。所述功能结构域通常通过肽接头(诸如GS接头)与分割型CasX效应蛋白中其他部分融合(例如,分割型CasX效应蛋白部分)。这些功能结构域可用于基于催化失活CasX效应蛋白重新改换该分割型CasX效应系统的功能。In some embodiments, the application provides segmented CasX effector systems having one or more (e.g., 1, 2, 3, 4, 5, 6 or more) functional domains associated with (i.e., bound or fused to) one or both of the segmented CasX effector protein moieties. Said functional domain may be provided as part of said first and/or second segmented CasX effector protein, as a fusion within the construct. The functional domain is usually fused to other parts of the segmented CasX effector protein (eg, segmented CasX effector protein portion) via a peptide linker (such as a GS linker). These functional domains can be used to reprogram the function of this segmented CasX effector system based on catalytically inactive CasX effector proteins.
在一些实施方式中,所述工程化的CasX效应蛋白包含一个或多个核定位序列(NLS)和/或一个或多个核输出序列(NES)。示例性的NLS序列包括例如PKKKRKVPG和ASPKKKRKV。NLS和/或NES可以可操作地连接至所述工程化的CasX效应蛋白的N末端和/或C末端或所述工程化的CasX效应蛋白中的多肽链。In some embodiments, the engineered CasX effector protein comprises one or more nuclear localization sequences (NLS) and/or one or more nuclear export sequences (NES). Exemplary NLS sequences include, eg, PKKKRKVPG and ASPKKKRKV. NLS and/or NES can be operably linked to the N-terminus and/or C-terminus of the engineered CasX effector protein or a polypeptide chain in the engineered CasX effector protein.
在一些实施方式中,所述工程化的CasX效应蛋白可以编码另外的组分,例如报告蛋白。在一些实施方式中,所述工程化的CasX效应蛋白包含荧光蛋白,例如GFP。这样的系统可以允许对基因组位点进行成像(例如,参见“Dynamic Imaging of Genomic Loci in Living Human Cells by an Optimized CRISPR/Cas System”Chen B et al.Cell 2013)。在一些实施方式中所述工程化的CasX效应蛋白是可用于对基因组位点成像的可诱导分割型CasX效应系统。In some embodiments, the engineered CasX effector protein can encode additional components, such as a reporter protein. In some embodiments, the engineered CasX effector protein comprises a fluorescent protein, such as GFP. Such systems may allow imaging of genomic loci (see, for example, "Dynamic Imaging of Genomic Loci in Living Human Cells by an Optimized CRISPR/Cas System" Chen B et al. Cell 2013). In some embodiments the engineered CasX effector protein is an inducible segmentable CasX effector system that can be used to image genomic loci.
在又一具体实施方式中,提供了一种工程化的CasX效应蛋白,其中所述效应蛋白能够诱导DNA分子中的双链断裂或单链断裂。In yet another specific embodiment, an engineered CasX effector protein is provided, wherein the effector protein is capable of inducing double-strand breaks or single-strand breaks in DNA molecules.
在再一具体实施方式中,提供了一种工程化的CasX效应蛋白,其中所述工程化的CasX核酸酶的功能衍生物为酶失活突变体。In yet another specific embodiment, an engineered CasX effector protein is provided, wherein the functional derivative of the engineered CasX nuclease is an enzyme inactive mutant.
在又一具体实施方式中,提供了一种工程化的CasX效应蛋白,其还包含与所述工程化的CasX核酸酶融合的功能结构域。In yet another specific embodiment, an engineered CasX effector protein is provided, which further comprises a functional domain fused with the engineered CasX nuclease.
在再一具体实施方式中,提供了一种工程化的CasX效应蛋白,其中所述功能结构域选自下组:翻译起始结构域、转录阻遏结构域、反式激活结构域、表观遗传修饰结构域、核碱基编辑结构域、逆转录酶结构域、报告分子结构域和核酸酶结构域。In yet another specific embodiment, an engineered CasX effector protein is provided, wherein the functional domain is selected from the group consisting of a translation initiation domain, a transcriptional repression domain, a transactivation domain, an epigenetic modification domain, a nucleobase editing domain, a reverse transcriptase domain, a reporter domain and a nuclease domain.
本申请在第三方面提供了工程化的CRISPR-CasX系统The present application provides an engineered CRISPR-CasX system in a third aspect
在一个具体实施方式中,提供了一种工程化的CRISPR-CasX系统,包括:(a)前述工程化CasX效应蛋白;以及(b)包含与靶序列互补的指导序列的指导RNA,或编码所述指导RNA的一种或多种核酸,其中工程化的CasX效应蛋白和所述指导RNA能够形成CRISPR复合物,所述CRISPR复合物特异性结合包含所述靶序列的靶核酸并诱导所述靶核酸的修饰。In a specific embodiment, an engineered CRISPR-CasX system is provided, comprising: (a) the aforementioned engineered CasX effector protein; and (b) a guide RNA comprising a guide sequence complementary to a target sequence, or one or more nucleic acids encoding the guide RNA, wherein the engineered CasX effector protein and the guide RNA can form a CRISPR complex, and the CRISPR complex specifically binds to the target nucleic acid comprising the target sequence and induces modification of the target nucleic acid.
在一个具体的实施方式中,所述CasX核酸酶选自下组的任一项:PlmCasX、DpbCasX。In a specific embodiment, the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
在一些实施方式中,所述工程化的CRISPR-CasX系统包括:(a)本文所述工程化的CasX效应蛋白中任一种(例如,工程化的CasX核酸酶、切口酶、分割型CasX、转录阻遏物、转录激活子、碱基编辑器或主编辑器);以及(b)包含与靶序列互补的指导序列的指导RNA,或编码所述指导RNA的一种或多种核酸,其中所述工程化的CasX效应蛋白和所述指导RNA能够形成CRISPR复合物,所述CRISPR复合物特异性结合于包含所述靶序列的靶核酸并诱导所述靶核酸的修饰。在一些实施方式中,所述工程化的CRISPR-CasX系统包含编码所述工程化的CasX效应蛋白和/或所述指导RNA的一种或多种核酸。在一些实施方式中,所述工程化的CRISPR-CasX 系统包含前体指导RNA阵列,其可以例如通过所述工程化的CasX效应蛋白加工成多个crRNA。在一些实施方式中,所述工程化的CRISPR-CasX系统包含一种或多种编码所述工程化的CasX效应蛋白和/或所述指导RNA的载体。在一些实施方式中,所述工程化的CRISPR-CasX系统包含核糖核蛋白(RNP)复合物,其包含结合至所述指导RNA的所述工程化的CasX效应蛋白。In some embodiments, the engineered CRISPR-CasX system comprises: (a) any of the engineered CasX effector proteins described herein (e.g., an engineered CasX nuclease, nickase, segmented CasX, transcription repressor, transcription activator, base editor, or master editor); and (b) a guide RNA comprising a guide sequence complementary to a target sequence, or one or more nucleic acids encoding the guide RNA, wherein the engineered CasX effector protein and the guide RNA are capable of forming a CRISPR complex, the The CRISPR complex specifically binds to a target nucleic acid comprising the target sequence and induces modification of the target nucleic acid. In some embodiments, the engineered CRISPR-CasX system comprises one or more nucleic acids encoding the engineered CasX effector protein and/or the guide RNA. In some embodiments, the engineered CRISPR-CasX The system comprises an array of precursor guide RNAs that can be processed into multiple crRNAs, eg, by the engineered CasX effector proteins. In some embodiments, the engineered CRISPR-CasX system comprises one or more vectors encoding the engineered CasX effector protein and/or the guide RNA. In some embodiments, the engineered CRISPR-CasX system comprises a ribonucleoprotein (RNP) complex comprising the engineered CasX effector protein bound to the guide RNA.
本申请的工程化的CRISPR-CasX系统可包含任何合适的指导RNA。指导RNA(gRNA)可以包含能够与目标靶核酸(诸如细胞中的目标基因组位点)中的靶序列杂交的指导序列。在一些实施方式中,所述gRNA包含含有所述指导序列的CRISPR RNA(crRNA)序列。The engineered CRISPR-CasX system of the present application can comprise any suitable guide RNA. A guide RNA (gRNA) may comprise a guide sequence capable of hybridizing to a target sequence in a target nucleic acid of interest, such as a genomic site of interest in a cell. In some embodiments, the gRNA comprises a CRISPR RNA (crRNA) sequence comprising the guide sequence.
通常,本文所述的crRNA包括直接重复序列和间隔区序列。在某些实施方式中,所述crRNA包括与指导序列或间隔区序列连接的直接重复序列、基本由或由它组成。在一些实施方式中,所述crRNA包括直接重复序列、间隔区序列和直接重复序列(DR-间隔区序列-DR),其是前体crRNA(pre-crRNA)构型的典型特征。在一些实施方式中,所述crRNA包括截短的直接重复序列和间隔区序列,其是经加工的或成熟的crRNA的典型特征。在一些实施方式中,所述CRISPR-CasX效应蛋白与RNA指导序列形成复合物,并且所述间隔区序列将所述复合物引导至与靶核酸进行序列特异性结合,所述靶核酸与间隔区序列互补。Typically, the crRNAs described herein include direct repeat sequences and spacer sequences. In certain embodiments, the crRNA comprises, consists essentially of, or consists of a direct repeat sequence linked to a guide sequence or a spacer sequence. In some embodiments, the crRNA includes a direct repeat sequence, a spacer sequence, and a direct repeat sequence (DR-spacer sequence-DR), which are typical features of a precursor crRNA (pre-crRNA) configuration. In some embodiments, the crRNA includes truncated direct repeat and spacer sequences, which are typical features of processed or mature crRNA. In some embodiments, the CRISPR-CasX effector protein forms a complex with an RNA guide sequence, and the spacer sequence directs the complex to sequence-specific binding to a target nucleic acid that is complementary to the spacer sequence.
在一些实施方式中,所述指导RNA是包含指导序列的crRNA。在一些实施方式中,所述工程化的CRISPR-CasX系统包含编码多个crRNA的前体指导RNA阵列。在一些实施方式中,所述CasX效应蛋白切割所述前体指导RNA阵列以产生多个crRNA。在一些实施方式中,所述工程化的CRISPR-CasX系统包含编码多个crRNA的前体指导RNA阵列,其中每个crRNA包含不同的指导序列。In some embodiments, the guide RNA is a crRNA comprising a guide sequence. In some embodiments, the engineered CRISPR-CasX system comprises an array of precursor guide RNAs encoding a plurality of crRNAs. In some embodiments, the CasX effector protein cleaves the array of precursor guide RNAs to generate a plurality of crRNAs. In some embodiments, the engineered CRISPR-CasX system comprises an array of precursor guide RNAs encoding multiple crRNAs, wherein each crRNA comprises a different guide sequence.
所述指导序列可以具有合适的长度。在一些实施方式中,所述指导序列在约18至约35个核苷酸之间,包括例如18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34或35个核苷酸。所述指导序列可以与靶核酸的靶序列具有至少约70%、75%、80%、85%、90%、95%、98%、99%或100%的互补性。 The guide sequence can be of suitable length. In some embodiments, the guide sequence is between about 18 and about 35 nucleotides, including, for example, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, or 35 nucleotides. The guide sequence may be at least about 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99%, or 100% complementary to the target sequence of the target nucleic acid.
本文还提供了编码本文所述工程化的CasX效应蛋白(例如工程化CasX核酸酶)中任一种的构建体、载体和表达系统。在一些实施方式中,所述构建体、载体或表达系统还包含一种或多种gRNA或crRNA阵列。Also provided herein are constructs, vectors, and expression systems encoding any of the engineered CasX effector proteins (eg, engineered CasX nucleases) described herein. In some embodiments, the construct, vector or expression system further comprises one or more gRNA or crRNA arrays.
“载体”是包含分离的核酸并且可以用于将所述分离的核酸递送至细胞内部的物质组合物。许多载体是本领域已知的,包括但不限于:线性多核苷酸、与离子或两亲性化合物缔合的多核苷酸、质粒和病毒。通常,合适的载体包含在至少一种生物中起作用的复制起点、启动子序列、方便的限制性核酸内切酶位点和一种或多种选择性标记物。术语“载体”也应被解释为包括非质粒和非病毒化合物,其促进核酸转移到细胞中,诸如例如,聚赖氨酸化合物、脂质体等。A "vector" is a composition of matter comprising an isolated nucleic acid and which can be used to deliver the isolated nucleic acid to the interior of a cell. Many vectors are known in the art including, but not limited to, linear polynucleotides, polynucleotides associated with ionic or amphiphilic compounds, plasmids, and viruses. In general, suitable vectors contain an origin of replication functional in at least one organism, a promoter sequence, convenient restriction endonuclease sites and one or more selectable markers. The term "vector" should also be construed to include non-plasmid and non-viral compounds that facilitate the transfer of nucleic acids into cells, such as, for example, polylysine compounds, liposomes, and the like.
在一些实施方式中,所述载体是病毒载体。病毒载体的实例包括但不限于:腺病毒载体、腺相关病毒载体、慢病毒载体、逆转录病毒载体、牛痘载体、单纯疱疹病毒载体及其衍生物。在一些实施方式中,所述载体是噬菌体载体。病毒载体技术是本领域众所周知的,并且例如描述于Sambrook等人(2001,Molecular Cloning:A Laboratory Manual,Cold Spring Harbor Laboratory,New York),以及其他病毒学和分子生物学手册。In some embodiments, the vector is a viral vector. Examples of viral vectors include, but are not limited to, adenoviral vectors, adeno-associated viral vectors, lentiviral vectors, retroviral vectors, vaccinia vectors, herpes simplex virus vectors, and derivatives thereof. In some embodiments, the vector is a phage vector. Viral vector technology is well known in the art and is described, for example, in Sambrook et al. (2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York), among other handbooks of virology and molecular biology.
已经开发了许多基于病毒的系统,用于将基因转移到哺乳动物细胞中。例如,逆转录病毒为基因递送系统提供了便利的平台。可以使用本领域已知的技术将异源核酸插入载体,并包装在逆转录病毒颗粒中。然后可以分离重组病毒,并在体外或离体递送至所述工程化的哺乳动物细胞。许多逆转录病毒系统是本领域已知的。在一些实施方式中,使用腺病毒载体。许多腺病毒载体是本领域已知的。在一些实施方式中,使用慢病毒载体。在一些实施方式中,使用自灭活的慢病毒载体。A number of virus-based systems have been developed for gene transfer into mammalian cells. For example, retroviruses provide a convenient platform for gene delivery systems. The heterologous nucleic acid can be inserted into a vector and packaged in retroviral particles using techniques known in the art. Recombinant virus can then be isolated and delivered to the engineered mammalian cells in vitro or ex vivo. Many retroviral systems are known in the art. In some embodiments, an adenoviral vector is used. Many adenoviral vectors are known in the art. In some embodiments, lentiviral vectors are used. In some embodiments, a self-inactivating lentiviral vector is used.
在某些实施方式中,所述载体是腺相关病毒(AAV)载体,例如AAV2、AAV8或AAV9,其可以按包含至少1×105个颗粒(也称为颗粒单位,pu)的单次给药施用腺病毒或腺相关病毒。在一些实施方式中,所述给药量是至少约1×106个颗粒,至少约1×107个颗粒,至少约1×108个颗粒,或至少约1×109个颗粒的腺相关病毒。递送方法和给药量描述于例如WO 2016205764和美国专利No.8,454,972,其通过整体引用而并入本文。In certain embodiments, the vector is an adeno-associated virus (AAV) vector, such as AAV2, AAV8, or AAV9, which can administer the adenovirus or adeno-associated virus in a single dose comprising at least 1 x 105 particles (also known as particle units, pu). In some embodiments, the administered amount is at least about 1×10 6 particles, at least about 1×10 7 particles, at least about 1×10 8 particles, or at least about 1×10 9 particles of adeno-associated virus. Delivery methods and dosages are described, for example, in WO 2016205764 and US Patent No. 8,454,972, which are incorporated herein by reference in their entirety.
在一些实施方式中,所述载体是重组腺相关病毒(rAAV)载体。例如,在一些实施方式中,可以使用经修饰的AAV载体递送。经修饰的AAV载体可 以基于几种衣壳类型中的一种或多种,包括AAV1、AV2、AAV5、AAV6、AAV8、AAV8.2、AAV9、AAV rh10,经修饰的AAV载体(例如经修饰的AAV2,经修饰的AAV3,经修饰的AAV6)和假型AAV(例如AAV2/8,AAV2/5和AAV2/6)。可用于产生rAAV颗粒的示例性AAV载体和技术是本领域已知的(参见例如Aponte-Ubillus et al.(2018)Appl.Microbiol.Biotechnol.102(3):1045-54;Zhong et al.(2012)J.Genet.Syndr.Gene Ther.S1:008;West et al.(1987)Virology 160:38-47(1987);Tratschin et al.(1985)Mol.Cell.Biol.5:3251-60;美国专利Nos.4,797,368和5,173,414;国际公开第WO2015/054653号和第WO93/24641号,其各自通过引用并入本文)。In some embodiments, the vector is a recombinant adeno-associated virus (rAAV) vector. For example, in some embodiments, modified AAV vectors can be used for delivery. Modified AAV vectors can to be based on one or more of several capsid types, including AAV1, AAV2, AAV5, AAV6, AAV8, AAV8.2, AAV9, AAV rh10, modified AAV vectors (e.g. modified AAV2, modified AAV3, modified AAV6) and pseudotyped AAV (e.g. AAV2/8, AAV2/5 and AAV2/6). Exemplary AAV vectors and techniques that can be used to generate rAAV particles are known in the art (see, e.g., Aponte-Ubillus et al. (2018) Appl. Microbiol. Biotechnol. 102(3):1045-54; Zhong et al. (2012) J. Genet. Syndr. Gene Ther. 160:38-47 (1987); Tratschin et al. (1985) Mol. Cell. Biol. 5:3251-60; US Patent Nos. 4,797,368 and 5,173,414; International Publication Nos. WO2015/054653 and WO93/24641, each of which is incorporated herein by reference).
用于递送Cas9和其他Cas蛋白的任何已知的AAV载体都可以用于递送本申请的工程化的CasX系统。Any known AAV vector used to deliver Cas9 and other Cas proteins can be used to deliver the engineered CasX system of the present application.
将载体引入哺乳动物细胞的方法是本领域已知的。可以通过物理、化学或生物学方法将载体转移到宿主细胞中。Methods for introducing vectors into mammalian cells are known in the art. Vectors can be transferred into host cells by physical, chemical or biological means.
用于将载体引入宿主细胞的物理方法包括:磷酸钙沉淀、脂质转染、粒子轰击、显微注射、电穿孔等。产生包含载体和/或外源核酸的细胞的方法是本领域众所周知的。参见,例如Sambrook et al.(2001)Molecular Cloning:A Laboratory Manual,Cold Spring Harbor Laboratory,New York。在一些实施方式中,通过电穿孔将所述载体引入所述细胞。Physical methods used to introduce vectors into host cells include: calcium phosphate precipitation, lipofection, particle bombardment, microinjection, electroporation, and the like. Methods for producing cells comprising vectors and/or exogenous nucleic acids are well known in the art. See, e.g., Sambrook et al. (2001) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York. In some embodiments, the vector is introduced into the cells by electroporation.
用于将异源核酸引入宿主细胞的生物学方法包括使用DNA和RNA载体。病毒载体已成为将基因插入哺乳动物例如人细胞的最广泛使用的方法。Biological methods for introducing heterologous nucleic acids into host cells include the use of DNA and RNA vectors. Viral vectors have become the most widely used method for inserting genes into mammalian, eg human, cells.
用于将载体引入宿主细胞的化学方法包括胶体分散系统如大分子复合物、纳米胶囊、微球、珠子和基于脂质的系统,所述基于脂质的系统包括水包油乳液、胶束、混合胶束和脂质体。用作体外递送载体的示例性胶体系统是脂质体(例如,人工膜囊泡)。在一些实施方式中,所述工程化的CRISPR-CasX系统以RNP的形式在纳米颗粒中递送。Chemical methods for introducing vectors into host cells include colloidal dispersion systems such as macromolecular complexes, nanocapsules, microspheres, beads, and lipid-based systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes. An exemplary colloidal system for use as an in vitro delivery vehicle is a liposome (eg, an artificial membrane vesicle). In some embodiments, the engineered CRISPR-CasX system is delivered in nanoparticles as RNPs.
在一些实施方式中,编码所述CRISPR-CasX系统或其组分的载体或表达系统包含一种或多种可选择或可检测的标记物,该标记物提供了分离或有效选择含有和/或已被CRISPR-CasX系统(例如在早期和大规模)修饰的细胞的手段。In some embodiments, the vector or expression system encoding the CRISPR-CasX system or components thereof comprises one or more selectable or detectable markers that provide a means to isolate or efficiently select cells that contain and/or have been modified (e.g., at an early stage and at a large scale) by the CRISPR-CasX system.
报告基因可用于鉴定潜在转染的细胞和评估调节序列的功能。通常,报告基因是在受体生物体或组织中不存在或不表达的基因,其编码的多肽的表 达是通过某些易于检测的性质(例如酶活性)来证明。在将DNA引入受体细胞后的合适时间测定报告基因的表达。合适的报告基因可以包括编码荧光素酶、β-半乳糖苷酶、氯霉素乙酰转移酶、分泌的碱性磷酸酶的基因或绿色荧光蛋白基因(例如Ui-Tei et al.FEBS Letters 479:79-82(2000))。Reporter genes can be used to identify potentially transfected cells and assess the function of regulatory sequences. Typically, a reporter gene is a gene that is absent or not expressed in the recipient organism or tissue, and the expression of the polypeptide it encodes Reaching is evidenced by some readily detectable property, such as enzymatic activity. Expression of the reporter gene is measured at an appropriate time after introduction of the DNA into the recipient cells. Suitable reporter genes may include genes encoding luciferase, β-galactosidase, chloramphenicol acetyltransferase, secreted alkaline phosphatase, or the green fluorescent protein gene (eg, Ui-Tei et al. FEBS Letters 479:79-82 (2000)).
确认宿主细胞中异源核酸的存在的其他方法包括例如本领域技术人员众所周知的分子生物学测定,诸如Southern和Northern印迹、RT-PCR和PCR;生化测定,例如通过免疫学方法(如ELISA和Western印迹)检测特定肽的存在或不存在。Other methods of confirming the presence of heterologous nucleic acid in host cells include, for example, molecular biological assays, such as Southern and Northern blots, RT-PCR and PCR, well known to those skilled in the art; biochemical assays, such as detection of the presence or absence of specific peptides by immunological methods such as ELISA and Western blotting.
在一些实施方式中,编码所述工程化的CasX效应蛋白和/或所述指导RNA的核酸序列与启动子可操作地连接。在一些实施方式中,所述启动子是相对于使用所述工程化的CRISPR-CasX系统进行工程化改造的细胞的内源启动子。例如,可以使用本领域已知的任何方法将编码所述工程化的CasX效应蛋白的核酸敲入工程化的哺乳动物细胞的基因组中位于内源性启动子下游处。在一些实施方式中,所述内源启动子是丰富蛋白(如β-肌动蛋白)的启动子。在一些实施方式中,所述内源启动子是可诱导的启动子,例如,可通过工程化的哺乳动物细胞的内源激活信号来诱导。在一些实施方式中,其中所述工程化的哺乳动物细胞是T细胞,所述启动子是T细胞活化依赖性启动子(诸如IL-2启动子、NFAT启动子或NFκB启动子)。In some embodiments, the nucleic acid sequence encoding the engineered CasX effector protein and/or the guide RNA is operably linked to a promoter. In some embodiments, the promoter is an endogenous promoter relative to the cell engineered using the engineered CRISPR-CasX system. For example, the nucleic acid encoding the engineered CasX effector protein can be knocked into the genome of an engineered mammalian cell downstream of an endogenous promoter using any method known in the art. In some embodiments, the endogenous promoter is the promoter of an abundant protein such as β-actin. In some embodiments, the endogenous promoter is an inducible promoter, eg, inducible by an endogenous activation signal of the engineered mammalian cell. In some embodiments, wherein the engineered mammalian cell is a T cell, the promoter is a T cell activation dependent promoter (such as the IL-2 promoter, NFAT promoter or NFκB promoter).
在一些实施方式中,所述启动子是相对于使用所述工程化的CRISPR-CasX系统进行工程化改造的细胞的异源启动子。已经探索了多种启动子以在哺乳动物细胞中表达基因,并且本领域中已知的任何启动子都可以用于本申请中。启动子可大致分为组成型启动子或受调节的启动子,例如诱导型启动子。In some embodiments, the promoter is a heterologous promoter relative to the cell engineered using the engineered CRISPR-CasX system. A variety of promoters have been explored for expression of genes in mammalian cells, and any promoter known in the art may be used in this application. Promoters can be broadly classified as constitutive promoters or regulated promoters, such as inducible promoters.
在一些实施方式中,编码所述工程化的CasX效应蛋白和/或所述指导RNA的核酸序列与组成型启动子可操作地连接。组成型启动子允许异源基因(也称为转基因)在宿主细胞中组成型表达。本文考虑的示例性组成型启动子包括但不限于:巨细胞病毒(CMV)启动子、人延伸因子-1α(hEF1α)、泛素C启动子(UbiC)、磷酸甘油激酶启动子(PGK)、猿猴病毒40早期启动子(SV40)以及鸡β-肌动蛋白启动子与CMV早期增强子(CAG)耦联。在一些实施方式中,所述启动子是CAG启动子,其包含巨细胞病毒(CMV)早期增强子元件、 启动子、鸡β-肌动蛋白基因的第一外显子和第一内含子,以及兔β-珠蛋白基因的剪接受体。In some embodiments, the nucleic acid sequence encoding the engineered CasX effector protein and/or the guide RNA is operably linked to a constitutive promoter. Constitutive promoters allow the constitutive expression of heterologous genes (also known as transgenes) in host cells. Exemplary constitutive promoters contemplated herein include, but are not limited to: cytomegalovirus (CMV) promoter, human elongation factor-1α (hEF1α), ubiquitin C promoter (UbiC), phosphoglycerol kinase promoter (PGK), Simian virus 40 early promoter (SV40), and chicken β-actin promoter coupled to CMV early enhancer (CAG). In some embodiments, the promoter is a CAG promoter comprising a cytomegalovirus (CMV) early enhancer element, Promoter, first exon and first intron of chicken β-actin gene, and splice acceptor of rabbit β-globin gene.
在一些实施方式中,编码所述工程化的CRISPR-CasX效应蛋白和/或所述指导RNA的核酸序列可操作地连接至诱导型启动子。诱导型启动子属于受调控的启动子类型。所述诱导型启动子可以通过一种或多种条件如物理条件、微环境或宿主细胞的生理状态、诱导物(即,诱导剂)或其组合来诱导。在一些实施方式中,所述诱导条件选自下组:诱导剂、辐照(如电离辐射、光)、温度(如热)、氧化还原状态、肿瘤环境和待通过工程化的CRISPR-CasX系统进行工程化改造的细胞的活化状态。在一些实施方式中,所述启动子可被小分子诱导剂诸如化合物诱导。在一些实施方式中,所述小分子选自下组:强力霉素、四环素、醇、金属或类固醇。化学诱导的启动子已被最广泛地研究。这样的启动子包括其转录活性受小分子化学物质例如强力霉素、四环素、醇、类固醇、金属和其他化合物的存在或不存在调节的启动子。具有逆四环素受控反式激活因子(rtTA)和四环素响应元件启动子(TRE)的强力霉素诱导系统是目前最成熟的系统。WO9429442描述了通过四环素响应性启动子严格控制真核细胞中基因表达。WO9601313公开了四环素调节的转录调节剂。另外,诸如Tet-on系统之类的Tet技术已描述于例如TetSystems.com网站。在本申请中,任何已知的化学调节的启动子均可用于驱动编码所述工程化的CRISPR-CasX蛋白和/或所述指导RNA的表达。In some embodiments, the nucleic acid sequence encoding the engineered CRISPR-CasX effector protein and/or the guide RNA is operably linked to an inducible promoter. Inducible promoters are a type of regulated promoter. The inducible promoter can be induced by one or more conditions, such as physical conditions, the microenvironment or the physiological state of the host cell, an inducer (ie, an inducer), or a combination thereof. In some embodiments, the inducing conditions are selected from the group consisting of inducing agent, irradiation (e.g., ionizing radiation, light), temperature (e.g., heat), redox state, tumor environment, and activation state of the cells to be engineered by the engineered CRISPR-CasX system. In some embodiments, the promoter is inducible by a small molecule inducing agent such as a chemical compound. In some embodiments, the small molecule is selected from the group consisting of doxycycline, tetracycline, alcohol, metal, or steroid. Chemically inducible promoters have been most extensively studied. Such promoters include promoters whose transcriptional activity is regulated by the presence or absence of small molecule chemicals such as doxycycline, tetracycline, alcohols, steroids, metals and other compounds. The doxycycline-inducible system with retrotetracycline-controlled transactivator (rtTA) and tetracycline-responsive element promoter (TRE) is currently the most mature system. WO9429442 describes the tight control of gene expression in eukaryotic cells by tetracycline-responsive promoters. WO9601313 discloses tetracycline regulated transcriptional regulators. Additionally, Tet technologies such as the Tet-on system have been described at, for example, the TetSystems.com website. In this application, any known chemically regulated promoter can be used to drive the expression encoding the engineered CRISPR-CasX protein and/or the guide RNA.
在一些实施方式中,编码所述工程化的CasX效应蛋白的核酸序列是经过密码子优化的。In some embodiments, the nucleic acid sequence encoding the engineered CasX effector protein is codon optimized.
在一些实施方式中,提供了表达构建体,其包含经密码子优化的序列,编码所述工程化的CasX效应蛋白的该序列连接到BPK2104-ccdB载体上。在一些实施方式中,所述表达构建体编码与所述工程化的CasX效应蛋白的C末端可操作连接的标签(例如10xHis标签)。In some embodiments, an expression construct is provided comprising a codon-optimized sequence encoding the engineered CasX effector protein ligated to a BPK2104-ccdB vector. In some embodiments, the expression construct encodes a tag (eg, a 10xHis tag) operably linked to the C-terminus of the engineered CasX effector protein.
在一些实施方式中,每种工程化的分割型CasX构建体编码荧光蛋白诸如GFP或RFP。所述报告蛋白可以用于评估所述工程化的CasX蛋白的共定位和/或二聚化,例如使用显微镜。可以使用编码自切割肽诸如T2A、P2A、E2A或F2A肽的序列,将编码工程化的CasX效应蛋白的核酸序列与编码另外的组分的核酸序列融合。 In some embodiments, each engineered split CasX construct encodes a fluorescent protein such as GFP or RFP. The reporter protein can be used to assess colocalization and/or dimerization of the engineered CasX protein, for example using microscopy. Nucleic acid sequences encoding engineered CasX effector proteins can be fused to nucleic acid sequences encoding additional components using sequences encoding self-cleaving peptides such as T2A, P2A, E2A or F2A peptides.
在一些实施方式中,提供了用于哺乳动物细胞(例如人细胞)的表达构建体,所述构建体包含编码所述工程化的CasX效应蛋白的核酸序列。在一些实施方式中,所述表达构建体包含经密码子优化的序列,所述序列编码插入pCAG-2A-eGFP载体中的所述工程化的CasX效应蛋白,从而使CasX蛋白可操作地连接至eGFP。在一些实施方式中,提供第二载体用于在哺乳动物细胞(例如人细胞)中表达指导RNA(例如crRNA或前体crRNA阵列)。在一些实施方式中,编码所述指导RNA的序列在pUC19-U6-i2-cr RNA载体主链中表达。In some embodiments, expression constructs for use in mammalian cells (eg, human cells) are provided, said constructs comprising a nucleic acid sequence encoding said engineered CasX effector protein. In some embodiments, the expression construct comprises a codon optimized sequence encoding the engineered CasX effector protein inserted into the pCAG-2A-eGFP vector such that the CasX protein is operably linked to eGFP. In some embodiments, a second vector is provided for expression of a guide RNA (eg, crRNA or pre-crRNA array) in a mammalian cell (eg, a human cell). In some embodiments, the sequence encoding the guide RNA is expressed in the pUC19-U6-i2-cr RNA vector backbone.
本申请在第四方面提供了检测样品中靶核酸的方法The present application provides a method for detecting target nucleic acid in a sample in a fourth aspect
在一个具体实施方式中,提供了一种检测样品中靶核酸的方法,包括:(a)使样品与前述工程化的CRISPR-CasX系统以及加标签的检测核酸接触,该检测核酸为单链且不与所述指导RNA的指导序列杂交;以及(b)测量通过所述工程化的CasX效应蛋白切割所述加标签的检测核酸而产生的可检测信号,从而检测所述靶核酸。In a specific embodiment, a method for detecting a target nucleic acid in a sample is provided, comprising: (a) contacting the sample with the aforementioned engineered CRISPR-CasX system and a tagged detection nucleic acid, the detection nucleic acid is single-stranded and does not hybridize with the guide sequence of the guide RNA; and (b) measuring a detectable signal generated by cutting the tagged detection nucleic acid with the engineered CasX effector protein, thereby detecting the target nucleic acid.
在一个具体的实施方式中,所述CasX核酸酶选自下组的任一项:PlmCasX、DpbCasX。In a specific embodiment, the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
本申请还提供了使用具有改善的活性的所述工程化的CasX效应蛋白或CRISPR-CasX系统中任一种来检测靶核酸的方法。使用CasX效应蛋白作为检测试剂可利用以下发现:一旦通过检测到靶DNA而被激活,V型CRISPR/Cas蛋白(例如,CasX)可混杂地切割非靶向单链DNA(ssDNA或RNA,即指导RNA的指导序列不与之杂交的单链核酸)。因此,当样品中存在靶DNA(双链或单链)时(例如,在某些情况下超过阈值量),结果是样品中单链核酸的切割,这可使用任何方便的检测方法进行检测(例如,使用加标签的单链检测核酸如DNA或RNA)。CasX可以切割ssDNA和ssRNA。例如使用Cas蛋白作为检测试剂的方法描述于US10253365和WO2020/056924,其通过整体引用而并入本文。The present application also provides methods for detecting target nucleic acids using any of the engineered CasX effector proteins or CRISPR-CasX systems with improved activity. The use of CasX effector proteins as detection reagents can take advantage of the discovery that, once activated by detection of target DNA, type V CRISPR/Cas proteins (e.g., CasX) can promiscuously cleave non-targeted single-stranded DNA (ssDNA or RNA, i.e., single-stranded nucleic acid to which the guide sequence of a guide RNA does not hybridize). Thus, when target DNA (double-stranded or single-stranded) is present in the sample (e.g., in some cases above a threshold amount), the result is cleavage of the single-stranded nucleic acid in the sample, which can be detected using any convenient detection method (e.g., using tagged single-stranded detection nucleic acids such as DNA or RNA). CasX can cleave ssDNA and ssRNA. For example methods using Cas proteins as detection reagents are described in US10253365 and WO2020/056924, which are hereby incorporated by reference in their entirety.
在一些实施方式中,提供了检测样品中的靶DNA(例如,双链或单链)的方法,其包括:(a)使所述样品与以下项接触:(i)本文所述工程化的CasX效应蛋白中任一种;(ii)指导RNA,其包含与所述靶DNA杂交的指导序列;和(iii)检测核酸,其为单链的(即“单链检测核酸”)并且不与所述指导RNA的指 导序列杂交;以及(b)测量通过所述工程化的CasX效应蛋白切割所述单链检测核酸而产生的可检测信号。在某些情况下,所述单链检测核酸包括发射荧光的染料对(例如,发射荧光的染料对是荧光共振能量转移(FRET)对、猝灭剂/荧光对)。在某些情况下,所述靶DNA是病毒DNA(例如,乳头状病毒、肝DNA病毒、疱疹病毒、腺病毒、痘病毒、细小病毒等)。在一些实施方式中,该单链检测核酸是DNA。在一些实施方式中,所述单链检测核酸是RNA。In some embodiments, there is provided a method of detecting target DNA (e.g., double-stranded or single-stranded) in a sample comprising: (a) contacting the sample with: (i) any of the engineered CasX effector proteins described herein; (ii) a guide RNA comprising a guide sequence that hybridizes to the target DNA; and (b) measuring a detectable signal produced by cleavage of the single-stranded detection nucleic acid by the engineered CasX effector protein. In some cases, the single-stranded detection nucleic acid includes a fluorescence emitting dye pair (eg, the fluorescence emitting dye pair is a fluorescence resonance energy transfer (FRET) pair, a quencher/fluorescent pair). In certain instances, the target DNA is viral DNA (eg, papillomavirus, hepadnavirus, herpesvirus, adenovirus, poxvirus, parvovirus, etc.). In some embodiments, the single-stranded detection nucleic acid is DNA. In some embodiments, the single-stranded detection nucleic acid is RNA.
本公开的用于检测样品中的靶DNA(单链或双链)的方法,可以以高灵敏度检测靶DNA。在一些情况下,本公开的方法可以用于检测存在于包含多个DNA(包括所述靶DNA和多个非靶DNA)的样品中的靶DNA,其中所述靶DNA以每107个非靶DNA中的一个或更多个拷贝存在(例如,每106个非靶DNA中的一个或更多个拷贝,每105个非靶DNA中的一个或更多个拷贝,每104个非靶DNA中的一个或更多个拷贝,每103个非靶DNA中的一个或更多个拷贝,每102个非靶DNA中的一个或更多个拷贝,每50个非靶DNA中的一个或更多个拷贝,每20个非靶DNA中的一个或更多个拷贝,每10个非靶DNA中的一个或更多个拷贝或每5个非靶DNA中的一个或更多个拷贝)。在一些实施方式中,与所述参比CasX核酸酶相比,本文所述工程化的CasX效应蛋白可以以更高的灵敏度检测靶DNA。在一些实施方式中,与所述参比CasX核酸酶相比,所述工程化的CasX效应蛋白可以以10%、15%、20%、25%、30%、40%、50%、60%、70%、80%、90%或更高的灵敏度检测靶DNA。The method for detecting target DNA (single-stranded or double-stranded) in a sample of the present disclosure can detect target DNA with high sensitivity.在一些情况下,本公开的方法可以用于检测存在于包含多个DNA(包括所述靶DNA和多个非靶DNA)的样品中的靶DNA,其中所述靶DNA以每10 7个非靶DNA中的一个或更多个拷贝存在(例如,每10 6个非靶DNA中的一个或更多个拷贝,每10 5个非靶DNA中的一个或更多个拷贝,每10 4个非靶DNA中的一个或更多个拷贝,每10 3个非靶DNA中的一个或更多个拷贝,每10 2个非靶DNA中的一个或更多个拷贝,每50个非靶DNA中的一个或更多个拷贝,每20个非靶DNA中的一个或更多个拷贝,每10个非靶DNA中的一个或更多个拷贝或每5个非靶DNA中的一个或更多个拷贝)。 In some embodiments, the engineered CasX effector protein described herein can detect target DNA with higher sensitivity than the reference CasX nuclease. In some embodiments, the engineered CasX effector protein can detect target DNA with a sensitivity of 10%, 15%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or higher compared to the reference CasX nuclease.
本申请在第五方面涉及一种修饰包含靶序列的靶核酸的方法In a fifth aspect, the present application relates to a method of modifying a target nucleic acid comprising a target sequence
在一个具体实施方式中,提供了一种修饰包含靶序列的靶核酸的方法,包括使所述靶核酸与前述的工程化的CRISPR-CasX系统接触。In a specific embodiment, there is provided a method of modifying a target nucleic acid comprising a target sequence, comprising contacting the target nucleic acid with the aforementioned engineered CRISPR-CasX system.
在一个具体的实施方式中,所述CasX核酸酶选自下组的任一项:PlmCasX、DpbCasX。In a specific embodiment, the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
在一些实施方式中,本申请提供了修饰包含靶序列的靶核酸的方法,所述方法包括使所述靶核酸与本文所述工程化的CRISPR-CasX系统中任一种接触。在一些实施方式中,所述方法在体外进行。在一些实施方式中,所述靶核酸存在于细胞中。在一些实施方式中,所述细胞是细菌细胞、酵母细胞、 哺乳动物细胞、植物细胞或动物细胞。在一些实施方式中,所述方法离体进行。在一些实施方式中,所述方法在体内进行。In some embodiments, the present application provides methods of modifying a target nucleic acid comprising a target sequence comprising contacting the target nucleic acid with any of the engineered CRISPR-CasX systems described herein. In some embodiments, the method is performed in vitro. In some embodiments, the target nucleic acid is present in a cell. In some embodiments, the cells are bacterial cells, yeast cells, Mammalian cells, plant cells or animal cells. In some embodiments, the method is performed ex vivo. In some embodiments, the method is performed in vivo.
在一些实施方式中,通过所述工程化的CRISPR-CasX系统切割所述靶核酸或改变所述靶核酸中的靶序列。在一些实施方式中,通过所述工程化的CRISPR-CasX系统改变所述靶核酸的表达。在一些实施方式中,所述靶核酸是基因组DNA。在一些实施方式中,所述靶序列与疾病或病况相关。在一些实施方式中,所述工程化的CRISPR-CasX系统包含编码多个crRNA的前体指导RNA阵列,其中每个crRNA包含不同的指导序列。In some embodiments, the target nucleic acid is cleaved or a target sequence in the target nucleic acid is altered by the engineered CRISPR-CasX system. In some embodiments, expression of the target nucleic acid is altered by the engineered CRISPR-CasX system. In some embodiments, the target nucleic acid is genomic DNA. In some embodiments, the target sequence is associated with a disease or condition. In some embodiments, the engineered CRISPR-CasX system comprises an array of precursor guide RNAs encoding multiple crRNAs, wherein each crRNA comprises a different guide sequence.
在一些实施方式中,本申请提供了处理与个体细胞中的靶核酸相关的疾病或病况的方法,其包括使用本文所述方法中任一种来修饰所述个体细胞中的所述靶核酸,从而处理所述疾病或病况。在一些实施方式中,所述疾病或病况选自下组:癌症、心血管疾病、遗传性疾病、自身免疫疾病、代谢性疾病、神经退行性疾病、眼病、细菌感染和病毒感染。In some embodiments, the present application provides methods of treating a disease or condition associated with a target nucleic acid in a cell of an individual comprising using any of the methods described herein to modify the target nucleic acid in the cell of the individual, thereby treating the disease or condition. In some embodiments, the disease or condition is selected from the group consisting of cancer, cardiovascular disease, genetic disease, autoimmune disease, metabolic disease, neurodegenerative disease, eye disease, bacterial infection, and viral infection.
本文所述工程化的CRISPR-CasX系统可以用多种方式修饰细胞中的靶核酸,这取决于所述CRISPR-CasX系统中工程化的CasX效应蛋白的类型。在一些实施方式中,所述方法诱导靶核酸中的位点特异性切割。在一些实施方式中,所述方法在细胞如细菌细胞、植物细胞或动物细胞(例如哺乳动物细胞)中切割基因组DNA。在一些实施方式中,所述方法通过切割细胞中的基因组DNA来杀死细胞。在一些实施方式中所述方法在细胞中切割病毒核酸。The engineered CRISPR-CasX systems described herein can modify target nucleic acids in cells in a variety of ways, depending on the type of CasX effector protein engineered in the CRISPR-CasX system. In some embodiments, the methods induce site-specific cleavage in the target nucleic acid. In some embodiments, the methods cleave genomic DNA in cells such as bacterial cells, plant cells, or animal cells (eg, mammalian cells). In some embodiments, the method kills the cell by cleavage of genomic DNA in the cell. In some embodiments the methods cleave viral nucleic acid in the cell.
在一些实施方式中,所述方法改变(如增加或减少)细胞中所述靶核酸的表达水平。在一些实施方式中,所述方法例如基于融合至反式激活结构域的无酶活性的CasX蛋白,使用工程化的CasX效应蛋白来提高细胞中所述靶核酸的表达水平。在一些实施方式中,所述方法例如基于融合至转录阻遏结构域的无酶活性的CasX蛋白,使用工程化的CasX效应蛋白来降低细胞中所述靶核酸的表达水平。在一些实施方式中,所述方法例如基于融合至表观遗传修饰结构域的无酶活性的CasX蛋白,使用工程化的CasX效应蛋白将表观遗传修饰引入细胞中的所述靶核酸。本文所述工程化的CasX系统可用于将其他修饰引入所述靶核酸,这取决于所述工程化的CasX效应蛋白所包含的功能结构域。 In some embodiments, the methods alter (eg, increase or decrease) the expression level of the target nucleic acid in a cell. In some embodiments, the method uses an engineered CasX effector protein to increase the expression level of the target nucleic acid in a cell, eg, based on an enzymatically inactive CasX protein fused to a transactivation domain. In some embodiments, the method uses an engineered CasX effector protein to reduce the expression level of the target nucleic acid in a cell, eg, based on an enzymatically inactive CasX protein fused to a transcriptional repression domain. In some embodiments, the method uses an engineered CasX effector protein to introduce an epigenetic modification into the target nucleic acid in a cell, eg, based on an enzymatically inactive CasX protein fused to an epigenetic modification domain. The engineered CasX system described herein can be used to introduce other modifications into the target nucleic acid, depending on the functional domains comprised by the engineered CasX effector protein.
在一些实施方式中,所述方法改变细胞中所述靶核酸中的靶序列。在一些实施方式中,所述方法将突变引入细胞中的所述靶核酸。在一些实施方式中,所述方法使用一种或多种内源性DNA修复途径,如非同源末端连接(NHEJ)或同源定向重组(HDR),在细胞中修复靶DNA中诱导的双链断裂,作为由CRISPR复合物进行序列特异性切割的结果。示例性的突变包括但不限于:插入、缺失、替换和移码。在一些实施方式中,所述方法在靶位点处插入供体DNA。在一些实施方式中,供体DNA的插入导致将选择性标记物或报告蛋白引入所述细胞。在一些实施方式中,供体DNA的插入导致基因的敲入。在一些实施方式中,供体DNA的插入导致敲除突变。在一些实施方式中,供体DNA的插入导致替换突变诸如单核苷酸替换。在一些实施方式中,所述方法诱导细胞的表型变化。In some embodiments, the method alters a target sequence in the target nucleic acid in a cell. In some embodiments, the method introduces a mutation into the target nucleic acid in a cell. In some embodiments, the methods use one or more endogenous DNA repair pathways, such as non-homologous end joining (NHEJ) or homology-directed recombination (HDR), to repair double-strand breaks induced in the target DNA in the cell as a result of sequence-specific cleavage by the CRISPR complex. Exemplary mutations include, but are not limited to, insertions, deletions, substitutions, and frameshifts. In some embodiments, the method inserts donor DNA at the target site. In some embodiments, insertion of donor DNA results in the introduction of a selectable marker or reporter protein into the cell. In some embodiments, insertion of donor DNA results in knock-in of the gene. In some embodiments, insertion of donor DNA results in a knockout mutation. In some embodiments, insertion of donor DNA results in substitutional mutations such as single nucleotide substitutions. In some embodiments, the method induces a phenotypic change in the cell.
在一些实施方式中,所述工程化的CRISPR-CasX系统用于基因电路(genetic circuit)的一部分,或用于将基因电路插入细胞的基因组DNA中。本文所述的诱导剂控制的工程化的分割型CasX效应蛋白尤其可用作基因电路的组分。基因电路可用于基因疗法。设计和使用基因电路的方法和技术是本领域已知的。可以进一步参考例如Brophy,Jennifer AN,and Christopher A.Voigt."Principles of genetic circuit design."Nature methods 11.5(2014):508。In some embodiments, the engineered CRISPR-CasX system is used as part of a genetic circuit, or to insert a genetic circuit into the genomic DNA of a cell. The inducer-controlled engineered segmented CasX effector proteins described herein are particularly useful as components of genetic circuits. Genetic circuits can be used in gene therapy. Methods and techniques for designing and using genetic circuits are known in the art. Further reference can be made to eg Brophy, Jennifer AN, and Christopher A. Voigt."Principles of genetic circuit design." Nature methods 11.5(2014):508.
本文所述工程化的CRISPR-CasX系统可用于修饰多种靶核酸。在一些实施方式中,所述靶核酸在细胞中。在一些实施方式中,所述靶核酸是基因组DNA。在一些实施方式中,所述靶核酸是染色体外DNA。在一些实施方式中,所述靶核酸对于细胞是外源的。在一些实施方式中,所述靶核酸是病毒核酸诸如病毒DNA。在一些实施方式中,所述靶核酸是细胞中的质粒。在一些实施方式中,所述靶核酸是经水平转移(horizontally transferred)的质粒。在一些实施方式中,所述靶核酸是RNA。The engineered CRISPR-CasX systems described herein can be used to modify a variety of target nucleic acids. In some embodiments, the target nucleic acid is in a cell. In some embodiments, the target nucleic acid is genomic DNA. In some embodiments, the target nucleic acid is extrachromosomal DNA. In some embodiments, the target nucleic acid is foreign to the cell. In some embodiments, the target nucleic acid is viral nucleic acid such as viral DNA. In some embodiments, the target nucleic acid is a plasmid in a cell. In some embodiments, the target nucleic acid is a horizontally transferred plasmid. In some embodiments, the target nucleic acid is RNA.
在一些实施方式中,所述靶核酸是经分离的核酸诸如经分离的DNA。在一些实施方式中,所述靶核酸存在于无细胞的环境中。在一些实施方式中,所述靶核酸是经分离的载体诸如质粒。在一些实施方式中,所述靶核酸是经分离的线性DNA片段。In some embodiments, the target nucleic acid is an isolated nucleic acid such as isolated DNA. In some embodiments, the target nucleic acid is present in a cell-free environment. In some embodiments, the target nucleic acid is an isolated vector such as a plasmid. In some embodiments, the target nucleic acid is an isolated linear DNA fragment.
本文描述的方法适用于任何合适的细胞类型。在一些实施方式中,所述细胞是细菌、酵母细胞、真菌细胞、藻类细胞、植物细胞或动物细胞(例如,哺乳动物细胞,如人细胞)。在一些实施方式中,所述细胞是自然来源的诸 如由组织活检分离出的细胞。在一些实施方式中,所述细胞是从体外培养的细胞系分离的细胞。在一些实施方式中,所述细胞来自原代细胞系。在一些实施方式中,所述细胞来自永生化细胞系。在一些实施方式中,所述细胞是基因工程化的细胞。The methods described herein are applicable to any suitable cell type. In some embodiments, the cells are bacteria, yeast cells, fungal cells, algal cells, plant cells, or animal cells (eg, mammalian cells, such as human cells). In some embodiments, the cells are naturally derived from Such as cells isolated from a tissue biopsy. In some embodiments, the cells are cells isolated from cell lines cultured in vitro. In some embodiments, the cells are from a primary cell line. In some embodiments, the cells are from an immortalized cell line. In some embodiments, the cells are genetically engineered cells.
在一些实施方式中,所述细胞是选自下组的生物的动物细胞:牛、绵羊、山羊、马、猪、鹿、鸡、鸭、鹅、兔和鱼。In some embodiments, the cell is an animal cell of an organism selected from the group consisting of cattle, sheep, goats, horses, pigs, deer, chickens, ducks, geese, rabbits, and fish.
在一些实施方式中,所述细胞是选自下组的生物的植物细胞:玉米、小麦、大麦、燕麦、水稻、大豆、油棕、红花、芝麻、烟草、亚麻、棉花、向日葵、珍珠小米、粟米、高粱、油菜、大麻、蔬菜作物、饲料作物、工业作物、木本作物和生物质作物。In some embodiments, the cell is a plant cell of an organism selected from the group consisting of corn, wheat, barley, oat, rice, soybean, oil palm, safflower, sesame, tobacco, flax, cotton, sunflower, pearl millet, millet, sorghum, canola, hemp, vegetable crops, forage crops, industrial crops, tree crops, and biomass crops.
在一些实施方式中,所述细胞是哺乳动物细胞。在一些实施方式中,所述细胞是人细胞。在一些实施方式中,所述人细胞是人胚胎肾293T(HEK293T或293T)细胞或HeLa细胞。在一些实施方式中,所述细胞是人胚肾(HEK293T)细胞。在一些实施方式中,所述细胞是小鼠Hepa1-6细胞。在一些实施方式中,所述哺乳动物细胞选自下组:免疫细胞、肝细胞、肿瘤细胞、干细胞、合子、肌肉细胞和皮肤细胞。In some embodiments, the cells are mammalian cells. In some embodiments, the cells are human cells. In some embodiments, the human cells are human embryonic kidney 293T (HEK293T or 293T) cells or HeLa cells. In some embodiments, the cells are human embryonic kidney (HEK293T) cells. In some embodiments, the cells are mouse Hepal-6 cells. In some embodiments, the mammalian cells are selected from the group consisting of immune cells, liver cells, tumor cells, stem cells, zygotes, muscle cells, and skin cells.
在一些实施方式中,所述细胞是选自下组的免疫细胞:细胞毒性T细胞、辅助T细胞、天然杀伤(NK)T细胞、iNK-T细胞、NK-T样细胞、γδT细胞、肿瘤浸润性T细胞和树突状细胞(DC)激活的T细胞。在一些实施方式中,所述方法产生经修饰的免疫细胞,诸如CAR-T细胞或TCR-T细胞。In some embodiments, the cell is an immune cell selected from the group consisting of cytotoxic T cells, helper T cells, natural killer (NK) T cells, iNK-T cells, NK-T-like cells, γδ T cells, tumor infiltrating T cells, and dendritic cell (DC) activated T cells. In some embodiments, the methods generate modified immune cells, such as CAR-T cells or TCR-T cells.
在一些实施方式中,所述细胞是胚胎干(ES)细胞、诱导性多能干(iPS)细胞、配子的祖细胞、配子、合子或胚胎中的细胞。In some embodiments, the cells are embryonic stem (ES) cells, induced pluripotent stem (iPS) cells, progenitor cells of gametes, cells in gametes, zygotes, or embryos.
本文所述的方法可以用于在体内、离体或体外修饰靶细胞,并且可以用改变所述细胞的方式进行,以使得一旦被修饰,所述经修饰的细胞的后代或细胞系就保留经改变的表型。所述经修饰的细胞和后代可以是多细胞生物的一部分,诸如具有离体或体内应用(如基因组编辑和基因疗法)的植物或动物。The methods described herein can be used to modify target cells in vivo, ex vivo, or in vitro, and can be done in such a way that the cells are altered such that, once modified, progeny or cell lines of the modified cells retain the altered phenotype. The modified cells and progeny may be part of a multicellular organism such as a plant or animal with ex vivo or in vivo applications such as genome editing and gene therapy.
在一些实施方式中,所述方法离体进行。在一些实施方式中,将所述工程化的CRISPR-CasX系统引入细胞后,所述经修饰的细胞(例如,哺乳动物细胞)离体繁殖。在一些实施方式中,将所述经修饰的细胞培养以繁殖至少约1天、2天、3天、4天、5天、6天、7天、10天、12天或14天中任一个。在一些实施方式中,将所述经修饰的细胞培养不超过约1天、2天、3 天、4天、5天、6天、7天、10天、12天或14天中任一个。在一些实施方式中,进一步评价或筛选所述经修饰的细胞以选择具有一种或多种所需表型或特性的细胞。In some embodiments, the method is performed ex vivo. In some embodiments, after introducing the engineered CRISPR-CasX system into the cells, the modified cells (eg, mammalian cells) are propagated ex vivo. In some embodiments, the modified cells are cultured to propagate for at least about any of 1 day, 2 days, 3 days, 4 days, 5 days, 6 days, 7 days, 10 days, 12 days, or 14 days. In some embodiments, the modified cells are cultured for no more than about 1 day, 2 days, 3 days Any of days, 4 days, 5 days, 6 days, 7 days, 10 days, 12 days or 14 days. In some embodiments, the modified cells are further evaluated or screened to select cells with one or more desired phenotypes or properties.
在一些实施方式中,所述靶序列是与疾病或病况相关的序列。示例性的疾病或病况包括但不限于:癌症、心血管疾病、遗传性疾病、自身免疫性疾病、代谢性疾病、神经退行性疾病、眼病、细菌感染和病毒感染。在一些实施方式中,所述疾病或病况是遗传疾病。在一些实施方式中,所述疾病或病况是单基因疾病或病况。在一些实施方式中,所述疾病或病况是多基因疾病或病况。In some embodiments, the target sequence is a sequence associated with a disease or condition. Exemplary diseases or conditions include, but are not limited to, cancer, cardiovascular disease, genetic disease, autoimmune disease, metabolic disease, neurodegenerative disease, eye disease, bacterial infection, and viral infection. In some embodiments, the disease or condition is a genetic disease. In some embodiments, the disease or condition is a monogenic disease or condition. In some embodiments, the disease or condition is a polygenic disease or condition.
在一些实施方式中,与野生型序列相比,所述靶序列具有突变。在一些实施方式中,所述靶序列具有与疾病或病况相关的单核苷酸多态性(SNP)。In some embodiments, the target sequence has a mutation compared to the wild-type sequence. In some embodiments, the target sequence has a single nucleotide polymorphism (SNP) associated with a disease or condition.
在一些实施方式中,插入所述靶核酸中的供体DNA编码选自下组的生物产物:报告蛋白、抗原特异性受体、治疗性蛋白、抗生素抗性蛋白、RNAi分子、细胞因子、激酶、抗原、抗原特异性受体、细胞因子受体和自杀多肽。在一些实施方式中,所述供体DNA编码治疗性蛋白。在一些实施方式中,所述供体DNA编码可用于基因疗法的治疗性蛋白。在一些实施方式中,所述供体DNA编码治疗性抗体。在一些实施方式中,所述供体DNA编码工程化的受体,诸如嵌合抗原受体(CAR)或工程化的TCR。在一些实施方式中,所述供体DNA编码治疗性RNA,诸如小RNA(例如,siRNA、shRNA或miRNA)或长的非编码RNA(lincRNA)。In some embodiments, the donor DNA inserted into the target nucleic acid encodes a biological product selected from the group consisting of reporter proteins, antigen-specific receptors, therapeutic proteins, antibiotic resistance proteins, RNAi molecules, cytokines, kinases, antigens, antigen-specific receptors, cytokine receptors, and suicide polypeptides. In some embodiments, the donor DNA encodes a therapeutic protein. In some embodiments, the donor DNA encodes a therapeutic protein that can be used in gene therapy. In some embodiments, the donor DNA encodes a therapeutic antibody. In some embodiments, the donor DNA encodes an engineered receptor, such as a chimeric antigen receptor (CAR) or an engineered TCR. In some embodiments, the donor DNA encodes a therapeutic RNA, such as a small RNA (eg, siRNA, shRNA, or miRNA) or a long non-coding RNA (lincRNA).
本文所述的方法可以用于在两个或更多个(例如2、3、4、5、6、8、10个或更多)不同的靶位点处进行多重基因编辑或调节。在一些实施方式中,所述方法检测或修饰多个靶核酸或靶核酸序列。在一些实施方式中,所述方法包括使靶核酸与包含多个(例如2、3、4、5、6、8、10个或更多)crRNA序列的指导RNA接触,其中每个crRNA均包含不同的靶序列。The methods described herein can be used for multiplex gene editing or modulation at two or more (eg, 2, 3, 4, 5, 6, 8, 10 or more) different target sites. In some embodiments, the methods detect or modify multiple target nucleic acids or target nucleic acid sequences. In some embodiments, the method comprises contacting a target nucleic acid with a guide RNA comprising a plurality (eg, 2, 3, 4, 5, 6, 8, 10 or more) of crRNA sequences, wherein each crRNA comprises a different target sequence.
还提供了包含经修饰的靶核酸的工程化的细胞,所述细胞使用本文所述方法中任一种来产生。所述工程化的细胞可以用于细胞疗法。可以使用本文所述用于细胞疗法的方法将自体或同种异体细胞用于制备工程化的细胞。Also provided are engineered cells comprising a modified target nucleic acid produced using any of the methods described herein. The engineered cells can be used in cell therapy. Autologous or allogeneic cells can be used to prepare engineered cells using the methods described herein for cell therapy.
本文所述的方法还可用于生成细胞(例如哺乳动物细胞)的等基因系以研究遗传变体。 The methods described herein can also be used to generate isogenic lines of cells (eg, mammalian cells) to study genetic variants.
还提供了包含本文所述工程化的细胞的工程化非人类动物。在一些实施方式中,所述工程化的非人类动物是经过基因组编辑的非人类动物。所述工程化的非人类动物可以用作疾病模型。Also provided are engineered non-human animals comprising engineered cells described herein. In some embodiments, the engineered non-human animal is a genome-edited non-human animal. The engineered non-human animals can be used as disease models.
产生非人类基因组编辑或转基因动物的技术是本领域众所周知的,包括但不限于:原核显微注射、病毒感染、对胚胎干细胞和诱导性多能干(iPS)细胞的转化。可以使用的详细方法包括但不限于Sundberg和Ichiki描述的方法(2006,Genetically Engineered Mice Handbook,CRC Press)和Gibson描述的方法(2004,A Primer Of Genome Science 2nd ed.Sunderland,Mass.:Sinauer)。Techniques for generating non-human genome edited or transgenic animals are well known in the art and include, but are not limited to: pronuclear microinjection, viral infection, transformation of embryonic stem cells and induced pluripotent stem (iPS) cells. Detailed methods that can be used include, but are not limited to, those described by Sundberg and Ichiki (2006, Genetically Engineered Mice Handbook, CRC Press) and those described by Gibson (2004, A Primer Of Genome Science 2nd ed. Sunderland, Mass.: Sinauer).
所述工程化的动物可以是任何合适的物种,包括但不限于:牛、马、羊、犬、鹿、猫科动物,山羊、猪、灵长类,以及了解较少的哺乳动物诸如大象、鹿、斑马或骆驼。The engineered animal can be of any suitable species including, but not limited to: cattle, horses, sheep, dogs, deer, felines, goats, pigs, primates, and lesser known mammals such as elephants, deer, zebras, or camels.
本申请在第六方面涉及CRISPR-CasX系统在制备治疗与个体的细胞中靶核In a sixth aspect, the present application relates to the use of CRISPR-CasX system to target nuclear cells in cells for the preparation of treatments and individuals. 酸相关的疾病或病症的药物中的用途Use in medicine for an acid-related disease or condition
在一些实施方式中,提供了前述工程化的CRISPR-CasX系统在制备治疗与个体的细胞中靶核酸相关的疾病或病症的药物中的用途;优选地,所述疾病或病症选自下组:癌症、心血管疾病、遗传性疾病、自身免疫疾病、代谢性疾病、神经退行性疾病、眼病、细菌感染和病毒感染。In some embodiments, a use of the aforementioned engineered CRISPR-CasX system in the preparation of a medicament for treating a disease or disorder associated with a target nucleic acid in a cell of an individual is provided; preferably, the disease or disorder is selected from the group consisting of cancer, cardiovascular disease, genetic disease, autoimmune disease, metabolic disease, neurodegenerative disease, eye disease, bacterial infection, and viral infection.
在一个具体的实施方式中,所述CasX核酸酶选自下组的任一项:PlmCasX、DpbCasX。In a specific embodiment, the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
进一步提供了使用根据本文所述的修饰细胞中的靶核酸的方法中的任一种的治疗方法。在一些实施方式中,本申请提供了治疗与个体细胞中的靶核酸相关的疾病或病况的方法,其包括使所述靶核酸与本文所述工程化的CRISPR-CasX系统中任一种接触,其中所述指导RNA的指导序列与所述靶核酸的靶序列互补,其中所述工程化的CasX效应蛋白和所述指导RNA彼此缔合以结合至所述靶核酸以修饰所述靶核酸,从而使所述疾病或病况得到治疗。在一些实施方式中,将突变(例如,敲除或敲入突变)引入所述靶核酸。在一些实施方式中,所述靶核酸的表达被增强。在一些实施方式中,所述靶核酸的表达被抑制。在一些实施方式中,本申请提供了治疗个体的疾病或病况的方法,其包括向所述个体施用有效量的本文所述工程化的CRISPR-CasX系统中任一种以及编码治疗剂的供体DNA,其中所述指导RNA的指导序列 与所述个体的靶核酸的靶序列互补,其中所述工程化的CasX效应蛋白和所述指导RNA彼此结合以结合至所述靶核酸并将供体DNA插入所述靶序列中,从而使所述疾病或病况得到治疗。Further provided is a method of treatment using any of the methods of modifying a target nucleic acid in a cell according to the herein described. In some embodiments, the present application provides a method of treating a disease or condition associated with a target nucleic acid in a cell of an individual comprising contacting the target nucleic acid with any of the engineered CRISPR-CasX systems described herein, wherein the guide sequence of the guide RNA is complementary to the target sequence of the target nucleic acid, wherein the engineered CasX effector protein and the guide RNA associate with each other to bind to the target nucleic acid to modify the target nucleic acid such that the disease or condition is treated. In some embodiments, mutations (eg, knockout or knockin mutations) are introduced into the target nucleic acid. In some embodiments, expression of the target nucleic acid is enhanced. In some embodiments, expression of the target nucleic acid is inhibited. In some embodiments, the present application provides methods of treating a disease or condition in an individual comprising administering to the individual an effective amount of any of the engineered CRISPR-CasX systems described herein and donor DNA encoding a therapeutic agent, wherein the guide sequence of the guide RNA is Complementary to a target sequence of a target nucleic acid of the individual, wherein the engineered CasX effector protein and the guide RNA bind to each other to bind to the target nucleic acid and insert donor DNA into the target sequence, thereby allowing the disease or condition to be treated.
在一些实施方式中,本申请提供了治疗个体的疾病或病况的方法,其包括向所述个体施用有效量的包含经修饰的靶核酸的工程化的细胞,其中所述工程化的细胞是通过使所述细胞与本文所述工程化的CRISPR-CasX系统中任一种接触而制备的,其中所述指导RNA的指导序列与所述靶核酸的靶序列互补,其中所述工程化的CasX效应蛋白和所述指导RNA相互缔合以结合至靶核酸以修饰所述靶核酸。在一些实施方式中,所述工程化的细胞是免疫细胞。在一些实施方式中,所述个体是人。在一些实施方式中,所述个体是动物,例如模型动物诸如啮齿动物、宠物或农场动物。在一些实施方式中,所述个体是哺乳动物。In some embodiments, the present application provides a method of treating a disease or condition in an individual comprising administering to the individual an effective amount of an engineered cell comprising a modified target nucleic acid, wherein the engineered cell is prepared by contacting the cell with any of the engineered CRISPR-CasX systems described herein, wherein the guide sequence of the guide RNA is complementary to the target sequence of the target nucleic acid, wherein the engineered CasX effector protein and the guide RNA associate with each other to bind to the target nucleic acid to modify the target nucleic acid. In some embodiments, the engineered cells are immune cells. In some embodiments, the individual is a human. In some embodiments, the individual is an animal, eg, a model animal such as a rodent, pet, or farm animal. In some embodiments, the individual is a mammal.
在一些实施方式中,所述疾病或病况选自下组:癌症、心血管疾病、遗传性疾病、自身免疫性疾病、代谢性疾病、神经退行性疾病、眼病、细菌感染和病毒感染。在一些实施方式中,所述靶核酸是PCSK9。在一些实施方式中,所述疾病或病况是心血管疾病。在一些实施方式中,所述疾病或病况是冠状动脉疾病。在一些实施方式中,所述方法降低个体的胆固醇水平。在一些实施方式中,所述方法治疗个体的糖尿病。In some embodiments, the disease or condition is selected from the group consisting of cancer, cardiovascular disease, genetic disease, autoimmune disease, metabolic disease, neurodegenerative disease, eye disease, bacterial infection, and viral infection. In some embodiments, the target nucleic acid is PCSK9. In some embodiments, the disease or condition is cardiovascular disease. In some embodiments, the disease or condition is coronary artery disease. In some embodiments, the method reduces cholesterol levels in the individual. In some embodiments, the method treats diabetes in the individual.
本申请在第七方面涉及一种递送方法The present application relates to a delivery method in the seventh aspect
在一些实施方式中,本文所述工程化的CRISPR-CasX系统或其组分、其核酸分子、或编码或提供其组分的核酸分子可通过多种递送系统诸如质粒或病毒递送至宿主细胞(例如,上面“构建体和载体”小节中描述的载体中的任一种)。在一些实施方式或方法中,所述工程化的CRISPR-CasX系统可以通过其他方法递送,如由所述工程化的CasX效应蛋白及其一个或多个同源RNA指导序列组成的核糖核蛋白复合物的核转染或电穿孔。In some embodiments, the engineered CRISPR-CasX system described herein, or components thereof, nucleic acid molecules thereof, or nucleic acid molecules encoding or providing components thereof, can be delivered to a host cell by a variety of delivery systems such as plasmids or viruses (e.g., any of the vectors described in the "Constructs and Vectors" section above). In some embodiments or methods, the engineered CRISPR-CasX system can be delivered by other methods, such as nucleofection or electroporation of a ribonucleoprotein complex consisting of the engineered CasX effector protein and its one or more cognate RNA guide sequences.
在一些实施方式中,通过纳米颗粒或外泌体来递送。In some embodiments, delivery is via nanoparticles or exosomes.
在一些实施方式中,成对的CasX切口酶复合物可以使用纳米颗粒或其他直接蛋白递送方法直接递送,使得包含两个成对的crRNA元件的复合物被共同递送。此外,蛋白可以通过病毒载体或直接地递送至细胞,然后直接递送包含针对双切口的两个成对间隔区的CRISPR阵列。在某些情况下,对 于直接RNA递送,可以将RNA缀合于至少一个糖部分诸如N-乙酰基半乳糖胺(GalNAc)(特别是三触角GalNAc)。In some embodiments, paired CasX nickase complexes can be delivered directly using nanoparticles or other direct protein delivery methods such that complexes comprising two paired crRNA elements are co-delivered. In addition, proteins can be delivered to cells via viral vectors or directly followed by CRISPR arrays containing two paired spacers for double nicks. In some cases, the For direct RNA delivery, the RNA can be conjugated to at least one sugar moiety such as N-acetylgalactosamine (GalNAc) (particularly triantennary GalNAc).
在一个具体的实施方式中,所述CasX核酸酶选自下组的任一项:PlmCasX、DpbCasX。In a specific embodiment, the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
本申请在第八方面涉及试剂盒和制品The application relates to kits and articles of manufacture in an eighth aspect
在一个具体实施方式中提供了一种试剂盒,其包含:一个或多个AAV载体,其编码前述工程化的CasX核酸酶、前述的工程化CasX效应蛋白和前述的工程化CRISPR-CasX系统中任一种。A kit is provided in a specific embodiment, comprising: one or more AAV vectors encoding any of the aforementioned engineered CasX nuclease, the aforementioned engineered CasX effector protein, and the aforementioned engineered CRISPR-CasX system.
在一个具体的实施方式中,所述CasX核酸酶选自下组的任一项:PlmCasX、DpbCasX。In a specific embodiment, the CasX nuclease is selected from any one of the following groups: PlmCasX, DpbCasX.
“载体”是包含分离的核酸并且可以用于将所述分离的核酸递送至细胞内部的物质组合物。许多载体是本领域已知的,包括但不限于:线性多核苷酸、与离子或两亲性化合物缔合的多核苷酸、质粒和病毒。通常,合适的载体包含在至少一种生物中起作用的复制起点、启动子序列、方便的限制性核酸内切酶位点和一种或多种选择性标记物。术语“载体”也应被解释为包括非质粒和非病毒化合物,其促进核酸转移到细胞中,诸如例如,聚赖氨酸化合物、脂质体等。A "vector" is a composition of matter comprising an isolated nucleic acid and which can be used to deliver the isolated nucleic acid to the interior of a cell. Many vectors are known in the art including, but not limited to, linear polynucleotides, polynucleotides associated with ionic or amphiphilic compounds, plasmids, and viruses. In general, suitable vectors contain an origin of replication functional in at least one organism, a promoter sequence, convenient restriction endonuclease sites and one or more selectable markers. The term "vector" should also be construed to include non-plasmid and non-viral compounds that facilitate the transfer of nucleic acids into cells, such as, for example, polylysine compounds, liposomes, and the like.
在一些实施方式中,所述载体是病毒载体。病毒载体的实例包括但不限于:腺病毒载体、腺相关病毒载体、慢病毒载体、逆转录病毒载体、牛痘载体、单纯疱疹病毒载体及其衍生物。在一些实施方式中,所述载体是噬菌体载体。病毒载体技术是本领域众所周知的,并且例如描述于Sambrook等人(2001,Molecular Cloning:A Laboratory Manual,Cold Spring Harbor Laboratory,New York),以及其他病毒学和分子生物学手册。In some embodiments, the vector is a viral vector. Examples of viral vectors include, but are not limited to, adenoviral vectors, adeno-associated viral vectors, lentiviral vectors, retroviral vectors, vaccinia vectors, herpes simplex virus vectors, and derivatives thereof. In some embodiments, the vector is a phage vector. Viral vector technology is well known in the art and is described, for example, in Sambrook et al. (2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York), among other handbooks of virology and molecular biology.
已经开发了许多基于病毒的系统,用于将基因转移到哺乳动物细胞中。例如,逆转录病毒为基因递送系统提供了便利的平台。可以使用本领域已知的技术将异源核酸插入载体,并包装在逆转录病毒颗粒中。然后可以分离重组病毒,并在体外或离体递送至所述工程化的哺乳动物细胞。许多逆转录病毒系统是本领域已知的。在一些实施方式中,使用腺病毒载体。许多腺病毒载体是本领域已知的。在一些实施方式中,使用慢病毒载体。在一些实施方式中,使用自灭活的慢病毒载体。 A number of virus-based systems have been developed for gene transfer into mammalian cells. For example, retroviruses provide a convenient platform for gene delivery systems. The heterologous nucleic acid can be inserted into a vector and packaged in retroviral particles using techniques known in the art. Recombinant virus can then be isolated and delivered to the engineered mammalian cells in vitro or ex vivo. Many retroviral systems are known in the art. In some embodiments, an adenoviral vector is used. Many adenoviral vectors are known in the art. In some embodiments, lentiviral vectors are used. In some embodiments, a self-inactivating lentiviral vector is used.
在某些实施方式中,所述载体是腺相关病毒(AAV)载体,例如AAV2、AAV8或AAV9,其可以按包含至少1×105个颗粒(也称为颗粒单位,pu)的单次给药施用腺病毒或腺相关病毒。在一些实施方式中,所述给药量是至少约1×106个颗粒,至少约1×107个颗粒,至少约1×108个颗粒,或至少约1×109个颗粒的腺相关病毒。递送方法和给药量描述于例如WO 2016205764和美国专利No.8,454,972,其通过整体引用而并入本文。In certain embodiments, the vector is an adeno-associated virus (AAV) vector, such as AAV2, AAV8, or AAV9, which can administer the adenovirus or adeno-associated virus in a single dose comprising at least 1 x 105 particles (also known as particle units, pu). In some embodiments, the administered amount is at least about 1×10 6 particles, at least about 1×10 7 particles, at least about 1×10 8 particles, or at least about 1×10 9 particles of adeno-associated virus. Delivery methods and dosages are described, for example, in WO 2016205764 and US Patent No. 8,454,972, which are incorporated herein by reference in their entirety.
在一些实施方式中,所述载体是重组腺相关病毒(rAAV)载体。例如,在一些实施方式中,可以使用经修饰的AAV载体递送。经修饰的AAV载体可以基于几种衣壳类型中的一种或多种,包括AAV1、AV2、AAV5、AAV6、AAV8、AAV8.2、AAV9、AAV rh10,经修饰的AAV载体(例如经修饰的AAV2,经修饰的AAV3,经修饰的AAV6)和假型AAV(例如AAV2/8,AAV2/5和AAV2/6)。可用于产生rAAV颗粒的示例性AAV载体和技术是本领域已知的(参见例如Aponte-Ubillus et al.(2018)Appl.Microbiol.Biotechnol.102(3):1045-54;Zhong et al.(2012)J.Genet.Syndr.Gene Ther.S1:008;West et al.(1987)Virology 160:38-47(1987);Tratschin et al.(1985)Mol.Cell.Biol.5:3251-60;美国专利Nos.4,797,368和5,173,414;国际公开第WO2015/054653号和第WO93/24641号,其各自通过引用并入本文)。In some embodiments, the vector is a recombinant adeno-associated virus (rAAV) vector. For example, in some embodiments, modified AAV vectors can be used for delivery. Modified AAV vectors can be based on one or more of several capsid types, including AAV1, AAV2, AAV5, AAV6, AAV8, AAV8.2, AAV9, AAV rh10, modified AAV vectors (e.g., modified AAV2, modified AAV3, modified AAV6) and pseudotyped AAV (e.g., AAV2/8, AAV2/5, and AAV2/6). Exemplary AAV vectors and techniques that can be used to generate rAAV particles are known in the art (see, e.g., Aponte-Ubillus et al. (2018) Appl. Microbiol. Biotechnol. 102(3): 1045-54; Zhong et al. (2012) J. Genet. Syndr. Gene Ther. 160:38-47 (1987); Tratschin et al. (1985) Mol. Cell. Biol. 5:3251-60; U.S. Patent Nos. 4,797,368 and 5,173,414; International Publication Nos. WO2015/054653 and WO93/24641, each of which is incorporated herein by reference).
用于递送Cas9和其他Cas蛋白的任何已知的AAV载体都可以用于递送本申请的工程化的CasX系统。Any known AAV vector used to deliver Cas9 and other Cas proteins can be used to deliver the engineered CasX system of the present application.
还提供了包含本文所述工程化的CasX核酸酶、工程化的CasX效应蛋白或工程化的CRISPR-CasX系统中任一种的一种或多种组分的组合物、试剂盒、单位药剂和制品。Also provided are compositions, kits, unit doses, and articles of manufacture comprising one or more components of any of the engineered CasX nucleases, engineered CasX effector proteins, or engineered CRISPR-CasX systems described herein.
在一些实施方式中,提供了试剂盒,其包含:一个或多个AAV载体,其编码本文所述工程化的CasX核酸酶、工程化CasX效应蛋白或工程化CRISPR-CasX系统中任一种。在一些实施方式中,所述试剂盒还包含一种或多种指导RNA。在一些实施方式中,所述试剂盒还包含供体DNA。在一些实施方式中,所述试剂盒还包含细胞诸如人细胞。In some embodiments, kits are provided comprising: one or more AAV vectors encoding any of the engineered CasX nucleases, engineered CasX effector proteins, or engineered CRISPR-CasX systems described herein. In some embodiments, the kit further comprises one or more guide RNAs. In some embodiments, the kit further comprises donor DNA. In some embodiments, the kit further comprises cells such as human cells.
所述试剂盒可包含一种或多种额外组分,诸如容器、试剂、培养基、细胞因子、缓冲液、抗体等,以允许工程化的细胞的繁殖。所述试剂盒还可包含用于施用组合物的装置。 The kit may comprise one or more additional components, such as containers, reagents, media, cytokines, buffers, antibodies, etc., to allow propagation of the engineered cells. The kit may also comprise a device for administering the composition.
所述试剂盒还可包含使用本文所述工程化的CRISPR-CasX系统的说明书,诸如检测或修饰靶核酸的方法。在一些实施方式中,所述试剂盒包含用于处理或诊断疾病或病况的说明书。与所述试剂盒组分使用有关的说明书通常包括针对该刻意处理的给药量、给药时间表和给药途径的信息。所述容器可以是单位剂量、散装包装(例如多剂量包装)或亚单位剂量。例如,可以提供这样的试剂盒(其包含足够剂量的本文所公开的组合物)以在延长的时期内提供对个体的有效处理。试剂盒还可以包括多个单位剂量的组合物和使用说明书,其包装数量足以在药房(例如医院药房和复方药房)中存储和使用。The kit may also comprise instructions for using the engineered CRISPR-CasX system described herein, such as methods for detecting or modifying target nucleic acids. In some embodiments, the kit comprises instructions for treating or diagnosing the disease or condition. The instructions pertaining to the use of the kit components will generally include information on the amount to be administered, the schedule of administration and the route of administration for such deliberate treatment. The container can be a unit dose, bulk package (eg, a multi-dose package), or a subunit dose. For example, kits comprising sufficient doses of a composition disclosed herein can be provided to provide effective treatment of an individual over an extended period of time. The kit can also include a plurality of unit doses of the composition and instructions for use packaged in quantities sufficient for storage and use in a pharmacy (eg, hospital pharmacy and compounding pharmacy).
本发明的试剂盒处于合适的包装中。合适的包装包括但不限于:小瓶、瓶子、广口瓶、软包装(例如密封的聚酯薄膜或塑料袋)等。试剂盒可以任选地提供额外的组分,如缓冲液和解释性信息。因此,本申请还提供了一种制品,其包括小瓶(如密封的小瓶)、瓶子、广口瓶、柔性包装等。The kits of the invention are in suitable packaging. Suitable packaging includes, but is not limited to, vials, bottles, jars, flexible packaging (eg, sealed mylar or plastic bags), and the like. Kits may optionally provide additional components such as buffers and explanatory information. Accordingly, the present application also provides an article of manufacture including vials (eg, sealed vials), bottles, jars, flexible packaging, and the like.
所述制品可以包括容器以及在容器上或与容器粘合的标签或包装插页。合适的容器包括例如瓶子、小瓶、注射器等。所述容器可以由多种材料(诸如玻璃或塑料)形成。通常,所述容器容纳有效处理本文所述的疾病或病症的组合物,并且可以具有无菌进入口(例如,容器可以是静脉注射溶液袋或具有可通过皮下注射针刺穿的塞子的小瓶)。所述标签或包装插页表明该组合物用于处理个体中的特定病况。所述标签或包装插页将进一步包括用于将组合物向个体施用的说明书。The article of manufacture may comprise a container and a label or package insert on or adhered to the container. Suitable containers include, for example, bottles, vials, syringes, and the like. The container can be formed from a variety of materials such as glass or plastic. Typically, the container contains a composition effective to treat a disease or condition described herein, and may have a sterile access port (eg, the container may be a bag of intravenous solution or a vial with a stopper pierceable by a hypodermic needle). The label or package insert indicates that the composition is used to treat a particular condition in an individual. The label or package insert will further include instructions for administering the composition to the individual.
包装插页是指通常包含在治疗产品的商业包装中的说明,所述说明包含关于使用此类治疗产品的适应症、用法、给药量、施用、禁忌症和/或警告的信息。此外,所述制品还可包括第二容器,其包含药学上可接受的缓冲液,如抑菌注射用水(BWFI)、磷酸盐缓冲盐水、林格氏溶液和葡萄糖溶液。从商业和用户的角度来看,它还可以包括其他材料,包括其他缓冲液、稀释剂、过滤器、针头和注射器。Package Insert means the instructions commonly included in commercial packages of therapeutic products that contain information on the indications, usage, dosage, administration, contraindications and/or warnings regarding the use of such therapeutic products. Additionally, the article of manufacture may also include a second container comprising a pharmaceutically acceptable buffer, such as bacteriostatic water for injection (BWFI), phosphate buffered saline, Ringer's solution, and dextrose solution. From a commercial and user perspective, it can also include other materials, including other buffers, diluents, filters, needles, and syringes.
实施例部分Example part
下面将参照附图更详细地描述本发明的具体实施例。虽然附图中显示了本发明的具体实施例,然而应当理解,可以以各种形式实现本发明而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本发明,并且能够将本发明的范围完整的传达给本领域的技术人员。 Specific embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. Although specific embodiments of the invention are shown in the drawings, it should be understood that the invention may be embodied in various forms and is not limited to the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present invention and to fully convey the scope of the present invention to those skilled in the art.
实施例1:将参比PlmCasX酶中与PAM相互作用的氨基酸替换为带正电的氨基酸,并验证其基因编辑效率。Example 1: Replace the amino acid that interacts with PAM in the reference PlmCasX enzyme with a positively charged amino acid, and verify its gene editing efficiency.
质粒构建Plasmid construction
我们将编码野生型PlmCasX以及突变体的DNA序列装载到pCAG-2A-EGFP质粒中,从而构建了表达CasX质粒。通过将含有靶序列的退火寡核苷酸连接到BasI消化的pUC19-U6-crRNA/sgRNA骨架中构建了在293T中表达Cas蛋白crRNA或者sgRNA的载体。We loaded the DNA sequences encoding wild-type PlmCasX and mutants into the pCAG-2A-EGFP plasmid to construct a CasX expression plasmid. A vector expressing Cas protein crRNA or sgRNA in 293T was constructed by ligating the annealed oligonucleotide containing the target sequence into the BasI-digested pUC19-U6-crRNA/sgRNA backbone.
细胞培养、转染以及荧光激活细胞分选法(FACS)Cell culture, transfection, and fluorescence-activated cell sorting (FACS)
将HEK293T细胞在含1%青霉素-链霉素(Gibco)和10%胎牛血清(Gibco)的DMEM(Gibco)中培养。将细胞接种在24-细胞培养皿(Corning)中16小时,直到细胞密度达到70%。通过使用Lipofectamine 3000(Invitrogen),将600ng编码Cas蛋白的质粒和300ng编码crRNA的质粒,转染到24孔细胞培养皿培养的细胞中。转染120h后,用胰蛋白酶-EDTA(0.05%)(Gibco)消化细胞,然后进行荧光激活细胞分选(FACS)。HEK293T cells were cultured in DMEM (Gibco) containing 1% penicillin-streptomycin (Gibco) and 10% fetal bovine serum (Gibco). Cells were seeded in 24-cell culture dishes (Corning) for 16 hours until the cell density reached 70%. By using Lipofectamine 3000 (Invitrogen), 600ng of a plasmid encoding a Cas protein and 300ng of a plasmid encoding a crRNA were transfected into cells cultured in a 24-well cell culture dish. 120 h after transfection, cells were digested with trypsin-EDTA (0.05%) (Gibco), and then subjected to fluorescence-activated cell sorting (FACS).
靶向深度测序分析以进行基因组修饰Targeted Deep Sequencing Analysis for Genome Modifications
将FACS分选的GFP-阳性293FT细胞用缓冲液L裂解,并在55℃下孵育3小时,然后在95℃下孵育10分钟。使用相应的引物对在不同基因组位点中包含靶位点的dsDNA片段进行PCR扩增。对于靶向的深度测序,直接使用细胞裂解液作为模板,通过条形码(barcoded)PCR直接扩增靶位点。纯化PCR产物并归集到几个文库中以进行高通量测序。通过计算包含插入或缺失的读取(reads)的比率,采用CRISPResso2软件分析插入缺失的频率(%)。数量少于完整读取的0.05%的读取被废弃。FACS-sorted GFP-positive 293FT cells were lysed with buffer L and incubated at 55 °C for 3 h, then at 95 °C for 10 min. dsDNA fragments containing target sites in different genomic loci were PCR amplified using the corresponding primers. For targeted deep sequencing, target loci are directly amplified by barcoded PCR using cell lysates as templates. PCR products were purified and pooled into several libraries for high-throughput sequencing. The frequency (%) of indels was analyzed using CRISPResso2 software by calculating the ratio of reads containing indels or indels. Reads with an amount less than 0.05% of complete reads were discarded.
实施例1-A选出了三个具有单个氨基酸替换的工程化PlmCasXExample 1-A selects three engineered PlmCasXs with single amino acid substitutions
按照实施例1描述的方法分别表达氨基酸序列具有单个突变的工程化PlmCasX酶,优选的氨基酸替换方式及其相应的基因编辑效率的见于图1和表1。在图1中,我们首先选取了PlmCasX距离PAM DNA以内的14个氨基酸:T26,G27,M29,P105,D195,S198,E204,S222,S230,Q512,S564,G565,K610,E640进行精氨酸(R)的点突变测试。通过在293T细胞中比较这些突变体与野生型PlmCasX在2个基因组位点:AAVS1-2,CCR5-2 的基因编辑效率,我们发现,带有下述氨基酸替换的突变体:T26R,K610R以及E640R能够有效地提高基因编辑效率,尤其是T26R,能够将效率提高36倍(此为针对AAVS1-2的参比酶的基因编辑效率)和37倍(此为针对CCR5-2的参比酶的基因编辑效率)。S230R,Q512R,S564R这几个突变体对PlmCasX效率的提高具有严重的负面影响。According to the method described in Example 1, engineered PlmCasX enzymes with a single mutation in the amino acid sequence were respectively expressed, and the preferred amino acid replacement method and its corresponding gene editing efficiency are shown in Figure 1 and Table 1. In Figure 1, we first selected the distance between PlmCasX and PAM DNA Within 14 amino acids: T26, G27, M29, P105, D195, S198, E204, S222, S230, Q512, S564, G565, K610, E640 for arginine (R) point mutation test. By comparing these mutants with wild-type PlmCasX in 293T cells at 2 genomic loci: AAVS1-2, CCR5-2 We found that mutants with the following amino acid substitutions: T26R, K610R and E640R can effectively improve the gene editing efficiency, especially T26R, which can increase the efficiency by 36 times (this is the gene editing efficiency of the reference enzyme for AAVS1-2) and 37 times (this is the gene editing efficiency of the reference enzyme for CCR5-2). The mutants S230R, Q512R, and S564R have serious negative effects on the improvement of PlmCasX efficiency.
表1
Table 1
实施例1-B选出了具有两个氨基酸替换的工程化PlmCasXExample 1-B selects engineered PlmCasX with two amino acid substitutions
按照实施例1描述的方法分别表达氨基酸序列具有单个突变的工程化PlmCasX酶,优选的氨基酸替换方式及其相应的基因编辑效率的见于图2和表2。在图2中,我们将实施例1-A的中描述的T26R与K610R两个突变体进行叠加,构建了新型的PlmCasX突变体:T26R+K610R。通过在293T细胞中比较这些突变体与野生型PlmCasX在2个基因组位点:AAVS1-2, CCR5-2的基因编辑效率,我们发现,相对于具有单个点突变的工程化PlmCasX,具有双点突变的工程化PlmCasX(T26R+K610R)在4个测试的基因组位点都具有最高的基因编辑效率。According to the method described in Example 1, engineered PlmCasX enzymes with a single mutation in the amino acid sequence were respectively expressed, and the preferred amino acid replacement method and its corresponding gene editing efficiency are shown in Figure 2 and Table 2. In Figure 2, we superimposed the T26R and K610R mutants described in Example 1-A to construct a new PlmCasX mutant: T26R+K610R. By comparing these mutants with wild-type PlmCasX in 293T cells at 2 genomic loci: AAVS1-2, For the gene editing efficiency of CCR5-2, we found that the engineered PlmCasX with a double point mutation (T26R+K610R) had the highest gene editing efficiency at all four tested genomic loci relative to the engineered PlmCasX with a single point mutation.
表2:
Table 2:
实施例2:将参比PlmCasX酶中与ssDNA底物相互作用的氨基酸替换为带正电的氨基酸,并验证其基因编辑效率。Example 2: Replace the amino acids in the reference PlmCasX enzyme that interact with the ssDNA substrate with positively charged amino acids, and verify its gene editing efficiency.
质粒构建Plasmid construction
我们将编码野生型PlmCasX以及突变体的DNA序列装载到pCAG-2A-EGFP质粒中,从而构建了表达CasX质粒。通过将含有靶序列的退火寡核苷酸连接到BasI消化的pUC19-U6-crRNA/sgRNA骨架中构建了在293T中表达Cas蛋白crRNA或者sgRNA的载体。We loaded the DNA sequences encoding wild-type PlmCasX and mutants into the pCAG-2A-EGFP plasmid to construct a CasX expression plasmid. A vector expressing Cas protein crRNA or sgRNA in 293T was constructed by ligating the annealed oligonucleotide containing the target sequence into the BasI-digested pUC19-U6-crRNA/sgRNA backbone.
细胞培养、转染以及荧光激活细胞分选法(FACS)Cell culture, transfection, and fluorescence-activated cell sorting (FACS)
将HEK293T细胞在含1%青霉素-链霉素(Gibco)和10%胎牛血清(Gibco)的DMEM(Gibco)中培养。将细胞接种在24-细胞培养皿(Corning)中16小时,直到细胞密度达到70%。通过使用Lipofectamine 3000(Invitrogen),将600ng编码Cas蛋白的质粒和300ng编码crRNA的质粒,转染到24孔细胞培养皿培养的细胞中。转染120h后,用胰蛋白酶-EDTA(0.05%)(Gibco)消化细胞,然后进行荧光激活细胞分选(FACS)。HEK293T cells were cultured in DMEM (Gibco) containing 1% penicillin-streptomycin (Gibco) and 10% fetal bovine serum (Gibco). Cells were seeded in 24-cell culture dishes (Corning) for 16 hours until the cell density reached 70%. By using Lipofectamine 3000 (Invitrogen), 600ng of a plasmid encoding a Cas protein and 300ng of a plasmid encoding a crRNA were transfected into cells cultured in a 24-well cell culture dish. 120 h after transfection, cells were digested with trypsin-EDTA (0.05%) (Gibco), and then subjected to fluorescence-activated cell sorting (FACS).
靶向深度测序分析以进行基因组修饰Targeted Deep Sequencing Analysis for Genome Modifications
将FACS分选的GFP-阳性293T细胞用缓冲液L裂解,并在55℃下孵育3小时,然后在95℃下孵育10分钟。使用相应的引物对在不同基因组位点中包含靶位点的dsDNA片段进行PCR扩增。对于靶向的深度测序,直接使用细胞裂解液作为模板,通过条形码(barcoded)PCR直接扩增靶位点。 纯化PCR产物并归集到几个文库中以进行高通量测序。通过计算包含插入或缺失的读取(reads)的比率,采用CRISPResso2软件分析插入缺失的频率(%)。数量少于完整读取的0.05%的读取被废弃。FACS-sorted GFP-positive 293T cells were lysed with buffer L and incubated at 55 °C for 3 h, then at 95 °C for 10 min. dsDNA fragments containing target sites in different genomic loci were PCR amplified using the corresponding primers. For targeted deep sequencing, target loci are directly amplified by barcoded PCR using cell lysates as templates. PCR products were purified and pooled into several libraries for high-throughput sequencing. The frequency (%) of indels was analyzed using CRISPResso2 software by calculating the ratio of reads containing indels or indels. Reads with an amount less than 0.05% of complete reads were discarded.
选出了具有单个氨基酸替换的工程化PlmCasXAn engineered PlmCasX with a single amino acid substitution was selected
按照实施例2描述的方法分别表达氨基酸序列具有单个突变的工程化PlmCasX酶,优选的氨基酸替换方式及其相应的基因编辑效率的见于图3和表3。在图3中,我们首先选取了PlmCasX距离ssDNA以内的16个氨基酸:G661,E662,N663,K699,N757,L758,S759,Q765,G766,E773,T806,K808,H903,Q907,E908,F958进行精氨酸(R)的点突变测试。通过在293T细胞中比较这些突变体与野生型PlmCasX在2个基因组位点:AAVS1-2,CCR5-2的基因编辑效率,我们发现,带有下述氨基酸替换的突变体:S759R以及K808R能够有效地提高基因编辑效率。然而,多个突变体对PlmCasX效率的提高具有严重的负面影响。According to the method described in Example 2, engineered PlmCasX enzymes with a single mutation in the amino acid sequence were respectively expressed, and the preferred amino acid replacement mode and its corresponding gene editing efficiency are shown in Figure 3 and Table 3. In Figure 3, we first selected the PlmCasX distance ssDNA The 16 amino acids within: G661, E662, N663, K699, N757, L758, S759, Q765, G766, E773, T806, K808, H903, Q907, E908, F958 were tested for point mutations of arginine (R). By comparing the gene editing efficiency of these mutants and wild-type PlmCasX at two genomic loci: AAVS1-2, CCR5-2 in 293T cells, we found that the mutants with the following amino acid substitutions: S759R and K808R can effectively improve gene editing efficiency. However, multiple mutants had severe negative effects on the improvement of PlmCasX efficiency.
表3
table 3
实施例3:将实施例1-2中的筛选得到的部分能够提高学野生型PlmCasX基因编辑效率的氨基酸突变体进行组合,并验证它们的基因编辑效率。Example 3: Combine some of the amino acid mutants obtained from the screening in Examples 1-2 that can improve the gene editing efficiency of the wild-type PlmCasX, and verify their gene editing efficiency.
质粒构建Plasmid construction
我们将编码野生型PlmCasX以及突变体的DNA序列装载到pCAG-2A-EGFP质粒中,从而构建了表达CasX质粒。通过将含有靶序列的退火寡核苷酸连接到BasI消化的pUC19-U6-crRNA/sgRNA骨架中构建了在293T中表达Cas蛋白crRNA或者sgRNA的载体。We loaded the DNA sequences encoding wild-type PlmCasX and mutants into the pCAG-2A-EGFP plasmid to construct a CasX expression plasmid. A vector expressing Cas protein crRNA or sgRNA in 293T was constructed by ligating the annealed oligonucleotide containing the target sequence into the BasI-digested pUC19-U6-crRNA/sgRNA backbone.
细胞培养、转染以及荧光激活细胞分选法(FACS)Cell culture, transfection, and fluorescence-activated cell sorting (FACS)
将HEK293T细胞在含1%青霉素-链霉素(Gibco)和10%胎牛血清(Gibco)的DMEM(Gibco)中培养。将细胞接种在24-细胞培养皿(Corning)中16小时,直到细胞密度达到70%。通过使用Lipofectamine 3000(Invitrogen),将600ng编码Cas蛋白的质粒和300ng编码crRNA的质粒,转染到24孔细胞培养皿培养的细胞中。转染120h后,用胰蛋白酶-EDTA(0.05%)(Gibco)消化细胞,然后进行荧光激活细胞分选(FACS)。HEK293T cells were cultured in DMEM (Gibco) containing 1% penicillin-streptomycin (Gibco) and 10% fetal bovine serum (Gibco). Cells were seeded in 24-cell culture dishes (Corning) for 16 hours until the cell density reached 70%. By using Lipofectamine 3000 (Invitrogen), 600ng of a plasmid encoding a Cas protein and 300ng of a plasmid encoding a crRNA were transfected into cells cultured in a 24-well cell culture dish. 120 h after transfection, cells were digested with trypsin-EDTA (0.05%) (Gibco), and then subjected to fluorescence-activated cell sorting (FACS).
靶向深度测序分析以进行基因组修饰Targeted Deep Sequencing Analysis for Genome Modifications
将FACS分选的GFP-阳性293FT细胞用缓冲液L裂解,并在55℃下孵育3小时,然后在95℃下孵育10分钟。使用相应的引物对在不同基因组位点中包含靶位点的dsDNA片段进行PCR扩增。对于靶向的深度测序,直接使用细胞裂解液作为模板,通过条形码(barcoded)PCR直接扩增靶位点。纯化PCR产物并归集到几个文库中以进行高通量测序。通过计算包含插入或缺失的读取(reads)的比率,采用CRISPResso2软件分析插入缺失的频率(%)。数量少于完整读取的0.05%的读取被废弃。FACS-sorted GFP-positive 293FT cells were lysed with buffer L and incubated at 55 °C for 3 h, then at 95 °C for 10 min. dsDNA fragments containing target sites in different genomic loci were PCR amplified using the corresponding primers. For targeted deep sequencing, target loci are directly amplified by barcoded PCR using cell lysates as templates. PCR products were purified and pooled into several libraries for high-throughput sequencing. The frequency (%) of indels was analyzed using CRISPResso2 software by calculating the ratio of reads containing indels or indels. Reads with an amount less than 0.05% of complete reads were discarded.
选出了具有最优氨基酸替换的工程化PlmCasXAn engineered PlmCasX with optimal amino acid substitutions was selected
按照实施例3描述的方法分别表达氨基酸序列具有单个以及多个突变的工程化PlmCasX酶,优选的氨基酸替换方式及其相应的基因编辑效率的见于图4和表4。在图4中,我们将实施例1-B中得到的最优点突变组合T26R+K610R与实施例2中的最优点突变K808R进一步组合到一个蛋白中, 通过在293T细胞中比较这个新突变体(T26R+K610R+K808R)与野生型PlmCasX在4个基因组位点:AAVS1-2,AAVS1-7,CCR5-2以及CD34-1的基因编辑效率,我们发现,突变体T26R+K610R+K808R在4个测试的基因组位点都具有最高的基因编辑效率。相对于野生型的PlmCasX,T26R+K610R+K808R的基因编辑效率分别提高了474倍(此为针对AAVS1-2的参比酶的基因编辑效率),16倍(此为针对AAVS1-7的参比酶的基因编辑效率),152倍(此为针对CCR5-2的参比酶的基因编辑效率)以及287倍(此为针对CD34-1的参比酶的基因编辑效率)。According to the method described in Example 3, the engineered PlmCasX enzymes with single and multiple mutations in the amino acid sequence were respectively expressed, and the preferred amino acid replacement method and its corresponding gene editing efficiency are shown in Figure 4 and Table 4. In Figure 4, we further combined the optimal point mutation combination T26R+K610R obtained in Example 1-B and the optimal point mutation K808R in Example 2 into a protein, By comparing the gene editing efficiency of this new mutant (T26R+K610R+K808R) and wild-type PlmCasX at four genomic loci: AAVS1-2, AAVS1-7, CCR5-2 and CD34-1 in 293T cells, we found that the mutant T26R+K610R+K808R had the highest gene editing efficiency at all four tested genomic loci. Compared with wild-type PlmCasX, the gene editing efficiency of T26R+K610R+K808R was increased by 474 times (this is the gene editing efficiency of the reference enzyme for AAVS1-2), 16 times (this is the gene editing efficiency of the reference enzyme for AAVS1-7), 152 times (this is the gene editing efficiency of the reference enzyme for CCR5-2) and 287 times (this is the gene editing efficiency of the reference enzyme for CD34-1 efficiency).
表4
Table 4
尽管以上结合附图对本发明的实施方式进行了描述,但本发明并不局限于上述的具体实施方式和应用领域,上述的具体实施方式仅仅是示意性的、指导性的,而不是限制性的。本领域的普通技术人员在本说明书的启示下和在不脱离本发明权利要求所保护的范围的情况下,还可以做出很多种的形式,这些均属于本发明保护之列。Although the embodiments of the present invention have been described above with reference to the accompanying drawings, the present invention is not limited to the above-mentioned specific embodiments and application fields, and the above-mentioned specific embodiments are only illustrative and instructive, rather than restrictive. Under the enlightenment of this specification and without departing from the protection scope of the claims of the present invention, those skilled in the art can also make many forms, which all belong to the protection of the present invention.
序列表
sequence listing
Claims (17)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202380017557.3A CN119013395A (en) | 2022-01-19 | 2023-01-18 | Engineered CasX nucleases, effector proteins and uses thereof |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202210061995.7 | 2022-01-19 | ||
| CN202210061995 | 2022-01-19 | ||
| CN202210232743.6 | 2022-03-09 | ||
| CN202210232743.6A CN116497002A (en) | 2022-01-19 | 2022-03-09 | Engineered CasX nucleases, effector proteins and uses thereof |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2023138617A1 true WO2023138617A1 (en) | 2023-07-27 |
Family
ID=87323594
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2023/072900 Ceased WO2023138617A1 (en) | 2022-01-19 | 2023-01-18 | Engineered casx nuclease, effector protein and use thereof |
Country Status (2)
| Country | Link |
|---|---|
| CN (2) | CN116497002A (en) |
| WO (1) | WO2023138617A1 (en) |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2020247882A1 (en) * | 2019-06-07 | 2020-12-10 | Scribe Therapeutics Inc. | Engineered casx systems |
| CN112195164A (en) * | 2020-12-07 | 2021-01-08 | 中国科学院动物研究所 | Engineered Cas effector proteins and methods of use thereof |
| CN112867792A (en) * | 2018-08-23 | 2021-05-28 | 桑格摩生物治疗股份有限公司 | Engineered target-specific base editor |
| CN113151215A (en) * | 2021-05-27 | 2021-07-23 | 中国科学院动物研究所 | Engineered Cas12i nuclease, effector protein thereof and uses thereof |
-
2022
- 2022-03-09 CN CN202210232743.6A patent/CN116497002A/en active Pending
-
2023
- 2023-01-18 WO PCT/CN2023/072900 patent/WO2023138617A1/en not_active Ceased
- 2023-01-18 CN CN202380017557.3A patent/CN119013395A/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112867792A (en) * | 2018-08-23 | 2021-05-28 | 桑格摩生物治疗股份有限公司 | Engineered target-specific base editor |
| WO2020247882A1 (en) * | 2019-06-07 | 2020-12-10 | Scribe Therapeutics Inc. | Engineered casx systems |
| CN112195164A (en) * | 2020-12-07 | 2021-01-08 | 中国科学院动物研究所 | Engineered Cas effector proteins and methods of use thereof |
| CN113151215A (en) * | 2021-05-27 | 2021-07-23 | 中国科学院动物研究所 | Engineered Cas12i nuclease, effector protein thereof and uses thereof |
Non-Patent Citations (2)
| Title |
|---|
| CHEN YANGCAN, HU YANPING, WANG XINGE, LUO SHENGQIU, YANG NING, CHEN YI, LI ZHIKUN, ZHOU QI, LI WEI: "Synergistic engineering of CRISPR-Cas nucleases enables robust mammalian genome editing", THE INNOVATION, vol. 3, no. 4, 1 July 2022 (2022-07-01), pages 100264, XP055973274, ISSN: 2666-6758, DOI: 10.1016/j.xinn.2022.100264 * |
| LIU JUN-JIE; ORLOVA NATALIA; OAKES BENJAMIN L.; MA ENBO; SPINNER HANNAH B.; BANEY KATHERINE L.; CHUCK JONATHAN; TAN DAN; KNOTT GAV: "CasX enzymes comprise a distinct family of RNA-guided genome editors", NATURE, NATURE PUBLISHING GROUP UK, LONDON, vol. 566, no. 7743, 4 February 2019 (2019-02-04), London, pages 218 - 223, XP036746431, ISSN: 0028-0836, DOI: 10.1038/s41586-019-0908-x * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN119013395A (en) | 2024-11-22 |
| CN116497002A (en) | 2023-07-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN113151215B (en) | Engineered Cas12i nuclease, effector protein thereof and application thereof | |
| CN113308451B (en) | Engineered Cas effector proteins and methods of use thereof | |
| KR102494449B1 (en) | Engineered cas9 systems for eukaryotic genome modification | |
| US20240425830A1 (en) | Engineered cas12i nuclease, effector protein and use thereof | |
| EP4421167A2 (en) | Enhanced hat family transposon-mediated gene transfer and associated compositions, systems, and methods | |
| CN111712569A (en) | Cpf1-related methods and compositions for gene editing | |
| JP2024041866A (en) | Enhanced hAT family transposon-mediated gene transfer and related compositions, systems, and methods | |
| WO2016014837A1 (en) | Gene editing for hiv gene therapy | |
| JP6956416B2 (en) | Transposon system, kits containing it and their use | |
| WO2023104185A1 (en) | Engineered cas12b effector proteins and methods of use thereof | |
| WO2022040909A1 (en) | Split cas12 systems and methods of use thereof | |
| WO2022120520A1 (en) | Engineered cas effector proteins and methods of use thereof | |
| KR102683424B1 (en) | CRISPR/Cas dropout screening platform to identify genetic vulnerabilities associated with tau aggregation | |
| WO2023138617A1 (en) | Engineered casx nuclease, effector protein and use thereof | |
| HK40054104B (en) | Engineered cas12i nuclease as well as effector protein and use thereof | |
| HK40054104A (en) | Engineered cas12i nuclease as well as effector protein and use thereof | |
| HK40109303A (en) | Engineered cas12i nuclease, effector protein and use thereof | |
| US20230272355A1 (en) | ENHANCED hAT FAMILY MEMBER SPIN TRANSPOSON-MEDIATED GENE TRANSFER AND ASSOCIATED COMPOSITIONS, SYSTEMS, AND METHODS | |
| WO2025119363A1 (en) | Cas protein, crispr-cas system containing cas protein, and use of cas protein |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23742932 Country of ref document: EP Kind code of ref document: A1 |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 202380017557.3 Country of ref document: CN |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 23742932 Country of ref document: EP Kind code of ref document: A1 |