[go: up one dir, main page]

WO2025049903A2 - Novel regulators of t cells - Google Patents

Novel regulators of t cells Download PDF

Info

Publication number
WO2025049903A2
WO2025049903A2 PCT/US2024/044668 US2024044668W WO2025049903A2 WO 2025049903 A2 WO2025049903 A2 WO 2025049903A2 US 2024044668 W US2024044668 W US 2024044668W WO 2025049903 A2 WO2025049903 A2 WO 2025049903A2
Authority
WO
WIPO (PCT)
Prior art keywords
cell
gene
cells
vector
hnf4a
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/US2024/044668
Other languages
French (fr)
Other versions
WO2025049903A3 (en
Inventor
Charles A. Gersbach
Tomer Rotstein
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Duke University
Original Assignee
Duke University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Duke University filed Critical Duke University
Publication of WO2025049903A2 publication Critical patent/WO2025049903A2/en
Publication of WO2025049903A3 publication Critical patent/WO2025049903A3/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K31/00Medicinal preparations containing organic active ingredients
    • A61K31/70Carbohydrates; Sugars; Derivatives thereof
    • A61K31/7088Compounds having three or more nucleosides or nucleotides
    • A61K31/7105Natural ribonucleic acids, i.e. containing only riboses attached to adenine, guanine, cytosine or uracil and having 3'-5' phosphodiester links
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K45/00Medicinal preparations containing active ingredients not provided for in groups A61K31/00 - A61K41/00
    • A61K45/06Mixtures of active ingredients without chemical characterisation, e.g. antiphlogistics and cardiaca

Definitions

  • T cells are part of the immune system’s adaptive defense. They specifically target and kill both virally infected and cancerous cells through antigen recognition. Unfortunately, cancer can exploit intrinsic T cell mechanisms to survive attack by the immune system. T cell state and function are largely regulated by specific transcription factors (TFs) and epigenetic modifiers that process intrinsic and extrinsic signals into complex and tightly controlled gene expression programs. T cell exhaustion arises from chronic antigen stimulation, which shifts a portion of the T cell population to the exhausted state (T EX ), resulting in diminished T cell proliferation and tumor/viral clearance.
  • the transcription factor TOX drives and maintains the T EX cell state through epigenetic regulation of exhaustion-associated genetic programs.
  • Adoptive T cell therapy holds tremendous potential for cancer treatment by redirecting T cells to cancer cells via expression of engineered receptors that recognize and bind to tumor-associated antigens. Receptor-antigen interactions can initiate complex transcriptional networks that drive multipotent T cell response and lead to cancer cell death.
  • the potency and duration of T cell response are associated with defined T cell subsets, and cell products enriched in stem or memory T cells, provide superior tumor control in animal models and in the clinic. Given the association between defined T cell subsets and clinical outcomes, precise regulation or programming of T cell state may be one approach to improve the therapeutic potential of ACT.
  • T cell exhaustion drives dysfunction and impaired immune response to cancer and chronic viral infections.
  • Modern genome engineering technologies have the potential to dramatically advance T cell therapy by programming exhausted T cells to desirable phenotypes.
  • the disclosure relates to an isolated polynucleotide encoding a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
  • a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
  • the disclosure relates to an isolated polynucleotide encoding a transcription factor selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof.
  • the isolated polynucleotide comprises a sequence selected from SEQ ID NOs: 75-97.
  • the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof.
  • the disclosure relates to a vector encoding a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
  • the disclosure relates to a vector encoding a transcription factor selected fromTGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof.
  • a transcription factor selected fromTGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLA
  • the vector comprises a promoter operably linked to a polynucleotide sequence encoding the transcription factor.
  • the promoter is non-endogenous to the transcription factor.
  • the promoter is a constitutive promoter, or a ubiquitous promoter, or an inducible promoter, or a cell-specific promoter, or a tissue-specific promoter.
  • the vector comprises an open reading frame (ORF) of the transcription factor.
  • the vector comprises a sequence selected from SEQ ID NOs: 75-97 or encodes a polypeptide comprising a sequence selected from SEQ ID NOs: 98-120.
  • the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof.
  • the vector is a viral vector.
  • the vector is a lentiviral vector.
  • the vector is an adeno-associated virus (AAV) vector.
  • the AAV vector is selected from AAV1, AAV2, AAV5, AAV6, AAV8, AAV9, and an engineered AAV vector.
  • the method may include administering to a T cell or a subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • modulating T cells comprises increasing T cells, or increasing memory T cells, or increasing T cell distribution, or increasing tissue infiltration, or preventing T cell exhaustions, or reversing T cell exhaustions, or a combination thereof.
  • Another aspect of the disclosure provides a method of increasing T cells.
  • the method may include administering to a T cell or a subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • Another aspect of the disclosure provides a method of enhancing adoptive T cell therapy (ACT) in a subject.
  • ACT adoptive T cell therapy
  • the method may include administering to a T cell or the subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • the method may include administering to a T cell or the subject an activator of a gene selected fromTGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • an activator of a gene selected fromTGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • the gene is selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof.
  • the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof.
  • the activator modulates T cells, and modulating T cells comprises increasing T cells, or increasing T cell distribution, or increasing tissue infiltration, or increasing memory T cells, or increasing the lifetime of a T cell, or preventing T cell exhaustions, or reversing T cell exhaustions, or reducing T cell exhaustion, or enhancing the therapeutic potential of T cells, or a combination thereof.
  • administration of the activator to the T cell results in a modified T cell.
  • the modified T cell is administered to a subject.
  • the T cell is autologous or allogenic.
  • the activator modulates gene expression within the T cell.
  • the activator increases expression of CD103 or IL7Ra, or a combination thereof, in the T cell.
  • the activator comprises a polypeptide, or a polynucleotide, or a small molecule, or a combination thereof.
  • the activator comprises a polynucleotide encoding the gene.
  • the activator comprises a polynucleotide comprising the open reading frame of the gene or a polynucleotide encoding a protein encoded by the gene.
  • the activator comprises a polynucleotide selected from SEQ ID NOs: 75-97 or a polynucleotide encoding a polypeptide comprising an amino acid sequence selected from SEQ ID NOs: 98-120. In some embodiments, the activator comprises a polypeptide comprising a protein encoded by the gene. In some embodiments, the activator comprises a polypeptide selected from SEQ ID NOs: 98-120. In some embodiments, the activator comprises a vector as detailed herein. In some embodiments, the activator or a polynucleotide encoding the activator is encapsulated within a lipid nanoparticle or polymeric carrier.
  • the method further includes administering at least one cancer therapy or at least one antiviral therapy.
  • Another aspect of the disclosure provides a vector comprising an isolated polynucleotide as detailed herein.
  • Another aspect of the disclosure provides a cell comprising an isolated polynucleotide as detailed herein, or a vector as detailed herein.
  • the cell is a CD8+ T cell or a CD4+ T cell.
  • Another aspect of the disclosure provides a pharmaceutical composition.
  • the pharmaceutical composition may include an isolated polynucleotide as detailed herein, or a vector as detailed herein, or a combination thereof.
  • the pharmaceutical composition further includes at least one cancer therapy or at least one antiviral therapy.
  • compositions for increasing T cells may include an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
  • the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof.
  • the activator comprises a polynucleotide encoding the gene, or a polynucleotide encoding the open reading frame of the gene, or a polypeptide encoded by the gene, or a combination thereof. In some embodiments, the activator comprises a polynucleotide selected from SEQ ID NOs: 75-97 or a polypeptide selected from SEQ ID NOs: 98-120. In some embodiments, the composition further includes at least one cancer therapy or at least one antiviral therapy. [00017] The disclosure provides for other aspects and embodiments that will be apparent in light of the following detailed description and accompanying figures.
  • FIG.1 is a diagram showing a tissue-resident memory T cell (T RM ) and that the T RM cell displays distinct functional properties. T RM cells may be used for adoptive cell therapy.
  • FIG.2 is a diagram showing a hypothesis that transcription factors may be used to reprogram blood-derived T cells towards a tissue-resident state.
  • FIGS.3A-3B show diagrams of protocols for identifying T cell fate decision mediators with pooled cDNA overexpression screens.
  • FIG.3A is a diagram showing a protocol for identifying T cell fate decision mediators using pooled cDNA overexpression screening.
  • FIG.3B is a diagram showing a protocol for screening of pooled cDNA overexpression to identify fate decision mediators.
  • FIGS.4A-4C show that IL7Ra and CD103 screens uncover synthetic drivers of the tissue resident state.
  • FIG.4A is a diagram showing T cell fates and phenotypes.
  • FIG. 4B is a graph showing transcription factors that mediate upregulation of CD103 and IL7Ra.
  • FIG.4C is a graph showing that TGIF2LX significantly upregulates both CD103 and IL7Ra.
  • FIGS.5A-5C show TGIF family structure and expression.
  • FIG.5A is a diagram showing the domains of TGIF family proteins. TGIF family proteins have unique functional domains.
  • FIG.5B is a graph showing the expression in transcript per million (TPM) of TGIF family members in human pan T cells from blood, spleen, bone marrow, ileum, jejunum, lung, and skin.
  • FIG.5C is a graph showing the expression ratio of Tgif1 and Tgif2 of murine spleen T cells and intestinal intraepithelial lymphocytes (IEL). Data was analyzed from PMID:29211713 (Milner et al. Nature 2017, 552 (7684), 253-225, incorporated herein by reference). [00023]
  • FIGS.6A-6E show that Tgif1 is upregulated in early resident memory. Data was analyzed from PMID:32414833 (Kurd et al. Sci.
  • FIG.6A is a diagram showing a protocol for generating tissue-resident memory T (Trm) cells using an infectious agent.
  • FIG.6B is a uniform manifold approximation and projection (UMAP) embedding of sequencing of T cells from the spleen and gut following LCMV infection where Tgif1 expression is shown in FIG.6E.
  • FIG.6C is a UMAP embedding of sequencing of T cells from the spleen and gut where the expression of Tgif1 is shown in FIG.6E, and the UMAP is colored coded by days post-infection.
  • UMAP uniform manifold approximation and projection
  • FIG.6D is a chart showing the expression of Tgif1 and percentage of cells expressing Tgif1 in T cells from the spleen and gut.
  • FIG.6E is a chart showing the expression of Tgif2 and percentage of cells expressing Tgif2 in T cells from the spleen and gut.
  • FIGS.7A-7E show a molecular characterization of transcription factor-engineered chimeric antigen receptor T-cells (CAR Ts).
  • FIG.7A is a diagram showing a protocol for transduction, expansion, and characterization of CAR Ts.
  • FIG.7B is a graph showing flow cytometry data measuring forward scatter (FSC) and the CD103 marker on engineered CAR Ts expressing a control (Thy1.1) or TGIF2LX.
  • FIG.7C is a graph showing that CD103 is upregulated in CAR Ts expressing TGIF2LX.
  • FIG.7D is a graph showing flow cytometry data measuring IL7Ra for engineered CAR Ts expressing Thy1.1 or TGIF2LX.
  • FIG.7E is a graph showing that IL7Ra is upregulated in CAR Ts expressing TGIF2LX.
  • FIGS.8A-8J show that TGIF2LX reprograms T cells towards a tissue resident- like state.
  • FIG.8A is a volcano plot showing genes enriched in CAR Ts expressing TGIF2LX compared to CAR Ts expressing Thy1.1.
  • FIG.8B is a clustered heatmap showing expression of markers in circulatory T cells, T RM cells, CAR Ts expressing TGIF2LX, and CAR Ts expressing Thy1.1.
  • FIG.8C is a heatmap showing similarities in gene expression between perturbed samples overexpressing the indicated transcription factor or Thy1.1 control.
  • FIG.8D is a graph showing principal component (PC) 1 and 2 variance between the perturbed transcription factors.
  • FIG.8E is a graph showing PC1 and PC2 variance between the culture media.
  • FIG.8F is a graph showing PC1 and PC2 variance between the donors.
  • FIG.8G is a graph showing normalized RNA read counts of CAR Ts expressing TGIF2LX and CAR Ts expressing Thy1.1 for KLF2.
  • FIG.8H is a graph showing normalized RNA read counts of CAR Ts expressing TGIF2LX and CAR Ts expressing Thy1.1 for CD103.
  • FIG.8I is a graph showing normalized RNA read counts of CAR Ts expressing TGIF2LX and CAR Ts expressing Thy1.1 for P2RX7.
  • FIG.8J is a graph showing normalized RNA read counts of CAR Ts expressing TGIF2LX and CAR Ts expressing Thy1.1 for ITGB1.
  • FIGS.9A-9G show that single cell-omics reveals CAR Ts heterogeneity.
  • FIG.9A is a diagram showing a protocol for CITE-Seq (Cellular Indexing of Transcriptomes and Epitopes by Sequencing) of CAR Ts.
  • FIG.9B is a graph showing a UMAP embedding of sequencing of TGIF2LX CAR Ts with clusters numbered.
  • FIG.9C is a UMAP embedding of sequencing of donors (each donor represented by a different color) of TGIF2LX CAR Ts.
  • FIG.9D is a UMAP embedding of sequencing of perturbation of TGIF2LX, RUNX3, and Thy1.1 CAR Ts.
  • FIG.9E is a graph showing expression of differentially expressed genes TGIF2LX and RUNX3 vs Thy1.1 expression.
  • FIG.9F is a UMAP embedding showing identification of multiple subpopulations of CD8 TGIF2LX T cells.
  • FIG.9G is a heatmap showing expression of markers for the subpopulations of CD8 TGIF2LX T cells.
  • FIGS.10A-10F show that TGIF2LX increases tissue-residence associated surface markers.
  • FIG.10A is UMAP embeddings showing expression of CD8, CD103, CD69, and CD49a for CD8 TGIF2LX T cells.
  • FIG.10B is a UMAP embeddings showing expression of CD8, CD103, CD69, and CD49a for CD8Thy1.1 T cells.
  • FIG.10C is a graph showing the expression level of CD8 for CD8 TGIF2LX T cells and CD8 Thy1.1 T cells.
  • FIG.10D is a graph showing the expression level of CD103 for CD8 TGIF2LX T cells and CD8 Thy1.1 T cells.
  • FIG.10E is a graph showing the expression level of CD69 for CD8 TGIF2LX T cells and CD8 Thy1.1 T cells.
  • FIG.10F is a graph showing the expression level of CD49a for CD8 TGIF2LX T cells and CD8 Thy1.1 T cells.
  • FIGS.11A-11C show that CD8 subpopulations mirror memory to effector hierarchy.
  • FIG.11A is a heatmap showing expression of markers in the CD8 T cell subpopulations numbered in FIG.9F.
  • FIG.11B is a graph showing the average expression and percentage of cells expressing each marker.
  • FIG.11C is a diagram showing that the CD8 T cell subpopulations mirror memory to effector cell hierarchy.
  • FIGS.12A-12E show that TGIF2LX CAR Ts maintained cytotoxic capabilities in vitro.
  • FIG.12A is a graph showing the percent of target cells left over time after incubation with CAR T TGIF2LX or CAR T Thy1.1 .
  • FIG.12B is graphs showing concentrations of secreted granzyme A, perforin, and granulysin in media by CAR T TGIF2LX or CAR T Thy1.1 after incubation with target cells.
  • FIG.12C is graphs showing concentrations of secreted IL-17A, IL-4, and IL-2 in media by CAR T TGIF2LX or CAR T Thy1.1 after incubation with target cells.
  • FIG. 12D is graphs showing concentrations of secreted granzyme B, IFN- ⁇ , and sFas in media by CAR T TGIF2LX or CAR T Thy1.1 after incubation with target cells.
  • FIG.12E is graphs showing secreted concentrations of IL-10, IL-6, and TNF- ⁇ in media by CAR T TGIF2LX or CAR T Thy1.1 after incubation with target cells.
  • FIGS.13A-13B show that CAR T TGIFL2X improved tumor control in a HER2+ breast cancer xenograft.
  • FIG.13A is a diagram showing a protocol for inducing a breast cancer model in vivo and treating the cancer with CAR T cells.
  • FIG.13B is a graph showing tumor volume overtime following treatment with various CAR T cells.
  • FIG.14 is a graph showing expression of CD103. The data shows that TGIF2LX boosts CD103 expression in context of TGF ⁇ .
  • FIGS.15A-15B show the TGIF2LX domains that may be responsible for residency reprogramming.
  • FIG.15A is a diagram showing the 3D structure of TGIF2LX.
  • FIG.15B is a diagram showing the domains of TGIF family proteins.
  • DETAILED DESCRIPTION [00033] Provided herein are compositions and methods for increasing or enhancing T cells, which may be used to enhance antiviral therapies and cancer therapies such as ACT.
  • the compositions and methods may include increasing gene expression or a gene product thereof that encodes a regulator of T cell phenotypes, such as transcription factors, and may include, for example, delivery of a vector encoding the gene or increasing gene expression or a gene product thereof.
  • Transcriptions factors are central mediators of cellular reprogramming and differentiation.
  • the gene may be selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1.
  • ORFs open reading frames
  • Overexpression of one or more of these genes resulted in widespread transcriptional reprogramming, including activation of genes associated with increased tissue residency programs (for example, ITGAE, CD69, and CXCR6) and decreases in genes associated with circulation programs (for example, KLF2 and S1PR1).
  • Increased expression of one or more of the genes detailed herein may impact the “resident memory” phenotype of a T cell, which relates to durability of the T cell and/or the distribution and infiltration of the T cell into tissues.
  • the genes disclosed herein could be used to support specific features of memory T cells, counter T cell exhaustion, improve tumor control, enhance T cell infiltration into tissues, and/or engineer T cells with enhanced durability and therapeutic potential.
  • the term “about” refers to a range of values that fall within 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, or less in either direction (greater than or less than) of the stated reference value unless otherwise stated or otherwise evident from the context (except where such number would exceed 100% of a possible value).
  • “about” can mean within 3 or more than 3 standard deviations, per the practice in the art.
  • the term “about” can mean within an order of magnitude, preferably within 5-fold, and more preferably within 2- fold, of a value.
  • Adeno-associated virus or “AAV” as used interchangeably herein refers to a small virus belonging to the genus Dependovirus of the Parvoviridae family that infects humans and some other primate species. AAV is not currently known to cause disease and consequently the virus causes a very mild immune response.
  • Allogeneic refers to any material derived from another subject of the same species. Allogeneic cells are genetically distinct and immunologically incompatible yet belong to the same species. Typically, “allogeneic” is used to define cells, such as stem cells, that are transplanted from a donor to a recipient of the same species.
  • amino acid refers to naturally occurring and non-natural synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids.
  • Naturally occurring amino acids are those encoded by the genetic code.
  • Amino acids can be referred to herein by either their commonly known three-letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Amino acids include the side chain and polypeptide backbone portions.
  • Autologous refers to any material derived from a subject and re-introduced to the same subject.
  • “Binding region” as used herein refers to the region within a target region that is recognized and bound by the CRISPR/Cas-based gene editing system.
  • the terms “cancer”, “cancer cell”, “tumor”, and “tumor cell” are used interchangeably herein and refer generally to a group of diseases characterized by uncontrolled, abnormal growth of cells (e.g., a neoplasia). In some forms of cancer, the cancer cells can spread locally or through the bloodstream and lymphatic system to other parts of the body (“metastatic cancer”).
  • Cancer refers to all types of cancer or neoplasm or malignant tumors found in animals, including carcinoma, adenoma, melanoma, sarcoma, lymphoma, leukemia, blastoma, glioma, astrocytoma, mesothelioma, or a germ cell tumor.
  • Cancer may include cancer of, for example, the colon, rectum, stomach, bladder, cervix, uterus, skin, epithelium, muscle, kidney, liver, lymph, bone, blood, ovary, prostate, lung, brain, head and neck, and/or breast. Cancer may include medullablastoma, non-small cell lung cancer, and/or mesothelioma.
  • the cancer includes leukemia.
  • leukemia refers to broadly progressive, malignant diseases of the hematopoietic organs/systems and is generally characterized by a distorted proliferation and development of leukocytes and their precursors in the blood and bone marrow.
  • Leukemia diseases include, for example, acute nonlymphocytic leukemia, chronic lymphocytic leukemia, acute granulocytic leukemia, chronic granulocytic leukemia, acute promyelocytic leukemia, adult T-cell leukemia, aleukemic leukemia, a leukocythemic leukemia, basophilic leukemia, blast cell leukemia, bovine leukemia, chronic myelocytic leukemia, leukemia cutis, embryonal leukemia, eosinophilic leukemia, Gross' leukemia, Rieder cell leukemia, Schilling's leukemia, stem cell leukemia, subleukemic leukemia, undifferentiated cell leukemia, hairy-cell leukemia, hemoblastic leukemia, hemocytoblastic leukemia, histiocytic leukemia, stem cell leukemia, acute monocytic leukemia, leukopenic leukemia, lymphatic leukemia,
  • the leukemia is chronic myeloid leukemia (CML). In some embodiments, the leukemia is acute myeloid leukemia (AML).
  • CML chronic myeloid leukemia
  • AML acute myeloid leukemia
  • CRISPRs Clustered Regularly Interspaced Short Palindromic Repeats” and “CRISPRs”, as used interchangeably herein, refers to loci containing multiple short direct repeats that are found in the genomes of approximately 40% of sequenced bacteria and 90% of sequenced archaea.
  • Coding sequence or “encoding nucleic acid” as used herein means the nucleic acids (RNA or DNA molecule) that comprise a nucleotide sequence which encodes a protein.
  • the coding sequence can further include initiation and termination signals operably linked to regulatory elements including a promoter and polyadenylation signal capable of directing expression in the cells of an individual or mammal to which the nucleic acid is administered.
  • the regulatory elements may include, for example, a promoter, an enhancer, an initiation codon, a stop codon, or a polyadenylation signal.
  • the coding sequence may be codon optimized.
  • “Complement” or “complementary” as used herein means a nucleic acid can mean Watson-Crick (e.g., A-T/U and C-G) or Hoogsteen base pairing between nucleotides or nucleotide analogs of nucleic acid molecules.
  • “Complementarity” refers to a property shared between two nucleic acid sequences, such that when they are aligned antiparallel to each other, the nucleotide bases at each position will be complementary.
  • the terms “control,” “reference level,” and “reference” are used herein interchangeably.
  • the reference level may be a predetermined value or range, which is employed as a benchmark against which to assess the measured result.
  • Control group refers to a group of control subjects.
  • the predetermined level may be a cutoff value from a control group.
  • AIM Adaptive Index Model
  • a cutoff value may be determined by selecting a value that corresponds to any value in the 25th-75th percentile range, preferably a value that corresponds to the 25th percentile, the 50th percentile or the 75th percentile, and more preferably the 75th percentile.
  • Such statistical analyses may be performed using any method known in the art and can be implemented through any number of commercially available software packages (e.g., from Analyse-it Software Ltd., Leeds, UK; StataCorp LP, College Station, TX; SAS Institute Inc., Cary, NC.).
  • the healthy or normal levels or ranges for a target or for a protein activity may be defined in accordance with standard practice.
  • a control may be a subject or cell without a composition as detailed herein.
  • a control may be a subject, or a sample therefrom, whose disease state is known.
  • the subject, or sample therefrom may be healthy, diseased, diseased prior to treatment, diseased during treatment, or diseased after treatment, or a combination thereof.
  • “Correcting”, “gene editing,” and “restoring” as used herein refers to changing a mutant gene that encodes a dysfunctional protein or truncated protein or no protein at all, such that a full-length functional or partially full-length functional protein expression is obtained.
  • Correcting or restoring a mutant gene may include replacing the region of the gene that has the mutation or replacing the entire mutant gene with a copy of the gene that does not have the mutation with a repair mechanism such as homology-directed repair (HDR).
  • HDR homology-directed repair
  • Correcting or restoring a mutant gene may also include repairing a frameshift mutation that causes a premature stop codon, an aberrant splice acceptor site or an aberrant splice donor site, by generating a double stranded break in the gene that is then repaired using non-homologous end joining (NHEJ). NHEJ may add or delete at least one base pair during repair which may restore the proper reading frame and eliminate the premature stop codon. Correcting or restoring a mutant gene may also include disrupting an aberrant splice acceptor site or splice donor sequence.
  • NHEJ non-homologous end joining
  • Correcting or restoring a mutant gene may also include deleting a non-essential gene segment by the simultaneous action of two nucleases on the same DNA strand in order to restore the proper reading frame by removing the DNA between the two nuclease target sites and repairing the DNA break by NHEJ.
  • Donor DNA “donor template,” and “repair template” as used interchangeably herein refers to a double-stranded DNA fragment or molecule that includes at least a portion of the gene of interest. The donor DNA may encode a full-functional protein or a partially functional protein.
  • “Enhancer” as used herein refers to non-coding DNA sequences containing multiple activator and repressor binding sites.
  • Enhancers range from 200 bp to 1 kb in length and may be either proximal, 5’ upstream to the promoter or within the first intron of the regulated gene, or distal, in introns of neighboring genes or intergenic regions far away from the locus.
  • active enhancers contact the promoter dependently of the core DNA binding motif promoter specificity. 4 to 5 enhancers may interact with a promoter.
  • enhancers may regulate more than one gene without linkage restriction and may “skip” neighboring genes to regulate more distant ones.
  • Transcriptional regulation may involve elements located in a chromosome different to one where the promoter resides.
  • Proximal enhancers or promoters of neighboring genes may serve as platforms to recruit more distal elements.
  • “Frameshift” or “frameshift mutation” as used interchangeably herein refers to a type of gene mutation wherein the addition or deletion of one or more nucleotides causes a shift in the reading frame of the codons in the mRNA. The shift in reading frame may lead to the alteration in the amino acid sequence at protein translation, such as a missense mutation or a premature stop codon.
  • “Functional” and “full-functional” as used herein describes protein that has biological activity.
  • a “functional gene” refers to a gene transcribed to mRNA, which is translated to a functional protein.
  • Fusion protein refers to a chimeric protein created through the joining of two or more genes that originally coded for separate proteins. The translation of the fusion gene results in a single polypeptide with functional properties derived from each of the original proteins.
  • Genetic construct refers to the DNA or RNA molecules that comprise a polynucleotide that encodes a protein. The coding sequence includes initiation and termination signals operably linked to regulatory elements including a promoter and polyadenylation signal capable of directing expression in the cells of the individual to whom the nucleic acid molecule is administered.
  • the term “expressible form” refers to gene constructs that contain the necessary regulatory elements operable linked to a coding sequence that encodes a protein such that when present in the cell of the individual, the coding sequence will be expressed.
  • the regulatory elements may include, for example, a promoter, an enhancer, an initiation codon, a stop codon, or a polyadenylation signal.
  • “Genome editing” or “gene editing” as used herein refers to changing the DNA sequence of a gene. Genome editing may include correcting or restoring a mutant gene or adding additional mutations. Genome editing may include knocking out a gene, such as a mutant gene or a normal gene.
  • Genome editing may be used to treat disease or, for example, enhance muscle repair, by changing the gene of interest.
  • the compositions and methods detailed herein are for use in somatic cells and not germ line cells.
  • heterologous refers to nucleic acid comprising two or more subsequences that are not found in the same relationship to each other in nature.
  • a nucleic acid that is recombinantly produced typically has two or more sequences from unrelated genes synthetically arranged to make a new functional nucleic acid, for example, a promoter from one source and a coding region from another source. The two nucleic acids are thus heterologous to each other in this context.
  • a heterologous nucleic acid When added to a cell, the recombinant nucleic acids would also be heterologous to the endogenous genes of the cell.
  • a heterologous nucleic acid would include a non-native (non- naturally occurring) nucleic acid that has integrated into the chromosome, or a non-native (non-naturally occurring) extrachromosomal nucleic acid.
  • a heterologous protein indicates that the protein comprises two or more subsequences that are not found in the same relationship to each other in nature (for example, a “fusion protein,” where the two subsequences are encoded by a single nucleic acid sequence).
  • HDR Homology-directed repair
  • a homologous piece of DNA is present in the nucleus, mostly in G2 and S phase of the cell cycle.
  • HDR uses a donor DNA template to guide repair and may be used to create specific sequence changes to the genome, including the targeted addition of whole genes. If a donor template is provided along with the CRISPR/Cas9-based gene editing system, then the cellular machinery will repair the break by homologous recombination, which is enhanced several orders of magnitude in the presence of DNA cleavage. When the homologous DNA piece is absent, non-homologous end joining may take place instead.
  • “Identical” or “identity” as used herein in the context of two or more polynucleotide or polypeptide sequences means that the sequences have a specified percentage of residues that are the same over a specified region. The percentage may be calculated by optimally aligning the two sequences, comparing the two sequences over the specified region, determining the number of positions at which the identical residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the specified region, and multiplying the result by 100 to yield the percentage of sequence identity.
  • thymine (T) and uracil (U) may be considered equivalent.
  • Identity may be performed manually or by using a computer sequence algorithm such as BLAST or BLAST 2.0.
  • “Mutant gene” or “mutated gene” as used interchangeably herein refers to a gene that has undergone a detectable mutation. A mutant gene has undergone a change, such as the loss, gain, or exchange of genetic material, which affects the normal transmission and expression of the gene.
  • a “disrupted gene” as used herein refers to a mutant gene that has a mutation that causes a premature stop codon.
  • the disrupted gene product is truncated relative to a full-length undisrupted gene product.
  • “Non-homologous end joining (NHEJ) pathway” as used herein refers to a pathway that repairs double-strand breaks in DNA by directly ligating the break ends without the need for a homologous template.
  • the template-independent re-ligation of DNA ends by NHEJ is a stochastic, error-prone repair process that introduces random micro-insertions and micro-deletions (indels) at the DNA breakpoint.
  • NHEJ typically uses short homologous DNA sequences called microhomologies to guide repair. These microhomologies are often present in single-stranded overhangs on the end of double-strand breaks. When the overhangs are perfectly compatible, NHEJ usually repairs the break accurately, yet imprecise repair leading to loss of nucleotides may also occur, but is much more common when the overhangs are not compatible. “Nuclease mediated NHEJ” as used herein refers to NHEJ that is initiated after a nuclease cuts double stranded DNA.
  • Normal gene refers to a gene that has not undergone a change, such as a loss, gain, or exchange of genetic material. The normal gene undergoes normal gene transmission and gene expression. For example, a normal gene may be a wild-type gene.
  • Nucleic acid or “oligonucleotide” or “polynucleotide” as used herein means at least two nucleotides covalently linked together. The depiction of a single strand also defines the sequence of the complementary strand. Thus, a polynucleotide also encompasses the complementary strand of a depicted single strand.
  • polynucleotide may be used for the same purpose as a given polynucleotide.
  • a polynucleotide also encompasses substantially identical polynucleotides and complements thereof.
  • a single strand provides a probe that may hybridize to a target sequence under stringent hybridization conditions.
  • a polynucleotide also encompasses a probe that hybridizes under stringent hybridization conditions.
  • Polynucleotides may be single stranded or double stranded or may contain portions of both double stranded and single stranded sequence.
  • the polynucleotide can be nucleic acid, natural or synthetic, DNA, genomic DNA, cDNA, RNA, mRNA, or a hybrid, where the polynucleotide can contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases including, for example, uracil, adenine, thymine, cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine, and isoguanine.
  • Polynucleotides can be obtained by chemical synthesis methods or by recombinant methods.
  • Open reading frame refers to a stretch of codons that begins with a start codon and ends at a stop codon. In eukaryotic genes with multiple exons, introns are removed, and exons are then joined together after transcription to yield the final mRNA for protein translation.
  • An open reading frame may be a continuous stretch of codons. In some embodiments, the open reading frame only applies to spliced mRNAs, not genomic DNA, for expression of a protein.
  • “Operably linked” as used herein means that expression of a gene is under the control of a promoter with which it is spatially connected. A promoter may be positioned 5' (upstream) or 3' (downstream) of a gene under its control.
  • the distance between the promoter and a gene may be approximately the same as the distance between that promoter and the gene it controls in the gene from which the promoter is derived. As is known in the art, variation in this distance may be accommodated without loss of promoter function.
  • Nucleic acid or amino acid sequences are “operably linked” (or “operatively linked”) when placed into a functional relationship with one another. For instance, a promoter or enhancer is operably linked to a coding sequence if it regulates, or contributes to the modulation of, the transcription of the coding sequence. Operably linked DNA sequences are typically contiguous, and operably linked amino acid sequences are typically contiguous and in the same reading frame.
  • enhancers generally function when separated from the promoter by up to several kilobases or more and intronic sequences may be of variable lengths
  • some polynucleotide elements may be operably linked but not contiguous.
  • certain amino acid sequences that are non-contiguous in a primary polypeptide sequence may nonetheless be operably linked due to, for example folding of a polypeptide chain.
  • operatively linked and “operably linked” can refer to the fact that each of the components performs the same function in linkage to the other component as it would if it were not so linked.
  • Partially-functional as used herein describes a protein that is encoded by a mutant gene and has less biological activity than a functional protein but more than a non- functional protein.
  • a “peptide” or “polypeptide” is a linked sequence of two or more amino acids linked by peptide bonds.
  • the polypeptide can be natural, synthetic, or a modification or combination of natural and synthetic.
  • Peptides and polypeptides include proteins such as binding proteins, receptors, and antibodies.
  • the terms “polypeptide”, “protein,” and “peptide” are used interchangeably herein.
  • Primary structure refers to the amino acid sequence of a particular peptide.
  • “Secondary structure” refers to locally ordered, three dimensional structures within a polypeptide. These structures are commonly known as domains, for example, enzymatic domains, extracellular domains, transmembrane domains, pore domains, and cytoplasmic tail domains. “Domains” are portions of a polypeptide that form a compact unit of the polypeptide and are typically 15 to 350 amino acids long. Exemplary domains include domains with enzymatic activity or ligand binding activity. Typical domains are made up of sections of lesser organization such as stretches of beta-sheet and alpha- helices. “Tertiary structure” refers to the complete three-dimensional structure of a polypeptide monomer.
  • “Quaternary structure” refers to the three-dimensional structure formed by the noncovalent association of independent tertiary units.
  • a “motif” is a portion of a polypeptide sequence and includes at least two amino acids.
  • a motif may be 2 to 20, 2 to 15, or 2 to 10 amino acids in length. In some embodiments, a motif includes 3, 4, 5, 6, or 7 sequential amino acids.
  • a domain may be comprised of a series of the same type of motif.
  • Premature stop codon” or “out-of-frame stop codon” as used interchangeably herein refers to nonsense mutation in a sequence of DNA, which results in a stop codon at location not normally found in the wild-type gene.
  • a premature stop codon may cause a protein to be truncated or shorter compared to the full-length version of the protein.
  • “Promoter” as used herein means a synthetic or naturally derived molecule which is capable of conferring, activating or enhancing expression of a nucleic acid in a cell.
  • a promoter may comprise one or more specific transcriptional regulatory sequences to further enhance expression and/or to alter the spatial expression and/or temporal expression of same.
  • a promoter may also comprise distal enhancer or repressor elements, which may be located as much as several thousand base pairs from the start site of transcription.
  • a promoter may be derived from sources including viral, bacterial, fungal, plants, insects, and animals.
  • a promoter may regulate the expression of a gene component constitutively, or differentially with respect to cell, the tissue or organ in which expression occurs or, with respect to the developmental stage at which expression occurs, or in response to external stimuli such as physiological stresses, pathogens, metal ions, or inducing agents.
  • promoters include the bacteriophage T7 promoter, bacteriophage T3 promoter, SP6 promoter, lac operator-promoter, tac promoter, SV40 late promoter, SV40 early promoter, RSV-LTR promoter, CMV IE promoter, SV40 early promoter or SV40 late promoter, human U6 (hU6) promoter, and CMV IE promoter.
  • Promoters that target muscle-specific stem cells may include the CK8 promoter, the Spc5-12 promoter, and the MHCK7 promoter.
  • the term “recombinant” when used with reference to, for example, a cell, nucleic acid, protein, or vector indicates that the cell, nucleic acid, protein, or vector, has been modified by the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic acid or protein, or that the cell is derived from a cell so modified.
  • recombinant cells express genes that are not found within the native (naturally occurring) form of the cell or express a second copy of a native gene that is otherwise normally or abnormally expressed, under expressed, or not expressed at all.
  • shRNA stands for short hairpin RNA or small hairpin RNA.
  • a shRNA is an artificial RNA molecule with a tight hairpin turn that can be used to silence target gene expression via RNA interference (RNAi). Expression of shRNA in cells may be facilitated by delivery of plasmids or viral or bacterial vectors. The shRNA is processed by Dicer into siRNA.
  • siRNA stands for small interfering RNA siRNA, sometimes also known as short interfering RNA or silencing RNA.
  • siRNA is a class of double-stranded RNA molecule. The siRNA may be natural or artificial. The siRNA forms a complex with the RNA-induced silencing complex (RISC).
  • RISC RNA-induced silencing complex
  • sample or “test sample” as used herein can mean any sample in which the presence and/or level of a target is to be detected or determined or any sample comprising a DNA targeting or gene editing system or component thereof as detailed herein. Samples may include liquids, solutions, emulsions, or suspensions. Samples may include a medical sample.
  • Samples may include any biological fluid or tissue, such as blood, whole blood, fractions of blood such as plasma and serum, muscle, interstitial fluid, sweat, saliva, urine, tears, synovial fluid, bone marrow, cerebrospinal fluid, nasal secretions, sputum, amniotic fluid, bronchoalveolar lavage fluid, gastric lavage, emesis, fecal matter, lung tissue, peripheral blood mononuclear cells, total white blood cells, lymph node cells, spleen cells, tonsil cells, cancer cells, tumor cells, bile, digestive fluid, skin, or combinations thereof.
  • the sample comprises an aliquot.
  • the sample comprises a biological fluid. Samples can be obtained by any means known in the art.
  • the sample can be used directly as obtained from a patient or can be pre-treated, such as by filtration, distillation, extraction, concentration, centrifugation, inactivation of interfering components, addition of reagents, and the like, to modify the character of the sample in some manner as discussed herein or otherwise as is known in the art.
  • the subject may be a human or a non-human.
  • the subject may be a vertebrate.
  • the subject may be a mammal.
  • the mammal may be a primate or a non- primate.
  • the mammal can be a non-primate such as, for example, cow, pig, camel, llama, hedgehog, anteater, platypus, elephant, alpaca, horse, goat, rabbit, sheep, hamster, guinea pig, cat, dog, rat, and mouse.
  • the mammal can be a primate such as a human.
  • the mammal can be a non-human primate such as, for example, monkey, cynomolgous monkey, rhesus monkey, chimpanzee, gorilla, orangutan, and gibbon.
  • the subject may be of any age or stage of development, such as, for example, an adult, an adolescent, a child, such as age 0-2, 2-4, 2-6, or 6-12 years, or an infant, such as age 0-1 years.
  • the subject may be male.
  • the subject may be female.
  • the subject has a specific genetic marker.
  • the subject may be undergoing other forms of treatment.
  • the subject may have a disease or condition.
  • the subject has cancer.
  • the subject is human.
  • “Substantially identical” can mean that a first and second amino acid or polynucleotide sequence are at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99%, or less than 100% over a region of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100 amino acids or nucleotides, respectively.
  • Target gene refers to any nucleotide sequence encoding a known or putative gene product.
  • the target gene may be a mutated gene involved in a genetic disease.
  • the target gene may encode a known or putative gene product that is intended to be corrected or for which its expression is intended to be modulated.
  • the target gene is a gene detailed herein as a modulator of T cells.
  • “Target region” as used herein refers to the region of the target gene to which the CRISPR/Cas9-based gene editing or targeting system is designed to bind.
  • T cells are a type of white blood cell of the immune system and play a central role in the adaptive immune response.
  • T cells express a T-cell receptor (TCR) on their cell surface.
  • T cell receptor (TCR) of a T cell is able to interact with immunogenic peptides (epitopes) bound to major histocompatibility complex (MHC) molecules and presented on the surface of target cells. Specific binding of the TCR triggers a signal cascade inside the T cell leading to proliferation and differentiation into a maturated effector T cell.
  • T cells may differentiate into different types of T cells. T cells may include, for example, CD8+ T cells (“killer T cells” or “cytotoxic T cells) and CD4+ T cells (“helper T cells”).
  • CD8+ T cells and CD4+ T cells may further differentiate into other types of T cells including, for example, regulatory T cells (“suppressor T cells”) and memory T cells.
  • the T cell is a memory T cell.
  • An antigen-na ⁇ ve T cell expands and differentiates into a memory T cell after encountering the cognate antigen within the context of a major histocompatibility complex (MHC) molecule on the surface of an antigen presenting cell.
  • MHC major histocompatibility complex
  • Memory T cells may be CD8+ or CD4+. Memory T cells are long-lived and can quickly expand to large numbers of effector T cells upon re-exposure to their cognate antigen.
  • Tissue-resident memory T cells are a subset of a long-lived memory T cells that occupy epithelial, mucosal, and other tissues such as skin, mucosa, lung, brain, pancreas, and gastrointestinal tract, without recirculating.
  • T RM cells may be transcriptionally, phenotypically, and functionally different from central memory (T CM ) and effector memory (T EM ) T cells that recirculate between blood, the T cell zones of secondary lymphoid organ, lymph tissues, and nonlymphoid tissues.
  • T RM cells can develop from circulating effector memory T cell precursors in response to an antigen.
  • TRM cells may be CD103+.
  • TRM cells may provide superior protection against infection in extralymphoid tissues.
  • a T cell detailed herein may be a T RM cell.
  • Transgene refers to a gene or genetic material containing a gene sequence that has been isolated from one organism and is introduced into a different organism. This non-native segment of DNA may retain the ability to produce RNA or protein in the transgenic organism, or it may alter the normal function of the transgenic organism's genetic code. The introduction of a transgene has the potential to change the phenotype of an organism.
  • Transcriptional regulatory elements or “regulatory elements” refers to a genetic element which can control the expression of nucleic acid sequences, such as activate, enhancer, or decrease expression, or alter the spatial and/or temporal expression of a nucleic acid sequence.
  • regulatory elements include, for example, promoters, enhancers, splicing signals, polyadenylation signals, and termination signals.
  • a regulatory element can be “endogenous,” “exogenous,” or “heterologous” with respect to the gene to which it is operably linked.
  • An “endogenous” regulatory element is one which is naturally linked with a given gene in the genome.
  • an “exogenous” or “heterologous” regulatory element is one which is not normally linked with a given gene but is placed in operable linkage with a gene by genetic manipulation.
  • “Treatment” or “treating” or “therapy” when referring to protection of a subject from a disease means suppressing, repressing, reversing, alleviating, ameliorating, or inhibiting the progress of disease, or completely eliminating a disease.
  • a treatment may be either performed in an acute or chronic way. The term also refers to reducing the severity of a disease or symptoms associated with such disease prior to affliction with the disease. Treatment may result in a reduction in the incidence, frequency, severity, and/or duration of symptoms of the disease.
  • Preventing the disease involves administering a composition of the present invention to a subject prior to onset of the disease.
  • Suppressing the disease involves administering a composition of the present invention to a subject after induction of the disease but before its clinical appearance.
  • Repressing or ameliorating the disease involves administering a composition of the present invention to a subject after clinical appearance of the disease.
  • the term “gene therapy” refers to a method of treating a patient wherein polypeptides or nucleic acid sequences are transferred into cells of a patient such that activity and/or the expression of a particular gene is modulated. In certain embodiments, the expression of the gene is suppressed. In certain embodiments, the expression of the gene is enhanced.
  • the temporal or spatial pattern of the expression of the gene is modulated.
  • “Variant” used herein with respect to a polynucleotide means (i) a portion or fragment of a referenced nucleotide sequence; (ii) the complement of a referenced nucleotide sequence or portion thereof; (iii) a nucleic acid that is substantially identical to a referenced nucleic acid or the complement thereof; or (iv) a nucleic acid that hybridizes under stringent conditions to the referenced nucleic acid, complement thereof, or a sequence substantially identical thereto.
  • a variant can be a polynucleotide sequence that is substantially identical over the full length of the full polynucleotide sequence or a fragment thereof.
  • the polynucleotide sequence can be 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, or less than 100% identical over the full length of the polynucleotide sequence or a fragment thereof.
  • “Variant” with respect to a peptide or polypeptide that differs in amino acid sequence by the insertion, deletion, or conservative substitution of amino acids, but retain at least one biological activity.
  • Variant may also mean a protein with an amino acid sequence that is substantially identical to a referenced protein with an amino acid sequence that retains at least one biological activity.
  • biological activity include the ability to be bound by a specific antibody or polypeptide or to promote an immune response.
  • Variant can mean a functional fragment thereof.
  • Variant can also mean multiple copies of a polypeptide. The multiple copies can be in tandem or separated by a linker. A conservative substitution of an amino acid, for example, replacing an amino acid with a different amino acid of similar properties (for example, hydrophilicity, degree and distribution of charged regions) is recognized in the art as typically involving a minor change.
  • hydropathic index of amino acids is based on a consideration of its hydrophobicity and charge. It is known in the art that amino acids of similar hydropathic indexes may be substituted and still retain protein function. In one aspect, amino acids having hydropathic indexes of ⁇ 2 are substituted.
  • the hydrophilicity of amino acids may also be used to reveal substitutions that would result in proteins retaining biological function. A consideration of the hydrophilicity of amino acids in the context of a peptide permits calculation of the greatest local average hydrophilicity of that peptide.
  • Substitutions may be performed with amino acids having hydrophilicity values within ⁇ 2 of each other. Both the hydrophobicity index and the hydrophilicity value of amino acids are influenced by the particular side chain of that amino acid. Consistent with that observation, amino acid substitutions that are compatible with biological function are understood to depend on the relative similarity of the amino acids, and particularly the side chains of those amino acids, as revealed by the hydrophobicity, hydrophilicity, charge, size, and other properties.
  • a variant can be an amino acid sequence that is substantially identical over the full length of the amino acid sequence or fragment thereof.
  • the amino acid sequence can be 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, or less than 100% identical over the full length of the amino acid sequence or a fragment thereof.
  • Vector as used herein means a nucleic acid sequence containing an origin of replication.
  • a vector may be capable of directing the delivery or transfer of a polynucleotide sequence to target cells, where it can be replicated or expressed.
  • a vector may contain an origin of replication, one or more regulatory elements, and/or one or more coding sequences.
  • a vector may be a viral vector, bacteriophage, bacterial artificial chromosome, plasmid, cosmid, or yeast artificial chromosome.
  • a vector may be a DNA or RNA vector.
  • a vector may be a self-replicating extrachromosomal vector.
  • Viral vectors include, but are not limited to, adenovirus vector, adeno-associated virus (AAV) vector, retrovirus vector, or lentivirus vector.
  • a vector may be an adeno-associated virus (AAV) vector.
  • the vector may encode a Cas9 protein and at least one gRNA molecule.
  • any nomenclatures used in connection with, and techniques of, cell and tissue culture, molecular biology, immunology, microbiology, genetics, and protein and nucleic acid chemistry and hybridization described herein are those that are well known and commonly used in the art. The meaning and scope of the terms should be clear; in the event however of any latent ambiguity, definitions provided herein take precedent over any dictionary or extrinsic definition. Further, unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. 2.
  • Modulators of T Cells [00086] Provided herein are modulators of T cells. Modifying or modulating may include increasing or decreasing, for example.
  • the compositions and methods comprise an agent that increases T cells.
  • Increasing T cells may include increasing the number of T cells and/or increasing the number of memory T cells and/or increasing the lifetime of a T cell and/or preventing T cell exhaustion and/or reducing T cell exhaustion and/or reversing T cell exhaustion and/or enhancing the therapeutic potential of T cells.
  • Modifying a T cell may include modifying the expression of a target gene within the T cell. Modifying a T cell may include delivery of a gene to T cells.
  • Modifying a T cell may include enhancing or increasing specific features of memory T cells, countering T cell exhaustion, reversing T cell exhaustion, inhibiting or preventing T cell exhaustion, improving tumor control, increasing T cell biodistribution, enhancing T cell infiltration into tissues, increasing infiltration of T cells into tissues, increasing T cell durability, engineering T cells with enhanced durability and therapeutic potential, increasing T cells, increasing T cell numbers, increasing memory T cells, increasing memory T cell numbers, or a combination thereof.
  • the compositions and methods detailed herein may engineer or modify the gene expression programs within T cells by engineering the T cells directly.
  • the compositions and methods comprise an agent that increases expression or activity of CD103 or IL7Ra, or a combination thereof, in T cells.
  • IL7Ra may comprise an amino acid sequence of SEQ ID NO: 193, encoded by a polynucleotide comprising the sequence of SEQ ID NO: 194.
  • CD103 may comprise an amino acid sequence of SEQ ID NO: 195, encoded by a polynucleotide comprising the sequence of SEQ ID NO: 196.
  • Expression or activity of CD103 or IL7Ra, or a combination thereof may each be increased by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • Expression or activity of CD103 or IL7Ra, or a combination thereof may each be increased by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • Expression or activity of CD103 or IL7Ra, or a combination thereof may each be increased by about 5- 95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control.
  • compositions and methods comprise an agent that decreases expression or activity of CCR7 and/or TOX in T cells.
  • Expression of a marker, such as IL7Ra or CD103 may be done by any suitable means in the art, including, for example, ELISA, immunohistochemistry, flow cytometry, FACS, DNA or RNA sequencing, and hybridization of reporters or probes to RNA transcripts.
  • the modulator of T cells may target a gene or a regulatory element thereof. Regulatory elements include, for example, promoters and enhancers. Regulatory elements may be within 1000 base pairs of the transcription start site. Regulatory elements may be within 600 base pairs of the transcription start site.
  • the agent, or the composition or the method comprising the agent may modify the expression of a gene.
  • the agent, or the composition or the method comprising the agent may reduce, inhibit, decrease, activate, increase, or enhance the expression or activity of a gene or its gene protein product.
  • the agent, or the composition or the method comprising the agent may directly or indirectly modulate the activity of the gene’s protein product.
  • the modulator of T cells may increase or decrease the binding or enzymatic activity of the gene’s protein product, or inhibit the binding of the gene’s protein product to another molecule or ligand, or increase the binding of the gene’s protein product to another molecule or ligand, or increase or decrease the degradation of the gene’s protein product, or a combination thereof.
  • the modulator of T cells may be an activator of a gene.
  • the activator may comprise or be the gene or coding sequence thereof itself.
  • an activator of a gene selected fromTGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a regulatory element thereof, or a region thereof, or a combination thereof.
  • the gene may exist in several isoforms.
  • the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a regulatory element thereof, or a region thereof, or a combination thereof.
  • the gene may encode a transcription factor.
  • an activator of a transcription factor selected fromTGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
  • the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a regulatory element thereof, or a region thereof, or a combination thereof.
  • an agent may activate or enhance expression or activity of a gene or gene protein product to increase T cells.
  • the activator may increase the level of polynucleotide encoding the gene or encoding the gene product.
  • the activator may increase the level of transcription of a polynucleotide encoding the gene or encoding the gene product.
  • the activator may increase the level of translation of a mRNA encoding the gene product.
  • the activator may increase the expression of protein encoded by the gene or the open reading frame thereof.
  • the activator may increase the level or amount of protein expressed from the gene.
  • the activator may increase the level or amount of protein expressed from the open reading frame of the gene.
  • the agent may comprise, for example, a polynucleotide, a polypeptide, a small molecule, a lipid, a carbohydrate, or a combination thereof.
  • the agent comprises a polynucleotide.
  • the agent may comprise a polynucleotide encoding the gene or a fragment thereof.
  • the agent may comprise a polynucleotide comprising a cDNA of the gene or a fragment thereof.
  • the polynucleotide may comprise a sequence selected from SEQ ID NOs: 75-97, or a fragment thereof.
  • the polynucleotide may comprise a sequence having at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%, or at least 98%, or greater identity to a sequence selected from SEQ ID NOs: 75- 97, or a fragment thereof.
  • the polynucleotide may comprise a sequence having one, two, three, four, five or more changes selected from nucleotide substitutions, insertions, or deletions, relative to a sequence selected from SEQ ID NOs: 75-97, or a fragment thereof.
  • the agent comprises a protein.
  • the agent may comprise a polypeptide comprising a protein product of the gene or a fragment thereof.
  • the polypeptide may comprise a sequence selected from SEQ ID NOs: 98-120, or a fragment thereof.
  • the polypeptide may comprise a sequence having at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%, or at least 98%, or greater identity to a sequence selected from SEQ ID NOs: 98-120, or a fragment thereof.
  • the polypeptide may comprise a sequence having one, two, three, four, five or more changes selected from amino acid substitutions, insertions, or deletions, relative to a sequence selected from SEQ ID NOs: 98-120, or a fragment thereof.
  • the agent comprises a DNA targeting composition as detailed herein or at least one component thereof. [00092] Examples of genes for modulating T cells are shown in TABLE 1 and TABLE 2.
  • T cells may be modulated by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • T cells may be modulated by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7- fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • T cells may be modulated by about 5- 95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control.
  • T cells may be reduced by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7- fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • T cells may be reduced by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • T cells may be reduced by about 5-95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control.
  • T cells may be increased by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • T cells may be increased by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • T cells may be increased by about 5-95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control.
  • the modulator of T cells is administered with or as a therapy such as cancer therapy.
  • the modulator of T cells is used to modify a T cell that is then or later administered with or used as a cancer therapy.
  • a T cell modified by an activator as detailed herein The T cell may be modified in vitro or ex vivo or in vivo.
  • a T cell modified by an activator as detailed herein may be administered to a subject.
  • the cancer therapy may include chemotherapy or immunotherapy.
  • the cancer therapy may include adoptive T cell therapy (ACT) therapy.
  • the cancer therapy may include a chimeric antigen receptor (CAR).
  • a modulator of T cells as detailed herein may be used to modify a CAR T cell to generate a modified or engineered CAR T cell.
  • a T cell modified by an activator as detailed herein, which is then or later modified to generate a CAR T cell.
  • a chimeric antigen receptor may also be known as chimeric immunoreceptor, chimeric T cell receptor, or artificial T cell receptor.
  • CARs are receptor proteins that have been engineered to give T cells the new ability to target a specific antigen.
  • CARs are chimeric in that they may combine both antigen-binding and T cell activating functions into a single receptor.
  • CARs may include an antigen binding domain specific for an antigen on a cancer cell. The premise of CAR T immunotherapy is to modify T cells to recognize cancer cells in order to target and destroy them.
  • T cells are harvested from a subject, the T cells are genetically altered to add a chimeric antigen receptor (CAR) that specifically recognizes cancer cells, and the resulting CAR T cells may be administered to the subject to attack their tumors.
  • CAR T cell therapy and modification to T cells are described in, for example, WO2012/079000 and WO2012/129514 and WO2018/005712, each of which is incorporated herein by reference in its entirety.
  • the cancer therapy may include a T cell receptor (TCR) therapy.
  • TCR T cell receptor
  • a modulator of T cells as detailed herein may be used to modify a T cell to generate a modified or engineered T cell for use in TCR therapy.
  • TCR therapy involves the modification of T cells.
  • TCR therapy capitalizes on the natural mechanisms of T cells.
  • solid tumors lack cell surface antigens to be recognized by typical CARs.
  • Intracellular antigens from solid tumors may be presented by HLA molecules as HLA-antigen epitope complexes on the cell surface, which may be specifically recognized by TCRs on the surface of tumor-specific T cells, thereby eliciting anti-tumor cytotoxicity by the tumor-specific T cells.
  • TCR therapies may include transformed T cells expressing at least one vector encoding a TCR that is capable of binding to an HLA molecule and antigen on tumor cells.
  • the modulator of T cells, or the resulting modified T cell is administered concurrently with a cancer therapy, or subsequent to a cancer therapy, or prior to a cancer therapy, or as a cancer therapy.
  • the agent comprises a DNA targeting composition or at least one component thereof.
  • a “DNA Targeting System” as used herein is a system capable of specifically targeting a particular region of DNA and modulating gene expression by binding to that region.
  • Non-limiting examples of these systems are CRISPR-Cas-based systems, meganucleases, zinc finger (ZF)-based systems, and/or transcription activator-like effector (TALE)-based systems.
  • the DNA Targeting System may be a nuclease system that acts through mutating or editing the target region (such as by insertion, deletion or substitution) or it may be a system that delivers a functional second polypeptide domain, such as an activator or repressor, to the target region.
  • Each of these systems comprises a DNA-binding portion or domain, such as a Cas protein and guide RNA, or a meganuclease, or a ZF, or a TALE, that specifically recognizes and binds to a particular target region of a target DNA.
  • the DNA-binding portion (for example, Cas protein, ZF, or TALE) can be linked to a second protein domain, such as a polypeptide with transcription activation activity, transcription repression activity, transcription release factor activity, histone modification activity, nuclease activity, nucleic acid association activity, methylase activity, demethylase activity, acetylation activity, or deacetylation activity, to form a fusion protein.
  • the DNA-binding portion can be linked to an activator and thus guide the activator to a specific target region of the target DNA.
  • the DNA-binding portion can be linked to a repressor and thus guide the repressor to a specific target region of the target DNA.
  • the DNA targeting composition comprises a meganuclease.
  • a meganuclease is an endodeoxyribonuclease characterized by a large recognition site, such as double-stranded DNA sequences of 12 to 40 base pairs. The recognition site may occur only once in any given genome.
  • a meganuclease may be a homing endonuclease selected from an intron endonuclease or an intein endonuclease. Meganucleases may include, for example, the LAGLIDADG family of homing endonucleases.
  • the DNA-binding portion comprises a Cas protein, such as a Cas9 protein.
  • a nuclease-null Cas9 can act as a repressor on its own, a nuclease-active Cas9 can act as a repressor on its own, or a nuclease-active Cas9 can act as an activator when paired with an inactive (dead) guide RNA.
  • RNA or DNA that hybridizes to a particular target region of the target DNA can be directly linked (covalently or non-covalently) to an activator or repressor.
  • Some CRISPR-Cas-based systems can operate to activate or repress expression using the Cas protein linked to a second protein domain, such as, for example, an activator or repressor.
  • the DNA Targeting System may include a DNA binding protein.
  • the DNA binding protein may comprise, for example, a zinc finger protein or a transcription activator- like effector (TALE).
  • TALE transcription activator- like effector
  • the zinc finger protein or TALE may target a gene selected fromTGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a regulatory element thereof.
  • a zinc finger protein is a protein that includes one or more zinc finger domains.
  • Zinc finger domains are relatively small protein motifs that contain multiple finger-like protrusions that make tandem contacts with their target molecule such as a DNA target molecule.
  • a zinc finger domain may bind one or more zinc ions or other metal ion such as iron, or in some cases a zinc finger domain forms salt bridges to stabilize the finger-like folds.
  • the zinc binding portion of a zinc finger protein may include one or more cysteine residues and/or one or more histidine residues to coordinate the zinc or other metal ion.
  • a zinc finger protein recognizes and binds to a particular DNA sequence via the zinc finger domain.
  • a zinc finger protein is fused to or includes a nuclease domain and may be referred to as a zinc finger nuclease (ZFN).
  • the nuclease domain may include, for example, the endonuclease FokI.
  • ZFNs may recognize target sites that consist of two zinc-finger binding sites that flank a 5- to 7-base pair (bp) spacer sequence recognized by the endonuclease FokI cleavage domain.
  • TALE Transcription Activator-like Effector
  • a TALE is another type of protein that recognizes and binds to a particular DNA sequence.
  • the DNA-binding domain of a TALE includes an array of tandem 33-35 amino acid repeats, also known as RVD modules. Each RVD module specifically recognizes a single base pair of DNA. RVD modules may be arranged in any order to assemble an array that recognizes a defined DNA sequence.
  • the binding specificity of a TALE DNA-binding domain is determined by the RVD array followed by a single truncated repeat of, for example, 20 amino acids.
  • a TALE DNA-binding domain may have an array of 12 to 27 RVD modules, each RVD module recognizing a single base pair of DNA. Specific RVDs have been identified that recognize each of the four possible DNA nucleotides (A, T, C, and G). Because the TALE DNA-binding domains are modular, repeats that recognize the four different DNA nucleotides may be linked together to recognize any particular DNA sequence. These targeted DNA-binding domains may then be combined with catalytic domains to create functional enzymes, including artificial transcription factors and/or nucleases.
  • a TALE is fused to or includes a nuclease domain and may be referred to as a TALE nuclease (TALEN).
  • the nuclease domain may include, for example, the endonuclease FokI.
  • TALENs may recognize target sites that consist of two TALE DNA- binding sites that flank a 12-bp to 20-bp spacer sequence recognized by the FokI cleavage domain.
  • TALENs may recognize target sites that consist of two TALE DNA- binding sites that flank a 12-bp to 20-bp spacer sequence recognized by the FokI cleavage domain.
  • a zinc finger protein or TALE can be fused to a polypeptide domain and referred to as a DNA binding fusion protein or fusion protein.
  • the fusion protein may act as a synthetic transcription factor.
  • the fusion protein comprises two heterologous polypeptide domains, including a first polypeptide domain comprising the zinc finger protein or the TALE or a Cas9 protein as further detailed below, and a second polypeptide domain having an activity selected from transcription activation activity, transcription repression activity, nuclease activity, transcription release factor activity, histone modification activity, nucleic acid association activity, methylase activity, and demethylase activity.
  • a zinc finger protein or TALE can be fused to a polypeptide domain having epigenetic modifying activity to mediate targeted gene regulation.
  • a fusion protein comprising a zinc finger protein or TALE, and a second polypeptide domain having transcription repression activity may mediate targeted gene repression.
  • a fusion protein comprising a zinc finger protein or TALE, and a second polypeptide domain having transcription activation activity, may mediate targeted gene activation.
  • the second polypeptide domain is further detailed below (see “Cas Fusion Protein”).
  • CRISPR/Cas-based Gene Editing System Provided herein are CRISPR/Cas-based gene editing systems.
  • the CRISPR/Cas-based gene editing system may be used to modulate T cells and/or enhance ACT.
  • the CRISPR/Cas-based gene editing system may include a Cas protein or a fusion protein, and at least one gRNA, and may also be referred to as a “CRISPR-Cas system.”
  • CRISPR-Cas system may include a Cas protein or a fusion protein, and at least one gRNA, and may also be referred to as a “CRISPR-Cas system.”
  • CRISPR-Cas system refers to loci containing multiple short direct repeats that are found in the genomes of approximately 40% of sequenced bacteria and 90% of sequenced archaea.
  • the CRISPR system is a microbial nuclease system involved in defense against invading phages and plasmids that provides a form of acquired immunity.
  • the CRISPR loci in microbial hosts contain a combination of CRISPR-associated (Cas) genes as well as non- coding RNA elements capable of programming the specificity of the CRISPR-mediated nucleic acid cleavage.
  • Short segments of foreign DNA, called spacers are incorporated into the genome between CRISPR repeats, and serve as a “memory” of past exposures.
  • Cas proteins include, for example, Cas12 such as Cas12a, Cas9, Cas13, Cascade proteins, and IscB/TnpB proteins.
  • Cas12a may also be referred to as “Cpf1.” Cas12a causes a staggered cut in double stranded DNA, while Cas9 produces a blunt cut.
  • the Cas protein comprises Cas12a.
  • Cas12a is described in, for example, WO 2018/017754, which is incorporated herein by reference.
  • Cas13 is an RNA-guided RNA endonuclease. Cas13 cleaves single-stranded RNA, and it does not cleave DNA.
  • the Cas protein comprises Cas13. In some embodiments, the Cas protein comprises Cas9.
  • Cas9 forms a complex with the 3’ end of the sgRNA (which may be referred interchangeably herein as “gRNA”), and the protein-RNA pair recognizes its genomic target by complementary base pairing between the 5’ end of the gRNA sequence and a predefined 20 bp DNA sequence, known as the protospacer.
  • This complex is directed to homologous loci of pathogen DNA via regions encoded within the crRNA, i.e., the protospacers, and protospacer-adjacent motifs (PAMs) within the pathogen genome.
  • the non-coding CRISPR array is transcribed and cleaved within direct repeats into short crRNAs containing individual spacer sequences, which direct Cas nucleases to the target site (protospacer).
  • CRISPR spacers are used to recognize and silence exogenous genetic elements in a manner analogous to RNAi in eukaryotic organisms.
  • CRISPR systems Three classes of CRISPR systems (Types I, II, and III effector systems) are known.
  • the Type II effector system carries out targeted DNA double-strand break in four sequential steps, using a single effector enzyme, Cas9, to cleave dsDNA.
  • Cas9 a single effector enzyme
  • the Type II effector system may function in alternative contexts such as eukaryotic cells.
  • the Type II effector system consists of a long pre ⁇ crRNA, which is transcribed from the spacer ⁇ containing CRISPR locus, the Cas9 protein, and a tracrRNA, which is involved in pre-crRNA processing.
  • the tracrRNAs hybridize to the repeat regions separating the spacers of the pre ⁇ crRNA, thus initiating dsRNA cleavage by endogenous RNase III. This cleavage is followed by a second cleavage event within each spacer by Cas9, producing mature crRNAs that remain associated with the tracrRNA and Cas9, forming a Cas9:crRNA- tracrRNA complex.
  • Cas12a systems include crRNA for successful targeting, whereas Cas9 systems include both crRNA and tracrRNA.
  • the Cas9:crRNA-tracrRNA complex unwinds the DNA duplex and searches for sequences matching the crRNA to cleave. Target recognition occurs upon detection of complementarity between a “protospacer” sequence in the target DNA and the remaining spacer sequence in the crRNA. Cas9 mediates cleavage of target DNA if a correct protospacer-adjacent motif (PAM) is also present at the 3’ end of the protospacer.
  • PAM protospacer-adjacent motif
  • the sequence must be immediately followed by the protospacer- adjacent motif (PAM), a short sequence recognized by the Cas9 nuclease that is required for DNA cleavage.
  • PAM protospacer- adjacent motif
  • Cas12a may function with PAM sequences rich in thymine “T.”
  • T thymine
  • the Cas9 protein was directed to genomic target sites by a synthetically reconstituted “guide RNA” (“gRNA”, also used interchangeably herein as a chimeric single guide RNA (“sgRNA”)), which is a crRNA- tracrRNA fusion that obviates the need for RNase III and crRNA processing in general.
  • gRNA guide RNA
  • sgRNA chimeric single guide RNA
  • CRISPR/Cas9-based engineered systems for use in gene editing and treating genetic diseases.
  • the CRISPR/Cas9-based engineered systems can be designed to target any gene, including genes involved in, for example, a genetic disease, aging, tissue regeneration, or wound healing.
  • the CRISPR/Cas9-based gene editing system can include a Cas9 protein or a Cas9 fusion protein.
  • Cas9 protein is an endonuclease that cleaves nucleic acid and is encoded by the CRISPR loci and is involved in the Type II CRISPR system.
  • the Cas9 protein can be from any bacterial or archaea species, including, but not limited to, Streptococcus pyogenes, Staphylococcus aureus (S.
  • the Cas9 molecule is a Streptococcus pyogenes Cas9 molecule (also referred herein as “SpCas9”).
  • SpCas9 may comprise an amino acid sequence of SEQ ID NO: 26.
  • the Cas9 molecule is a Staphylococcus aureus Cas9 molecule (also referred herein as “SaCas9”).
  • SaCas9 may comprise an amino acid sequence of SEQ ID NO: 27.
  • a Cas9 molecule or a Cas9 fusion protein can interact with one or more gRNA molecule(s) and, in concert with the gRNA molecule(s), can localize to a site which comprises a target domain, and in certain embodiments, a PAM sequence.
  • the Cas9 protein forms a complex with the 3’ end of a gRNA.
  • the ability of a Cas9 molecule or a Cas9 fusion protein to recognize a PAM sequence can be determined, for example, by using a transformation assay as known in the art.
  • the specificity of the CRISPR-based system may depend on two factors: the target sequence and the protospacer-adjacent motif (PAM).
  • the target sequence is located on the 5’ end of the gRNA and is designed to bond with base pairs on the host DNA at the correct DNA sequence known as the protospacer.
  • the Cas9 protein can be directed to new genomic targets.
  • the PAM sequence is located on the DNA to be altered and is recognized by a Cas9 protein.
  • PAM recognition sequences of the Cas9 protein can be species specific.
  • the ability of a Cas9 molecule or a Cas9 fusion protein to interact with and cleave a target nucleic acid is PAM sequence dependent.
  • a PAM sequence is a sequence in the target nucleic acid.
  • cleavage of the target nucleic acid occurs upstream from the PAM sequence.
  • Cas9 molecules from different bacterial species can recognize different sequence motifs (for example, PAM sequences).
  • a Cas9 molecule of S. pyogenes may recognize the PAM sequence of NRG (5’-NRG-3’, where R is any nucleotide residue, and in some embodiments, R is either A or G, SEQ ID NO: 1).
  • a Cas9 molecule of S. pyogenes may naturally prefer and recognize the sequence motif NGG (SEQ ID NO: 2) and directs cleavage of a target nucleic acid sequence 1 to 10, for example, 3 to 5, bp upstream from that sequence.
  • a Cas9 molecule of S. pyogenes accepts other PAM sequences, such as NAG (SEQ ID NO: 3) in engineered systems (Hsu et al., Nature Biotechnology 2013 doi:10.1038/nbt.2647, incorporated herein by reference).
  • NNGRRV N or G
  • V A or C or G
  • SEQ ID NO: 10 A Cas9 molecule derived from Neisseria meningitidis
  • NmCas9 normally has a native PAM of NNNNGATT (SEQ ID NO: 11), but may have activity across a variety of PAMs, including a highly degenerate NNNNGNNN PAM (SEQ ID NO: 12) (Esvelt et al. Nature Methods 2013 doi:10.1038/nmeth.2681, incorporated herein by reference).
  • N can be any nucleotide residue, for example, any of A, G, C, or T.
  • Cas9 molecules can be engineered to alter the PAM specificity of the Cas9 molecule.
  • the Cas9 protein is a Cas9 protein of S.
  • N can be any nucleotide residue, for example, any of A, G, C, or T.
  • a nucleic acid encoding a Cas9 molecule or Cas9 polypeptide may comprise a nuclear localization sequence (NLS).
  • the at least one Cas9 molecule is a mutant Cas9 molecule.
  • the Cas9 protein can be mutated so that the nuclease activity is inactivated.
  • An inactivated Cas9 protein (“iCas9”, also referred to as “dCas9”) with no endonuclease activity has been targeted to genes in bacteria, yeast, and human cells by gRNAs to silence gene expression through steric hindrance. Exemplary mutations with reference to the S.
  • a S. pyogenes Cas9 sequence to inactivate the nuclease activity include: D10A, E762A, H840A, N854A, N863A and/or D986A.
  • a S. pyogenes Cas9 protein with the D10A mutation may comprise an amino acid sequence of SEQ ID NO: 28.
  • a S. pyogenes Cas9 protein with D10A and H849A mutations may comprise an amino acid sequence of SEQ ID NO: 29.
  • Exemplary mutations with reference to the S. aureus Cas9 sequence to inactivate the nuclease activity include D10A and N580A.
  • the mutant S. aureus Cas9 molecule comprises a D10A mutation.
  • the nucleotide sequence encoding this mutant S. aureus Cas9 is set forth in SEQ ID NO: 30.
  • the mutant S. aureus Cas9 molecule comprises a N580A mutation.
  • the nucleotide sequence encoding this mutant S. aureus Cas9 molecule is set forth in SEQ ID NO: 31.
  • the Cas9 protein is a VQR variant.
  • the VQR variant of Cas9 is a mutant with a different PAM recognition, as detailed in Kleinstiver, et al. (Nature 2015, 523, 481–485, incorporated herein by reference).
  • a polynucleotide encoding a Cas9 molecule can be a synthetic polynucleotide.
  • the synthetic polynucleotide can be chemically modified.
  • the synthetic polynucleotide can be codon optimized, for example, at least one non-common codon or less-common codon has been replaced by a common codon.
  • the synthetic polynucleotide can direct the synthesis of an optimized messenger mRNA, for example, optimized for expression in a mammalian expression system, as described herein.
  • An exemplary codon optimized nucleic acid sequence encoding a Cas9 molecule of S. pyogenes is set forth in SEQ ID NO: 32.
  • Exemplary codon optimized nucleic acid sequences encoding a Cas9 molecule of S. aureus, and optionally containing nuclear localization sequences (NLSs), are set forth in SEQ ID NOs: 33-39.
  • Another exemplary codon optimized nucleic acid sequence encoding a Cas9 molecule of S. aureus comprises the nucleotides 1293-4451 of SEQ ID NO: 40.
  • Cas Fusion Protein [000119]
  • the CRISPR/Cas-based gene editing system can include a fusion protein.
  • the fusion protein can comprise two heterologous polypeptide domains.
  • the first polypeptide domain comprises a Cas protein or a mutated Cas protein.
  • the first polypeptide domain is fused to at least one second polypeptide domain.
  • the second polypeptide domain has a different activity that what is endogenous to Cas protein.
  • the second polypeptide domain may have any DNA editing activity.
  • the second polypeptide domain may have an activity such as transcription activation activity, transcription repression activity, transcription release factor activity, histone modification activity, nuclease activity, nucleic acid association activity, histone methylase activity, DNA methylase activity, histone demethylase activity, DNA demethylase activity, acetylation activity, and/or deacetylation activity.
  • the activity of the second polypeptide domain may be direct or indirect.
  • the second polypeptide domain may have this activity itself (direct), or it may recruit and/or interact with a polypeptide domain that has this activity (indirect). In some embodiments, the second polypeptide domain has transcription activation activity. In some embodiments, the second polypeptide domain has transcription repression activity. In some embodiments, the second polypeptide domain comprises a synthetic transcription factor. The second polypeptide domain may be at the C-terminal end of the first polypeptide domain, or at the N-terminal end of the first polypeptide domain, or a combination thereof.
  • the fusion protein may include one second polypeptide domain. In some embodiments, the fusion protein comprises more than one second polypeptide domain. The fusion protein may include two of the second polypeptide domains.
  • the fusion protein may include a second polypeptide domain at the N-terminal end of the first polypeptide domain as well as a second polypeptide domain at the C-terminal end of the first polypeptide domain.
  • the fusion protein may include a single first polypeptide domain and more than one (for example, two or three) second polypeptide domains in tandem.
  • the linkage from the first polypeptide domain to the second polypeptide domain can be through reversible or irreversible covalent linkage or through a non-covalent linkage, as long as the linker does not interfere with the function of the second polypeptide domain.
  • a Cas polypeptide can be linked to a second polypeptide domain as part of a fusion protein.
  • the fusion protein includes at least one linker.
  • a linker may be included anywhere in the polypeptide sequence of the fusion protein, for example, between the first and second polypeptide domains.
  • a linker may be of any length and design to promote or restrict the mobility of components in the fusion protein.
  • a linker may comprise any amino acid sequence of about 2 to about 100, about 5 to about 80, about 10 to about 60, or about 20 to about 50 amino acids.
  • a linker may comprise an amino acid sequence of at least about 2, 3, 4, 5, 10, 15, 20, 25, or 30 amino acids.
  • a linker may comprise an amino acid sequence of less than about 100, 90, 80, 70, 60, 50, or 40 amino acids.
  • a linker may include sequential or tandem repeats of an amino acid sequence that is 2 to 20 amino acids in length.
  • Linkers may include, for example, a GS linker (Gly-Gly-Gly- Gly-Ser) n , wherein n is an integer between 0 and 10 (SEQ ID NO: 21).
  • n can be adjusted to optimize the linker length and achieve appropriate separation of the functional domains.
  • linkers may include, for example, Gly-Gly-Gly-Gly-Gly-Gly (SEQ ID NO: 22), Gly-Gly-Ala-Gly-Gly (SEQ ID NO: 23), Gly/Ser rich linkers such as Gly-Gly-Gly-Gly- Ser-Ser-Ser (SEQ ID NO: 24), or Gly/Ala rich linkers such as Gly-Gly-Gly-Gly-Ala-Ala-Ala (SEQ ID NO: 25).
  • the agent and/or Cas protein and/or the Cas fusion protein and/or gRNAs detailed herein may be used in compositions and methods for modulating expression of gene.
  • Modulating may include, for example, increasing or enhancing expression of the gene, or reducing or inhibiting expression of the gene.
  • the expression of the gene may be modulated by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • the expression of the gene may be modulated by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • the expression of the gene may be modulated by about 5-95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control.
  • the expression of the gene may be reduced by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • the expression of the gene may be reduced by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • the expression of the gene may be reduced by about 5-95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control.
  • the expression of the gene may be increased by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6- fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • the expression of the gene may be increased by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7- fold, 8-fold, 9-fold, or 10-fold, relative to a control.
  • the expression of the gene may be increased by about 5-95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control.
  • the expression of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1 is increased.
  • the expression of a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1 is increased.
  • the level of a transcription factor protein selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1 is increased.
  • the second polypeptide domain can have transcription activation activity, for example, a transactivation domain.
  • gene expression of endogenous mammalian genes can be achieved by targeting a fusion protein of a first polypeptide domain, such as dCas9, and a transactivation domain to mammalian promoters via combinations of gRNAs.
  • the transactivation domain can include a VP16 protein, multiple VP16 proteins, such as a VP48 domain or VP64 domain, p65 domain of NF kappa B transcription activator activity, TET1, VPR, VPH, Rta, and/or p300.
  • the fusion protein may comprise dCas9-p300.
  • p300 comprises a polypeptide having the amino acid sequence of SEQ ID NO: 41 or SEQ ID NO: 42.
  • the fusion protein comprises dCas9-VP64.
  • the fusion protein comprises VP64-dCas9-VP64.
  • VP64-dCas9-VP64 may comprise a polypeptide having the amino acid sequence of SEQ ID NO: 43, encoded by the polynucleotide of SEQ ID NO: 44.
  • VPH may comprise a polypeptide having the amino acid sequence of SEQ ID NO: 53, encoded by the polynucleotide of SEQ ID NO: 54.
  • VPR may comprise a polypeptide having the amino acid sequence of SEQ ID NO: 55, encoded by the polynucleotide of SEQ ID NO: 56.
  • the second polypeptide domain can have transcription repression activity.
  • Non- limiting examples of repressors include Kruppel associated box activity such as a KRAB domain or KRAB, MECP2, EED, ERF repressor domain (ERD), Mad mSIN3 interaction domain (SID) or Mad-SID repressor domain, SID4X repressor domain, Mxil repressor domain, SUV39H1, SUV39H2, G9A, ESET/SETBD1, Cir4, Su(var)3-9, Pr-SET7/8, SUV4- 20H1, PR-set7, Suv4-20, Set9, EZH2, RIZ1, JMJD2A/JHDM3A, JMJD2B, JMJ2D2C/GASC1, JMJD2D, Rph1, JARID1A/RBP2, JARID1B/PLU-1, JARID1C/SMCX, JARID1D/SMCY, Lid, Jhn2, Jmj2, HDAC1, HDAC2, HDAC3, HDAC8, Rpd3, Hos
  • the second polypeptide domain has a KRAB domain activity, ERF repressor domain activity, Mxil repressor domain activity, SID4X repressor domain activity, Mad-SID repressor domain activity, DNMT3A or DNMT3L or fusion thereof activity, LSD1 histone demethylase activity, or TATA box binding protein activity.
  • the polypeptide domain comprises KRAB.
  • KRAB may comprise a polypeptide having the amino acid sequence of SEQ ID NO: 45, encoded by a polynucleotide comprising the sequence of SEQ ID NO: 46.
  • the fusion protein may be S.
  • the second polypeptide domain can have transcription release factor activity.
  • the second polypeptide domain can have eukaryotic release factor 1 (ERF1) activity or eukaryotic release factor 3 (ERF3) activity.
  • EEF1 eukaryotic release factor 1
  • EEF3 eukaryotic release factor 3
  • the second polypeptide domain can have histone deacetylase, histone acetyltransferase, histone demethylase, or histone methyltransferase activity.
  • the histone acetyltransferase may be p300 or CREB-binding protein (CBP) protein, or fragments thereof.
  • the fusion protein may be dCas9-p300.
  • p300 comprises a polypeptide of SEQ ID NO: 41 or SEQ ID NO: 42.
  • Nuclease Activity [000127]
  • the second polypeptide domain can have nuclease activity that is different from the nuclease activity of the Cas9 protein.
  • a nuclease, or a protein having nuclease activity is an enzyme capable of cleaving the phosphodiester bonds between the nucleotide subunits of nucleic acids.
  • Nucleases are usually further divided into endonucleases and exonucleases, although some of the enzymes may fall in both categories.
  • Well known nucleases include deoxyribonuclease and ribonuclease.
  • the second polypeptide domain includes a meganuclease, as detailed above.
  • the polypeptide domain having nuclease activity comprises FokI.
  • the second polypeptide domain can have nucleic acid association activity or nucleic acid binding protein-DNA-binding domain (DBD).
  • a DBD is an independently folded protein domain that contains at least one motif that recognizes double- or single-stranded DNA.
  • a DBD can recognize a specific DNA sequence (a recognition sequence) or have a general affinity to DNA.
  • a nucleic acid association region may be selected from helix-turn- helix region, leucine zipper region, winged helix region, winged helix-turn-helix region, helix- loop-helix region, immunoglobulin fold, B3 domain, Zinc finger, HMG-box, Wor3 domain, and TAL effector DNA-binding domain.
  • the second polypeptide domain may have base editing activity.
  • Base editing enables the direct, irreversible conversion of a specific DNA base into another base at a targeted genomic locus without requiring double-stranded DNA breaks (DSB).
  • a base editing domain has sequence requirements for activity.
  • the target base may be within 4-8 nucleotides from the PAM-distal end.
  • An exemplary splice acceptor is an “AG” immediately before the exon, and an exemplary splice donor is a “GT” immediately following the exon.
  • Cas9 molecules from different species may use different PAMs, and thereby provide some flexibility in selecting the base to edit.
  • adenine and cytosine base editors may be capable of disrupting an “AG” splice acceptor, converting it to either a “GG” or “AA”, respectively.
  • the base-editing domain includes an adenine base editor (ABE).
  • Adenine base editors may include, for example, ecTadA, including wild-type and mutants thereof.
  • the adenine base editor may be as described in Gaudelli et al. (Nature 2017, 551, 464–471), Koblan et al. (Nature Biotech. 2018, 36, 843–846), Richter et al.
  • the ABE may comprise a polypeptide selected from SEQ ID NOs: 57-64 and/or be encoded by a polynucleotide comprising a sequence selected from SEQ ID NOs: 65-72, respectively.
  • the base-editing domain includes a cytidine deaminase domain.
  • a cytidine deaminase domain can convert the DNA base cytosine to uracil.
  • the cytidine deaminase domain can include an apolipoprotein B mRNA- editing enzyme, catalytic polypeptide-like (APOBEC) family deaminase.
  • the cytidine deaminase domain can include an APOBEC 1 deaminase, APOBEC2 deaminase, APOBEC3A deaminase, APOBEC3B deaminase, APOBEC3C deaminase, APOBEC3D deaminase, APOBEC3F deaminase, APOBEC3G deaminase, APOBEC3H deaminase, or a combination thereof.
  • APOBEC catalytic polypeptide-like
  • the second polypeptide domain can have methylase activity, which involves transferring a methyl group to DNA, RNA, protein, small molecule, cytosine, or adenine.
  • the second polypeptide domain includes a DNA methyltransferase.
  • Demethylase Activity [000131]
  • the second polypeptide domain can have demethylase activity.
  • the second polypeptide domain can include an enzyme that removes methyl (CH3-) groups from nucleic acids, proteins (in particular histones), and other molecules.
  • the second polypeptide can convert the methyl group to hydroxymethylcytosine in a mechanism for demethylating DNA.
  • the second polypeptide can catalyze this reaction.
  • the second polypeptide that catalyzes this reaction can be Tet1, also known as Tet1CD (Ten- eleven translocation methylcytosine dioxygenase 1; amino acid sequence comprising SEQ ID NO: 51; polynucleotide sequence comprising SEQ ID NO: 52).
  • Tet1CD Teten- eleven translocation methylcytosine dioxygenase 1; amino acid sequence comprising SEQ ID NO: 51; polynucleotide sequence comprising SEQ ID NO: 52).
  • the second polypeptide domain has histone demethylase activity.
  • the second polypeptide domain has DNA demethylase activity.
  • the CRISPR/Cas-based gene editing system includes at least one gRNA molecule.
  • the CRISPR/Cas-based gene editing system may include two gRNA molecules.
  • the at least one gRNA molecule can bind and recognize a target region.
  • the gRNA is the part of the CRISPR-Cas system that provides DNA targeting specificity to the CRISPR/Cas-based gene editing system.
  • the gRNA is a fusion of two noncoding RNAs: a crRNA and a tracrRNA. gRNA mimics the naturally occurring crRNA:tracrRNA duplex involved in the Type II Effector system.
  • This duplex which may include, for example, a 42- nucleotide crRNA and a 75-nucleotide tracrRNA, acts as a guide for the Cas9 to bind, and in some cases, cleave the target nucleic acid.
  • the gRNA may target any desired DNA sequence by exchanging the sequence encoding a 20 bp protospacer which confers targeting specificity through complementary base pairing with the desired DNA target.
  • the “target region” or “target sequence” or “protospacer” refers to the region of the target gene to which the CRISPR/Cas9-based gene editing system targets and binds.
  • the portion of the gRNA that targets the target sequence in the genome may be referred to as the “targeting sequence” or “targeting portion” or “targeting domain.”
  • “Protospacer” or “gRNA spacer” may refer to the region of the target gene to which the CRISPR/Cas9-based gene editing system targets and binds; “protospacer” or “gRNA spacer” may also refer to the portion of the gRNA that is complementary to the targeted sequence in the genome.
  • the gRNA may include a gRNA scaffold.
  • a gRNA scaffold facilitates Cas9 binding to the gRNA and may facilitate endonuclease activity.
  • the gRNA scaffold is a polynucleotide sequence that follows the portion of the gRNA corresponding to sequence that the gRNA targets. Together, the gRNA targeting portion and gRNA scaffold form one polynucleotide.
  • the constant region of the gRNA may include the sequence of SEQ ID NO: 19 (RNA), which is encoded by a sequence comprising SEQ ID NO: 18 (DNA).
  • the CRISPR/Cas9-based gene editing system may include at least one gRNA, wherein the gRNAs target different DNA sequences. The target DNA sequences may be overlapping.
  • the gRNA may comprise at its 5’ end the targeting domain that is sufficiently complementary to the target region to be able to hybridize to, for example, about 10 to about 20 nucleotides of the target region of the target gene, when it is followed by an appropriate Protospacer Adjacent Motif (PAM).
  • PAM Protospacer Adjacent Motif
  • the target region or protospacer is followed by a PAM sequence at the 3’ end of the protospacer in the genome.
  • Different Type II systems have differing PAM requirements, as detailed above.
  • the targeting domain of the gRNA does not need to be perfectly complementary to the target region of the target DNA.
  • the targeting domain of the gRNA is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or at least 99% complementary to (or has 1, 2 or 3 mismatches compared to) the target region over a length of, such as, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotides.
  • the DNA-targeting domain of the gRNA may be at least 80% complementary over at least 18 nucleotides of the target region.
  • the target region may be on either strand of the target DNA.
  • the gRNA may target the Cas9 protein or fusion protein to a gene or a regulatory element thereof.
  • the gRNA may target the Cas protein or fusion protein to a non-open chromatin region, an open chromatin region, a transcribed region of the target gene, a region upstream of a transcription start site of the target gene, a regulatory element of the target gene, an intron of the target gene, or an exon of the target gene, or a combination thereof.
  • the gRNA targets the Cas9 protein or fusion protein to a promoter of a gene.
  • the target region is located between about 1 to about 1000 base pairs upstream of a transcription start site of a target gene.
  • the DNA targeting composition comprises two or more gRNAs, each gRNA binding to a different target region.
  • the gRNA may target a region of a gene that modulates T cells.
  • the gRNA may target a region of a gene encoding a transcription factor that modulates T cells.
  • the gRNA may target a region of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, or a regulatory element thereof.
  • the gRNA may target a region of a gene selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof, or a regulatory element thereof.
  • the gRNA may target a region of TGIF2LX, or a regulatory element thereof.
  • the gRNA targets a gene and is used in combination with a Cas9 fusion protein wherein the second polypeptide domain has transcription activation activity, to activate or enhance expression of the gene to increase T cells.
  • the gRNA targets a gene and is used in combination with a Cas9 fusion protein wherein the second polypeptide domain has transcription repression activity, to inhibit or reduce or decrease expression of the gene to increase T cells.
  • the gRNA may comprise a polynucleotide selected from at least one of SEQ ID NOs: 157-192, or a complement thereof, or a variant thereof, or a truncation thereof.
  • the gRNA may be encoded by a polynucleotide sequence comprising at least one of SEQ ID NOs: 121-156, or a complement thereof, or a variant thereof, or a truncation thereof.
  • the gRNA may bind and target a polynucleotide sequence comprising at least one of SEQ ID NOs: 121-156, or a complement thereof, or a variant thereof, or a truncation thereof.
  • a truncation may be 1, 2, 3, 4, 5, 6, 7, 8, or 9 nucleotides shorter than the sequence of any one of SEQ ID NOs: 121-192.
  • the targeting domain of a gRNA molecule may comprise at least a 10 base pair, at least a 11 base pair, at least a 12 base pair, at least a 13 base pair, at least a 14 base pair, at least a 15 base pair, at least a 16 base pair, at least a 17 base pair, at least a 18 base pair, at least a 19 base pair, at least a 20 base pair, at least a 21 base pair, at least a 22 base pair, at least a 23 base pair, at least a 24 base pair, at least a 25 base pair, at least a 30 base pair, or at least a 35 base pair complementary polynucleotide sequence of the target DNA sequence followed by a PAM sequence.
  • the targeting domain of a gRNA molecule has 19-25 nucleotides in length. In certain embodiments, the targeting domain of a gRNA molecule is 20 nucleotides in length. In certain embodiments, the targeting domain of a gRNA molecule is 21 nucleotides in length. In certain embodiments, the targeting domain of a gRNA molecule is 22 nucleotides in length. In certain embodiments, the targeting domain of a gRNA molecule is 23 nucleotides in length.
  • the number of gRNA molecules that may be included in the CRISPR/Cas9- based gene editing system can be at least 1 gRNA, at least 2 different gRNAs, at least 3 different gRNAs, at least 4 different gRNAs, at least 5 different gRNAs, at least 6 different gRNAs, at least 7 different gRNAs, at least 8 different gRNAs, at least 9 different gRNAs, at least 10 different gRNAs, at least 11 different gRNAs, at least 12 different gRNAs, at least 13 different gRNAs, at least 14 different gRNAs, at least 15 different gRNAs, at least 16 different gRNAs, at least 17 different gRNAs, at least 18 different gRNAs, at least 18 different gRNAs, at least 20 different gRNAs, at least 25 different gRNAs, at least 30 different gRNAs, at least 35 different gRNAs, at least 40 different gRNAs, at least 45 different gRNAs
  • the number of gRNA molecules that may be included in the CRISPR/Cas9-based gene editing system can be less than 50 different gRNAs, less than 45 different gRNAs, less than 40 different gRNAs, less than 35 different gRNAs, less than 30 different gRNAs, less than 25 different gRNAs, less than 20 different gRNAs, less than 19 different gRNAs, less than 18 different gRNAs, less than 17 different gRNAs, less than 16 different gRNAs, less than 15 different gRNAs, less than 14 different gRNAs, less than 13 different gRNAs, less than 12 different gRNAs, less than 11 different gRNAs, less than 10 different gRNAs, less than 9 different gRNAs, less than 8 different gRNAs, less than 7 different gRNAs, less than 6 different gRNAs, less than 5 different gRNAs, less than 4 different gRNAs, less than 3 different gRNAs, or less than 2 different gRNAs.
  • the number of gRNAs that may be included in the CRISPR/Cas9-based gene editing system can be between at least 1 gRNA to at least 50 different gRNAs, at least 1 gRNA to at least 45 different gRNAs, at least 1 gRNA to at least 40 different gRNAs, at least 1 gRNA to at least 35 different gRNAs, at least 1 gRNA to at least 30 different gRNAs, at least 1 gRNA to at least 25 different gRNAs, at least 1 gRNA to at least 20 different gRNAs, at least 1 gRNA to at least 16 different gRNAs, at least 1 gRNA to at least 12 different gRNAs, at least 1 gRNA to at least 8 different gRNAs, at least 1 gRNA to at least 4 different gRNAs, at least 4 gRNAs to at least 50 different gRNAs, at least 4 different gRNAs to at least 45 different gRNAs, at least 4 different gRNAs to at least 40 different
  • the CRISPR/Cas9-based gene editing system may be used to introduce site- specific double strand breaks at targeted genomic loci, such as a gene for modulating T cells as detailed herein.
  • Site-specific double-strand breaks are created when the CRISPR/Cas9- based gene editing system binds to a target DNA sequences, thereby permitting cleavage of the target DNA.
  • This DNA cleavage may stimulate the natural DNA-repair machinery, leading to one of two possible repair pathways: homology-directed repair (HDR) or the non- homologous end joining (NHEJ) pathway.
  • HDR homology-directed repair
  • NHEJ non- homologous end joining
  • Restoration of protein expression from a gene may involve homology-directed repair (HDR).
  • a donor template may be administered to a cell.
  • a donor sequence comprises a polynucleotide sequence to be inserted into a genome.
  • the donor template may include a nucleotide sequence encoding a full-functional protein or a partially functional protein.
  • the donor template may include fully functional gene construct for restoring a mutant gene, or a fragment of the gene that after homology-directed repair, leads to restoration of the mutant gene.
  • the donor template may include a nucleotide sequence encoding a mutated version of an inhibitory regulatory element of a gene. Mutations may include, for example, nucleotide substitutions, insertions, deletions, or a combination thereof.
  • NHEJ Non-Homologous End Joining
  • Restoration of protein expression from gene may be through template-free NHEJ- mediated DNA repair.
  • NHEJ is a nuclease mediated NHEJ, which in certain embodiments, refers to NHEJ that is initiated a Cas9 molecule that cuts double stranded DNA.
  • the method comprises administering a presently disclosed CRISPR/Cas9- based gene editing system or a composition comprising thereof to a subject for gene editing.
  • Nuclease mediated NHEJ may correct a mutated target gene and offer several potential advantages over the HDR pathway.
  • NHEJ does not require a donor template, which may cause nonspecific insertional mutagenesis.
  • NHEJ operates efficiently in all stages of the cell cycle and therefore may be effectively exploited in both cycling and post-mitotic cells, such as muscle fibers. This provides a robust, permanent gene restoration alternative to oligonucleotide-based exon skipping or pharmacologic forced read-through of stop codons and could theoretically require as few as one drug treatment. 3.
  • the transcription factor such as one selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, may be encoded by one or more genetic constructs.
  • a polynucleotide encoding a transcription factor such as one selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, may be comprised within one or more genetic constructs.
  • the genetic construct may comprise an open reading frame (ORF) of the transcription factor.
  • the genetic construct comprises a sequence selected from SEQ ID NOs: 75-97 or encodes a polypeptide comprising a sequence selected from SEQ ID NOs: 98-120.
  • the CRISPR/Cas9-based gene editing system may be encoded by or comprised within one or more genetic constructs.
  • the CRISPR/Cas9-based gene editing system may comprise one or more genetic constructs.
  • the genetic construct such as a plasmid or expression vector, may comprise a nucleic acid that encodes the transcription factor or the CRISPR/Cas9-based gene editing system and/or at least one of the gRNAs.
  • a genetic construct encodes one gRNA molecule, i.e., a first gRNA molecule, and optionally a Cas9 molecule or fusion protein.
  • a genetic construct encodes two gRNA molecules, i.e., a first gRNA molecule and a second gRNA molecule, and optionally a Cas9 molecule or fusion protein.
  • a first genetic construct encodes one gRNA molecule, i.e., a first gRNA molecule, and optionally a Cas9 molecule or fusion protein
  • a second genetic construct encodes one gRNA molecule, i.e., a second gRNA molecule, and optionally a Cas9 molecule or fusion protein.
  • a first genetic construct encodes one gRNA molecule and one donor sequence
  • a second genetic construct encodes a Cas9 molecule or fusion protein.
  • a first genetic construct encodes one gRNA molecule and a Cas9 molecule or fusion protein
  • a second genetic construct encodes one donor sequence.
  • Genetic constructs may include polynucleotides such as vectors and plasmids.
  • the genetic construct may be a linear minichromosome including centromere, telomeres, or plasmids or cosmids.
  • the vector may be an expression vectors or system to produce protein by routine techniques and readily available starting materials including Sambrook et al., Molecular Cloning and Laboratory Manual, Second Ed., Cold Spring Harbor (1989), which is incorporated fully by reference.
  • the construct may be recombinant.
  • the genetic construct may be part of a genome of a recombinant viral vector, including recombinant lentivirus, recombinant adenovirus, and recombinant adenovirus associated virus. Viral vectors are further detailed below.
  • the genetic construct may comprise regulatory elements for gene expression of the coding sequences of the nucleic acid.
  • the regulatory elements may be a promoter, an enhancer, an initiation codon, a stop codon, or a polyadenylation signal.
  • the genetic construct may comprise heterologous nucleic acid encoding the transcription factor or the CRISPR/Cas-based gene editing system and may further comprise an initiation codon, which may be upstream of the transcription factor or the CRISPR/Cas- based gene editing system coding sequence, and a stop codon, which may be downstream of the transcription factor or the CRISPR/Cas-based gene editing system coding sequence.
  • the genetic construct may include more than one stop codon, which may be downstream of the transcription factor or the CRISPR/Cas-based gene editing system coding sequence. In some embodiments, the genetic construct includes 1, 2, 3, 4, or 5 stop codons.
  • the genetic construct includes 1, 2, 3, 4, or 5 stop codons downstream of the sequence encoding the donor sequence.
  • a stop codon may be in-frame with a coding sequence in the transcription factor or the CRISPR/Cas-based gene editing system.
  • one or more stop codons may be in-frame with the donor sequence.
  • the genetic construct may include one or more stop codons that are out of frame of a coding sequence in the transcription factor or the CRISPR/Cas-based gene editing system.
  • one stop codon may be in-frame with the donor sequence, and two other stop codons may be included that are in the other two possible reading frames.
  • a genetic construct may include a stop codon for all three potential reading frames.
  • the initiation and termination codon may be in frame with the transcription factor or the CRISPR/Cas-based gene editing system coding sequence.
  • the vector may also comprise a promoter that is operably linked to the transcription factor coding sequence or the CRISPR/Cas-based gene editing system coding sequence.
  • the vector comprises a promoter operably linked to a polynucleotide sequence encoding the transcription factor.
  • the promoter may be a constitutive promoter, a ubiquitous promoter, an inducible promoter, a cell-specific promoter, a tissue-specific promoter, a repressible promoter, or a regulatable promoter.
  • the promoter is a ubiquitous promoter.
  • the promoter may be non- endogenous to the transcription factor.
  • the promoter may be non-native to the transcription factor.
  • the promoter may be a cell-specific promoter.
  • the promoter may be a promoter specific for T cells.
  • the promoter may be a tissue-specific promoter.
  • the tissue specific promoter may be a muscle specific promoter.
  • the tissue specific promoter may be a skin specific promoter.
  • the transcription factor coding sequence or the CRISPR/Cas- based gene editing system may be under the light-inducible or chemically inducible control to enable the dynamic control of gene/genome editing in space and time.
  • the promoter operably linked to the transcription factor coding sequence or the CRISPR/Cas-based gene editing system coding sequence may be a promoter from simian virus 40 (SV40), a mouse mammary tumor virus (MMTV) promoter, a human immunodeficiency virus (HIV) promoter such as the bovine immunodeficiency virus (BIV) long terminal repeat (LTR) promoter, a Moloney virus promoter, an avian leukosis virus (ALV) promoter, a cytomegalovirus (CMV) promoter such as the CMV immediate early promoter, Epstein Barr virus (EBV) promoter, or a Rous sarcoma virus (RSV) promoter.
  • SV40 simian virus 40
  • MMTV mouse mammary tumor virus
  • HSV human immunodeficiency virus
  • HSV human immunodeficiency virus
  • BIV bovine immunodeficiency virus
  • LTR long terminal repeat
  • Moloney virus promoter an avi
  • the promoter may also be a promoter from a human gene such as human ubiquitin C (hUbC), human actin, human myosin, human hemoglobin, human muscle creatine, or human metalothionein.
  • a tissue specific promoter such as a muscle or skin specific promoter, natural or synthetic, are described in U.S. Patent Application Publication No. US20040175727, the contents of which are incorporated herein in its entirety.
  • the promoter may be a CK8 promoter, a Spc512 promoter, a MHCK7 promoter, for example.
  • the genetic construct may also comprise a polyadenylation signal, which may be downstream of the transcription factor coding sequence or the CRISPR/Cas-based gene editing system.
  • the polyadenylation signal may be a SV40 polyadenylation signal, LTR polyadenylation signal, bovine growth hormone (bGH) polyadenylation signal, human growth hormone (hGH) polyadenylation signal, or human ⁇ -globin polyadenylation signal.
  • the SV40 polyadenylation signal may be a polyadenylation signal from a pCEP4 vector (Invitrogen, San Diego, CA).
  • Coding sequences in the genetic construct may be optimized for stability and high levels of expression. In some instances, codons are selected to reduce secondary structure formation of the RNA such as that formed due to intramolecular bonding.
  • the genetic construct may also comprise an enhancer upstream of the transcription factor coding sequence or the CRISPR/Cas-based gene editing system or gRNAs.
  • the enhancer may be necessary for DNA expression.
  • the enhancer may be human actin, human myosin, human hemoglobin, human muscle creatine or a viral enhancer such as one from CMV, HA, RSV, or EBV.
  • Polynucleotide function enhancers are described in U.S. Patent Nos.5,593,972, 5,962,428, and WO94/016737, the contents of each are fully incorporated by reference.
  • the genetic construct may also comprise a mammalian origin of replication in order to maintain the vector extrachromosomally and produce multiple copies of the vector in a cell.
  • the genetic construct may also comprise a regulatory sequence, which may be well suited for gene expression in a mammalian or human cell into which the vector is administered.
  • the genetic construct may also comprise a reporter gene, such as polynucleotide encoding a reporter protein and/or a selectable marker, such as hygromycin (“Hygro”).
  • the reporter protein may include any protein or peptide that is suitably detectable, such as, by fluorescence, chemiluminescence, enzyme activity such as beta galactosidase or alkaline phosphatase, and/or antibody binding detection.
  • the reporter protein may comprise a fluorescent protein.
  • the reporter protein may comprise a protein or peptide detectable with an antibody.
  • the reporter protein may comprise green fluorescent protein (“GFP”), YFP, RFP, CFP, DsRed, luciferase, and/or Thy1.
  • GFP green fluorescent protein
  • the genetic construct may be useful for transfecting cells with nucleic acid encoding the transcription factor coding sequence or the CRISPR/Cas-based gene editing system, which the transformed host cell is cultured and maintained under conditions wherein expression of the transcription factor coding sequence or the CRISPR/Cas-based gene editing system takes place.
  • the genetic construct may be transformed or transduced into a cell.
  • the genetic construct may be formulated into any suitable type of delivery vehicle including, for example, a viral vector, lentiviral expression, mRNA electroporation, and lipid- mediated transfection for delivery into a cell.
  • the genetic construct may be part of the genetic material in attenuated live microorganisms or recombinant microbial vectors which live in cells.
  • the genetic construct may be present in the cell as a functioning extrachromosomal molecule.
  • the cell is an immune cell.
  • the cell is a T cell.
  • the cell may be a CD8+ T cell.
  • the cell may be a CD4+ T cell.
  • the cell is a stem cell.
  • the stem cell may be a human stem cell.
  • the cell is an embryonic stem cell.
  • the stem cell may be a human pluripotent stem cell (iPSCs).
  • iPSCs human pluripotent stem cell
  • a genetic construct may be a viral vector. Further provided herein is a viral delivery system. Viral delivery systems may include, for example, lentivirus, retrovirus, adenovirus, mRNA electroporation, or nanoparticles.
  • the vector is a lentiviral vector. Lentiviruses are a subclass of Retroviruses.
  • Lentiviruses resemble ⁇ - retroviruses ( ⁇ -RV) in their ability to stably integrate into the target cell genome, resulting in persistent expression of the gene of interest.
  • Species of lentivirus include, for example, human immunodeficiency virus 1 (HIV-1), human immunodeficiency virus 2 (HIV-2), simian immunodeficiency virus (SIV), bovine immunodeficiency virus (BIV), and feline immunodeficiency virus (FIV).
  • the vector is a modified lentiviral vector.
  • the vector is an engineered lentiviral vector.
  • Lentiviruses may include, for example, pseudo-type lentivirus, integrase-deficient lentivirus, and virus-like particles.
  • the viral vector is an adeno-associated virus (AAV) vector.
  • AAV vector is a small virus belonging to the genus Dependovirus of the Parvoviridae family that infects humans and some other primate species.
  • Viral vectors may be used to deliver the transcription factor coding sequence or the CRISPR/Cas9-based gene editing systems using various construct configurations.
  • AAV vectors may deliver Cas9 or fusion protein and gRNA expression cassettes on separate vectors or on the same vector.
  • the AAV vector has a 4.7 kb packaging limit.
  • the AAV vector is a modified AAV vector.
  • the AAV vector is an engineered AAV vector.
  • the AAV vector may include an engineered AAV capsid.
  • the AAV vector may be an engineered AAV vector for a specific cell type.
  • the AAV vector may be an engineered AAV vector for T cells.
  • the modified AAV vector may have enhanced cell type tropism.
  • the modified AAV vector may have enhanced cardiac and/or skeletal muscle tissue tropism.
  • the modified AAV vector may be capable of delivering and expressing the transcription factor coding sequence or the CRISPR/Cas9-based gene editing system in the cell of a mammal.
  • the modified AAV vector may be an AAV-SASTG vector (Piacentino et al. Human Gene Therapy 2012, 23, 635–646, incorporated herein by reference).
  • the modified AAV vector may be based on one or more of several capsid types, including AAV1, AAV2, AAV5, AAV6, AAV8, and AAV9.
  • the modified AAV vector may be based on AAV2 pseudotype with alternative muscle-tropic AAV capsids, such as AAV2/1, AAV2/6, AAV2/7, AAV2/8, AAV2/9, AAV2.5, and AAV/SASTG vectors that efficiently transduce skeletal muscle or cardiac muscle by systemic and local delivery (Seto et al. Current Gene Therapy 2012, 12, 139-151, incorporated herein by reference).
  • the modified AAV vector may be AAV2i8G9 (Shen et al. J. Biol. Chem.2013, 288, 28814-28823, incorporated herein by reference). 4.
  • compositions and methods detailed herein may further include at least one additional therapy, such as at least one cancer therapy or at least one antiviral therapy, or a combination thereof.
  • additional therapy such as at least one cancer therapy or at least one antiviral therapy, or a combination thereof.
  • standard of care treatment or “additional therapy” or “additional treatment” are used interchangeably and refer to any other standard treatments/additional treatments that do not include the specific compositions detailed herein for modifying a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1.
  • Additional therapies may comprise a small molecule, peptide, polypeptide, antibody, nucleotide, polynucleotide, lipid, or carbohydrate, or a combination thereof. Additional therapies may be synthesized and/or extracted and/or purified by any suitable means known in the art. Additional therapies may be commercially available. An effective amount of the additional therapy may be administered. a. Cancer Therapies [000155] The compositions and methods detailed herein may further include at least one cancer therapy.
  • Standard of care treatment or “additional therapy” or “additional treatment” are used interchangeably and refer to any other standard cancer treatments/additional cancer treatments that do not include the specific compositions detailed herein for modifying a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1.
  • Additional cancer therapies may comprise a small molecule, peptide, polypeptide, antibody, nucleotide, polynucleotide, lipid, or carbohydrate, or a combination thereof.
  • Additional cancer therapies may be synthesized and/or extracted and/or purified by any suitable means known in the art. Additional cancer therapies may be commercially available. Additional cancer therapies may include, for example, chemotherapy, immunotherapy, radiation therapy, hormone therapy, targeted drug therapy, cryoablation, antibody drug conjugates, and surgery, or a combination thereof. Hormone therapy, for example, may block hormone synthesis such as blocking estrogen synthesis. An effective amount of the additional cancer therapy may be administered.
  • Chemotherapy may include, for example, an antimitotic agent, an alkylating agent, an antimetabolite, an antimicrotubule agent, a topoisomerase inhibitor, a cytotoxic agent, a cell cycle inhibitor, a growth factor inhibitor, a histone deacetylase (HDAC) inhibitor, and an inhibitor of a pathway that cross-talks with and activates ER transcriptional activity, or a combination thereof.
  • Alkylating agents may include, for example, cisplatin (PLATINOL®), oxaliplatin (ELOXATIN®), chlorambucil (LEUKERAN®), procarbazine (MATULANE®; NATULAN®), or carmustine (BiCNU®), or a combination thereof.
  • Antimetabolites may include, for example, methotrexate (also known as amethopterin), 5-fluorouracil, cytarabine (also known as cytosine arabinoside or ara-C; CYTOSAR®), or gemcitabine (GEMZAR®), or a combination thereof.
  • Antimicrotubule agents may include, for example, vinblastine (VELBAN®; VELBE®), or paclitaxel (TAXOL®), or a combination thereof.
  • Topoisomerase inhibitors may include, for example, etoposide (VEPESID®), or doxorubicin (ADRIAMYCIN®; MYOCET®), or a combination thereof.
  • Cytotoxic agents may include, for example, bleomycin (BLENOXANE®).
  • Growth factor inhibitors may include, for example, human epidermal growth factor receptor 2 (HER2) inhibitors.
  • HER2 inhibitors include, for example, trastuzumab (HERCEPTIN®), deruxtecan, sacitizumab, and/or ado-trastuzumab emtansine (KADCYLA®).
  • HDAC inhibitors may include, for example, vorinostat (ZOLINZA®), romidepsin (ISTODAX®), chidamide (also known as tucidinostat; EPIDAZA®; HIYASTATM), panobinostat (FARYDAK®), belinostat (also known as BELEODAQ® or PXD101), valproic acid (DEPAKOTE®; DEPAKENE®; STAVZOR®)), mocetinostat (also known as MGCD0103), abexinostat (also known as PCI-24781), entinostat (also known as SNDX-275 or MS-275), pracinostat (also known as SB939), resminostat (also known as 4SC-
  • Inhibitors of a pathway that cross-talks with and activates ER transcriptional activity may include, for example, a phosphoinositide 3-kinase (PI3K) inhibitor, a heat shock protein 90 (HSP90) inhibitor, or a mammalian target of rapamycin (mTOR) inhibitor.
  • mTOR inhibitors include, for example, everolimus (AFINITOR®; VOTUBIA®; ZORTRESS®).
  • the HDAC inhibitor comprises vorinostat (ZOLINZA®) and /or romidepsin (ISTODAX®).
  • Immunotherapies may include, for example, a checkpoint inhibitor, or denosumab (PROLIA®; XGEVA®), or a combination thereof.
  • Checkpoint inhibitor or “immune checkpoint inhibitor” may also be referred to as an immune checkpoint blockade (ICB) therapy.
  • Checkpoint inhibitors may comprise an antibody.
  • Checkpoint inhibitors may include, for example, an antibody to programmed cell death protein 1 (PD1) (anti-PD1), or an antibody to cytotoxic T-lymphocyte-associated protein 4 (CTLA4) (anti-CTLA4), or an antibody to programmed death-ligand 1 (PDL1) (anti-PDL1), or DMXAA (sting agonist; also known as ASA404, vadimezan, or dimethylxanthone acetic acid) or a combination thereof.
  • PD1 refers to an antibody that binds PD1
  • anti-CTLA4 refers to an antibody that binds CTLA4
  • anti-PDL1 refers to an antibody that binds PDL1.
  • the PD-1 antibody comprises pembrolizumab (KEYTRUDA®) or nivolumab (OPDIVOo®).
  • the CTLA-4 antibody comprises ipilimumab (YERVOY®).
  • Antibody drug conjugates may include, for example, gemtuzumab ozogamicin (MYLOTARGTM), brentuximab vedotin (ADCETRIS®), ado-trastuzumab emtansine (KADCYLA®), inotuzumab ozogamicin (BESPONSA®), polatuzumab vedotin (POLIVY®), enfortumab vedotin (PADCEV®), fam-trastuzumab deruxtecan (ENHERTU®), sacituzumab govitecan (TRODELVY®), loncastuximab tesirine (ZYNLONTA®), tisotumab vedotin (TIVDAK®), mirvetuximab soravtansinegynx (ELAHERETM), moxetumomab pasudotox (LUMOX
  • compositions and methods detailed herein may further include at least one antiviral therapy.
  • standard of care treatment or “additional therapy” or “additional treatment” are used interchangeably and refer to any other standard antiviral treatments/additional antiviral treatments that do not include the specific compositions detailed herein for modifying a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1.
  • Additional antiviral therapies may comprise a small molecule, peptide, polypeptide, antibody, nucleotide, polynucleotide, lipid, or carbohydrate, or a combination thereof. Additional antiviral therapies may be synthesized and/or extracted and/or purified by any suitable means known in the art. Additional antiviral therapies may be commercially available. An effective amount of the additional antiviral therapy may be administered. Antiviral therapies and viruses targeted are described in, for example, US 20200165594, and De Clercq et al. (Clin. Microbiol. Rev.2016, 29, 695-747), each of which is incorporated herein by reference. Antiviral therapies may include, for example, the antiviral therapies listed in TABLE 4 below. p p
  • the antivirals described herein may target one or more viruses.
  • the virus may be a DNA virus (single or double stranded, positive or negative sense, or ambisense) or an RNA virus (single or double stranded, positive or negative sense, or ambisense).
  • the virus may be Ebola, measles, SARS, Chikungunya, hepatitis, Marburg, yellow fever, MERS, Dengue, Lassa, influenza, rhabdovirus, COVID-19, or HIV.
  • a hepatitis virus may include hepatitis A, hepatitis B, or hepatitis C.
  • An influenza virus may include, for example, influenza A or influenza B.
  • An HIV may include HIV 1 or HIV 2.
  • the virus may be a human respiratory syncytial virus, Sudan Ebola virus, Bundibugyo virus, Tai Forest Ebola virus, Reston Ebola virus, Achimota, Aedes flavivirus, Aguacate virus, Akabane virus, Alethinophid reptarenavirus, Allpahuayo mammarenavirus, Amapari mmarenavirus, Andes virus, acea virus, Aravan virus, Aroa virus, Arumwot virus, Atlantic salmon paramyoxivirus, Australian bat lyssavirus, Avian bornavirus, Avian metapneumovirus, Avian paramyoxviruses, penguin or Falkland Islandsvirus, BK polyomavirus, Bagaza virus, Banna virus, Bat hepevirus, Bat sapovirus, Bear Canon mammarenavirus, Beilong virus, Betacoronoavirus, Betapapillomavirus 1-6, Bhanja
  • RNA viruses include one or more of (or any combination of) Coronaviridae virus, a Picornaviridae virus, a Caliciviridae virus, a Flaviviridae virus, a Togaviridae virus, a Bornaviridae, a Filoviridae, a Paramyxoviridae, a Pneumoviridae, a Rhabdoviridae, an Arenaviridae, a Bunyaviridae, an Orthomyxoviridae, or a Deltavirus.
  • the virus is Coronavirus, SARS, Poliovirus, Rhinovirus, Hepatitis A, Norwalk virus, Yellow fever virus, West Nile virus, Hepatitis C virus, Dengue fever virus, Zika virus, Rubella virus, Ross River virus, Sindbis virus, Chikungunya virus, Borna disease virus, Ebola virus, Marburg virus, Measles virus, Mumps virus, Nipah virus, Hendra virus, Newcastle disease virus, Human respiratory syncytial virus, Rabies virus, Lassa virus, Hantavirus, Crimean-Congo hemorrhagic fever virus, Influenza, or Hepatitis D virus. 5.
  • compositions comprising the above- described modulator of T cells or genetic constructs or gene editing systems.
  • the composition further includes at least one cancer therapy such as a chimeric antigen receptor (CAR).
  • CAR chimeric antigen receptor
  • the pharmaceutical composition may comprise about 1 ng to about 10 mg of DNA encoding the transcription factor or the CRISPR/Cas-based gene editing system.
  • the systems or genetic constructs as detailed herein, or at least one component thereof, may be formulated into pharmaceutical compositions in accordance with standard techniques well known to those skilled in the pharmaceutical art.
  • the pharmaceutical compositions can be formulated according to the mode of administration to be used.
  • compositions are injectable pharmaceutical compositions, they are sterile, pyrogen free, and particulate free.
  • An isotonic formulation is preferably used.
  • additives for isotonicity may include sodium chloride, dextrose, mannitol, sorbitol and lactose.
  • isotonic solutions such as phosphate buffered saline are preferred.
  • Stabilizers include gelatin and albumin.
  • a vasoconstriction agent is added to the formulation.
  • the composition may further comprise a pharmaceutically acceptable excipient.
  • the pharmaceutically acceptable excipient may be functional molecules as vehicles, adjuvants, carriers, or diluents.
  • pharmaceutically acceptable carrier may be a non-toxic, inert solid, semi-solid or liquid filler, diluent, encapsulating material or formulation auxiliary of any type.
  • Pharmaceutically acceptable carriers include, for example, diluents, lubricants, binders, disintegrants, colorants, flavors, sweeteners, antioxidants, preservatives, glidants, solvents, suspending agents, wetting agents, surfactants, emollients, propellants, humectants, powders, pH adjusting agents, and combinations thereof.
  • the pharmaceutically acceptable excipient may be a transfection facilitating agent, which may include surface active agents, such as immune-stimulating complexes (ISCOMS), Freunds incomplete adjuvant, LPS analog including monophosphoryl lipid A, muramyl peptides, quinone analogs, vesicles such as squalene and squalene, hyaluronic acid, lipids, liposomes, calcium ions, viral proteins, polyanions, polycations, or nanoparticles, or other known transfection facilitating agents.
  • the transfection facilitating agent may be a polyanion, polycation, including poly-L-glutamate (LGS), or lipid.
  • the transfection facilitating agent may be poly-L- glutamate, and more preferably, the poly-L-glutamate may be present in the composition for gene editing in skeletal muscle or cardiac muscle at a concentration less than 6 mg/mL. 6.
  • the systems or genetic constructs as detailed herein, or at least one component thereof, may be administered or delivered to a cell. Methods of introducing a nucleic acid into a host cell are known in the art, and any known method can be used to introduce a nucleic acid (e.g., an expression construct) into a cell.
  • Suitable methods include, for example, viral or bacteriophage infection, transfection, conjugation, protoplast fusion, polycation or lipid:nucleic acid conjugates, lipofection, electroporation, nucleofection, immunoliposomes, calcium phosphate precipitation, polyethyleneimine (PEI)-mediated transfection, DEAE-dextran mediated transfection, liposome-mediated transfection, particle gun technology, calcium phosphate precipitation, direct micro injection, nanoparticle- mediated nucleic acid delivery, and the like.
  • the composition may be delivered by mRNA delivery and ribonucleoprotein (RNP) complex delivery.
  • the system, genetic construct, or composition comprising the same may be electroporated using BioRad Gene Pulser Xcell or Amaxa Nucleofector IIb devices or other electroporation device.
  • Several different buffers may be used, including BioRad electroporation solution, Sigma phosphate-buffered saline product #D8537 (PBS), Invitrogen OptiMEM I (OM), or Amaxa Nucleofector solution V (N.V.).
  • Transfections may include a transfection reagent, such as Lipofectamine 2000.
  • the activators detailed herein may be delivered or administered, for example, to a cell ex vivo or to a subject in vivo, by a method including viral delivery (such as, for example, lentivirus, retrovirus, or AAV vectors as detailed above), virus-like particles (VLPs), and integrase-defective lentivirus, non-viral integrating methods (such as, for example, transposons, integrases, and gene editing), and non-viral transient methods (such as, for example, mRNA, plasmids, and minicircles as detailed above).
  • the activators detailed herein may be delivered by any delivery method suitable for ex vivo engineered cell therapy.
  • the activators detailed herein may be delivered by any suitable delivery method, including, for example, lipid nanoparticles and micelles.
  • the activator or a polynucleotide encoding the activator is encapsulated within a lipid nanoparticle or polymeric carrier.
  • the systems or genetic constructs as detailed herein, or at least one component thereof, or the pharmaceutical compositions comprising the same, may be administered to a subject or a cell. Such compositions can be administered in dosages and by techniques well known to those skilled in the medical arts taking into consideration such factors as the age, sex, weight, and condition of the particular subject, and the route of administration.
  • the presently disclosed systems, or at least one component thereof, genetic constructs, or compositions comprising the same may be administered to a subject by different routes including orally, parenterally, sublingually, transdermally, rectally, transmucosally, topically, intranasal, intravaginal, via inhalation, via buccal administration, intrapleurally, intravenous, intraarterial, intraperitoneal, subcutaneous, intradermally, epidermally, intramuscular, intranasal, intrathecal, intracranial, and intraarticular or combinations thereof.
  • the system, genetic construct, or composition comprising the same is administered to a subject intramuscularly, intravenously, or a combination thereof.
  • the systems, genetic constructs, or compositions comprising the same may be delivered to a subject by several technologies including DNA injection (also referred to as DNA vaccination) with and without in vivo electroporation, liposome mediated, nanoparticle facilitated, recombinant vectors such as recombinant lentivirus, recombinant adenovirus, and recombinant adenovirus associated virus.
  • the composition may be injected into the brain or other component of the central nervous system.
  • the composition may be injected into the skeletal muscle or cardiac muscle.
  • the composition may be injected into the tibialis anterior muscle or tail.
  • the systems, genetic constructs, or compositions comprising the same may be administered as a suitably acceptable formulation in accordance with normal veterinary practice.
  • the veterinarian may readily determine the dosing regimen and route of administration that is most appropriate for a particular animal.
  • the systems, genetic constructs, or compositions comprising the same may be administered by traditional syringes, needleless injection devices, “microprojectile bombardment gone guns,” or other physical methods such as electroporation (“EP”), “hydrodynamic method”, or ultrasound.
  • transient in vivo delivery of CRISPR/Cas-based systems by non-viral or non-integrating viral gene transfer, or by direct delivery of purified proteins and gRNAs containing cell-penetrating motifs may enable highly specific correction and/or restoration in situ with minimal or no risk of exogenous DNA integration.
  • T cells may be increased, T cell numbers may be increased, memory T cells may be increased, T cell biodistribution may be increased, infiltration into tissues may be increased, T cell durability may be increased, T cell exhaustion may be inhibited or prevented, T cell exhaustion may be reversed, cancer therapy may be enhanced or its effectiveness increased, antiviral therapy may be enhanced or its effectiveness increased, or a combination thereof.
  • the transfected cells may express the gene or gene product thereof, or the gRNA molecule(s) and the Cas9 molecule or fusion protein.
  • the transfected cells may express the gene or gene product thereof, or the gRNA molecule(s) and the Cas9 molecule or fusion protein.
  • a. Cell Types Any of the delivery methods and/or routes of administration detailed herein can be utilized with a myriad of cell types.
  • a cell comprising an isolated polynucleotide encoding the protein product from a gene as detailed herein.
  • a cell comprising an isolated polynucleotide encoding a transcription factor or a CRISPR/Cas9 system as detailed herein. Suitable cell types are detailed herein.
  • the cell may be isolated, as from a subject or tissue, or ex vivo, or in vivo.
  • the cell may be autologous to the subject.
  • the cell may be allogenic to the subject.
  • the cell is an immune cell.
  • Immune cells may include, for example, lymphocytes such as T cells and B cells and natural killer (NK) cells.
  • the cell is a T cell.
  • T cells may be divided into cytotoxic T cells and helper T cells, which are in turn categorized as TH1 or TH2 helper T cells.
  • Immune cells may further include innate immune cells, adaptive immune cells, tumor-primed T cells, NKT cells, IFN- ⁇ producing killer dendritic cells (IKDC), memory T cells (TCMs), and effector T cells (TEs).
  • the cell may be a stem cell such as a human stem cell.
  • the cell is an embryonic stem cell or a hematopoietic stem cell.
  • the stem cell may be a human induced pluripotent stem cell (iPSCs).
  • the cell is a T cell.
  • the cell is a CD8+ T cell.
  • the cell is a CD4+ T cell. 7.
  • Kits [000168] Provided herein is a kit, which may be used to modulate, such as increase, T cells.
  • the kit may be used in conjunction with ACT to enhance the ACT.
  • the kit comprises an activator as detailed herein, or genetic constructs or a composition comprising the same, as described above, and instructions for using said composition.
  • the kit includes an activator of a gene, or a genetic construct encoding the activator of a gene, wherein the gene is selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
  • the kit comprises at least one polynucleotide sequence selected from SEQ ID NOs: 75-97 or 121-192, a complement thereof, a variant thereof, or fragment thereof.
  • the kit comprises at least one polypeptide sequence selected from SEQ ID NOs: 98-120, a variant thereof, or fragment thereof.
  • the kit comprises at least one gRNA comprising a polynucleotide sequence selected from SEQ ID NOs: 157-192, a complement thereof, a variant thereof, or fragment thereof, or at least one gRNA targeting or encoded by a polynucleotide comprising a sequence selected from SEQ ID NOs: 121-156, a complement thereof, a variant thereof, or fragment thereof.
  • the kit may further include instructions for using the activator or the CRISPR/Cas-based gene editing system.
  • kits may be affixed to packaging material or may be included as a package insert. While the instructions are typically written on printed materials they are not limited to such. Any medium capable of storing such instructions and communicating them to an end user is contemplated by this disclosure. Such media include, but are not limited to, electronic storage media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. As used herein, the term “instructions” may include the address of an internet site that provides the instructions.
  • the genetic constructs or a composition comprising thereof for modulating T cells may include a vector such as a modified lentiviral or AAV vector that comprises a cDNA encoding a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, and/or includes a gRNA molecule(s) and a Cas9 protein or fusion protein, as described above, that specifically binds a region of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or
  • modulating T cells comprises administering to a cell or a subject a composition as detailed herein, or an isolated polynucleotide sequence as detailed herein, or a vector as detailed herein, or a cell as detailed herein, or a pharmaceutical composition as detailed herein, or a combination thereof.
  • modulating T cells comprises increasing T cells, or increasing memory T cells, or preventing T cell exhaustions, or reversing T cell exhaustions, or a combination thereof.
  • the methods increase expression or activity of CD103 or IL7Ra, or a combination thereof, in a T cell.
  • the activators of T cells detailed herein may be delivered or administered, for example, to a cell in vitro or ex vivo or to a subject in vivo.
  • the activators detailed herein may be delivered or administered to a cell for in vivo or ex vivo cell modification.
  • an activator as detailed herein is administered to a cell isolated from a subject.
  • the cell may be autologous.
  • the cell may be allogenic.
  • a T cell modified by an activator as detailed herein may be administered to a subject.
  • Methods of Increasing T Cells Provided herein are methods of increasing T cells.
  • the methods may include administering to a cell or a subject a composition as detailed herein, or an isolated polynucleotide sequence as detailed herein, or a vector as detailed herein, or a cell as detailed herein, or a pharmaceutical composition as detailed herein, or a combination thereof.
  • the methods increase expression or activity of CD103 or IL7Ra, or a combination thereof, in a T cell.
  • the activators detailed herein may be delivered or administered, for example, to a cell in vitro or ex vivo or to a subject in vivo.
  • the activators detailed herein may be delivered or administered to a cell for in vivo or ex vivo cell modification.
  • an activator as detailed herein is administered to a cell isolated from a subject.
  • the cell may be autologous.
  • the cell may be allogenic.
  • a T cell modified by an activator as detailed herein may be administered to a subject.
  • Methods of Enhancing Adoptive T Cell Therapy (ACT) [000173]
  • the methods may include administering to the subject a composition as detailed herein, or an isolated polynucleotide sequence as detailed herein, or a vector as detailed herein, or a cell as detailed herein, or a pharmaceutical composition as detailed herein, or a combination thereof.
  • the methods increase expression or activity of CD103 or IL7Ra, or a combination thereof, in a T cell.
  • the activators detailed herein may be delivered or administered, for example, to a cell in vitro or ex vivo or to a subject in vivo.
  • the activators detailed herein may be delivered or administered to a cell for in vivo or ex vivo cell modification.
  • an activator as detailed herein is administered to a cell isolated from a subject.
  • the cell may be autologous.
  • the cell may be allogenic.
  • a T cell modified by an activator as detailed herein may be administered to a subject. d. Methods of Treating Cancer [000174] Provided herein are methods of treating cancer in a subject.
  • the methods may include administering to the subject a composition as detailed herein, or an isolated polynucleotide sequence as detailed herein, or a vector as detailed herein, or a cell as detailed herein, or a pharmaceutical composition as detailed herein, or a combination thereof.
  • the methods increase expression or activity of CD103 or IL7Ra, or a combination thereof, in a T cell.
  • the activators detailed herein may be delivered or administered, for example, to a cell in vitro or ex vivo or to a subject in vivo.
  • the activators detailed herein may be delivered or administered to a cell for in vivo or ex vivo cell modification.
  • an activator as detailed herein is administered to a cell isolated from a subject.
  • the cell may be autologous.
  • the cell may be allogenic.
  • a T cell modified by an activator as detailed herein may be administered to a subject. 9. Examples [000175] The foregoing may be better understood by reference to the following examples, which are presented for purposes of illustration and are not intended to limit the scope of the invention. The present disclosure has multiple aspects and embodiments, illustrated by the appended non-limiting examples.
  • Opti-MEMTM I Reduced Serum Medium supplemented with 1x Glutamax, 5% FBS, 1 mM Sodium Pyruvate, and 1x MEM Non-Essential Amino Acids.
  • HEK293T cells were transfected with 3.25 ⁇ g pMD2.G, 9.75 ⁇ g psPAX2, and 4.3 ⁇ g transgene using Lipofectamine 3000. Media was exchanged 6 hours after transfection, and lentiviral supernatant was collected and pooled at 24 hours and 48 hours after transfection. Collected lentivirus was centrifuged at 500g for 10 min to remove cellular debris. Supernatant was concentrated 100X using Lenti-X concentrator. Concentrated lentivirus was snap frozen and stored at -80°C until use. Human primary T cells were transduced 24 hours post activation with 10% v/v concentrated lentivirus. [000178] cDNA overexpression screens.
  • Human primary CD8+ T cells were activated using dynabeads as described above. 24 hours post-activation, T cells were transduced with lentivirus encoding a library of human transcription factors (PMID: 36608654; Joung et al. Cell 2023, 186, 209-229, incorporated herein by reference). Cells were expanded for at least 7 days at which point they were collected for sorting. For the IL7Ra screen, the top and bottom 10% of cells were sorted based on IL7Ra protein levels. For the CD103 screen, CD103+ and CD103- negative populations were sorted. Genomic DNA was isolated using Qiagen’s (Hilden, Germany) DNeasy Blood and Tissue Kit from the sorted populations.
  • Transcription factor barcodes were amplified and sequenced as previously described (PMID: 36608654; Joung et al. Cell 2023, 186, 209-229, incorporated herein by reference). Transcription factor barcode read counts were compared across the different groups using DESeq2. Hits were determined as transcription factors with DESeq2 adjusted P values ⁇ .05. [000179] Flow cytometry. For antibody staining, cells were spun down at 300g for 5 minutes and resuspended in Cell Staining Buffer (Biolegend, San Diego, CA) with the appropriate antibody dilutions. Cells were stained for 30 minutes on ice in the dark and washed once before sorting or analysis. [000180] RNA sequencing.
  • CAR T cells were incubated with target cells at a 2:1 ratio in Immunocult-XF media unless otherwise indicated. After complete target cell elimination (2 days), cells were washed in fresh media. RNA was isolated using Norgen Biotek’s (Thorold, Ontario, Canada) Total RNA purification plus kit and submitted to Azenta for standard RNA- seq with polyA selection. Gene expression matrices were generated using salmon (v1.10.2) with the GRCh38 genome. Differential gene expression analysis was performed using DESeq2. [000181] scRNA-sequencing. CAR T cells generated from pan T cells were expanded for 7 days. Cells were spun down and resuspended in 49.5 ⁇ L cell staining buffer.
  • Fc receptors were blocked by adding 0.5 ⁇ L TruStainFcX PLUS and incubating on ice for 10 minutes. 50 ⁇ L of additional cell staining buffer was then added. 1 ⁇ g of cell hashing antibodies was added to each CAR T population based on perturbation. Cells were incubated on ice for 30 minutes. Afterwards, cells were washed 3X in cell staining buffer and resuspended in 300 ⁇ L. Equal cell numbers were then pooled, pelleted, and resuspended at 2e7 cells/mL in cell staining buffer.
  • Cleaned matrices were loaded into Seurat (v5) and subset to include cells with nFeature_RNA > 1800, percent.mt ⁇ 10, and nCount_RNA ⁇ 50,000. High quality cells were assigned their perturbation (based on cell hashing antibody) using HTODemux.
  • Multiplex ELISA HER2-targeting CAR T cells were generated from pan T cells. CAR T cells were incubated with target cells (SKBR3) at a ratio of 4:1 in Immunocult-XF media without rh-IL2 supplementation. 20 hours after incubation, cells were centrifuged to pellet. Supernatant was collected and stored at -80°C.
  • CD8 T cells were transduced with lentiviruses encoding either aHER2 CAR T2A-TGIF2LX or aHER2 CAR T2A-THY1.1 and expanded for at least 7 days.
  • 625 target cells (SKBR3-GFP) were plated into a 96-well plate and allowed to attach overnight.
  • CAR T cells were collected and diluted to 10 cells/ ⁇ L in Immunocult-XF with 100 IU/rh-IL2.
  • CD8 T cells were transduced with lentivirus encoding one of the following constructs: aHER2 CAR-T2A-THY1.1, aHER2 CAR-T2A- TGIFL2X, or aHER2 CAR-T2A-RUNX3, and expanded for 9 days. 500,000 CAR T cells were infused intravenously 21 days after tumor implantation. Immediately before CAR T administration, tumors were measured and mice were randomized into groups. Tumor volumes were tracked based on caliper measurements every 4-6 days. [000185] TGF-b1 incubation.
  • CD8 T cells were activated and transduced 24 hours later with lentivirus encoding either aHER2 CAR T2A-TGIF2LX or aHER2 CAR T2A-THY1.1.
  • media was replaced with fresh Immunocult-XF supplemented with 100 IU/rhIL2 and 2 ng/mL TGF-b1.
  • Five days later cells were collected and prepared for flow cytometry.
  • Example 2 Resident Memory T Cells for Adoptive Cell Therapy [000186] Memory T cells exhibit distinct functional characteristics depending on their location within the body. Recent advancements have unveiled a significant population of memory T cells residing in non-lymphoid tissues.
  • the intersection of these screens identified the transcription factor TGIF2LX, capable of upregulating both markers (FIGS.4A-4C).
  • the initial activation of na ⁇ ve T cells by antigen is driven by transcription factors including BATF, IRF-4, and NFAT-AP1.
  • transcription factors including BATF, IRF-4, and NFAT-AP1.
  • MPECs KLRG-1 lo CD127 hi memory precursor T cells
  • MPECs can be differentiated into various types of cells depending on antigen levels and disease setting.
  • Effector T cells (Teff cells) that form from MPECs are characterized by high KLRG-1 and effector functions, such as cytokine production, driven in part by the transcription factors T-bet and Blimp-1.
  • T mem cells Differentiation of MPECs to memory T cells (T mem cells) occurs following antigen clearance in acute infections and vaccination, and self- renewing T mem cells use the transcription factors Tcf-1 and Eomes.
  • T RM cells do not circulate in the body but are rather retained in tissues and are characterized by high CD103 and CD69 expression and low CD127 expression. Additionally, resident memory cells are associated with low KLF-2 and expression of Hobit.
  • Early exhausted cells form during chronic infections and cancer, where an antigen persists.
  • Early exhausted T cells (T ex cells) are characterized by intermediate expression of PD-1 and low Eomes, with a role for Tcf-1 in a progenitor population.
  • FIGS.6A-6E show that TGIF1 is upregulated in early resident memory.
  • FIGS.15A-15B show the TGIF2LX domains that may be responsible for residency reprogramming. [000188] Hits were obtained from two separate marker-based screens (CD103 and IL7Ra). CD103 is a tissue-resident memory marker and IL7Ra is another memory marker.
  • FIGS.7A-7E show a molecular characterization of transcription factor-engineered chimeric antigen receptor T cells (CAR Ts). Bulk RNA-sequencing revealed that TGIF2LX overexpression in T cells lead to widespread transcriptional reprogramming, including activation of genes associated with increased tissue residency programs (for example, ITGAE (i.e.
  • TGIF2LX-overexpressing T cells were characterized using a multimodal single cell readout, pairing scRNA-seq with a CITE-seq library of 134 unique cell surface markers (FIGS.9A-9G). This demonstrated a distinct cell state with TGIFL2X-overexpressing T cells existing in clusters unique from those containing unenhanced cells. Of note, these cells also clustered separately from T cells overexpressing RUNX3, a transcription factor previously identified to enhance tissue residency and improve tumor control, suggesting a novel axis for controlling tissue residency (FIG.9D).
  • FIGS.10A- 10F show that TGIF2LX increased tissue-residence associated surface markers.
  • FIGS. 11A-11C show that CD8 subpopulations mirrored memory to effector hierarchy.
  • FIGS.12A- 12E show that TGIF2LX CAR Ts maintained cytotoxic capabilities in vitro.
  • a proliferation-based cDNA overexpression screen in CD8 T cells was completed. A library of about 3,500 transcription factors was introduced into HER2-targeting CAR Ts. The CAR Ts were repeatedly stimulated with cancer cells to induce exhaustion. CAR Ts were collected before and after repeated stimulation to identify transcription factors that modulate CAR T abundance with the hypothesis that transcription factors that increase CAR T durability will be more highly represented at the end of the screen.
  • a marker-based cDNA overexpression screen in CD8 T cells was completed.
  • the same library as above was introduced into HER2-targeting CAR Ts.
  • the CAR Ts were expanded for 10 days at which point, CD103 high cells were collected as well as an unsorted population.
  • the goal was to identify transcription factors that increase expression of CD103.
  • CD103 is a tissue-resident memory T cell marker, and these cells are associated with improved survival in the context of checkpoint inhibitors.
  • Example 4 TGIF2LX Overexpression Modulates CAR T Efficacy in a Human Breast Cancer Model [000192] The effect of TGIF2LX overexpression in modulating CAR T efficacy in a human breast cancer xenograft model was also tested. The protocol shown in FIG.13A was used.
  • HCC1954 breast cancer cells were implanted orthotopically into the mammary fat of Nod/scid mice. After 10 days, HER2-targeting CAR T cells were administered intravenously at a sub-curative dose (5x10 5 cells). TGIF2LX-overexpression dramatically improved tumor control by CAR T cells relative to the unmodified control and RUNX3-overexpressing CAR Ts (FIG.13B). These data provided proof-of-concept of a strategy for enhancing adoptive cell therapies for solid tumors. [000193] TGIF2LX overexpression was found to give circulating T cells a tissue-resident- like phenotype.
  • CRISPRa will be used to increase the expression of a gene selected from TGIF2LX, TGIF1, TGIF2, CARF, EBF3, FOS, HMX3, HNF4A, KLF8, LHX4, LMX1A, NFKBIZ, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1.
  • a dCas9 protein (such as dSpCas9) will be fused to an activation domain, such as at least one domain of VP64, p300, or p300 core.
  • the resulting fusion protein may be VP64-dCas9-VP64 (SEQ ID NO: 43).
  • T cells will be transduced with a lentiviral vector encoding the fusion protein such as VP64- dCas9-VP64 and a gRNA targeting a gene selected from TGIF2LX, TGIF1, TGIF2, CARF, EBF3, FOS, HMX3, HNF4A, KLF8, LHX4, LMX1A, NFKBIZ, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1.
  • Exemplary gRNAs are shown in TABLE 3.
  • the transduced T cells will be assayed for CD103 and IL7Ra protein levels (via FACS) and transcriptomic changes (via bulk RNA-seq), as described in Example 5.
  • transduced cells will show increased CD103 and IL7Ra expression, decreased circulatory marker expression, and increased tissue residency marker expression. *** [000195]
  • the foregoing description of the specific aspects will so fully reveal the general nature of the invention that others can, by applying knowledge within the skill of the art, readily modify and/or adapt for various applications such specific aspects, without undue experimentation, without departing from the general concept of the present disclosure. Therefore, such adaptations and modifications are intended to be within the meaning and range of equivalents of the disclosed aspects, based on the teaching and guidance presented herein. It is to be understood that the phraseology or terminology herein is for the purpose of description and not of limitation, such that the terminology or phraseology of the present specification is to be interpreted by the skilled artisan in light of the teachings and guidance.
  • Clause 4. The isolated polynucleotide of any one of clauses 1-3, wherein the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof.
  • Clause 8. The vector of clause 7, wherein the promoter is non-endogenous to the transcription factor.
  • Clause 9. The vector of clause 7 or 8, wherein the promoter is a constitutive promoter, or a ubiquitous promoter, or an inducible promoter, or a cell-specific promoter, or a tissue-specific promoter.
  • Clause 10. The vector of any one of clauses 5-9, wherein the vector comprises an open reading frame (ORF) of the transcription factor. [000209] Clause 11.
  • AAV adeno-associated virus
  • Clause 16 The vector of clause 15, wherein the AAV vector is selected from AAV1, AAV2, AAV5, AAV6, AAV8, AAV9, and an engineered AAV vector. [000215] Clause 17.
  • a method of modulating T cells comprising administering to a T cell or a subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • modulating T cells comprises increasing T cells, or increasing memory T cells, or increasing T cell distribution, or increasing tissue infiltration, or preventing T cell exhaustions, or reversing T cell exhaustions, or a combination thereof.
  • a method of increasing T cells comprising administering to a T cell or a subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • a method of enhancing adoptive T cell therapy (ACT) in a subject comprising administering to a T cell or the subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of
  • a method of treating cancer in a subject comprising administering to a T cell or the subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
  • an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene
  • a cell comprising the isolated polynucleotide of any one of clauses 1- 4, or the vector of any one of clauses 5-16, or the vector of clause 38.
  • Clause 40 The cell of clause 39, wherein the cell is a CD8+ T cell or a CD4+ T cell.
  • Clause 41 A pharmaceutical composition comprising: the isolated polynucleotide of any one of clauses 1-4, or the vector of any one of clauses 5-16, or the vector of clause 38, or a combination thereof.
  • Clause 42 The pharmaceutical composition of clause 41, further comprising at least one cancer therapy or at least one antiviral therapy.
  • Clause 43 Clause 43.
  • a composition for increasing T cells comprising an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
  • a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
  • Clause 46 The composition of clause 45, wherein the activator comprises a polynucleotide selected from SEQ ID NOs: 75-97 or a polypeptide selected from SEQ ID NOs: 98-120.
  • Clause 47 The composition of any one of clauses 43-46, further comprising at least one cancer therapy or at least one antiviral therapy. [000246] Clause 48.
  • the activator comprises a DNA targeting composition
  • the DNA targeting composition comprising: (a) a Cas9 protein and at least one guide RNA (gRNA) that targets the Cas9 protein to the gene or a regulatory element thereof; or (b) a meganuclease, or (c) a fusion protein, wherein the fusion protein comprises two heterologous polypeptide domains, wherein the first polypeptide domain comprises a zinc finger protein or a TALE or a Cas12 protein or a Cas13 protein or a Cas9 protein, and wherein the second polypeptide domain has an activity selected from transcription activation activity, transcription repression activity, nuclease activity, base editing activity, prime editing activity, transcription release factor activity, histone modification activity, nucleic acid association activity, methylase activity, and demethylase activity, wherein when the first polypeptide domain comprises a Cas9 protein the DNA targeting composition further comprises at least one guide RNA (gRNA) that
  • a DNA targeting composition comprising: a Cas9 protein or a fusion protein, wherein the fusion protein comprises two heterologous polypeptide domains, wherein the first polypeptide domain comprises a zinc finger protein or a TALE or a Cas12 protein or a Cas13 protein or a Cas9 protein, and wherein the second polypeptide domain has an activity selected from transcription activation activity, transcription repression activity, nuclease activity, base editing activity, prime editing activity, transcription release factor activity, histone modification activity, nucleic acid association activity, methylase activity, and demethylase activity; and at least one guide RNA (gRNA) that targets the Cas9 protein to a target gene or a regulatory element thereof when the DNA targeting composition comprises a Cas9 protein, wherein the target gene is selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4,
  • Clause 50 The composition of any one of clauses 48-49, wherein the gene is selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5. [000249] Clause 51.
  • composition of any one of clauses 48-50, wherein the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof.
  • Clause 52 The composition of any one of clauses 48-51, wherein the gRNA is encoded by a polynucleotide comprising a sequence selected from SEQ ID NOs: 121-156, or comprises a sequence selected from SEQ ID NOs: 157-192.
  • Clause 53 Clause 53.
  • the Cas protein comprises a Streptococcus pyogenes Cas9 protein, or a Staphylococcus aureus Cas9 protein, or any fragment thereof.
  • Clause 55 The composition of any one of clauses 48-54, wherein the fusion protein comprises more than one second polypeptide domain.
  • Clause 56 The composition of any one of clauses 48-55, wherein the second polypeptide domain has transcription activation activity.
  • Clause 57 The composition of clause 56, wherein the second polypeptide domain comprises a polypeptide selected from VP16, VP64, p65, TET1, VPR, VPH, Rta, and p300, or a fragment thereof.
  • Clause 58 The composition of clause 57, wherein the second polypeptide domain comprises VP64, p300, VPH, or VPR, or a fragment thereof.
  • Clause 59 The composition of one of clauses 48-58, wherein the second polypeptide domain comprises the amino acid sequence of SEQ ID NO: 41, 42, 53, or 55, or any fragment thereof, and/or wherein the second polypeptide domain is encoded by a polynucleotide comprising the sequence of SEQ ID NO: 54 or 56, and/or wherein the second polypeptide domain comprises an amino acid sequence having at least 90% or greater identity to SEQ ID NO: 41, 42, 53, or 55, or any fragment thereof, and/or wherein the second polypeptide domain is encoded by a polynucleotide comprising a sequence having at least 90% or greater identity to SEQ ID NO: 54 or 56, or any fragment thereof, and/or wherein the second polypeptide domain comprises an amino acid sequence having one, two, three, four, five or more changes selected from amino acid substitutions, insertions, or deletions, relative to SEQ ID NO: 41, 42, 53, or 55, or any fragment thereof, and/or wherein the second polypeptid
  • Clause 60 The composition of any one of clauses 48-59, wherein the fusion protein comprises the amino acid sequence of SEQ ID NO: 43, or any fragment thereof, and/or wherein the fusion protein is encoded by a polynucleotide comprising the sequence of SEQ ID NO: 44, and/or wherein the fusion protein comprises an amino acid sequence having at least 90% or greater identity to SEQ ID NO: 43, or any fragment thereof, and/or wherein the fusion protein is encoded by a polynucleotide comprising a sequence having at least 90% or greater identity to SEQ ID NO: 44, or any fragment thereof, and/or wherein the fusion protein comprises an amino acid sequence having one, two, three, four, five or more changes selected from amino acid substitutions, insertions, or deletions, relative to SEQ ID NO: 43, or any fragment thereof, and/or wherein the fusion protein is encoded by a polynucleotide comprising a sequence having one, two, three, four, five or more
  • Clause 61 The composition of any one of clauses 48-60, further comprising at least one cancer therapy or at least one antiviral therapy.
  • Clause 62 An isolated polynucleotide sequence encoding the composition of any one of clauses 48-61.
  • Clause 63 A vector comprising the isolated polynucleotide sequence of clause 62.
  • Clause 64 A cell comprising the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or a combination thereof.
  • Clause 65 The cell of clause 64, wherein the cell is a CD8+ T cell or a CD4+ T cell.
  • Clause 66 A pharmaceutical composition comprising: the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or a combination thereof.
  • Clause 67 A method of modulating T cells, the method comprising administering to a T cell or a subject the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or the cell of clause 64 or 65, or the pharmaceutical composition of clause 66, or a combination thereof.
  • Clause 68 Clause 68.
  • modulating T cells comprises increasing T cells, or increasing memory T cells, or increasing T cell distribution, or increasing tissue infiltration, or preventing T cell exhaustions, or reversing T cell exhaustions, or a combination thereof.
  • Clause 69 The method of clause 67 or 68, wherein the composition or isolated polynucleotide sequence or vector is administered to a T cell, and wherein the T cell thereby increases expression of CD103 or IL7Ra, or a combination thereof.
  • a method of increasing T cells comprising administering to a T cell or a subject the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or the cell of clause 64 or 65, or the pharmaceutical composition of clause 66, or a combination thereof.
  • Clause 71 A method of enhancing adoptive T cell therapy (ACT) in a subject, the method comprising administering to a T cell or the subject the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or the cell of clause 64 or 65, or the pharmaceutical composition of clause 66, or a combination thereof.
  • ACT adoptive T cell therapy
  • a method of treating cancer in a subject comprising administering to a T cell or the subject the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or the cell of clause 64 or 65, or the pharmaceutical composition of clause 66, or a combination thereof.
  • NRG A or G
  • N can be any nucleotide residue, e.g., any of A, G, C, or T
  • SEQ ID NO: 2 NGG N can be any nucleotide residue, e.g., any of A, G, C, or T
  • SEQ ID NO: 3 NAG N can be any nucleotide residue, e.g., any of A, G, C, or T
  • SEQ ID NO: 4 NGGNG N can be any nucleotide residue, e.g., any of A, G, C, or T
  • N can be any nucleotide residue, e.g., any of A, G, C, or T
  • N can be any nucleotide residue, e.g., any of A, G, C, or T
  • aureus Cas9 aagcggaactacatcctgggcctggacatcggcatcaccagcgtgggctacggcatcatcatcgactacga gacacgggacgtgatcgatgccggcgtgcggctgttcaaagaggccaacgtggaaaacaacgagggca ggcggagcaagagaggcgccagaaggctgaagcggcggaggcggcatagaatccagagagtgaagaag ctgcttcgactacaacctgctgaccgaccacagcgagctgagcggcatcaacccctacgaggccag agtgaagggcctgagccagagtgaagggcctgagccagaaagggcctgagccagaagctgagaggctg
  • aureus Cas9 ctaaattgtaagcgttaatattttgttaaaattcgcgttaaatttttgttaaatcagctcatttttta accaataggccgaaatcggcaaaatcccttataaatcaaaagaatagaccgagatagggttgagtgttt gttccactattaaagaacgtggactccaacgtcaaagggcgaaaaccgt ctatcagggcgatggcccactacgtgaaccatcaccctaatcaagttttttggggtcgaggtgccgta aagcactaaatcggaacccaccctaatcaagttttttggggtcgaggtgccgta agcactaaatcggaacccta

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Medicinal Chemistry (AREA)
  • Epidemiology (AREA)
  • Animal Behavior & Ethology (AREA)
  • General Health & Medical Sciences (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Medicines Containing Material From Animals Or Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)

Abstract

Disclosed herein are compositions and methods for modulating T cells. For example, the compositions and methods may be used to increase memory T cells. The compositions and methods may increase the expression or protein level of a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof. The compositions and method may be used in combination with Adoptive T Cell Therapy (ACT) to enhance the ACT.

Description

NOVEL REGULATORS OF T CELLS CROSS-REFERENCE TO RELATED APPLICATIONS [0001] This application claims priority to U.S. Provisional Patent Application No. 63/579,780 filed August 30, 2023, U.S. Provisional Patent Application No.63/636,706 filed April 19, 2024, and U.S. Provisional Patent Application No.63/636,859 filed April 21, 2024, the entire contents of each of which are hereby incorporated by reference. FIELD [0002] This disclosure relates to compositions and methods for delivering, increasing, or activating genes to enhance properties of T cells for improving immunotherapies for viral infections and cancer, including adoptive T cell therapy (ACT). INTRODUCTION [0003] T cells are part of the immune system’s adaptive defense. They specifically target and kill both virally infected and cancerous cells through antigen recognition. Unfortunately, cancer can exploit intrinsic T cell mechanisms to survive attack by the immune system. T cell state and function are largely regulated by specific transcription factors (TFs) and epigenetic modifiers that process intrinsic and extrinsic signals into complex and tightly controlled gene expression programs. T cell exhaustion arises from chronic antigen stimulation, which shifts a portion of the T cell population to the exhausted state (TEX), resulting in diminished T cell proliferation and tumor/viral clearance. The transcription factor TOX, for example, drives and maintains the TEX cell state through epigenetic regulation of exhaustion-associated genetic programs. [0004] Adoptive T cell therapy (ACT) holds tremendous potential for cancer treatment by redirecting T cells to cancer cells via expression of engineered receptors that recognize and bind to tumor-associated antigens. Receptor-antigen interactions can initiate complex transcriptional networks that drive multipotent T cell response and lead to cancer cell death. The potency and duration of T cell response are associated with defined T cell subsets, and cell products enriched in stem or memory T cells, provide superior tumor control in animal models and in the clinic. Given the association between defined T cell subsets and clinical outcomes, precise regulation or programming of T cell state may be one approach to improve the therapeutic potential of ACT. [0005] T cell exhaustion drives dysfunction and impaired immune response to cancer and chronic viral infections. Modern genome engineering technologies have the potential to dramatically advance T cell therapy by programming exhausted T cells to desirable phenotypes. There is a need to find and develop regulators of T cell state and discover T cell gene networks and their corresponding phenotypes, in order to enhance T cell phenotype and improve the efficacy of T cell therapies to help treat viral infections, kill cancer cells, and control solid tumors. SUMMARY [0006] In an aspect, the disclosure relates to an isolated polynucleotide encoding a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof. In a further aspect, the disclosure relates to an isolated polynucleotide encoding a transcription factor selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof. In some embodiments, the isolated polynucleotide comprises a sequence selected from SEQ ID NOs: 75-97. In some embodiments, the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof. [0007] In a further aspect, the disclosure relates to a vector encoding a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof. In a further aspect, the disclosure relates to a vector encoding a transcription factor selected fromTGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof. In some embodiments, the vector comprises a promoter operably linked to a polynucleotide sequence encoding the transcription factor. In some embodiments, the promoter is non-endogenous to the transcription factor. In some embodiments, the promoter is a constitutive promoter, or a ubiquitous promoter, or an inducible promoter, or a cell-specific promoter, or a tissue-specific promoter. In some embodiments, the vector comprises an open reading frame (ORF) of the transcription factor. In some embodiments, the vector comprises a sequence selected from SEQ ID NOs: 75-97 or encodes a polypeptide comprising a sequence selected from SEQ ID NOs: 98-120. In some embodiments, the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof. In some embodiments, the vector is a viral vector. In some embodiments, the vector is a lentiviral vector. In some embodiments, the vector is an adeno-associated virus (AAV) vector. In some embodiments, the AAV vector is selected from AAV1, AAV2, AAV5, AAV6, AAV8, AAV9, and an engineered AAV vector. [0008] Another aspect of the disclosure provides a method of modulating T cells. The method may include administering to a T cell or a subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene. In some embodiments, modulating T cells comprises increasing T cells, or increasing memory T cells, or increasing T cell distribution, or increasing tissue infiltration, or preventing T cell exhaustions, or reversing T cell exhaustions, or a combination thereof. [0009] Another aspect of the disclosure provides a method of increasing T cells. The method may include administering to a T cell or a subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene. [00010] Another aspect of the disclosure provides a method of enhancing adoptive T cell therapy (ACT) in a subject. The method may include administering to a T cell or the subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene. [00011] Another aspect of the disclosure provides a method of treating cancer in a subject. The method may include administering to a T cell or the subject an activator of a gene selected fromTGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene. [00012] In some embodiments, the gene is selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof. In some embodiments, the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof. In some embodiments, the activator modulates T cells, and modulating T cells comprises increasing T cells, or increasing T cell distribution, or increasing tissue infiltration, or increasing memory T cells, or increasing the lifetime of a T cell, or preventing T cell exhaustions, or reversing T cell exhaustions, or reducing T cell exhaustion, or enhancing the therapeutic potential of T cells, or a combination thereof. In some embodiments, administration of the activator to the T cell results in a modified T cell. In some embodiments, the modified T cell is administered to a subject. In some embodiments, the T cell is autologous or allogenic. In some embodiments, the activator modulates gene expression within the T cell. In some embodiments, the activator increases expression of CD103 or IL7Ra, or a combination thereof, in the T cell. In some embodiments, the activator comprises a polypeptide, or a polynucleotide, or a small molecule, or a combination thereof. In some embodiments, the activator comprises a polynucleotide encoding the gene. In some embodiments, the activator comprises a polynucleotide comprising the open reading frame of the gene or a polynucleotide encoding a protein encoded by the gene. In some embodiments, the activator comprises a polynucleotide selected from SEQ ID NOs: 75-97 or a polynucleotide encoding a polypeptide comprising an amino acid sequence selected from SEQ ID NOs: 98-120. In some embodiments, the activator comprises a polypeptide comprising a protein encoded by the gene. In some embodiments, the activator comprises a polypeptide selected from SEQ ID NOs: 98-120. In some embodiments, the activator comprises a vector as detailed herein. In some embodiments, the activator or a polynucleotide encoding the activator is encapsulated within a lipid nanoparticle or polymeric carrier. In some embodiments, the method further includes administering at least one cancer therapy or at least one antiviral therapy. [00013] Another aspect of the disclosure provides a vector comprising an isolated polynucleotide as detailed herein. [00014] Another aspect of the disclosure provides a cell comprising an isolated polynucleotide as detailed herein, or a vector as detailed herein. In some embodiments, the cell is a CD8+ T cell or a CD4+ T cell. [00015] Another aspect of the disclosure provides a pharmaceutical composition. The pharmaceutical composition may include an isolated polynucleotide as detailed herein, or a vector as detailed herein, or a combination thereof. In some embodiments, the pharmaceutical composition further includes at least one cancer therapy or at least one antiviral therapy. [00016] Another aspect of the disclosure provides a composition for increasing T cells. The composition may include an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof. In some embodiments, the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof. In some embodiments, the activator comprises a polynucleotide encoding the gene, or a polynucleotide encoding the open reading frame of the gene, or a polypeptide encoded by the gene, or a combination thereof. In some embodiments, the activator comprises a polynucleotide selected from SEQ ID NOs: 75-97 or a polypeptide selected from SEQ ID NOs: 98-120. In some embodiments, the composition further includes at least one cancer therapy or at least one antiviral therapy. [00017] The disclosure provides for other aspects and embodiments that will be apparent in light of the following detailed description and accompanying figures. BRIEF DESCRIPTION OF THE DRAWINGS [00018] FIG.1 is a diagram showing a tissue-resident memory T cell (TRM) and that the TRM cell displays distinct functional properties. TRM cells may be used for adoptive cell therapy. [00019] FIG.2 is a diagram showing a hypothesis that transcription factors may be used to reprogram blood-derived T cells towards a tissue-resident state. [00020] FIGS.3A-3B show diagrams of protocols for identifying T cell fate decision mediators with pooled cDNA overexpression screens. FIG.3A is a diagram showing a protocol for identifying T cell fate decision mediators using pooled cDNA overexpression screening. FIG.3B is a diagram showing a protocol for screening of pooled cDNA overexpression to identify fate decision mediators. [00021] FIGS.4A-4C show that IL7Ra and CD103 screens uncover synthetic drivers of the tissue resident state. FIG.4A is a diagram showing T cell fates and phenotypes. FIG. 4B is a graph showing transcription factors that mediate upregulation of CD103 and IL7Ra. FIG.4C is a graph showing that TGIF2LX significantly upregulates both CD103 and IL7Ra. [00022] FIGS.5A-5C show TGIF family structure and expression. FIG.5A is a diagram showing the domains of TGIF family proteins. TGIF family proteins have unique functional domains. FIG.5B is a graph showing the expression in transcript per million (TPM) of TGIF family members in human pan T cells from blood, spleen, bone marrow, ileum, jejunum, lung, and skin. FIG.5C is a graph showing the expression ratio of Tgif1 and Tgif2 of murine spleen T cells and intestinal intraepithelial lymphocytes (IEL). Data was analyzed from PMID:29211713 (Milner et al. Nature 2017, 552 (7684), 253-225, incorporated herein by reference). [00023] FIGS.6A-6E show that Tgif1 is upregulated in early resident memory. Data was analyzed from PMID:32414833 (Kurd et al. Sci. Immunol.2020, 5, 1-16, incorporated herein by reference). FIG.6A is a diagram showing a protocol for generating tissue-resident memory T (Trm) cells using an infectious agent. FIG.6B is a uniform manifold approximation and projection (UMAP) embedding of sequencing of T cells from the spleen and gut following LCMV infection where Tgif1 expression is shown in FIG.6E. FIG.6C is a UMAP embedding of sequencing of T cells from the spleen and gut where the expression of Tgif1 is shown in FIG.6E, and the UMAP is colored coded by days post-infection. FIG.6D is a chart showing the expression of Tgif1 and percentage of cells expressing Tgif1 in T cells from the spleen and gut. FIG.6E is a chart showing the expression of Tgif2 and percentage of cells expressing Tgif2 in T cells from the spleen and gut. [00024] FIGS.7A-7E show a molecular characterization of transcription factor-engineered chimeric antigen receptor T-cells (CAR Ts). FIG.7A is a diagram showing a protocol for transduction, expansion, and characterization of CAR Ts. FIG.7B is a graph showing flow cytometry data measuring forward scatter (FSC) and the CD103 marker on engineered CAR Ts expressing a control (Thy1.1) or TGIF2LX. FIG.7C is a graph showing that CD103 is upregulated in CAR Ts expressing TGIF2LX. FIG.7D is a graph showing flow cytometry data measuring IL7Ra for engineered CAR Ts expressing Thy1.1 or TGIF2LX. FIG.7E is a graph showing that IL7Ra is upregulated in CAR Ts expressing TGIF2LX. [00025] FIGS.8A-8J show that TGIF2LX reprograms T cells towards a tissue resident- like state. FIG.8A is a volcano plot showing genes enriched in CAR Ts expressing TGIF2LX compared to CAR Ts expressing Thy1.1. FIG.8B is a clustered heatmap showing expression of markers in circulatory T cells, TRM cells, CAR Ts expressing TGIF2LX, and CAR Ts expressing Thy1.1. FIG.8C is a heatmap showing similarities in gene expression between perturbed samples overexpressing the indicated transcription factor or Thy1.1 control. FIG.8D is a graph showing principal component (PC) 1 and 2 variance between the perturbed transcription factors. FIG.8E is a graph showing PC1 and PC2 variance between the culture media. FIG.8F is a graph showing PC1 and PC2 variance between the donors. FIG.8G is a graph showing normalized RNA read counts of CAR Ts expressing TGIF2LX and CAR Ts expressing Thy1.1 for KLF2. FIG.8H is a graph showing normalized RNA read counts of CAR Ts expressing TGIF2LX and CAR Ts expressing Thy1.1 for CD103. FIG.8I is a graph showing normalized RNA read counts of CAR Ts expressing TGIF2LX and CAR Ts expressing Thy1.1 for P2RX7. FIG.8J is a graph showing normalized RNA read counts of CAR Ts expressing TGIF2LX and CAR Ts expressing Thy1.1 for ITGB1. [00026] FIGS.9A-9G show that single cell-omics reveals CAR Ts heterogeneity. FIG.9A is a diagram showing a protocol for CITE-Seq (Cellular Indexing of Transcriptomes and Epitopes by Sequencing) of CAR Ts. FIG.9B is a graph showing a UMAP embedding of sequencing of TGIF2LX CAR Ts with clusters numbered. FIG.9C is a UMAP embedding of sequencing of donors (each donor represented by a different color) of TGIF2LX CAR Ts. FIG.9D is a UMAP embedding of sequencing of perturbation of TGIF2LX, RUNX3, and Thy1.1 CAR Ts. FIG.9E is a graph showing expression of differentially expressed genes TGIF2LX and RUNX3 vs Thy1.1 expression. FIG.9F is a UMAP embedding showing identification of multiple subpopulations of CD8TGIF2LX T cells. FIG.9G is a heatmap showing expression of markers for the subpopulations of CD8TGIF2LX T cells. [00027] FIGS.10A-10F show that TGIF2LX increases tissue-residence associated surface markers. FIG.10A is UMAP embeddings showing expression of CD8, CD103, CD69, and CD49a for CD8TGIF2LX T cells. FIG.10B is a UMAP embeddings showing expression of CD8, CD103, CD69, and CD49a for CD8Thy1.1 T cells. FIG.10C is a graph showing the expression level of CD8 for CD8TGIF2LX T cells and CD8Thy1.1 T cells. FIG.10D is a graph showing the expression level of CD103 for CD8TGIF2LX T cells and CD8Thy1.1 T cells. FIG.10E is a graph showing the expression level of CD69 for CD8TGIF2LX T cells and CD8Thy1.1 T cells. FIG.10F is a graph showing the expression level of CD49a for CD8TGIF2LX T cells and CD8Thy1.1 T cells. [00028] FIGS.11A-11C show that CD8 subpopulations mirror memory to effector hierarchy. FIG.11A is a heatmap showing expression of markers in the CD8 T cell subpopulations numbered in FIG.9F. FIG.11B is a graph showing the average expression and percentage of cells expressing each marker. FIG.11C is a diagram showing that the CD8 T cell subpopulations mirror memory to effector cell hierarchy. [00029] FIGS.12A-12E show that TGIF2LX CAR Ts maintained cytotoxic capabilities in vitro. FIG.12A is a graph showing the percent of target cells left over time after incubation with CAR TTGIF2LX or CAR TThy1.1. FIG.12B is graphs showing concentrations of secreted granzyme A, perforin, and granulysin in media by CAR TTGIF2LX or CAR TThy1.1 after incubation with target cells. FIG.12C is graphs showing concentrations of secreted IL-17A, IL-4, and IL-2 in media by CAR TTGIF2LX or CAR TThy1.1 after incubation with target cells. FIG. 12D is graphs showing concentrations of secreted granzyme B, IFN-Ȗ, and sFas in media by CAR TTGIF2LX or CAR TThy1.1 after incubation with target cells. FIG.12E is graphs showing secreted concentrations of IL-10, IL-6, and TNF-Į in media by CAR TTGIF2LX or CAR TThy1.1 after incubation with target cells. [00030] FIGS.13A-13B show that CAR TTGIFL2X improved tumor control in a HER2+ breast cancer xenograft. FIG.13A is a diagram showing a protocol for inducing a breast cancer model in vivo and treating the cancer with CAR T cells. FIG.13B is a graph showing tumor volume overtime following treatment with various CAR T cells. [00031] FIG.14 is a graph showing expression of CD103. The data shows that TGIF2LX boosts CD103 expression in context of TGFȕ. [00032] FIGS.15A-15B show the TGIF2LX domains that may be responsible for residency reprogramming. FIG.15A is a diagram showing the 3D structure of TGIF2LX. FIG.15B is a diagram showing the domains of TGIF family proteins. DETAILED DESCRIPTION [00033] Provided herein are compositions and methods for increasing or enhancing T cells, which may be used to enhance antiviral therapies and cancer therapies such as ACT. The compositions and methods may include increasing gene expression or a gene product thereof that encodes a regulator of T cell phenotypes, such as transcription factors, and may include, for example, delivery of a vector encoding the gene or increasing gene expression or a gene product thereof. Transcriptions factors (TFs) are central mediators of cellular reprogramming and differentiation. The gene may be selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1. As detailed herein, cDNA overexpression screens were conducted, using open reading frames (ORFs) encoding all known transcription factors, in primary CD8 T-cells to identify genetic regulators of CD103 (a residency marker) and IL7Ra (a memory marker). Overexpression of one or more of these genes resulted in widespread transcriptional reprogramming, including activation of genes associated with increased tissue residency programs (for example, ITGAE, CD69, and CXCR6) and decreases in genes associated with circulation programs (for example, KLF2 and S1PR1). Increased expression of one or more of the genes detailed herein may impact the “resident memory” phenotype of a T cell, which relates to durability of the T cell and/or the distribution and infiltration of the T cell into tissues. The genes disclosed herein could be used to support specific features of memory T cells, counter T cell exhaustion, improve tumor control, enhance T cell infiltration into tissues, and/or engineer T cells with enhanced durability and therapeutic potential. The gene targets identified and described herein may be used, for example, to improve the efficacy of ACT. 1. Definitions [00034] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art. In case of conflict, the present document, including definitions, will control. Preferred methods and materials are described below, although methods and materials similar or equivalent to those described herein can be used in practice or testing of the present invention. All publications, patent applications, patents and other references mentioned herein are incorporated by reference in their entirety. The materials, methods, and examples disclosed herein are illustrative only and not intended to be limiting. [00035] The terms “comprise(s),” “include(s),” “having,” “has,” “can,” “contain(s),” and variants thereof, as used herein, are intended to be open-ended transitional phrases, terms, or words that do not preclude the possibility of additional acts or structures. The singular forms “a,” “and,” and “the” include plural references unless the context clearly dictates otherwise. The present disclosure also contemplates other embodiments “comprising,” “consisting of,” and “consisting essentially of,” the embodiments or elements presented herein, whether explicitly set forth or not. [00036] For the recitation of numeric ranges herein, each intervening number there between with the same degree of precision is explicitly contemplated. For example, for the range of 6-9, the numbers 7 and 8 are contemplated in addition to 6 and 9, and for the range 6.0-7.0, the number 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, and 7.0 are explicitly contemplated. [00037] The term “about” or “approximately” as used herein as applied to one or more values of interest, refers to a value that is similar to a stated reference value, or within an acceptable error range for the particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured or determined, such as the limitations of the measurement system. In certain aspects, the term “about” refers to a range of values that fall within 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, or less in either direction (greater than or less than) of the stated reference value unless otherwise stated or otherwise evident from the context (except where such number would exceed 100% of a possible value). Alternatively, “about” can mean within 3 or more than 3 standard deviations, per the practice in the art. Alternatively, such as with respect to biological systems or processes, the term “about” can mean within an order of magnitude, preferably within 5-fold, and more preferably within 2- fold, of a value. [00038] “Adeno-associated virus” or “AAV” as used interchangeably herein refers to a small virus belonging to the genus Dependovirus of the Parvoviridae family that infects humans and some other primate species. AAV is not currently known to cause disease and consequently the virus causes a very mild immune response. [00039] “Allogeneic” refers to any material derived from another subject of the same species. Allogeneic cells are genetically distinct and immunologically incompatible yet belong to the same species. Typically, “allogeneic” is used to define cells, such as stem cells, that are transplanted from a donor to a recipient of the same species. [00040] “Amino acid” as used herein refers to naturally occurring and non-natural synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids. Naturally occurring amino acids are those encoded by the genetic code. Amino acids can be referred to herein by either their commonly known three-letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical Nomenclature Commission. Amino acids include the side chain and polypeptide backbone portions. [00041] “Autologous" refers to any material derived from a subject and re-introduced to the same subject. [00042] “Binding region” as used herein refers to the region within a target region that is recognized and bound by the CRISPR/Cas-based gene editing system. [00043] The terms “cancer”, “cancer cell”, “tumor”, and “tumor cell” are used interchangeably herein and refer generally to a group of diseases characterized by uncontrolled, abnormal growth of cells (e.g., a neoplasia). In some forms of cancer, the cancer cells can spread locally or through the bloodstream and lymphatic system to other parts of the body (“metastatic cancer”). “Cancer” refers to all types of cancer or neoplasm or malignant tumors found in animals, including carcinoma, adenoma, melanoma, sarcoma, lymphoma, leukemia, blastoma, glioma, astrocytoma, mesothelioma, or a germ cell tumor. Cancer may include cancer of, for example, the colon, rectum, stomach, bladder, cervix, uterus, skin, epithelium, muscle, kidney, liver, lymph, bone, blood, ovary, prostate, lung, brain, head and neck, and/or breast. Cancer may include medullablastoma, non-small cell lung cancer, and/or mesothelioma. In embodiments detailed herein, the cancer includes leukemia. The term “leukemia” refers to broadly progressive, malignant diseases of the hematopoietic organs/systems and is generally characterized by a distorted proliferation and development of leukocytes and their precursors in the blood and bone marrow. Leukemia diseases include, for example, acute nonlymphocytic leukemia, chronic lymphocytic leukemia, acute granulocytic leukemia, chronic granulocytic leukemia, acute promyelocytic leukemia, adult T-cell leukemia, aleukemic leukemia, a leukocythemic leukemia, basophilic leukemia, blast cell leukemia, bovine leukemia, chronic myelocytic leukemia, leukemia cutis, embryonal leukemia, eosinophilic leukemia, Gross' leukemia, Rieder cell leukemia, Schilling's leukemia, stem cell leukemia, subleukemic leukemia, undifferentiated cell leukemia, hairy-cell leukemia, hemoblastic leukemia, hemocytoblastic leukemia, histiocytic leukemia, stem cell leukemia, acute monocytic leukemia, leukopenic leukemia, lymphatic leukemia, lymphoblastic leukemia, lymphocytic leukemia, lymphogenous leukemia, lymphoid leukemia, lymphosarcoma cell leukemia, mast cell leukemia, megakaryocytic leukemia, micromyeloblastic leukemia, monocytic leukemia, myeloblastic leukemia, myelocytic leukemia, myeloid leukemia, myeloid granulocytic leukemia, myelomonocytic leukemia, Naegeli leukemia, plasma cell leukemia, plasmacytic leukemia, and promyelocytic leukemia. In some embodiments, the leukemia is chronic myeloid leukemia (CML). In some embodiments, the leukemia is acute myeloid leukemia (AML). [00044] “Clustered Regularly Interspaced Short Palindromic Repeats” and “CRISPRs”, as used interchangeably herein, refers to loci containing multiple short direct repeats that are found in the genomes of approximately 40% of sequenced bacteria and 90% of sequenced archaea. [00045] “Coding sequence” or “encoding nucleic acid” as used herein means the nucleic acids (RNA or DNA molecule) that comprise a nucleotide sequence which encodes a protein. The coding sequence can further include initiation and termination signals operably linked to regulatory elements including a promoter and polyadenylation signal capable of directing expression in the cells of an individual or mammal to which the nucleic acid is administered. The regulatory elements may include, for example, a promoter, an enhancer, an initiation codon, a stop codon, or a polyadenylation signal. The coding sequence may be codon optimized. [00046] “Complement” or “complementary” as used herein means a nucleic acid can mean Watson-Crick (e.g., A-T/U and C-G) or Hoogsteen base pairing between nucleotides or nucleotide analogs of nucleic acid molecules. “Complementarity” refers to a property shared between two nucleic acid sequences, such that when they are aligned antiparallel to each other, the nucleotide bases at each position will be complementary. [00047] The terms “control,” “reference level,” and “reference” are used herein interchangeably. The reference level may be a predetermined value or range, which is employed as a benchmark against which to assess the measured result. “Control group” as used herein refers to a group of control subjects. The predetermined level may be a cutoff value from a control group. The predetermined level may be an average from a control group. Cutoff values (or predetermined cutoff values) may be determined by Adaptive Index Model (AIM) methodology. Cutoff values (or predetermined cutoff values) may be determined by a receiver operating curve (ROC) analysis from biological samples of the patient group. ROC analysis, as generally known in the biological arts, is a determination of the ability of a test to discriminate one condition from another, e.g., to determine the performance of each marker in identifying a patient having CRC. A description of ROC analysis is provided in P.J. Heagerty et al. (Biometrics 2000, 56, 337-44), the disclosure of which is hereby incorporated by reference in its entirety. Alternatively, cutoff values may be determined by a quartile analysis of biological samples of a patient group. For example, a cutoff value may be determined by selecting a value that corresponds to any value in the 25th-75th percentile range, preferably a value that corresponds to the 25th percentile, the 50th percentile or the 75th percentile, and more preferably the 75th percentile. Such statistical analyses may be performed using any method known in the art and can be implemented through any number of commercially available software packages (e.g., from Analyse-it Software Ltd., Leeds, UK; StataCorp LP, College Station, TX; SAS Institute Inc., Cary, NC.). The healthy or normal levels or ranges for a target or for a protein activity may be defined in accordance with standard practice. A control may be a subject or cell without a composition as detailed herein. A control may be a subject, or a sample therefrom, whose disease state is known. The subject, or sample therefrom, may be healthy, diseased, diseased prior to treatment, diseased during treatment, or diseased after treatment, or a combination thereof. [00048] “Correcting”, “gene editing,” and “restoring” as used herein refers to changing a mutant gene that encodes a dysfunctional protein or truncated protein or no protein at all, such that a full-length functional or partially full-length functional protein expression is obtained. Correcting or restoring a mutant gene may include replacing the region of the gene that has the mutation or replacing the entire mutant gene with a copy of the gene that does not have the mutation with a repair mechanism such as homology-directed repair (HDR). Correcting or restoring a mutant gene may also include repairing a frameshift mutation that causes a premature stop codon, an aberrant splice acceptor site or an aberrant splice donor site, by generating a double stranded break in the gene that is then repaired using non-homologous end joining (NHEJ). NHEJ may add or delete at least one base pair during repair which may restore the proper reading frame and eliminate the premature stop codon. Correcting or restoring a mutant gene may also include disrupting an aberrant splice acceptor site or splice donor sequence. Correcting or restoring a mutant gene may also include deleting a non-essential gene segment by the simultaneous action of two nucleases on the same DNA strand in order to restore the proper reading frame by removing the DNA between the two nuclease target sites and repairing the DNA break by NHEJ. [00049] “Donor DNA”, “donor template,” and “repair template” as used interchangeably herein refers to a double-stranded DNA fragment or molecule that includes at least a portion of the gene of interest. The donor DNA may encode a full-functional protein or a partially functional protein. [00050] “Enhancer” as used herein refers to non-coding DNA sequences containing multiple activator and repressor binding sites. Enhancers range from 200 bp to 1 kb in length and may be either proximal, 5’ upstream to the promoter or within the first intron of the regulated gene, or distal, in introns of neighboring genes or intergenic regions far away from the locus. Through DNA looping, active enhancers contact the promoter dependently of the core DNA binding motif promoter specificity. 4 to 5 enhancers may interact with a promoter. Similarly, enhancers may regulate more than one gene without linkage restriction and may “skip” neighboring genes to regulate more distant ones. Transcriptional regulation may involve elements located in a chromosome different to one where the promoter resides. Proximal enhancers or promoters of neighboring genes may serve as platforms to recruit more distal elements. [00051] “Frameshift” or “frameshift mutation” as used interchangeably herein refers to a type of gene mutation wherein the addition or deletion of one or more nucleotides causes a shift in the reading frame of the codons in the mRNA. The shift in reading frame may lead to the alteration in the amino acid sequence at protein translation, such as a missense mutation or a premature stop codon. [00052] “Functional” and “full-functional” as used herein describes protein that has biological activity. A “functional gene” refers to a gene transcribed to mRNA, which is translated to a functional protein. [00053] “Fusion protein” as used herein refers to a chimeric protein created through the joining of two or more genes that originally coded for separate proteins. The translation of the fusion gene results in a single polypeptide with functional properties derived from each of the original proteins. [00054] “Genetic construct" as used herein refers to the DNA or RNA molecules that comprise a polynucleotide that encodes a protein. The coding sequence includes initiation and termination signals operably linked to regulatory elements including a promoter and polyadenylation signal capable of directing expression in the cells of the individual to whom the nucleic acid molecule is administered. As used herein, the term “expressible form” refers to gene constructs that contain the necessary regulatory elements operable linked to a coding sequence that encodes a protein such that when present in the cell of the individual, the coding sequence will be expressed. The regulatory elements may include, for example, a promoter, an enhancer, an initiation codon, a stop codon, or a polyadenylation signal. [00055] “Genome editing” or “gene editing” as used herein refers to changing the DNA sequence of a gene. Genome editing may include correcting or restoring a mutant gene or adding additional mutations. Genome editing may include knocking out a gene, such as a mutant gene or a normal gene. Genome editing may be used to treat disease or, for example, enhance muscle repair, by changing the gene of interest. In some embodiments, the compositions and methods detailed herein are for use in somatic cells and not germ line cells. [00056] The term “heterologous” as used herein refers to nucleic acid comprising two or more subsequences that are not found in the same relationship to each other in nature. For instance, a nucleic acid that is recombinantly produced typically has two or more sequences from unrelated genes synthetically arranged to make a new functional nucleic acid, for example, a promoter from one source and a coding region from another source. The two nucleic acids are thus heterologous to each other in this context. When added to a cell, the recombinant nucleic acids would also be heterologous to the endogenous genes of the cell. Thus, in a chromosome, a heterologous nucleic acid would include a non-native (non- naturally occurring) nucleic acid that has integrated into the chromosome, or a non-native (non-naturally occurring) extrachromosomal nucleic acid. Similarly, a heterologous protein indicates that the protein comprises two or more subsequences that are not found in the same relationship to each other in nature (for example, a “fusion protein,” where the two subsequences are encoded by a single nucleic acid sequence). [00057] “Homology-directed repair” or “HDR” as used interchangeably herein refers to a mechanism in cells to repair double strand DNA lesions when a homologous piece of DNA is present in the nucleus, mostly in G2 and S phase of the cell cycle. HDR uses a donor DNA template to guide repair and may be used to create specific sequence changes to the genome, including the targeted addition of whole genes. If a donor template is provided along with the CRISPR/Cas9-based gene editing system, then the cellular machinery will repair the break by homologous recombination, which is enhanced several orders of magnitude in the presence of DNA cleavage. When the homologous DNA piece is absent, non-homologous end joining may take place instead. [00058] “Identical” or “identity” as used herein in the context of two or more polynucleotide or polypeptide sequences means that the sequences have a specified percentage of residues that are the same over a specified region. The percentage may be calculated by optimally aligning the two sequences, comparing the two sequences over the specified region, determining the number of positions at which the identical residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the specified region, and multiplying the result by 100 to yield the percentage of sequence identity. In cases where the two sequences are of different lengths or the alignment produces one or more staggered ends and the specified region of comparison includes only a single sequence, the residues of single sequence are included in the denominator but not the numerator of the calculation. When comparing DNA and RNA, thymine (T) and uracil (U) may be considered equivalent. Identity may be performed manually or by using a computer sequence algorithm such as BLAST or BLAST 2.0. [00059] “Mutant gene” or “mutated gene” as used interchangeably herein refers to a gene that has undergone a detectable mutation. A mutant gene has undergone a change, such as the loss, gain, or exchange of genetic material, which affects the normal transmission and expression of the gene. A “disrupted gene” as used herein refers to a mutant gene that has a mutation that causes a premature stop codon. The disrupted gene product is truncated relative to a full-length undisrupted gene product. [00060] “Non-homologous end joining (NHEJ) pathway” as used herein refers to a pathway that repairs double-strand breaks in DNA by directly ligating the break ends without the need for a homologous template. The template-independent re-ligation of DNA ends by NHEJ is a stochastic, error-prone repair process that introduces random micro-insertions and micro-deletions (indels) at the DNA breakpoint. This method may be used to intentionally disrupt, delete, or alter the reading frame of targeted gene sequences. NHEJ typically uses short homologous DNA sequences called microhomologies to guide repair. These microhomologies are often present in single-stranded overhangs on the end of double-strand breaks. When the overhangs are perfectly compatible, NHEJ usually repairs the break accurately, yet imprecise repair leading to loss of nucleotides may also occur, but is much more common when the overhangs are not compatible. “Nuclease mediated NHEJ” as used herein refers to NHEJ that is initiated after a nuclease cuts double stranded DNA. [00061] “Normal gene” as used herein refers to a gene that has not undergone a change, such as a loss, gain, or exchange of genetic material. The normal gene undergoes normal gene transmission and gene expression. For example, a normal gene may be a wild-type gene. [00062] “Nucleic acid” or “oligonucleotide” or “polynucleotide” as used herein means at least two nucleotides covalently linked together. The depiction of a single strand also defines the sequence of the complementary strand. Thus, a polynucleotide also encompasses the complementary strand of a depicted single strand. Many variants of a polynucleotide may be used for the same purpose as a given polynucleotide. Thus, a polynucleotide also encompasses substantially identical polynucleotides and complements thereof. A single strand provides a probe that may hybridize to a target sequence under stringent hybridization conditions. Thus, a polynucleotide also encompasses a probe that hybridizes under stringent hybridization conditions. Polynucleotides may be single stranded or double stranded or may contain portions of both double stranded and single stranded sequence. The polynucleotide can be nucleic acid, natural or synthetic, DNA, genomic DNA, cDNA, RNA, mRNA, or a hybrid, where the polynucleotide can contain combinations of deoxyribo- and ribo-nucleotides, and combinations of bases including, for example, uracil, adenine, thymine, cytosine, guanine, inosine, xanthine hypoxanthine, isocytosine, and isoguanine. Polynucleotides can be obtained by chemical synthesis methods or by recombinant methods. [00063] “Open reading frame” refers to a stretch of codons that begins with a start codon and ends at a stop codon. In eukaryotic genes with multiple exons, introns are removed, and exons are then joined together after transcription to yield the final mRNA for protein translation. An open reading frame may be a continuous stretch of codons. In some embodiments, the open reading frame only applies to spliced mRNAs, not genomic DNA, for expression of a protein. [00064] “Operably linked” as used herein means that expression of a gene is under the control of a promoter with which it is spatially connected. A promoter may be positioned 5' (upstream) or 3' (downstream) of a gene under its control. The distance between the promoter and a gene may be approximately the same as the distance between that promoter and the gene it controls in the gene from which the promoter is derived. As is known in the art, variation in this distance may be accommodated without loss of promoter function. Nucleic acid or amino acid sequences are “operably linked” (or “operatively linked”) when placed into a functional relationship with one another. For instance, a promoter or enhancer is operably linked to a coding sequence if it regulates, or contributes to the modulation of, the transcription of the coding sequence. Operably linked DNA sequences are typically contiguous, and operably linked amino acid sequences are typically contiguous and in the same reading frame. However, since enhancers generally function when separated from the promoter by up to several kilobases or more and intronic sequences may be of variable lengths, some polynucleotide elements may be operably linked but not contiguous. Similarly, certain amino acid sequences that are non-contiguous in a primary polypeptide sequence may nonetheless be operably linked due to, for example folding of a polypeptide chain. With respect to fusion polypeptides, the terms “operatively linked” and “operably linked” can refer to the fact that each of the components performs the same function in linkage to the other component as it would if it were not so linked. [00065] “Partially-functional” as used herein describes a protein that is encoded by a mutant gene and has less biological activity than a functional protein but more than a non- functional protein. [00066] A “peptide” or “polypeptide” is a linked sequence of two or more amino acids linked by peptide bonds. The polypeptide can be natural, synthetic, or a modification or combination of natural and synthetic. Peptides and polypeptides include proteins such as binding proteins, receptors, and antibodies. The terms “polypeptide”, “protein,” and “peptide” are used interchangeably herein. “Primary structure” refers to the amino acid sequence of a particular peptide. “Secondary structure” refers to locally ordered, three dimensional structures within a polypeptide. These structures are commonly known as domains, for example, enzymatic domains, extracellular domains, transmembrane domains, pore domains, and cytoplasmic tail domains. “Domains” are portions of a polypeptide that form a compact unit of the polypeptide and are typically 15 to 350 amino acids long. Exemplary domains include domains with enzymatic activity or ligand binding activity. Typical domains are made up of sections of lesser organization such as stretches of beta-sheet and alpha- helices. “Tertiary structure” refers to the complete three-dimensional structure of a polypeptide monomer. “Quaternary structure” refers to the three-dimensional structure formed by the noncovalent association of independent tertiary units. A “motif” is a portion of a polypeptide sequence and includes at least two amino acids. A motif may be 2 to 20, 2 to 15, or 2 to 10 amino acids in length. In some embodiments, a motif includes 3, 4, 5, 6, or 7 sequential amino acids. A domain may be comprised of a series of the same type of motif. [00067] “Premature stop codon” or “out-of-frame stop codon” as used interchangeably herein refers to nonsense mutation in a sequence of DNA, which results in a stop codon at location not normally found in the wild-type gene. A premature stop codon may cause a protein to be truncated or shorter compared to the full-length version of the protein. [00068] “Promoter” as used herein means a synthetic or naturally derived molecule which is capable of conferring, activating or enhancing expression of a nucleic acid in a cell. A promoter may comprise one or more specific transcriptional regulatory sequences to further enhance expression and/or to alter the spatial expression and/or temporal expression of same. A promoter may also comprise distal enhancer or repressor elements, which may be located as much as several thousand base pairs from the start site of transcription. A promoter may be derived from sources including viral, bacterial, fungal, plants, insects, and animals. A promoter may regulate the expression of a gene component constitutively, or differentially with respect to cell, the tissue or organ in which expression occurs or, with respect to the developmental stage at which expression occurs, or in response to external stimuli such as physiological stresses, pathogens, metal ions, or inducing agents. Representative examples of promoters include the bacteriophage T7 promoter, bacteriophage T3 promoter, SP6 promoter, lac operator-promoter, tac promoter, SV40 late promoter, SV40 early promoter, RSV-LTR promoter, CMV IE promoter, SV40 early promoter or SV40 late promoter, human U6 (hU6) promoter, and CMV IE promoter. Promoters that target muscle-specific stem cells may include the CK8 promoter, the Spc5-12 promoter, and the MHCK7 promoter. [00069] The term “recombinant” when used with reference to, for example, a cell, nucleic acid, protein, or vector, indicates that the cell, nucleic acid, protein, or vector, has been modified by the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, for example, recombinant cells express genes that are not found within the native (naturally occurring) form of the cell or express a second copy of a native gene that is otherwise normally or abnormally expressed, under expressed, or not expressed at all. [00070] The term “shRNA” stands for short hairpin RNA or small hairpin RNA. A shRNA is an artificial RNA molecule with a tight hairpin turn that can be used to silence target gene expression via RNA interference (RNAi). Expression of shRNA in cells may be facilitated by delivery of plasmids or viral or bacterial vectors. The shRNA is processed by Dicer into siRNA. [00071] The term “siRNA” stands for small interfering RNA siRNA, sometimes also known as short interfering RNA or silencing RNA. A siRNA is a class of double-stranded RNA molecule. The siRNA may be natural or artificial. The siRNA forms a complex with the RNA-induced silencing complex (RISC). The antisense (guide) strand of siRNA directs RISC to mRNA that has a complementary sequence, and then the mRNA is cleaved by RISC or its translation is repressed. [00072] “Sample” or “test sample” as used herein can mean any sample in which the presence and/or level of a target is to be detected or determined or any sample comprising a DNA targeting or gene editing system or component thereof as detailed herein. Samples may include liquids, solutions, emulsions, or suspensions. Samples may include a medical sample. Samples may include any biological fluid or tissue, such as blood, whole blood, fractions of blood such as plasma and serum, muscle, interstitial fluid, sweat, saliva, urine, tears, synovial fluid, bone marrow, cerebrospinal fluid, nasal secretions, sputum, amniotic fluid, bronchoalveolar lavage fluid, gastric lavage, emesis, fecal matter, lung tissue, peripheral blood mononuclear cells, total white blood cells, lymph node cells, spleen cells, tonsil cells, cancer cells, tumor cells, bile, digestive fluid, skin, or combinations thereof. In some embodiments, the sample comprises an aliquot. In other embodiments, the sample comprises a biological fluid. Samples can be obtained by any means known in the art. The sample can be used directly as obtained from a patient or can be pre-treated, such as by filtration, distillation, extraction, concentration, centrifugation, inactivation of interfering components, addition of reagents, and the like, to modify the character of the sample in some manner as discussed herein or otherwise as is known in the art. [00073] “Subject” and “patient” as used herein interchangeably refers to any vertebrate, including, but not limited to, a mammal that wants or is in need of the herein described compositions or methods. The subject may be a human or a non-human. The subject may be a vertebrate. The subject may be a mammal. The mammal may be a primate or a non- primate. The mammal can be a non-primate such as, for example, cow, pig, camel, llama, hedgehog, anteater, platypus, elephant, alpaca, horse, goat, rabbit, sheep, hamster, guinea pig, cat, dog, rat, and mouse. The mammal can be a primate such as a human. The mammal can be a non-human primate such as, for example, monkey, cynomolgous monkey, rhesus monkey, chimpanzee, gorilla, orangutan, and gibbon. The subject may be of any age or stage of development, such as, for example, an adult, an adolescent, a child, such as age 0-2, 2-4, 2-6, or 6-12 years, or an infant, such as age 0-1 years. The subject may be male. The subject may be female. In some embodiments, the subject has a specific genetic marker. The subject may be undergoing other forms of treatment. The subject may have a disease or condition. In some embodiments, the subject has cancer. In some embodiments, the subject is human. [00074] “Substantially identical” can mean that a first and second amino acid or polynucleotide sequence are at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99%, or less than 100% over a region of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1100 amino acids or nucleotides, respectively. [00075] “Target gene” as used herein refers to any nucleotide sequence encoding a known or putative gene product. The target gene may be a mutated gene involved in a genetic disease. The target gene may encode a known or putative gene product that is intended to be corrected or for which its expression is intended to be modulated. In certain embodiments, the target gene is a gene detailed herein as a modulator of T cells. [00076] “Target region” as used herein refers to the region of the target gene to which the CRISPR/Cas9-based gene editing or targeting system is designed to bind. [00077] “T cells” are a type of white blood cell of the immune system and play a central role in the adaptive immune response. T cells express a T-cell receptor (TCR) on their cell surface. The T cell receptor (TCR) of a T cell is able to interact with immunogenic peptides (epitopes) bound to major histocompatibility complex (MHC) molecules and presented on the surface of target cells. Specific binding of the TCR triggers a signal cascade inside the T cell leading to proliferation and differentiation into a maturated effector T cell. T cells may differentiate into different types of T cells. T cells may include, for example, CD8+ T cells (“killer T cells” or “cytotoxic T cells) and CD4+ T cells (“helper T cells”). CD8+ T cells and CD4+ T cells may further differentiate into other types of T cells including, for example, regulatory T cells (“suppressor T cells”) and memory T cells. In some embodiments herein, the T cell is a memory T cell. An antigen-naïve T cell expands and differentiates into a memory T cell after encountering the cognate antigen within the context of a major histocompatibility complex (MHC) molecule on the surface of an antigen presenting cell. Memory T cells may be CD8+ or CD4+. Memory T cells are long-lived and can quickly expand to large numbers of effector T cells upon re-exposure to their cognate antigen. Tissue-resident memory T cells (TRM cells) are a subset of a long-lived memory T cells that occupy epithelial, mucosal, and other tissues such as skin, mucosa, lung, brain, pancreas, and gastrointestinal tract, without recirculating. TRM cells may be transcriptionally, phenotypically, and functionally different from central memory (TCM) and effector memory (TEM) T cells that recirculate between blood, the T cell zones of secondary lymphoid organ, lymph tissues, and nonlymphoid tissues. TRM cells can develop from circulating effector memory T cell precursors in response to an antigen. TRM cells may be CD103+. TRM cells may provide superior protection against infection in extralymphoid tissues. A T cell detailed herein may be a TRM cell. [00078] “Transgene” as used herein refers to a gene or genetic material containing a gene sequence that has been isolated from one organism and is introduced into a different organism. This non-native segment of DNA may retain the ability to produce RNA or protein in the transgenic organism, or it may alter the normal function of the transgenic organism's genetic code. The introduction of a transgene has the potential to change the phenotype of an organism. [00079] “Transcriptional regulatory elements” or “regulatory elements” refers to a genetic element which can control the expression of nucleic acid sequences, such as activate, enhancer, or decrease expression, or alter the spatial and/or temporal expression of a nucleic acid sequence. Examples of regulatory elements include, for example, promoters, enhancers, splicing signals, polyadenylation signals, and termination signals. A regulatory element can be “endogenous,” “exogenous,” or “heterologous” with respect to the gene to which it is operably linked. An “endogenous” regulatory element is one which is naturally linked with a given gene in the genome. An “exogenous” or “heterologous” regulatory element is one which is not normally linked with a given gene but is placed in operable linkage with a gene by genetic manipulation. [00080] “Treatment” or “treating” or “therapy” when referring to protection of a subject from a disease, means suppressing, repressing, reversing, alleviating, ameliorating, or inhibiting the progress of disease, or completely eliminating a disease. A treatment may be either performed in an acute or chronic way. The term also refers to reducing the severity of a disease or symptoms associated with such disease prior to affliction with the disease. Treatment may result in a reduction in the incidence, frequency, severity, and/or duration of symptoms of the disease. Preventing the disease involves administering a composition of the present invention to a subject prior to onset of the disease. Suppressing the disease involves administering a composition of the present invention to a subject after induction of the disease but before its clinical appearance. Repressing or ameliorating the disease involves administering a composition of the present invention to a subject after clinical appearance of the disease. [00081] As used herein, the term “gene therapy” refers to a method of treating a patient wherein polypeptides or nucleic acid sequences are transferred into cells of a patient such that activity and/or the expression of a particular gene is modulated. In certain embodiments, the expression of the gene is suppressed. In certain embodiments, the expression of the gene is enhanced. In certain embodiments, the temporal or spatial pattern of the expression of the gene is modulated. [00082] “Variant” used herein with respect to a polynucleotide means (i) a portion or fragment of a referenced nucleotide sequence; (ii) the complement of a referenced nucleotide sequence or portion thereof; (iii) a nucleic acid that is substantially identical to a referenced nucleic acid or the complement thereof; or (iv) a nucleic acid that hybridizes under stringent conditions to the referenced nucleic acid, complement thereof, or a sequence substantially identical thereto. A variant can be a polynucleotide sequence that is substantially identical over the full length of the full polynucleotide sequence or a fragment thereof. The polynucleotide sequence can be 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, or less than 100% identical over the full length of the polynucleotide sequence or a fragment thereof. [00083] “Variant” with respect to a peptide or polypeptide that differs in amino acid sequence by the insertion, deletion, or conservative substitution of amino acids, but retain at least one biological activity. Variant may also mean a protein with an amino acid sequence that is substantially identical to a referenced protein with an amino acid sequence that retains at least one biological activity. Representative examples of “biological activity” include the ability to be bound by a specific antibody or polypeptide or to promote an immune response. Variant can mean a functional fragment thereof. Variant can also mean multiple copies of a polypeptide. The multiple copies can be in tandem or separated by a linker. A conservative substitution of an amino acid, for example, replacing an amino acid with a different amino acid of similar properties (for example, hydrophilicity, degree and distribution of charged regions) is recognized in the art as typically involving a minor change. These minor changes may be identified, in part, by considering the hydropathic index of amino acids, as understood in the art (Kyte et al., J. Mol. Biol.1982, 157, 105-132). The hydropathic index of an amino acid is based on a consideration of its hydrophobicity and charge. It is known in the art that amino acids of similar hydropathic indexes may be substituted and still retain protein function. In one aspect, amino acids having hydropathic indexes of ±2 are substituted. The hydrophilicity of amino acids may also be used to reveal substitutions that would result in proteins retaining biological function. A consideration of the hydrophilicity of amino acids in the context of a peptide permits calculation of the greatest local average hydrophilicity of that peptide. Substitutions may be performed with amino acids having hydrophilicity values within ±2 of each other. Both the hydrophobicity index and the hydrophilicity value of amino acids are influenced by the particular side chain of that amino acid. Consistent with that observation, amino acid substitutions that are compatible with biological function are understood to depend on the relative similarity of the amino acids, and particularly the side chains of those amino acids, as revealed by the hydrophobicity, hydrophilicity, charge, size, and other properties. A variant can be an amino acid sequence that is substantially identical over the full length of the amino acid sequence or fragment thereof. The amino acid sequence can be 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, or less than 100% identical over the full length of the amino acid sequence or a fragment thereof. [00084] “Vector” as used herein means a nucleic acid sequence containing an origin of replication. A vector may be capable of directing the delivery or transfer of a polynucleotide sequence to target cells, where it can be replicated or expressed. A vector may contain an origin of replication, one or more regulatory elements, and/or one or more coding sequences. A vector may be a viral vector, bacteriophage, bacterial artificial chromosome, plasmid, cosmid, or yeast artificial chromosome. A vector may be a DNA or RNA vector. A vector may be a self-replicating extrachromosomal vector. Viral vectors include, but are not limited to, adenovirus vector, adeno-associated virus (AAV) vector, retrovirus vector, or lentivirus vector. A vector may be an adeno-associated virus (AAV) vector. The vector may encode a Cas9 protein and at least one gRNA molecule. [00085] Unless otherwise defined herein, scientific and technical terms used in connection with the present disclosure shall have the meanings that are commonly understood by those of ordinary skill in the art. For example, any nomenclatures used in connection with, and techniques of, cell and tissue culture, molecular biology, immunology, microbiology, genetics, and protein and nucleic acid chemistry and hybridization described herein are those that are well known and commonly used in the art. The meaning and scope of the terms should be clear; in the event however of any latent ambiguity, definitions provided herein take precedent over any dictionary or extrinsic definition. Further, unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. 2. Modulators of T Cells [00086] Provided herein are modulators of T cells. Modifying or modulating may include increasing or decreasing, for example. In some embodiments, the compositions and methods comprise an agent that increases T cells. Increasing T cells may include increasing the number of T cells and/or increasing the number of memory T cells and/or increasing the lifetime of a T cell and/or preventing T cell exhaustion and/or reducing T cell exhaustion and/or reversing T cell exhaustion and/or enhancing the therapeutic potential of T cells. Modifying a T cell may include modifying the expression of a target gene within the T cell. Modifying a T cell may include delivery of a gene to T cells. Modifying a T cell may include enhancing or increasing specific features of memory T cells, countering T cell exhaustion, reversing T cell exhaustion, inhibiting or preventing T cell exhaustion, improving tumor control, increasing T cell biodistribution, enhancing T cell infiltration into tissues, increasing infiltration of T cells into tissues, increasing T cell durability, engineering T cells with enhanced durability and therapeutic potential, increasing T cells, increasing T cell numbers, increasing memory T cells, increasing memory T cell numbers, or a combination thereof. The compositions and methods detailed herein may engineer or modify the gene expression programs within T cells by engineering the T cells directly. In some embodiments, the compositions and methods comprise an agent that increases expression or activity of CD103 or IL7Ra, or a combination thereof, in T cells. IL7Ra may comprise an amino acid sequence of SEQ ID NO: 193, encoded by a polynucleotide comprising the sequence of SEQ ID NO: 194. CD103 may comprise an amino acid sequence of SEQ ID NO: 195, encoded by a polynucleotide comprising the sequence of SEQ ID NO: 196. Expression or activity of CD103 or IL7Ra, or a combination thereof, may each be increased by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. Expression or activity of CD103 or IL7Ra, or a combination thereof, may each be increased by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. Expression or activity of CD103 or IL7Ra, or a combination thereof, may each be increased by about 5- 95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control. In some embodiments, the compositions and methods comprise an agent that decreases expression or activity of CCR7 and/or TOX in T cells. Expression of a marker, such as IL7Ra or CD103, may be done by any suitable means in the art, including, for example, ELISA, immunohistochemistry, flow cytometry, FACS, DNA or RNA sequencing, and hybridization of reporters or probes to RNA transcripts. [00087] The modulator of T cells may target a gene or a regulatory element thereof. Regulatory elements include, for example, promoters and enhancers. Regulatory elements may be within 1000 base pairs of the transcription start site. Regulatory elements may be within 600 base pairs of the transcription start site. The agent, or the composition or the method comprising the agent, may modify the expression of a gene. For example, the agent, or the composition or the method comprising the agent, may reduce, inhibit, decrease, activate, increase, or enhance the expression or activity of a gene or its gene protein product. The agent, or the composition or the method comprising the agent, may directly or indirectly modulate the activity of the gene’s protein product. For example, the modulator of T cells may increase or decrease the binding or enzymatic activity of the gene’s protein product, or inhibit the binding of the gene’s protein product to another molecule or ligand, or increase the binding of the gene’s protein product to another molecule or ligand, or increase or decrease the degradation of the gene’s protein product, or a combination thereof. [00088] The modulator of T cells may be an activator of a gene. The activator may comprise or be the gene or coding sequence thereof itself. Provided herein is an activator of a gene selected fromTGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a regulatory element thereof, or a region thereof, or a combination thereof. The gene may exist in several isoforms. Further provided herein is an activator of a gene selected fromTGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a regulatory element thereof, or a region thereof, or a combination thereof. In some embodiments, the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a regulatory element thereof, or a region thereof, or a combination thereof. The gene may encode a transcription factor. Further provided herein is an activator of a transcription factor selected fromTGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof. In some embodiments, the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a regulatory element thereof, or a region thereof, or a combination thereof. [00089] As an activator, an agent may activate or enhance expression or activity of a gene or gene protein product to increase T cells. The activator may increase the level of polynucleotide encoding the gene or encoding the gene product. The activator may increase the level of transcription of a polynucleotide encoding the gene or encoding the gene product. The activator may increase the level of translation of a mRNA encoding the gene product. The activator may increase the expression of protein encoded by the gene or the open reading frame thereof. The activator may increase the level or amount of protein expressed from the gene. The activator may increase the level or amount of protein expressed from the open reading frame of the gene. [00090] The agent may comprise, for example, a polynucleotide, a polypeptide, a small molecule, a lipid, a carbohydrate, or a combination thereof. In some embodiments, the agent comprises a polynucleotide. The agent may comprise a polynucleotide encoding the gene or a fragment thereof. The agent may comprise a polynucleotide comprising a cDNA of the gene or a fragment thereof. For example, the polynucleotide may comprise a sequence selected from SEQ ID NOs: 75-97, or a fragment thereof. The polynucleotide may comprise a sequence having at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%, or at least 98%, or greater identity to a sequence selected from SEQ ID NOs: 75- 97, or a fragment thereof. The polynucleotide may comprise a sequence having one, two, three, four, five or more changes selected from nucleotide substitutions, insertions, or deletions, relative to a sequence selected from SEQ ID NOs: 75-97, or a fragment thereof. [00091] In some embodiments, the agent comprises a protein. The agent may comprise a polypeptide comprising a protein product of the gene or a fragment thereof. For example, the polypeptide may comprise a sequence selected from SEQ ID NOs: 98-120, or a fragment thereof. The polypeptide may comprise a sequence having at least 75%, at least 80%, at least 85%, at least 90%, or at least 95%, or at least 98%, or greater identity to a sequence selected from SEQ ID NOs: 98-120, or a fragment thereof. The polypeptide may comprise a sequence having one, two, three, four, five or more changes selected from amino acid substitutions, insertions, or deletions, relative to a sequence selected from SEQ ID NOs: 98-120, or a fragment thereof. In some embodiments, the agent comprises a DNA targeting composition as detailed herein or at least one component thereof. [00092] Examples of genes for modulating T cells are shown in TABLE 1 and TABLE 2.
Figure imgf000028_0001
Figure imgf000029_0001
Figure imgf000030_0001
Figure imgf000031_0001
Figure imgf000032_0001
Figure imgf000033_0001
Figure imgf000034_0001
Figure imgf000035_0001
Figure imgf000036_0001
Figure imgf000037_0001
Figure imgf000038_0001
Figure imgf000039_0001
Figure imgf000040_0001
Figure imgf000041_0001
Figure imgf000042_0001
Figure imgf000043_0001
Figure imgf000044_0001
Figure imgf000045_0001
Figure imgf000046_0001
Figure imgf000047_0001
Figure imgf000048_0001
Figure imgf000049_0001
[00093] T cells may be modulated by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. T cells may be modulated by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7- fold, 8-fold, 9-fold, or 10-fold, relative to a control. T cells may be modulated by about 5- 95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control. T cells may be reduced by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7- fold, 8-fold, 9-fold, or 10-fold, relative to a control. T cells may be reduced by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. T cells may be reduced by about 5-95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control. T cells may be increased by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. T cells may be increased by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. T cells may be increased by about 5-95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control. [00094] In some embodiments, the modulator of T cells is administered with or as a therapy such as cancer therapy. In some embodiments, the modulator of T cells is used to modify a T cell that is then or later administered with or used as a cancer therapy. Accordingly, further provided herein is a T cell modified by an activator as detailed herein. The T cell may be modified in vitro or ex vivo or in vivo. A T cell modified by an activator as detailed herein may be administered to a subject. The cancer therapy may include chemotherapy or immunotherapy. The cancer therapy may include adoptive T cell therapy (ACT) therapy. The cancer therapy may include a chimeric antigen receptor (CAR). For example, a modulator of T cells as detailed herein may be used to modify a CAR T cell to generate a modified or engineered CAR T cell. Further provided herein is a T cell modified by an activator as detailed herein, which is then or later modified to generate a CAR T cell. A chimeric antigen receptor (CAR) may also be known as chimeric immunoreceptor, chimeric T cell receptor, or artificial T cell receptor. CARs are receptor proteins that have been engineered to give T cells the new ability to target a specific antigen. CARs are chimeric in that they may combine both antigen-binding and T cell activating functions into a single receptor. CARs may include an antigen binding domain specific for an antigen on a cancer cell. The premise of CAR T immunotherapy is to modify T cells to recognize cancer cells in order to target and destroy them. T cells are harvested from a subject, the T cells are genetically altered to add a chimeric antigen receptor (CAR) that specifically recognizes cancer cells, and the resulting CAR T cells may be administered to the subject to attack their tumors. CAR T cell therapy and modification to T cells are described in, for example, WO2012/079000 and WO2012/129514 and WO2018/005712, each of which is incorporated herein by reference in its entirety. [00095] The cancer therapy may include a T cell receptor (TCR) therapy. A modulator of T cells as detailed herein may be used to modify a T cell to generate a modified or engineered T cell for use in TCR therapy. Like CAR T cell therapy, TCR therapy involves the modification of T cells. However, unlike CAR T cell therapy, which may use man-made receptors that targets antigens on the surface of cells, TCR therapy capitalizes on the natural mechanisms of T cells. Unlike hematological cancer, solid tumors lack cell surface antigens to be recognized by typical CARs. Intracellular antigens from solid tumors may be presented by HLA molecules as HLA-antigen epitope complexes on the cell surface, which may be specifically recognized by TCRs on the surface of tumor-specific T cells, thereby eliciting anti-tumor cytotoxicity by the tumor-specific T cells. TCR therapies may include transformed T cells expressing at least one vector encoding a TCR that is capable of binding to an HLA molecule and antigen on tumor cells. TCR therapy and modification to T cells are described in, for example, WO2019/196088, WO2019/196924, U.S. Patent No.10,889,629, and WO2020/223537, each of which is incorporated herein by reference in its entirety. [00096] In some embodiments, the modulator of T cells, or the resulting modified T cell, is administered concurrently with a cancer therapy, or subsequent to a cancer therapy, or prior to a cancer therapy, or as a cancer therapy. a. DNA Targeting Systems [00097] In some embodiments, the agent comprises a DNA targeting composition or at least one component thereof. A “DNA Targeting System” as used herein is a system capable of specifically targeting a particular region of DNA and modulating gene expression by binding to that region. Non-limiting examples of these systems are CRISPR-Cas-based systems, meganucleases, zinc finger (ZF)-based systems, and/or transcription activator-like effector (TALE)-based systems. The DNA Targeting System may be a nuclease system that acts through mutating or editing the target region (such as by insertion, deletion or substitution) or it may be a system that delivers a functional second polypeptide domain, such as an activator or repressor, to the target region. [00098] Each of these systems comprises a DNA-binding portion or domain, such as a Cas protein and guide RNA, or a meganuclease, or a ZF, or a TALE, that specifically recognizes and binds to a particular target region of a target DNA. The DNA-binding portion (for example, Cas protein, ZF, or TALE) can be linked to a second protein domain, such as a polypeptide with transcription activation activity, transcription repression activity, transcription release factor activity, histone modification activity, nuclease activity, nucleic acid association activity, methylase activity, demethylase activity, acetylation activity, or deacetylation activity, to form a fusion protein. Exemplary second polypeptide domains are detailed further below (see “Cas Fusion Protein”). For example, the DNA-binding portion can be linked to an activator and thus guide the activator to a specific target region of the target DNA. Similarly, the DNA-binding portion can be linked to a repressor and thus guide the repressor to a specific target region of the target DNA. [00099] In some embodiments, the DNA targeting composition comprises a meganuclease. A meganuclease is an endodeoxyribonuclease characterized by a large recognition site, such as double-stranded DNA sequences of 12 to 40 base pairs. The recognition site may occur only once in any given genome. A meganuclease may be a homing endonuclease selected from an intron endonuclease or an intein endonuclease. Meganucleases may include, for example, the LAGLIDADG family of homing endonucleases. [000100] In some embodiments, the DNA-binding portion comprises a Cas protein, such as a Cas9 protein. Some CRISPR-Cas-based systems can operate to activate or repress expression using the Cas protein alone, not linked to an activator or repressor. For example, a nuclease-null Cas9 can act as a repressor on its own, a nuclease-active Cas9 can act as a repressor on its own, or a nuclease-active Cas9 can act as an activator when paired with an inactive (dead) guide RNA. In addition, RNA or DNA that hybridizes to a particular target region of the target DNA can be directly linked (covalently or non-covalently) to an activator or repressor. Some CRISPR-Cas-based systems can operate to activate or repress expression using the Cas protein linked to a second protein domain, such as, for example, an activator or repressor. i) DNA Binding Protein [000101] The DNA Targeting System may include a DNA binding protein. The DNA binding protein may comprise, for example, a zinc finger protein or a transcription activator- like effector (TALE). The zinc finger protein or TALE may target a gene selected fromTGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a regulatory element thereof. (1) Zinc Finger Protein [000102] A zinc finger protein is a protein that includes one or more zinc finger domains. Zinc finger domains are relatively small protein motifs that contain multiple finger-like protrusions that make tandem contacts with their target molecule such as a DNA target molecule. A zinc finger domain may bind one or more zinc ions or other metal ion such as iron, or in some cases a zinc finger domain forms salt bridges to stabilize the finger-like folds. The zinc binding portion of a zinc finger protein may include one or more cysteine residues and/or one or more histidine residues to coordinate the zinc or other metal ion. A zinc finger protein recognizes and binds to a particular DNA sequence via the zinc finger domain. In some embodiments, a zinc finger protein is fused to or includes a nuclease domain and may be referred to as a zinc finger nuclease (ZFN). The nuclease domain may include, for example, the endonuclease FokI. ZFNs may recognize target sites that consist of two zinc-finger binding sites that flank a 5- to 7-base pair (bp) spacer sequence recognized by the endonuclease FokI cleavage domain. (2) Transcription Activator-like Effector (TALE) [000103] A TALE is another type of protein that recognizes and binds to a particular DNA sequence. The DNA-binding domain of a TALE includes an array of tandem 33-35 amino acid repeats, also known as RVD modules. Each RVD module specifically recognizes a single base pair of DNA. RVD modules may be arranged in any order to assemble an array that recognizes a defined DNA sequence. The binding specificity of a TALE DNA-binding domain is determined by the RVD array followed by a single truncated repeat of, for example, 20 amino acids. A TALE DNA-binding domain may have an array of 12 to 27 RVD modules, each RVD module recognizing a single base pair of DNA. Specific RVDs have been identified that recognize each of the four possible DNA nucleotides (A, T, C, and G). Because the TALE DNA-binding domains are modular, repeats that recognize the four different DNA nucleotides may be linked together to recognize any particular DNA sequence. These targeted DNA-binding domains may then be combined with catalytic domains to create functional enzymes, including artificial transcription factors and/or nucleases. In some embodiments, a TALE is fused to or includes a nuclease domain and may be referred to as a TALE nuclease (TALEN). The nuclease domain may include, for example, the endonuclease FokI. TALENs may recognize target sites that consist of two TALE DNA- binding sites that flank a 12-bp to 20-bp spacer sequence recognized by the FokI cleavage domain. (3) DNA Binding Fusion Protein [000104] Additionally or alternatively, a zinc finger protein or TALE can be fused to a polypeptide domain and referred to as a DNA binding fusion protein or fusion protein. The fusion protein may act as a synthetic transcription factor. The fusion protein comprises two heterologous polypeptide domains, including a first polypeptide domain comprising the zinc finger protein or the TALE or a Cas9 protein as further detailed below, and a second polypeptide domain having an activity selected from transcription activation activity, transcription repression activity, nuclease activity, transcription release factor activity, histone modification activity, nucleic acid association activity, methylase activity, and demethylase activity. A zinc finger protein or TALE can be fused to a polypeptide domain having epigenetic modifying activity to mediate targeted gene regulation. A fusion protein comprising a zinc finger protein or TALE, and a second polypeptide domain having transcription repression activity, may mediate targeted gene repression. A fusion protein comprising a zinc finger protein or TALE, and a second polypeptide domain having transcription activation activity, may mediate targeted gene activation. The second polypeptide domain is further detailed below (see “Cas Fusion Protein”). ii) CRISPR/Cas-based Gene Editing System [000105] Provided herein are CRISPR/Cas-based gene editing systems. The CRISPR/Cas-based gene editing system may be used to modulate T cells and/or enhance ACT. The CRISPR/Cas-based gene editing system may include a Cas protein or a fusion protein, and at least one gRNA, and may also be referred to as a “CRISPR-Cas system.” [000106] “Clustered Regularly Interspaced Short Palindromic Repeats” and “CRISPRs”, as used interchangeably herein, refers to loci containing multiple short direct repeats that are found in the genomes of approximately 40% of sequenced bacteria and 90% of sequenced archaea. The CRISPR system is a microbial nuclease system involved in defense against invading phages and plasmids that provides a form of acquired immunity. The CRISPR loci in microbial hosts contain a combination of CRISPR-associated (Cas) genes as well as non- coding RNA elements capable of programming the specificity of the CRISPR-mediated nucleic acid cleavage. Short segments of foreign DNA, called spacers, are incorporated into the genome between CRISPR repeats, and serve as a “memory” of past exposures. Cas proteins include, for example, Cas12 such as Cas12a, Cas9, Cas13, Cascade proteins, and IscB/TnpB proteins. Cas12a may also be referred to as “Cpf1.” Cas12a causes a staggered cut in double stranded DNA, while Cas9 produces a blunt cut. In some embodiments, the Cas protein comprises Cas12a. Cas12a is described in, for example, WO 2018/017754, which is incorporated herein by reference. Cas13 is an RNA-guided RNA endonuclease. Cas13 cleaves single-stranded RNA, and it does not cleave DNA. In some embodiments, the Cas protein comprises Cas13. In some embodiments, the Cas protein comprises Cas9. Cas9 forms a complex with the 3’ end of the sgRNA (which may be referred interchangeably herein as “gRNA”), and the protein-RNA pair recognizes its genomic target by complementary base pairing between the 5’ end of the gRNA sequence and a predefined 20 bp DNA sequence, known as the protospacer. This complex is directed to homologous loci of pathogen DNA via regions encoded within the crRNA, i.e., the protospacers, and protospacer-adjacent motifs (PAMs) within the pathogen genome. The non-coding CRISPR array is transcribed and cleaved within direct repeats into short crRNAs containing individual spacer sequences, which direct Cas nucleases to the target site (protospacer). By simply exchanging the 20 bp recognition sequence of the expressed gRNA, the Cas9 nuclease can be directed to new genomic targets. CRISPR spacers are used to recognize and silence exogenous genetic elements in a manner analogous to RNAi in eukaryotic organisms. [000107] Three classes of CRISPR systems (Types I, II, and III effector systems) are known. The Type II effector system carries out targeted DNA double-strand break in four sequential steps, using a single effector enzyme, Cas9, to cleave dsDNA. Compared to the Type I and Type III effector systems, which require multiple distinct effectors acting as a complex, the Type II effector system may function in alternative contexts such as eukaryotic cells. The Type II effector system consists of a long preǦcrRNA, which is transcribed from the spacerǦcontaining CRISPR locus, the Cas9 protein, and a tracrRNA, which is involved in pre-crRNA processing. The tracrRNAs hybridize to the repeat regions separating the spacers of the preǦcrRNA, thus initiating dsRNA cleavage by endogenous RNase III. This cleavage is followed by a second cleavage event within each spacer by Cas9, producing mature crRNAs that remain associated with the tracrRNA and Cas9, forming a Cas9:crRNA- tracrRNA complex. Cas12a systems include crRNA for successful targeting, whereas Cas9 systems include both crRNA and tracrRNA. [000108] The Cas9:crRNA-tracrRNA complex unwinds the DNA duplex and searches for sequences matching the crRNA to cleave. Target recognition occurs upon detection of complementarity between a “protospacer” sequence in the target DNA and the remaining spacer sequence in the crRNA. Cas9 mediates cleavage of target DNA if a correct protospacer-adjacent motif (PAM) is also present at the 3’ end of the protospacer. For protospacer targeting, the sequence must be immediately followed by the protospacer- adjacent motif (PAM), a short sequence recognized by the Cas9 nuclease that is required for DNA cleavage. Different Cas and Cas Type II systems have differing PAM requirements. For example, Cas12a may function with PAM sequences rich in thymine “T.” [000109] An engineered form of the Type II effector system of S. pyogenes was shown to function in human cells for genome engineering. In this system, the Cas9 protein was directed to genomic target sites by a synthetically reconstituted “guide RNA” (“gRNA”, also used interchangeably herein as a chimeric single guide RNA (“sgRNA”)), which is a crRNA- tracrRNA fusion that obviates the need for RNase III and crRNA processing in general. Provided herein are CRISPR/Cas9-based engineered systems for use in gene editing and treating genetic diseases. The CRISPR/Cas9-based engineered systems can be designed to target any gene, including genes involved in, for example, a genetic disease, aging, tissue regeneration, or wound healing. The CRISPR/Cas9-based gene editing system can include a Cas9 protein or a Cas9 fusion protein. iii) Cas9 Protein [000110] Cas9 protein is an endonuclease that cleaves nucleic acid and is encoded by the CRISPR loci and is involved in the Type II CRISPR system. The Cas9 protein can be from any bacterial or archaea species, including, but not limited to, Streptococcus pyogenes, Staphylococcus aureus (S. aureus), Acidovorax avenae, Actinobacillus pleuropneumoniae, Actinobacillus succinogenes, Actinobacillus suis, Actinomyces sp., cycliphilus denitrificans, Aminomonas paucivorans, Bacillus cereus, Bacillus smithii, Bacillus thuringiensis, Bacteroides sp., Blastopirellula marina, Bradyrhizobium sp., Brevibacillus laterosporus, Campylobacter coli, Campylobacter jejuni, Campylobacter lari, Candidatus Puniceispirillum, Clostridium cellulolyticum, Clostridium perfringens, Corynebacterium accolens, Corynebacterium diphtheria, Corynebacterium matruchotii, Dinoroseobacter shibae, Eubacterium dolichum, gamma proteobacterium, Gluconacetobacter diazotrophicus, Haemophilus parainfluenzae, Haemophilus sputorum, Helicobacter canadensis, Helicobacter cinaedi, Helicobacter mustelae, Ilyobacter polytropus, Kingella kingae, Lactobacillus crispatus, Listeria ivanovii, Listeria monocytogenes, Listeriaceae bacterium, Methylocystis sp., Methylosinus trichosporium, Mobiluncus mulieris, Neisseria bacilliformis, Neisseria cinerea, Neisseria flavescens, Neisseria lactamica, Neisseria sp., Neisseria wadsworthii, Nitrosomonas sp., Parvibaculum lavamentivorans, Pasteurella multocida, Phascolarctobacterium succinatutens, Ralstonia syzygii, Rhodopseudomonas palustris, Rhodovulum sp., Simonsiella muelleri, Sphingomonas sp., Sporolactobacillus vineae, Staphylococcus lugdunensis, Streptococcus sp., Subdoligranulum sp., Tistrella mobilis, Treponema sp., or Verminephrobacter eiseniae. In certain embodiments, the Cas9 molecule is a Streptococcus pyogenes Cas9 molecule (also referred herein as “SpCas9”). SpCas9 may comprise an amino acid sequence of SEQ ID NO: 26. In certain embodiments, the Cas9 molecule is a Staphylococcus aureus Cas9 molecule (also referred herein as “SaCas9”). SaCas9 may comprise an amino acid sequence of SEQ ID NO: 27. [000111] A Cas9 molecule or a Cas9 fusion protein can interact with one or more gRNA molecule(s) and, in concert with the gRNA molecule(s), can localize to a site which comprises a target domain, and in certain embodiments, a PAM sequence. The Cas9 protein forms a complex with the 3’ end of a gRNA. The ability of a Cas9 molecule or a Cas9 fusion protein to recognize a PAM sequence can be determined, for example, by using a transformation assay as known in the art. [000112] The specificity of the CRISPR-based system may depend on two factors: the target sequence and the protospacer-adjacent motif (PAM). The target sequence is located on the 5’ end of the gRNA and is designed to bond with base pairs on the host DNA at the correct DNA sequence known as the protospacer. By simply exchanging the recognition sequence of the gRNA, the Cas9 protein can be directed to new genomic targets. The PAM sequence is located on the DNA to be altered and is recognized by a Cas9 protein. PAM recognition sequences of the Cas9 protein can be species specific. [000113] In certain embodiments, the ability of a Cas9 molecule or a Cas9 fusion protein to interact with and cleave a target nucleic acid is PAM sequence dependent. A PAM sequence is a sequence in the target nucleic acid. In certain embodiments, cleavage of the target nucleic acid occurs upstream from the PAM sequence. Cas9 molecules from different bacterial species can recognize different sequence motifs (for example, PAM sequences). A Cas9 molecule of S. pyogenes may recognize the PAM sequence of NRG (5’-NRG-3’, where R is any nucleotide residue, and in some embodiments, R is either A or G, SEQ ID NO: 1). In certain embodiments, a Cas9 molecule of S. pyogenes may naturally prefer and recognize the sequence motif NGG (SEQ ID NO: 2) and directs cleavage of a target nucleic acid sequence 1 to 10, for example, 3 to 5, bp upstream from that sequence. In some embodiments, a Cas9 molecule of S. pyogenes accepts other PAM sequences, such as NAG (SEQ ID NO: 3) in engineered systems (Hsu et al., Nature Biotechnology 2013 doi:10.1038/nbt.2647, incorporated herein by reference). In certain embodiments, a Cas9 molecule of S. thermophilus recognizes the sequence motif NGGNG (SEQ ID NO: 4) and/or NNAGAAW (W = A or T) (SEQ ID NO: 5) and directs cleavage of a target nucleic acid sequence 1 to 10, for example, 3 to 5, bp upstream from these sequences. In certain embodiments, a Cas9 molecule of S. mutans recognizes the sequence motif NGG (SEQ ID NO: 2) and/or NAAR (R = A or G) (SEQ ID NO: 6) and directs cleavage of a target nucleic acid sequence 1 to 10, for example, 3 to 5 bp, upstream from this sequence. In certain embodiments, a Cas9 molecule of S. aureus recognizes the sequence motif NNGRR (R = A or G) (SEQ ID NO: 7) and directs cleavage of a target nucleic acid sequence 1 to 10, for example, 3 to 5, bp upstream from that sequence. In certain embodiments, a Cas9 molecule of S. aureus recognizes the sequence motif NNGRRN (R = A or G) (SEQ ID NO: 8) and directs cleavage of a target nucleic acid sequence 1 to 10, for example, 3 to 5, bp upstream from that sequence. In certain embodiments, a Cas9 molecule of S. aureus recognizes the sequence motif NNGRRT (R = A or G) (SEQ ID NO: 9) and directs cleavage of a target nucleic acid sequence 1 to 10, for example, 3 to 5, bp upstream from that sequence. In certain embodiments, a Cas9 molecule of S. aureus recognizes the sequence motif NNGRRV (R = A or G; V = A or C or G) (SEQ ID NO: 10) and directs cleavage of a target nucleic acid sequence 1 to 10, for example, 3 to 5, bp upstream from that sequence. A Cas9 molecule derived from Neisseria meningitidis (NmCas9) normally has a native PAM of NNNNGATT (SEQ ID NO: 11), but may have activity across a variety of PAMs, including a highly degenerate NNNNGNNN PAM (SEQ ID NO: 12) (Esvelt et al. Nature Methods 2013 doi:10.1038/nmeth.2681, incorporated herein by reference). In the aforementioned embodiments, N can be any nucleotide residue, for example, any of A, G, C, or T. Cas9 molecules can be engineered to alter the PAM specificity of the Cas9 molecule. [000114] In some embodiments, the Cas9 protein recognizes a PAM sequence NGG (SEQ ID NO: 2) or NGA (SEQ ID NO: 13) or NNNRRT (R = A or G) (SEQ ID NO: 14) or ATTCCT (SEQ ID NO: 15) or NGAN (SEQ ID NO: 16) or NGNG (SEQ ID NO: 17). In some embodiments, the Cas9 protein is a Cas9 protein of S. aureus and recognizes the sequence motif NNGRR (R = A or G) (SEQ ID NO: 7), NNGRRN (R = A or G) (SEQ ID NO: 8), NNGRRT (R = A or G) (SEQ ID NO: 9), or NNGRRV (R = A or G; V = A or C or G) (SEQ ID NO: 10). In the aforementioned embodiments, N can be any nucleotide residue, for example, any of A, G, C, or T. [000115] Additionally or alternatively, a nucleic acid encoding a Cas9 molecule or Cas9 polypeptide may comprise a nuclear localization sequence (NLS). Nuclear localization sequences are known in the art, for example, SV40 NLS (Pro-Lys-Lys-Lys-Arg-Lys-Val; SEQ ID NO: 20). [000116] In some embodiments, the at least one Cas9 molecule is a mutant Cas9 molecule. The Cas9 protein can be mutated so that the nuclease activity is inactivated. An inactivated Cas9 protein (“iCas9”, also referred to as “dCas9”) with no endonuclease activity has been targeted to genes in bacteria, yeast, and human cells by gRNAs to silence gene expression through steric hindrance. Exemplary mutations with reference to the S. pyogenes Cas9 sequence to inactivate the nuclease activity include: D10A, E762A, H840A, N854A, N863A and/or D986A. A S. pyogenes Cas9 protein with the D10A mutation may comprise an amino acid sequence of SEQ ID NO: 28. A S. pyogenes Cas9 protein with D10A and H849A mutations may comprise an amino acid sequence of SEQ ID NO: 29. Exemplary mutations with reference to the S. aureus Cas9 sequence to inactivate the nuclease activity include D10A and N580A. In certain embodiments, the mutant S. aureus Cas9 molecule comprises a D10A mutation. The nucleotide sequence encoding this mutant S. aureus Cas9 is set forth in SEQ ID NO: 30. In certain embodiments, the mutant S. aureus Cas9 molecule comprises a N580A mutation. The nucleotide sequence encoding this mutant S. aureus Cas9 molecule is set forth in SEQ ID NO: 31. [000117] In some embodiments, the Cas9 protein is a VQR variant. The VQR variant of Cas9 is a mutant with a different PAM recognition, as detailed in Kleinstiver, et al. (Nature 2015, 523, 481–485, incorporated herein by reference). [000118] A polynucleotide encoding a Cas9 molecule can be a synthetic polynucleotide. For example, the synthetic polynucleotide can be chemically modified. The synthetic polynucleotide can be codon optimized, for example, at least one non-common codon or less-common codon has been replaced by a common codon. For example, the synthetic polynucleotide can direct the synthesis of an optimized messenger mRNA, for example, optimized for expression in a mammalian expression system, as described herein. An exemplary codon optimized nucleic acid sequence encoding a Cas9 molecule of S. pyogenes is set forth in SEQ ID NO: 32. Exemplary codon optimized nucleic acid sequences encoding a Cas9 molecule of S. aureus, and optionally containing nuclear localization sequences (NLSs), are set forth in SEQ ID NOs: 33-39. Another exemplary codon optimized nucleic acid sequence encoding a Cas9 molecule of S. aureus comprises the nucleotides 1293-4451 of SEQ ID NO: 40. iv) Cas Fusion Protein [000119] Alternatively or additionally, the CRISPR/Cas-based gene editing system can include a fusion protein. The fusion protein can comprise two heterologous polypeptide domains. The first polypeptide domain comprises a Cas protein or a mutated Cas protein. The first polypeptide domain is fused to at least one second polypeptide domain. The second polypeptide domain has a different activity that what is endogenous to Cas protein. The second polypeptide domain may have any DNA editing activity. The second polypeptide domain may have an activity such as transcription activation activity, transcription repression activity, transcription release factor activity, histone modification activity, nuclease activity, nucleic acid association activity, histone methylase activity, DNA methylase activity, histone demethylase activity, DNA demethylase activity, acetylation activity, and/or deacetylation activity. The activity of the second polypeptide domain may be direct or indirect. The second polypeptide domain may have this activity itself (direct), or it may recruit and/or interact with a polypeptide domain that has this activity (indirect). In some embodiments, the second polypeptide domain has transcription activation activity. In some embodiments, the second polypeptide domain has transcription repression activity. In some embodiments, the second polypeptide domain comprises a synthetic transcription factor. The second polypeptide domain may be at the C-terminal end of the first polypeptide domain, or at the N-terminal end of the first polypeptide domain, or a combination thereof. The fusion protein may include one second polypeptide domain. In some embodiments, the fusion protein comprises more than one second polypeptide domain. The fusion protein may include two of the second polypeptide domains. For example, the fusion protein may include a second polypeptide domain at the N-terminal end of the first polypeptide domain as well as a second polypeptide domain at the C-terminal end of the first polypeptide domain. In other embodiments, the fusion protein may include a single first polypeptide domain and more than one (for example, two or three) second polypeptide domains in tandem. [000120] The linkage from the first polypeptide domain to the second polypeptide domain can be through reversible or irreversible covalent linkage or through a non-covalent linkage, as long as the linker does not interfere with the function of the second polypeptide domain. For example, a Cas polypeptide can be linked to a second polypeptide domain as part of a fusion protein. As another example, they can be linked through reversible non-covalent interactions such as avidin (or streptavidin)-biotin interaction, histidine-divalent metal ion interaction (such as, Ni, Co, Cu, Fe), interactions between multimerization (such as, dimerization) domains, or glutathione S-transferase (GST)-glutathione interaction. As yet another example, they can be linked covalently but reversibly with linkers such as dibromomaleimide (DBM) or amino-thiol conjugation. [000121] In some embodiments, the fusion protein includes at least one linker. A linker may be included anywhere in the polypeptide sequence of the fusion protein, for example, between the first and second polypeptide domains. A linker may be of any length and design to promote or restrict the mobility of components in the fusion protein. A linker may comprise any amino acid sequence of about 2 to about 100, about 5 to about 80, about 10 to about 60, or about 20 to about 50 amino acids. A linker may comprise an amino acid sequence of at least about 2, 3, 4, 5, 10, 15, 20, 25, or 30 amino acids. A linker may comprise an amino acid sequence of less than about 100, 90, 80, 70, 60, 50, or 40 amino acids. A linker may include sequential or tandem repeats of an amino acid sequence that is 2 to 20 amino acids in length. Linkers may include, for example, a GS linker (Gly-Gly-Gly- Gly-Ser) n , wherein n is an integer between 0 and 10 (SEQ ID NO: 21). In a GS linker, n can be adjusted to optimize the linker length and achieve appropriate separation of the functional domains. Other examples of linkers may include, for example, Gly-Gly-Gly-Gly-Gly (SEQ ID NO: 22), Gly-Gly-Ala-Gly-Gly (SEQ ID NO: 23), Gly/Ser rich linkers such as Gly-Gly-Gly-Gly- Ser-Ser-Ser (SEQ ID NO: 24), or Gly/Ala rich linkers such as Gly-Gly-Gly-Gly-Ala-Ala-Ala (SEQ ID NO: 25). [000122] In some embodiments, the agent and/or Cas protein and/or the Cas fusion protein and/or gRNAs detailed herein may be used in compositions and methods for modulating expression of gene. Modulating may include, for example, increasing or enhancing expression of the gene, or reducing or inhibiting expression of the gene. The expression of the gene may be modulated by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. The expression of the gene may be modulated by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. The expression of the gene may be modulated by about 5-95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control. The expression of the gene may be reduced by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. The expression of the gene may be reduced by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. The expression of the gene may be reduced by about 5-95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control. The expression of the gene may be increased by at least about 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6- fold, 7-fold, 8-fold, 9-fold, or 10-fold, relative to a control. The expression of the gene may be increased by less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 1.5-fold, 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7- fold, 8-fold, 9-fold, or 10-fold, relative to a control. The expression of the gene may be increased by about 5-95%, 10-90%, 15-85%, 20-80%, or 1.5-fold to 10-fold, relative to a control. In some embodiments, the expression of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1 is increased. In some embodiments, the expression of a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1 is increased. In some embodiments, the level of a transcription factor protein selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1 is increased. (1) Transcription Activation Activity [000123] The second polypeptide domain can have transcription activation activity, for example, a transactivation domain. For example, gene expression of endogenous mammalian genes, such as human genes, can be achieved by targeting a fusion protein of a first polypeptide domain, such as dCas9, and a transactivation domain to mammalian promoters via combinations of gRNAs. The transactivation domain can include a VP16 protein, multiple VP16 proteins, such as a VP48 domain or VP64 domain, p65 domain of NF kappa B transcription activator activity, TET1, VPR, VPH, Rta, and/or p300. For example, the fusion protein may comprise dCas9-p300. In some embodiments, p300 comprises a polypeptide having the amino acid sequence of SEQ ID NO: 41 or SEQ ID NO: 42. In other embodiments, the fusion protein comprises dCas9-VP64. In other embodiments, the fusion protein comprises VP64-dCas9-VP64. VP64-dCas9-VP64 may comprise a polypeptide having the amino acid sequence of SEQ ID NO: 43, encoded by the polynucleotide of SEQ ID NO: 44. VPH may comprise a polypeptide having the amino acid sequence of SEQ ID NO: 53, encoded by the polynucleotide of SEQ ID NO: 54. VPR may comprise a polypeptide having the amino acid sequence of SEQ ID NO: 55, encoded by the polynucleotide of SEQ ID NO: 56. (2) Transcription Repression Activity [000124] The second polypeptide domain can have transcription repression activity. Non- limiting examples of repressors include Kruppel associated box activity such as a KRAB domain or KRAB, MECP2, EED, ERF repressor domain (ERD), Mad mSIN3 interaction domain (SID) or Mad-SID repressor domain, SID4X repressor domain, Mxil repressor domain, SUV39H1, SUV39H2, G9A, ESET/SETBD1, Cir4, Su(var)3-9, Pr-SET7/8, SUV4- 20H1, PR-set7, Suv4-20, Set9, EZH2, RIZ1, JMJD2A/JHDM3A, JMJD2B, JMJ2D2C/GASC1, JMJD2D, Rph1, JARID1A/RBP2, JARID1B/PLU-1, JARID1C/SMCX, JARID1D/SMCY, Lid, Jhn2, Jmj2, HDAC1, HDAC2, HDAC3, HDAC8, Rpd3, Hos1, Cir6, HDAC4, HDAC5, HDAC7, HDAC9, Hda1, Cir3, SIRT1, SIRT2, Sir2, Hst1, Hst2, Hst3, Hst4, HDAC11, DNMT1, DNMT3a/3b, DNMT3A-3L, MET1, DRM3, ZMET2, CMT1, CMT2, Laminin A, Laminin B, CTCF, and/or a domain having TATA box binding protein activity, or a combination thereof. In some embodiments, the second polypeptide domain has a KRAB domain activity, ERF repressor domain activity, Mxil repressor domain activity, SID4X repressor domain activity, Mad-SID repressor domain activity, DNMT3A or DNMT3L or fusion thereof activity, LSD1 histone demethylase activity, or TATA box binding protein activity. In some embodiments, the polypeptide domain comprises KRAB. KRAB may comprise a polypeptide having the amino acid sequence of SEQ ID NO: 45, encoded by a polynucleotide comprising the sequence of SEQ ID NO: 46. For example, the fusion protein may be S. pyogenes dCas9-KRAB (protein sequence comprising SEQ ID NO: 47; polynucleotide sequence comprising SEQ ID NO: 48). The fusion protein may be S. aureus dCas9-KRAB (protein sequence comprising SEQ ID NO: 49; polynucleotide sequence comprising SEQ ID NO: 50). (3) Transcription Release Factor Activity [000125] The second polypeptide domain can have transcription release factor activity. The second polypeptide domain can have eukaryotic release factor 1 (ERF1) activity or eukaryotic release factor 3 (ERF3) activity. (4) Histone Modification Activity [000126] The second polypeptide domain can have histone modification activity. The second polypeptide domain can have histone deacetylase, histone acetyltransferase, histone demethylase, or histone methyltransferase activity. The histone acetyltransferase may be p300 or CREB-binding protein (CBP) protein, or fragments thereof. For example, the fusion protein may be dCas9-p300. In some embodiments, p300 comprises a polypeptide of SEQ ID NO: 41 or SEQ ID NO: 42. (5) Nuclease Activity [000127] The second polypeptide domain can have nuclease activity that is different from the nuclease activity of the Cas9 protein. A nuclease, or a protein having nuclease activity, is an enzyme capable of cleaving the phosphodiester bonds between the nucleotide subunits of nucleic acids. Nucleases are usually further divided into endonucleases and exonucleases, although some of the enzymes may fall in both categories. Well known nucleases include deoxyribonuclease and ribonuclease. In some embodiments, the second polypeptide domain includes a meganuclease, as detailed above. In some embodiments, the polypeptide domain having nuclease activity comprises FokI. (6) Nucleic Acid Association Activity [000128] The second polypeptide domain can have nucleic acid association activity or nucleic acid binding protein-DNA-binding domain (DBD). A DBD is an independently folded protein domain that contains at least one motif that recognizes double- or single-stranded DNA. A DBD can recognize a specific DNA sequence (a recognition sequence) or have a general affinity to DNA. A nucleic acid association region may be selected from helix-turn- helix region, leucine zipper region, winged helix region, winged helix-turn-helix region, helix- loop-helix region, immunoglobulin fold, B3 domain, Zinc finger, HMG-box, Wor3 domain, and TAL effector DNA-binding domain. (7) Base Editing Activity [000129] The second polypeptide domain may have base editing activity. Base editing enables the direct, irreversible conversion of a specific DNA base into another base at a targeted genomic locus without requiring double-stranded DNA breaks (DSB). A base editing domain has sequence requirements for activity. In a 20 nucleotide protospacer, the target base may be within 4-8 nucleotides from the PAM-distal end. An exemplary splice acceptor is an “AG” immediately before the exon, and an exemplary splice donor is a “GT” immediately following the exon. Cas9 molecules from different species may use different PAMs, and thereby provide some flexibility in selecting the base to edit. Disruption of canonical splice sites can lead to exon skipping or activation of cryptic splice sites. Both adenine and cytosine base editors may be capable of disrupting an “AG” splice acceptor, converting it to either a “GG” or “AA”, respectively. In some embodiments, the base-editing domain includes an adenine base editor (ABE). Adenine base editors may include, for example, ecTadA, including wild-type and mutants thereof. The adenine base editor may be as described in Gaudelli et al. (Nature 2017, 551, 464–471), Koblan et al. (Nature Biotech. 2018, 36, 843–846), Richter et al. (Nature Biotech.2020, 38, 883–891), and Gaudelli et al. (Nature Biotech.2020, 38, 892–900), each of which is incorporated herein by reference. The ABE may comprise a polypeptide selected from SEQ ID NOs: 57-64 and/or be encoded by a polynucleotide comprising a sequence selected from SEQ ID NOs: 65-72, respectively. In some embodiments, the base-editing domain includes a cytidine deaminase domain. A cytidine deaminase domain can convert the DNA base cytosine to uracil. In some embodiments, the cytidine deaminase domain can include an apolipoprotein B mRNA- editing enzyme, catalytic polypeptide-like (APOBEC) family deaminase. In some embodiments, the cytidine deaminase domain can include an APOBEC 1 deaminase, APOBEC2 deaminase, APOBEC3A deaminase, APOBEC3B deaminase, APOBEC3C deaminase, APOBEC3D deaminase, APOBEC3F deaminase, APOBEC3G deaminase, APOBEC3H deaminase, or a combination thereof. Base editing domains are detailed in, for example, WO 2020/210776 and WO 2022/081612, each of which is incorporated herein by reference. (8) Methylase Activity [000130] The second polypeptide domain can have methylase activity, which involves transferring a methyl group to DNA, RNA, protein, small molecule, cytosine, or adenine. In some embodiments, the second polypeptide domain includes a DNA methyltransferase. (9) Demethylase Activity [000131] The second polypeptide domain can have demethylase activity. The second polypeptide domain can include an enzyme that removes methyl (CH3-) groups from nucleic acids, proteins (in particular histones), and other molecules. Alternatively, the second polypeptide can convert the methyl group to hydroxymethylcytosine in a mechanism for demethylating DNA. The second polypeptide can catalyze this reaction. For example, the second polypeptide that catalyzes this reaction can be Tet1, also known as Tet1CD (Ten- eleven translocation methylcytosine dioxygenase 1; amino acid sequence comprising SEQ ID NO: 51; polynucleotide sequence comprising SEQ ID NO: 52). In some embodiments, the second polypeptide domain has histone demethylase activity. In some embodiments, the second polypeptide domain has DNA demethylase activity. v) Guide RNA (gRNA) [000132] The CRISPR/Cas-based gene editing system includes at least one gRNA molecule. For example, the CRISPR/Cas-based gene editing system may include two gRNA molecules. The at least one gRNA molecule can bind and recognize a target region. The gRNA is the part of the CRISPR-Cas system that provides DNA targeting specificity to the CRISPR/Cas-based gene editing system. The gRNA is a fusion of two noncoding RNAs: a crRNA and a tracrRNA. gRNA mimics the naturally occurring crRNA:tracrRNA duplex involved in the Type II Effector system. This duplex, which may include, for example, a 42- nucleotide crRNA and a 75-nucleotide tracrRNA, acts as a guide for the Cas9 to bind, and in some cases, cleave the target nucleic acid. The gRNA may target any desired DNA sequence by exchanging the sequence encoding a 20 bp protospacer which confers targeting specificity through complementary base pairing with the desired DNA target. The “target region” or “target sequence” or “protospacer” refers to the region of the target gene to which the CRISPR/Cas9-based gene editing system targets and binds. The portion of the gRNA that targets the target sequence in the genome may be referred to as the “targeting sequence” or “targeting portion” or “targeting domain.” “Protospacer” or “gRNA spacer” may refer to the region of the target gene to which the CRISPR/Cas9-based gene editing system targets and binds; “protospacer” or “gRNA spacer” may also refer to the portion of the gRNA that is complementary to the targeted sequence in the genome. The gRNA may include a gRNA scaffold. A gRNA scaffold facilitates Cas9 binding to the gRNA and may facilitate endonuclease activity. The gRNA scaffold is a polynucleotide sequence that follows the portion of the gRNA corresponding to sequence that the gRNA targets. Together, the gRNA targeting portion and gRNA scaffold form one polynucleotide. The constant region of the gRNA may include the sequence of SEQ ID NO: 19 (RNA), which is encoded by a sequence comprising SEQ ID NO: 18 (DNA). The CRISPR/Cas9-based gene editing system may include at least one gRNA, wherein the gRNAs target different DNA sequences. The target DNA sequences may be overlapping. The gRNA may comprise at its 5’ end the targeting domain that is sufficiently complementary to the target region to be able to hybridize to, for example, about 10 to about 20 nucleotides of the target region of the target gene, when it is followed by an appropriate Protospacer Adjacent Motif (PAM). The target region or protospacer is followed by a PAM sequence at the 3’ end of the protospacer in the genome. Different Type II systems have differing PAM requirements, as detailed above. [000133] The targeting domain of the gRNA does not need to be perfectly complementary to the target region of the target DNA. In some embodiments, the targeting domain of the gRNA is at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or at least 99% complementary to (or has 1, 2 or 3 mismatches compared to) the target region over a length of, such as, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotides. For example, the DNA-targeting domain of the gRNA may be at least 80% complementary over at least 18 nucleotides of the target region. The target region may be on either strand of the target DNA. [000134] The gRNA may target the Cas9 protein or fusion protein to a gene or a regulatory element thereof. The gRNA may target the Cas protein or fusion protein to a non-open chromatin region, an open chromatin region, a transcribed region of the target gene, a region upstream of a transcription start site of the target gene, a regulatory element of the target gene, an intron of the target gene, or an exon of the target gene, or a combination thereof. In some embodiments, the gRNA targets the Cas9 protein or fusion protein to a promoter of a gene. In some embodiments, the target region is located between about 1 to about 1000 base pairs upstream of a transcription start site of a target gene. In some embodiments, the DNA targeting composition comprises two or more gRNAs, each gRNA binding to a different target region. [000135] The gRNA may target a region of a gene that modulates T cells. The gRNA may target a region of a gene encoding a transcription factor that modulates T cells. The gRNA may target a region of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, or a regulatory element thereof. The gRNA may target a region of a gene selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof, or a regulatory element thereof. The gRNA may target a region of TGIF2LX, or a regulatory element thereof. In some embodiments, the gRNA targets a gene and is used in combination with a Cas9 fusion protein wherein the second polypeptide domain has transcription activation activity, to activate or enhance expression of the gene to increase T cells. In some embodiments, the gRNA targets a gene and is used in combination with a Cas9 fusion protein wherein the second polypeptide domain has transcription repression activity, to inhibit or reduce or decrease expression of the gene to increase T cells. The gRNA may comprise a polynucleotide selected from at least one of SEQ ID NOs: 157-192, or a complement thereof, or a variant thereof, or a truncation thereof. The gRNA may be encoded by a polynucleotide sequence comprising at least one of SEQ ID NOs: 121-156, or a complement thereof, or a variant thereof, or a truncation thereof. The gRNA may bind and target a polynucleotide sequence comprising at least one of SEQ ID NOs: 121-156, or a complement thereof, or a variant thereof, or a truncation thereof. A truncation may be 1, 2, 3, 4, 5, 6, 7, 8, or 9 nucleotides shorter than the sequence of any one of SEQ ID NOs: 121-192. The gRNA may be used, for example, in a CRISPR/Cas-based gene editing system with dCas9, such as a fusion protein comprising dCas9 with VP64 or p300. Exemplary gRNA sequences for modulating T cells are shown in TABLE 3.
Figure imgf000067_0001
Figure imgf000068_0001
Figure imgf000069_0001
[000136] As described above, the gRNA molecule comprises a targeting domain (also referred to as targeted or targeting sequence), which is a polynucleotide sequence complementary to the target DNA sequence. The gRNA may comprise a “G” at the 5’ end of the targeting domain or complementary polynucleotide sequence. The CRISPR/Cas9-based gene editing system may use gRNAs of varying sequences and lengths. The targeting domain of a gRNA molecule may comprise at least a 10 base pair, at least a 11 base pair, at least a 12 base pair, at least a 13 base pair, at least a 14 base pair, at least a 15 base pair, at least a 16 base pair, at least a 17 base pair, at least a 18 base pair, at least a 19 base pair, at least a 20 base pair, at least a 21 base pair, at least a 22 base pair, at least a 23 base pair, at least a 24 base pair, at least a 25 base pair, at least a 30 base pair, or at least a 35 base pair complementary polynucleotide sequence of the target DNA sequence followed by a PAM sequence. In certain embodiments, the targeting domain of a gRNA molecule has 19-25 nucleotides in length. In certain embodiments, the targeting domain of a gRNA molecule is 20 nucleotides in length. In certain embodiments, the targeting domain of a gRNA molecule is 21 nucleotides in length. In certain embodiments, the targeting domain of a gRNA molecule is 22 nucleotides in length. In certain embodiments, the targeting domain of a gRNA molecule is 23 nucleotides in length. [000137] The number of gRNA molecules that may be included in the CRISPR/Cas9- based gene editing system can be at least 1 gRNA, at least 2 different gRNAs, at least 3 different gRNAs, at least 4 different gRNAs, at least 5 different gRNAs, at least 6 different gRNAs, at least 7 different gRNAs, at least 8 different gRNAs, at least 9 different gRNAs, at least 10 different gRNAs, at least 11 different gRNAs, at least 12 different gRNAs, at least 13 different gRNAs, at least 14 different gRNAs, at least 15 different gRNAs, at least 16 different gRNAs, at least 17 different gRNAs, at least 18 different gRNAs, at least 18 different gRNAs, at least 20 different gRNAs, at least 25 different gRNAs, at least 30 different gRNAs, at least 35 different gRNAs, at least 40 different gRNAs, at least 45 different gRNAs, or at least 50 different gRNAs. The number of gRNA molecules that may be included in the CRISPR/Cas9-based gene editing system can be less than 50 different gRNAs, less than 45 different gRNAs, less than 40 different gRNAs, less than 35 different gRNAs, less than 30 different gRNAs, less than 25 different gRNAs, less than 20 different gRNAs, less than 19 different gRNAs, less than 18 different gRNAs, less than 17 different gRNAs, less than 16 different gRNAs, less than 15 different gRNAs, less than 14 different gRNAs, less than 13 different gRNAs, less than 12 different gRNAs, less than 11 different gRNAs, less than 10 different gRNAs, less than 9 different gRNAs, less than 8 different gRNAs, less than 7 different gRNAs, less than 6 different gRNAs, less than 5 different gRNAs, less than 4 different gRNAs, less than 3 different gRNAs, or less than 2 different gRNAs. The number of gRNAs that may be included in the CRISPR/Cas9-based gene editing system can be between at least 1 gRNA to at least 50 different gRNAs, at least 1 gRNA to at least 45 different gRNAs, at least 1 gRNA to at least 40 different gRNAs, at least 1 gRNA to at least 35 different gRNAs, at least 1 gRNA to at least 30 different gRNAs, at least 1 gRNA to at least 25 different gRNAs, at least 1 gRNA to at least 20 different gRNAs, at least 1 gRNA to at least 16 different gRNAs, at least 1 gRNA to at least 12 different gRNAs, at least 1 gRNA to at least 8 different gRNAs, at least 1 gRNA to at least 4 different gRNAs, at least 4 gRNAs to at least 50 different gRNAs, at least 4 different gRNAs to at least 45 different gRNAs, at least 4 different gRNAs to at least 40 different gRNAs, at least 4 different gRNAs to at least 35 different gRNAs, at least 4 different gRNAs to at least 30 different gRNAs, at least 4 different gRNAs to at least 25 different gRNAs, at least 4 different gRNAs to at least 20 different gRNAs, at least 4 different gRNAs to at least 16 different gRNAs, at least 4 different gRNAs to at least 12 different gRNAs, at least 4 different gRNAs to at least 8 different gRNAs, at least 8 different gRNAs to at least 50 different gRNAs, at least 8 different gRNAs to at least 45 different gRNAs, at least 8 different gRNAs to at least 40 different gRNAs, at least 8 different gRNAs to at least 35 different gRNAs, 8 different gRNAs to at least 30 different gRNAs, at least 8 different gRNAs to at least 25 different gRNAs, 8 different gRNAs to at least 20 different gRNAs, at least 8 different gRNAs to at least 16 different gRNAs, or 8 different gRNAs to at least 12 different gRNAs. vi) Repair Pathways [000138] The CRISPR/Cas9-based gene editing system may be used to introduce site- specific double strand breaks at targeted genomic loci, such as a gene for modulating T cells as detailed herein. Site-specific double-strand breaks are created when the CRISPR/Cas9- based gene editing system binds to a target DNA sequences, thereby permitting cleavage of the target DNA. This DNA cleavage may stimulate the natural DNA-repair machinery, leading to one of two possible repair pathways: homology-directed repair (HDR) or the non- homologous end joining (NHEJ) pathway. (1) Homology-Directed Repair (HDR) [000139] Restoration of protein expression from a gene may involve homology-directed repair (HDR). A donor template may be administered to a cell. A donor sequence comprises a polynucleotide sequence to be inserted into a genome. The donor template may include a nucleotide sequence encoding a full-functional protein or a partially functional protein. In such embodiments, the donor template may include fully functional gene construct for restoring a mutant gene, or a fragment of the gene that after homology-directed repair, leads to restoration of the mutant gene. In other embodiments, the donor template may include a nucleotide sequence encoding a mutated version of an inhibitory regulatory element of a gene. Mutations may include, for example, nucleotide substitutions, insertions, deletions, or a combination thereof. In such embodiments, introduced mutation(s) into the inhibitory regulatory element of the gene may reduce the transcription of or binding to the inhibitory regulatory element. (2) Non-Homologous End Joining (NHEJ) [000140] Restoration of protein expression from gene may be through template-free NHEJ- mediated DNA repair. In certain embodiments, NHEJ is a nuclease mediated NHEJ, which in certain embodiments, refers to NHEJ that is initiated a Cas9 molecule that cuts double stranded DNA. The method comprises administering a presently disclosed CRISPR/Cas9- based gene editing system or a composition comprising thereof to a subject for gene editing. [000141] Nuclease mediated NHEJ may correct a mutated target gene and offer several potential advantages over the HDR pathway. For example, NHEJ does not require a donor template, which may cause nonspecific insertional mutagenesis. In contrast to HDR, NHEJ operates efficiently in all stages of the cell cycle and therefore may be effectively exploited in both cycling and post-mitotic cells, such as muscle fibers. This provides a robust, permanent gene restoration alternative to oligonucleotide-based exon skipping or pharmacologic forced read-through of stop codons and could theoretically require as few as one drug treatment. 3. Genetic Constructs [000142] The transcription factor, such as one selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, may be encoded by one or more genetic constructs. A polynucleotide encoding a transcription factor, such as one selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, may be comprised within one or more genetic constructs. The genetic construct may comprise an open reading frame (ORF) of the transcription factor. In some embodiments, the genetic construct comprises a sequence selected from SEQ ID NOs: 75-97 or encodes a polypeptide comprising a sequence selected from SEQ ID NOs: 98-120. The CRISPR/Cas9-based gene editing system may be encoded by or comprised within one or more genetic constructs. The CRISPR/Cas9-based gene editing system may comprise one or more genetic constructs. The genetic construct, such as a plasmid or expression vector, may comprise a nucleic acid that encodes the transcription factor or the CRISPR/Cas9-based gene editing system and/or at least one of the gRNAs. In certain embodiments, a genetic construct encodes one gRNA molecule, i.e., a first gRNA molecule, and optionally a Cas9 molecule or fusion protein. In some embodiments, a genetic construct encodes two gRNA molecules, i.e., a first gRNA molecule and a second gRNA molecule, and optionally a Cas9 molecule or fusion protein. In some embodiments, a first genetic construct encodes one gRNA molecule, i.e., a first gRNA molecule, and optionally a Cas9 molecule or fusion protein, and a second genetic construct encodes one gRNA molecule, i.e., a second gRNA molecule, and optionally a Cas9 molecule or fusion protein. In some embodiments, a first genetic construct encodes one gRNA molecule and one donor sequence, and a second genetic construct encodes a Cas9 molecule or fusion protein. In some embodiments, a first genetic construct encodes one gRNA molecule and a Cas9 molecule or fusion protein, and a second genetic construct encodes one donor sequence. [000143] Genetic constructs may include polynucleotides such as vectors and plasmids. The genetic construct may be a linear minichromosome including centromere, telomeres, or plasmids or cosmids. The vector may be an expression vectors or system to produce protein by routine techniques and readily available starting materials including Sambrook et al., Molecular Cloning and Laboratory Manual, Second Ed., Cold Spring Harbor (1989), which is incorporated fully by reference. The construct may be recombinant. The genetic construct may be part of a genome of a recombinant viral vector, including recombinant lentivirus, recombinant adenovirus, and recombinant adenovirus associated virus. Viral vectors are further detailed below. The genetic construct may comprise regulatory elements for gene expression of the coding sequences of the nucleic acid. The regulatory elements may be a promoter, an enhancer, an initiation codon, a stop codon, or a polyadenylation signal. [000144] The genetic construct may comprise heterologous nucleic acid encoding the transcription factor or the CRISPR/Cas-based gene editing system and may further comprise an initiation codon, which may be upstream of the transcription factor or the CRISPR/Cas- based gene editing system coding sequence, and a stop codon, which may be downstream of the transcription factor or the CRISPR/Cas-based gene editing system coding sequence. The genetic construct may include more than one stop codon, which may be downstream of the transcription factor or the CRISPR/Cas-based gene editing system coding sequence. In some embodiments, the genetic construct includes 1, 2, 3, 4, or 5 stop codons. In some embodiments, the genetic construct includes 1, 2, 3, 4, or 5 stop codons downstream of the sequence encoding the donor sequence. A stop codon may be in-frame with a coding sequence in the transcription factor or the CRISPR/Cas-based gene editing system. For example, one or more stop codons may be in-frame with the donor sequence. The genetic construct may include one or more stop codons that are out of frame of a coding sequence in the transcription factor or the CRISPR/Cas-based gene editing system. For example, one stop codon may be in-frame with the donor sequence, and two other stop codons may be included that are in the other two possible reading frames. A genetic construct may include a stop codon for all three potential reading frames. The initiation and termination codon may be in frame with the transcription factor or the CRISPR/Cas-based gene editing system coding sequence. [000145] The vector may also comprise a promoter that is operably linked to the transcription factor coding sequence or the CRISPR/Cas-based gene editing system coding sequence. In some embodiments, the vector comprises a promoter operably linked to a polynucleotide sequence encoding the transcription factor. The promoter may be a constitutive promoter, a ubiquitous promoter, an inducible promoter, a cell-specific promoter, a tissue-specific promoter, a repressible promoter, or a regulatable promoter. In some embodiments, the promoter is a ubiquitous promoter. The promoter may be non- endogenous to the transcription factor. The promoter may be non-native to the transcription factor. The promoter may be a cell-specific promoter. For example, the promoter may be a promoter specific for T cells. The promoter may be a tissue-specific promoter. The tissue specific promoter may be a muscle specific promoter. The tissue specific promoter may be a skin specific promoter. The transcription factor coding sequence or the CRISPR/Cas- based gene editing system may be under the light-inducible or chemically inducible control to enable the dynamic control of gene/genome editing in space and time. The promoter operably linked to the transcription factor coding sequence or the CRISPR/Cas-based gene editing system coding sequence may be a promoter from simian virus 40 (SV40), a mouse mammary tumor virus (MMTV) promoter, a human immunodeficiency virus (HIV) promoter such as the bovine immunodeficiency virus (BIV) long terminal repeat (LTR) promoter, a Moloney virus promoter, an avian leukosis virus (ALV) promoter, a cytomegalovirus (CMV) promoter such as the CMV immediate early promoter, Epstein Barr virus (EBV) promoter, or a Rous sarcoma virus (RSV) promoter. The promoter may also be a promoter from a human gene such as human ubiquitin C (hUbC), human actin, human myosin, human hemoglobin, human muscle creatine, or human metalothionein. Examples of a tissue specific promoter, such as a muscle or skin specific promoter, natural or synthetic, are described in U.S. Patent Application Publication No. US20040175727, the contents of which are incorporated herein in its entirety. The promoter may be a CK8 promoter, a Spc512 promoter, a MHCK7 promoter, for example. [000146] The genetic construct may also comprise a polyadenylation signal, which may be downstream of the transcription factor coding sequence or the CRISPR/Cas-based gene editing system. The polyadenylation signal may be a SV40 polyadenylation signal, LTR polyadenylation signal, bovine growth hormone (bGH) polyadenylation signal, human growth hormone (hGH) polyadenylation signal, or human ȕ-globin polyadenylation signal. The SV40 polyadenylation signal may be a polyadenylation signal from a pCEP4 vector (Invitrogen, San Diego, CA). [000147] Coding sequences in the genetic construct may be optimized for stability and high levels of expression. In some instances, codons are selected to reduce secondary structure formation of the RNA such as that formed due to intramolecular bonding. [000148] The genetic construct may also comprise an enhancer upstream of the transcription factor coding sequence or the CRISPR/Cas-based gene editing system or gRNAs. The enhancer may be necessary for DNA expression. The enhancer may be human actin, human myosin, human hemoglobin, human muscle creatine or a viral enhancer such as one from CMV, HA, RSV, or EBV. Polynucleotide function enhancers are described in U.S. Patent Nos.5,593,972, 5,962,428, and WO94/016737, the contents of each are fully incorporated by reference. The genetic construct may also comprise a mammalian origin of replication in order to maintain the vector extrachromosomally and produce multiple copies of the vector in a cell. The genetic construct may also comprise a regulatory sequence, which may be well suited for gene expression in a mammalian or human cell into which the vector is administered. The genetic construct may also comprise a reporter gene, such as polynucleotide encoding a reporter protein and/or a selectable marker, such as hygromycin (“Hygro”). The reporter protein may include any protein or peptide that is suitably detectable, such as, by fluorescence, chemiluminescence, enzyme activity such as beta galactosidase or alkaline phosphatase, and/or antibody binding detection. The reporter protein may comprise a fluorescent protein. The reporter protein may comprise a protein or peptide detectable with an antibody. For example, the reporter protein may comprise green fluorescent protein (“GFP”), YFP, RFP, CFP, DsRed, luciferase, and/or Thy1. [000149] The genetic construct may be useful for transfecting cells with nucleic acid encoding the transcription factor coding sequence or the CRISPR/Cas-based gene editing system, which the transformed host cell is cultured and maintained under conditions wherein expression of the transcription factor coding sequence or the CRISPR/Cas-based gene editing system takes place. The genetic construct may be transformed or transduced into a cell. The genetic construct may be formulated into any suitable type of delivery vehicle including, for example, a viral vector, lentiviral expression, mRNA electroporation, and lipid- mediated transfection for delivery into a cell. The genetic construct may be part of the genetic material in attenuated live microorganisms or recombinant microbial vectors which live in cells. The genetic construct may be present in the cell as a functioning extrachromosomal molecule. [000150] Further provided herein is a cell transformed or transduced with a system or component thereof as detailed herein. Suitable cell types are detailed herein. In some embodiments, the cell is an immune cell. In some embodiments, the cell is a T cell. The cell may be a CD8+ T cell. The cell may be a CD4+ T cell. In some embodiments, the cell is a stem cell. The stem cell may be a human stem cell. In some embodiments, the cell is an embryonic stem cell. The stem cell may be a human pluripotent stem cell (iPSCs). a. Viral Vectors [000151] A genetic construct may be a viral vector. Further provided herein is a viral delivery system. Viral delivery systems may include, for example, lentivirus, retrovirus, adenovirus, mRNA electroporation, or nanoparticles. In some embodiments, the vector is a lentiviral vector. Lentiviruses are a subclass of Retroviruses. Lentiviruses resemble Ȗ- retroviruses (Ȗ-RV) in their ability to stably integrate into the target cell genome, resulting in persistent expression of the gene of interest. Species of lentivirus include, for example, human immunodeficiency virus 1 (HIV-1), human immunodeficiency virus 2 (HIV-2), simian immunodeficiency virus (SIV), bovine immunodeficiency virus (BIV), and feline immunodeficiency virus (FIV). In some embodiments, the vector is a modified lentiviral vector. In some embodiments, the vector is an engineered lentiviral vector. Lentiviruses may include, for example, pseudo-type lentivirus, integrase-deficient lentivirus, and virus-like particles. In some embodiments, the viral vector is an adeno-associated virus (AAV) vector. The AAV vector is a small virus belonging to the genus Dependovirus of the Parvoviridae family that infects humans and some other primate species. [000152] Viral vectors may be used to deliver the transcription factor coding sequence or the CRISPR/Cas9-based gene editing systems using various construct configurations. For example, AAV vectors may deliver Cas9 or fusion protein and gRNA expression cassettes on separate vectors or on the same vector. Alternatively, if the small Cas9 proteins or fusion proteins, derived from species such as Staphylococcus aureus or Neisseria meningitidis, are used then both the Cas9 and up to two gRNA expression cassettes may be combined in a single AAV vector. In some embodiments, the AAV vector has a 4.7 kb packaging limit. [000153] In some embodiments, the AAV vector is a modified AAV vector. In some embodiments, the AAV vector is an engineered AAV vector. The AAV vector may include an engineered AAV capsid. The AAV vector may be an engineered AAV vector for a specific cell type. For example, the AAV vector may be an engineered AAV vector for T cells. The modified AAV vector may have enhanced cell type tropism. The modified AAV vector may have enhanced cardiac and/or skeletal muscle tissue tropism. The modified AAV vector may be capable of delivering and expressing the transcription factor coding sequence or the CRISPR/Cas9-based gene editing system in the cell of a mammal. For example, the modified AAV vector may be an AAV-SASTG vector (Piacentino et al. Human Gene Therapy 2012, 23, 635–646, incorporated herein by reference). The modified AAV vector may be based on one or more of several capsid types, including AAV1, AAV2, AAV5, AAV6, AAV8, and AAV9. The modified AAV vector may be based on AAV2 pseudotype with alternative muscle-tropic AAV capsids, such as AAV2/1, AAV2/6, AAV2/7, AAV2/8, AAV2/9, AAV2.5, and AAV/SASTG vectors that efficiently transduce skeletal muscle or cardiac muscle by systemic and local delivery (Seto et al. Current Gene Therapy 2012, 12, 139-151, incorporated herein by reference). The modified AAV vector may be AAV2i8G9 (Shen et al. J. Biol. Chem.2013, 288, 28814-28823, incorporated herein by reference). 4. Additional Therapies [000154] The compositions and methods detailed herein may further include at least one additional therapy, such as at least one cancer therapy or at least one antiviral therapy, or a combination thereof. As used herein, the term “standard of care treatment” or “additional therapy” or “additional treatment” are used interchangeably and refer to any other standard treatments/additional treatments that do not include the specific compositions detailed herein for modifying a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1. Additional therapies may comprise a small molecule, peptide, polypeptide, antibody, nucleotide, polynucleotide, lipid, or carbohydrate, or a combination thereof. Additional therapies may be synthesized and/or extracted and/or purified by any suitable means known in the art. Additional therapies may be commercially available. An effective amount of the additional therapy may be administered. a. Cancer Therapies [000155] The compositions and methods detailed herein may further include at least one cancer therapy. The term “standard of care treatment” or “additional therapy” or “additional treatment” are used interchangeably and refer to any other standard cancer treatments/additional cancer treatments that do not include the specific compositions detailed herein for modifying a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1. Additional cancer therapies may comprise a small molecule, peptide, polypeptide, antibody, nucleotide, polynucleotide, lipid, or carbohydrate, or a combination thereof. Additional cancer therapies may be synthesized and/or extracted and/or purified by any suitable means known in the art. Additional cancer therapies may be commercially available. Additional cancer therapies may include, for example, chemotherapy, immunotherapy, radiation therapy, hormone therapy, targeted drug therapy, cryoablation, antibody drug conjugates, and surgery, or a combination thereof. Hormone therapy, for example, may block hormone synthesis such as blocking estrogen synthesis. An effective amount of the additional cancer therapy may be administered. [000156] Chemotherapy may include, for example, an antimitotic agent, an alkylating agent, an antimetabolite, an antimicrotubule agent, a topoisomerase inhibitor, a cytotoxic agent, a cell cycle inhibitor, a growth factor inhibitor, a histone deacetylase (HDAC) inhibitor, and an inhibitor of a pathway that cross-talks with and activates ER transcriptional activity, or a combination thereof. [000157] Alkylating agents may include, for example, cisplatin (PLATINOL®), oxaliplatin (ELOXATIN®), chlorambucil (LEUKERAN®), procarbazine (MATULANE®; NATULAN®), or carmustine (BiCNU®), or a combination thereof. Antimetabolites may include, for example, methotrexate (also known as amethopterin), 5-fluorouracil, cytarabine (also known as cytosine arabinoside or ara-C; CYTOSAR®), or gemcitabine (GEMZAR®), or a combination thereof. Antimicrotubule agents may include, for example, vinblastine (VELBAN®; VELBE®), or paclitaxel (TAXOL®), or a combination thereof. Topoisomerase inhibitors may include, for example, etoposide (VEPESID®), or doxorubicin (ADRIAMYCIN®; MYOCET®), or a combination thereof. Cytotoxic agents may include, for example, bleomycin (BLENOXANE®). Growth factor inhibitors may include, for example, human epidermal growth factor receptor 2 (HER2) inhibitors. HER2 inhibitors include, for example, trastuzumab (HERCEPTIN®), deruxtecan, sacitizumab, and/or ado-trastuzumab emtansine (KADCYLA®). HDAC inhibitors may include, for example, vorinostat (ZOLINZA®), romidepsin (ISTODAX®), chidamide (also known as tucidinostat; EPIDAZA®; HIYASTA™), panobinostat (FARYDAK®), belinostat (also known as BELEODAQ® or PXD101), valproic acid (DEPAKOTE®; DEPAKENE®; STAVZOR®)), mocetinostat (also known as MGCD0103), abexinostat (also known as PCI-24781), entinostat (also known as SNDX-275 or MS-275), pracinostat (also known as SB939), resminostat (also known as 4SC-201 or RAS2410), givinostat (also known as gavinostat or ITF2357), quisinostat (also known as JNJ-26481585), kevetrin, CUDC-101, AR-42, tefinostat (also known as CHR-2845), nanatinostat (also known as CHR-3996), domatinostat (also known as 4SC-202), ivaltinostat (also known as CG-200745), rocilinostat (also known as ACY-1215), or sulforaphane, or a combination thereof. Inhibitors of a pathway that cross-talks with and activates ER transcriptional activity may include, for example, a phosphoinositide 3-kinase (PI3K) inhibitor, a heat shock protein 90 (HSP90) inhibitor, or a mammalian target of rapamycin (mTOR) inhibitor. mTOR inhibitors include, for example, everolimus (AFINITOR®; VOTUBIA®; ZORTRESS®). In some embodiments, the HDAC inhibitor comprises vorinostat (ZOLINZA®) and /or romidepsin (ISTODAX®). [000158] Immunotherapies may include, for example, a checkpoint inhibitor, or denosumab (PROLIA®; XGEVA®), or a combination thereof. “Checkpoint inhibitor” or “immune checkpoint inhibitor” may also be referred to as an immune checkpoint blockade (ICB) therapy. Checkpoint inhibitors may comprise an antibody. Checkpoint inhibitors may include, for example, an antibody to programmed cell death protein 1 (PD1) (anti-PD1), or an antibody to cytotoxic T-lymphocyte-associated protein 4 (CTLA4) (anti-CTLA4), or an antibody to programmed death-ligand 1 (PDL1) (anti-PDL1), or DMXAA (sting agonist; also known as ASA404, vadimezan, or dimethylxanthone acetic acid) or a combination thereof. “Anti-PD1” refers to an antibody that binds PD1, “anti-CTLA4” refers to an antibody that binds CTLA4, and “anti-PDL1” refers to an antibody that binds PDL1. In some embodiments, the PD-1 antibody comprises pembrolizumab (KEYTRUDA®) or nivolumab (OPDIVOo®). In some embodiments, the CTLA-4 antibody comprises ipilimumab (YERVOY®). [000159] Antibody drug conjugates may include, for example, gemtuzumab ozogamicin (MYLOTARG™), brentuximab vedotin (ADCETRIS®), ado-trastuzumab emtansine (KADCYLA®), inotuzumab ozogamicin (BESPONSA®), polatuzumab vedotin (POLIVY®), enfortumab vedotin (PADCEV®), fam-trastuzumab deruxtecan (ENHERTU®), sacituzumab govitecan (TRODELVY®), loncastuximab tesirine (ZYNLONTA®), tisotumab vedotin (TIVDAK®), mirvetuximab soravtansinegynx (ELAHERE™), moxetumomab pasudotox (LUMOXITI™), belantamab mafodotin-blmf (BLENREP®), cetuximab saratolacan (AKALUX®), or disitamab vedotin (AIDIXI®), or a combination thereof. b. Antiviral Therapies [000160] The compositions and methods detailed herein may further include at least one antiviral therapy. The term “standard of care treatment” or “additional therapy” or “additional treatment” are used interchangeably and refer to any other standard antiviral treatments/additional antiviral treatments that do not include the specific compositions detailed herein for modifying a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1. Additional antiviral therapies may comprise a small molecule, peptide, polypeptide, antibody, nucleotide, polynucleotide, lipid, or carbohydrate, or a combination thereof. Additional antiviral therapies may be synthesized and/or extracted and/or purified by any suitable means known in the art. Additional antiviral therapies may be commercially available. An effective amount of the additional antiviral therapy may be administered. Antiviral therapies and viruses targeted are described in, for example, US 20200165594, and De Clercq et al. (Clin. Microbiol. Rev.2016, 29, 695-747), each of which is incorporated herein by reference. Antiviral therapies may include, for example, the antiviral therapies listed in TABLE 4 below.
Figure imgf000079_0001
Figure imgf000080_0001
p p
Figure imgf000081_0001
Figure imgf000082_0001
Figure imgf000083_0001
Figure imgf000084_0001
Figure imgf000085_0001
Figure imgf000086_0001
[000161] The antivirals described herein may target one or more viruses. The virus may be a DNA virus (single or double stranded, positive or negative sense, or ambisense) or an RNA virus (single or double stranded, positive or negative sense, or ambisense). In some embodiments, the virus may be Ebola, measles, SARS, Chikungunya, hepatitis, Marburg, yellow fever, MERS, Dengue, Lassa, influenza, rhabdovirus, COVID-19, or HIV. A hepatitis virus may include hepatitis A, hepatitis B, or hepatitis C. An influenza virus may include, for example, influenza A or influenza B. An HIV may include HIV 1 or HIV 2. In some embodiments, the virus may be a human respiratory syncytial virus, Sudan Ebola virus, Bundibugyo virus, Tai Forest Ebola virus, Reston Ebola virus, Achimota, Aedes flavivirus, Aguacate virus, Akabane virus, Alethinophid reptarenavirus, Allpahuayo mammarenavirus, Amapari mmarenavirus, Andes virus, Apoi virus, Aravan virus, Aroa virus, Arumwot virus, Atlantic salmon paramyoxivirus, Australian bat lyssavirus, Avian bornavirus, Avian metapneumovirus, Avian paramyoxviruses, penguin or Falkland Islandsvirus, BK polyomavirus, Bagaza virus, Banna virus, Bat hepevirus, Bat sapovirus, Bear Canon mammarenavirus, Beilong virus, Betacoronoavirus, Betapapillomavirus 1-6, Bhanja virus, Bokeloh bat lyssavirus, Borna disease virus, Bourbon virus, Bovine hepacivirus, Bovine parainfluenza virus 3, Bovine respiratory syncytial virus, Brazoran virus, Bunyamwere virus, Caliciviridae virus. California encephalitis virus, Candiru virus, Canine distemper virus, Canaine pneumovirus, Cedar virus, Cell fusing agent virus, Cetacean morbillivirus, Chandipura virus, Chaoyang virus, Chapare mammarenavirus, Chikungunya virus, Colobus monkey papillomavirus, Colorado tick fever virus, Cowpox virus, Crimean-Congo hemorrhagic fever virus, Culex flavivirus, Cupixi mammarenavirus, Dengue virus, Dobrava- Belgrade virus, Donggang virus, Dugbe virus, Duvenhage virus, Eastern equine encephalitis virus, Entebbe bat virus, Enterovirus A-D, European bat lyssavirus 1-2, Eyach virus, Feline morbillivirus, Fer-de-Lance paramyxovirus, Fitzroy River virus, Flaviviridae virus, Flexal mammarenavirus, GB virus C, Gairo virus, Gemycircularvirus, Goose paramyoxiviurs SF02, Great Island virus, Guanarito mammarenavirus, Hantaan virus, Hantavirus Z10, Heartland virus, Hendra virus, Hepatitis A/B/C/E, Hepatitis delta virus, Human bocavirus, Human coronavirus, Human endogenous retrovirus K, Human enteric coronavirus, Human gential- associated circular DNA virus-1, Human herpesvirus 1-8, Human immunodeficiency virus 1/2, Huan mastadenovirus A-G, Human papillomavirus, Human parainfluenza virus 1-4, Human paraechovirus, Human picobirnavirus, Human smacovirus, Ikoma lyssavirus, Ilheus virus, Influenza A-C, Ippy mammarenavirus, Irkut virus, J-virus, JC polyomavirus, Japanses encephalitis virus, Junin mammarenavirus, KI polyomavirus, Kadipiro virus, Kamiti River virus, Kedougou virus, Khujand virus, Kokobera virus, Kyasanur forest disease virus, Lagos bat virus, Langat virus, Lassa mammarenavirus, Latino mammarenavirus, Leopards Hill virus, Liao ning virus, Ljungan virus, Lloviu virus, Louping ill virus, Lujo mammarenavirus, Luna mammarenavirus, Lunk virus, Lymphocytic choriomeningitis mammarenavirus, Lyssavirus Ozernoe, MSSI2\.225 virus, Machupo mammarenavirus, Mamastrovirus 1, Manzanilla virus, Mapuera virus, Marburg virus, Mayaro virus, Measles virus, Menangle virus, Mercadeo virus, Merkel cell polyomavirus, Middle East respiratory syndrome coronavirus, Mobala mammarenavirus, Modoc virus, Moijang virus, Mokolo virus, Monkeypox virus, Montana myotis leukoenchalitis virus, Mopeia lassa virus reassortant 29, Mopeia mammarenavirus, Morogoro virus, Mossman virus, Mumps virus, Murine pneumonia virus, Murray Valley encephalitis virus, Nariva virus, Newcastle disease virus, Nipah virus, Norwalk virus, Norway rat hepacivirus, Ntaya virus, O'nyong-nyong virus, Oliveros mammarenavirus, Omsk hemorrhagic fever virus, Oropouche virus, Parainfluenza virus 5, Parana mammarenavirus, Parramatta River virus, Peste-des-petits-ruminants virus, Pichande mammarenavirus, Picornaviridae virus, Pirital mammarenavirus, Piscihepevirus A, Procine parainfluenza virus 1, porcine rubulavirus, Powassan virus, Primate T-lymphotropic virus 1-2, Primate erythroparvovirus 1, Punta Toro virus, Puumala virus, Quang Binh virus, Rabies virus, Razdan virus, Reptile bornavirus 1, Rhinovirus A-B, Rift Valley fever virus, Rinderpest virus, Rio Bravo virus, Rodent Torque Teno virus, Rodent hepacivirus, Ross River virus, Rotavirus A-I, Royal Farm virus, Rubella virus, Sabia mammarenavirus, Salem virus, Sandfly fever Naples virus, Sandfly fever Sicilian virus, Sapporo virus, Sathuperi virus, Seal anellovirus, Semliki Forest virus, Sendai virus, Seoul virus, Sepik virus, Severe acute respiratory syndrome-related coronavirus, Severe fever with thrombocytopenia syndrome virus, Shamonda virus, Shimoni bat virus, Shuni virus, Simbu virus, Simian torque teno virus, Simian virus 40-41, Sin Nombre virus, Sindbis virus, Small anellovirus, Sosuga virus, Spanish goat encephalitis virus, Spondweni virus, St. Louis encephalitis virus, Sunshine virus, TTV-like mini virus, Tacaribe mammarenavirus, Taila virus, Tamana bat virus, Tamiami mammarenavirus, Tembusu virus, Thogoto virus, Thottapalayam virus, Tick-borne encephalitis virus, Tioman virus, Togaviridae virus, Torque teno canis virus, Torque teno douroucouli virus, Torque teno felis virus, Torque teno midi virus, Torque teno sus virus, Torque teno tamarin virus, Torque teno virus, Torque teno zalophus virus, Tuhoko virus, Tula virus, Tupaia paramyxovirus, Usutu virus, Uukuniemi virus, Vaccinia virus, Variola virus, Venezuelan equine encephalitis virus, Vesicular stomatitis Indiana virus, WU Polyomavirus, Wesselsbron virus, West Caucasian bat virus, West Nile virus, Western equine encephalitis virus, Whitewater Arroyo mammarenavirus, Yellow fever virus, Yokose virus, Yug Bogdanovac virus, Zaire ebolavirus, Zika virus, or Zygosaccharomyces bailii virus Z viral sequence. Examples of RNA viruses include one or more of (or any combination of) Coronaviridae virus, a Picornaviridae virus, a Caliciviridae virus, a Flaviviridae virus, a Togaviridae virus, a Bornaviridae, a Filoviridae, a Paramyxoviridae, a Pneumoviridae, a Rhabdoviridae, an Arenaviridae, a Bunyaviridae, an Orthomyxoviridae, or a Deltavirus. In some embodiments, the virus is Coronavirus, SARS, Poliovirus, Rhinovirus, Hepatitis A, Norwalk virus, Yellow fever virus, West Nile virus, Hepatitis C virus, Dengue fever virus, Zika virus, Rubella virus, Ross River virus, Sindbis virus, Chikungunya virus, Borna disease virus, Ebola virus, Marburg virus, Measles virus, Mumps virus, Nipah virus, Hendra virus, Newcastle disease virus, Human respiratory syncytial virus, Rabies virus, Lassa virus, Hantavirus, Crimean-Congo hemorrhagic fever virus, Influenza, or Hepatitis D virus. 5. Pharmaceutical Compositions [000162] Further provided herein are pharmaceutical compositions comprising the above- described modulator of T cells or genetic constructs or gene editing systems. In some embodiments, the composition further includes at least one cancer therapy such as a chimeric antigen receptor (CAR). In some embodiments, the pharmaceutical composition may comprise about 1 ng to about 10 mg of DNA encoding the transcription factor or the CRISPR/Cas-based gene editing system. The systems or genetic constructs as detailed herein, or at least one component thereof, may be formulated into pharmaceutical compositions in accordance with standard techniques well known to those skilled in the pharmaceutical art. The pharmaceutical compositions can be formulated according to the mode of administration to be used. In cases where pharmaceutical compositions are injectable pharmaceutical compositions, they are sterile, pyrogen free, and particulate free. An isotonic formulation is preferably used. Generally, additives for isotonicity may include sodium chloride, dextrose, mannitol, sorbitol and lactose. In some cases, isotonic solutions such as phosphate buffered saline are preferred. Stabilizers include gelatin and albumin. In some embodiments, a vasoconstriction agent is added to the formulation. [000163] The composition may further comprise a pharmaceutically acceptable excipient. The pharmaceutically acceptable excipient may be functional molecules as vehicles, adjuvants, carriers, or diluents. The term “pharmaceutically acceptable carrier,” may be a non-toxic, inert solid, semi-solid or liquid filler, diluent, encapsulating material or formulation auxiliary of any type. Pharmaceutically acceptable carriers include, for example, diluents, lubricants, binders, disintegrants, colorants, flavors, sweeteners, antioxidants, preservatives, glidants, solvents, suspending agents, wetting agents, surfactants, emollients, propellants, humectants, powders, pH adjusting agents, and combinations thereof. The pharmaceutically acceptable excipient may be a transfection facilitating agent, which may include surface active agents, such as immune-stimulating complexes (ISCOMS), Freunds incomplete adjuvant, LPS analog including monophosphoryl lipid A, muramyl peptides, quinone analogs, vesicles such as squalene and squalene, hyaluronic acid, lipids, liposomes, calcium ions, viral proteins, polyanions, polycations, or nanoparticles, or other known transfection facilitating agents. The transfection facilitating agent may be a polyanion, polycation, including poly-L-glutamate (LGS), or lipid. The transfection facilitating agent may be poly-L- glutamate, and more preferably, the poly-L-glutamate may be present in the composition for gene editing in skeletal muscle or cardiac muscle at a concentration less than 6 mg/mL. 6. Administration [000164] The systems or genetic constructs as detailed herein, or at least one component thereof, may be administered or delivered to a cell. Methods of introducing a nucleic acid into a host cell are known in the art, and any known method can be used to introduce a nucleic acid (e.g., an expression construct) into a cell. Suitable methods include, for example, viral or bacteriophage infection, transfection, conjugation, protoplast fusion, polycation or lipid:nucleic acid conjugates, lipofection, electroporation, nucleofection, immunoliposomes, calcium phosphate precipitation, polyethyleneimine (PEI)-mediated transfection, DEAE-dextran mediated transfection, liposome-mediated transfection, particle gun technology, calcium phosphate precipitation, direct micro injection, nanoparticle- mediated nucleic acid delivery, and the like. In some embodiments, the composition may be delivered by mRNA delivery and ribonucleoprotein (RNP) complex delivery. The system, genetic construct, or composition comprising the same, may be electroporated using BioRad Gene Pulser Xcell or Amaxa Nucleofector IIb devices or other electroporation device. Several different buffers may be used, including BioRad electroporation solution, Sigma phosphate-buffered saline product #D8537 (PBS), Invitrogen OptiMEM I (OM), or Amaxa Nucleofector solution V (N.V.). Transfections may include a transfection reagent, such as Lipofectamine 2000. The activators detailed herein may be delivered or administered, for example, to a cell ex vivo or to a subject in vivo, by a method including viral delivery (such as, for example, lentivirus, retrovirus, or AAV vectors as detailed above), virus-like particles (VLPs), and integrase-defective lentivirus, non-viral integrating methods (such as, for example, transposons, integrases, and gene editing), and non-viral transient methods (such as, for example, mRNA, plasmids, and minicircles as detailed above). The activators detailed herein may be delivered by any delivery method suitable for ex vivo engineered cell therapy. The activators detailed herein may be delivered by any suitable delivery method, including, for example, lipid nanoparticles and micelles. In some embodiments, the activator or a polynucleotide encoding the activator is encapsulated within a lipid nanoparticle or polymeric carrier. [000165] The systems or genetic constructs as detailed herein, or at least one component thereof, or the pharmaceutical compositions comprising the same, may be administered to a subject or a cell. Such compositions can be administered in dosages and by techniques well known to those skilled in the medical arts taking into consideration such factors as the age, sex, weight, and condition of the particular subject, and the route of administration. The presently disclosed systems, or at least one component thereof, genetic constructs, or compositions comprising the same, may be administered to a subject by different routes including orally, parenterally, sublingually, transdermally, rectally, transmucosally, topically, intranasal, intravaginal, via inhalation, via buccal administration, intrapleurally, intravenous, intraarterial, intraperitoneal, subcutaneous, intradermally, epidermally, intramuscular, intranasal, intrathecal, intracranial, and intraarticular or combinations thereof. In certain embodiments, the system, genetic construct, or composition comprising the same, is administered to a subject intramuscularly, intravenously, or a combination thereof. The systems, genetic constructs, or compositions comprising the same may be delivered to a subject by several technologies including DNA injection (also referred to as DNA vaccination) with and without in vivo electroporation, liposome mediated, nanoparticle facilitated, recombinant vectors such as recombinant lentivirus, recombinant adenovirus, and recombinant adenovirus associated virus. The composition may be injected into the brain or other component of the central nervous system. The composition may be injected into the skeletal muscle or cardiac muscle. For example, the composition may be injected into the tibialis anterior muscle or tail. For veterinary use, the systems, genetic constructs, or compositions comprising the same may be administered as a suitably acceptable formulation in accordance with normal veterinary practice. The veterinarian may readily determine the dosing regimen and route of administration that is most appropriate for a particular animal. The systems, genetic constructs, or compositions comprising the same may be administered by traditional syringes, needleless injection devices, “microprojectile bombardment gone guns,” or other physical methods such as electroporation (“EP”), “hydrodynamic method”, or ultrasound. Alternatively, transient in vivo delivery of CRISPR/Cas-based systems by non-viral or non-integrating viral gene transfer, or by direct delivery of purified proteins and gRNAs containing cell-penetrating motifs may enable highly specific correction and/or restoration in situ with minimal or no risk of exogenous DNA integration. [000166] Upon delivery of the presently disclosed modulator of T cells, a variety of effects may be elicited, such as, for example, T cells may be increased, T cell numbers may be increased, memory T cells may be increased, T cell biodistribution may be increased, infiltration into tissues may be increased, T cell durability may be increased, T cell exhaustion may be inhibited or prevented, T cell exhaustion may be reversed, cancer therapy may be enhanced or its effectiveness increased, antiviral therapy may be enhanced or its effectiveness increased, or a combination thereof. Upon delivery of the presently disclosed systems or genetic constructs as detailed herein, or at least one component thereof, or the pharmaceutical compositions comprising the same, and thereupon the vector into the cells of the subject, the transfected cells may express the gene or gene product thereof, or the gRNA molecule(s) and the Cas9 molecule or fusion protein. a. Cell Types [000167] Any of the delivery methods and/or routes of administration detailed herein can be utilized with a myriad of cell types. Further provided herein is a cell transformed or transduced with a system or component thereof as detailed herein. For example, provided herein is a cell comprising an isolated polynucleotide encoding the protein product from a gene as detailed herein. For example, provided herein is a cell comprising an isolated polynucleotide encoding a transcription factor or a CRISPR/Cas9 system as detailed herein. Suitable cell types are detailed herein. The cell may be isolated, as from a subject or tissue, or ex vivo, or in vivo. The cell may be autologous to the subject. The cell may be allogenic to the subject. In some embodiments, the cell is an immune cell. Immune cells may include, for example, lymphocytes such as T cells and B cells and natural killer (NK) cells. In some embodiments, the cell is a T cell. T cells may be divided into cytotoxic T cells and helper T cells, which are in turn categorized as TH1 or TH2 helper T cells. Immune cells may further include innate immune cells, adaptive immune cells, tumor-primed T cells, NKT cells, IFN-Ȗ producing killer dendritic cells (IKDC), memory T cells (TCMs), and effector T cells (TEs). The cell may be a stem cell such as a human stem cell. In some embodiments, the cell is an embryonic stem cell or a hematopoietic stem cell. The stem cell may be a human induced pluripotent stem cell (iPSCs). In some embodiments, the cell is a T cell. In some embodiments, the cell is a CD8+ T cell. In some embodiments, the cell is a CD4+ T cell. 7. Kits [000168] Provided herein is a kit, which may be used to modulate, such as increase, T cells. The kit may be used in conjunction with ACT to enhance the ACT. The kit comprises an activator as detailed herein, or genetic constructs or a composition comprising the same, as described above, and instructions for using said composition. The kit includes an activator of a gene, or a genetic construct encoding the activator of a gene, wherein the gene is selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof. In some embodiments, the kit comprises at least one polynucleotide sequence selected from SEQ ID NOs: 75-97 or 121-192, a complement thereof, a variant thereof, or fragment thereof. In some embodiments, the kit comprises at least one polypeptide sequence selected from SEQ ID NOs: 98-120, a variant thereof, or fragment thereof. In some embodiments, the kit comprises at least one gRNA comprising a polynucleotide sequence selected from SEQ ID NOs: 157-192, a complement thereof, a variant thereof, or fragment thereof, or at least one gRNA targeting or encoded by a polynucleotide comprising a sequence selected from SEQ ID NOs: 121-156, a complement thereof, a variant thereof, or fragment thereof. The kit may further include instructions for using the activator or the CRISPR/Cas-based gene editing system. [000169] Instructions included in kits may be affixed to packaging material or may be included as a package insert. While the instructions are typically written on printed materials they are not limited to such. Any medium capable of storing such instructions and communicating them to an end user is contemplated by this disclosure. Such media include, but are not limited to, electronic storage media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the like. As used herein, the term “instructions” may include the address of an internet site that provides the instructions. [000170] As detailed above, the genetic constructs or a composition comprising thereof for modulating T cells may include a vector such as a modified lentiviral or AAV vector that comprises a cDNA encoding a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, and/or includes a gRNA molecule(s) and a Cas9 protein or fusion protein, as described above, that specifically binds a region of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a regulatory element thereof. 8. Methods a. Methods of Modulating T Cells [000171] Provided herein are methods of modulating T cells. The methods may include administering to a cell or a subject a composition as detailed herein, or an isolated polynucleotide sequence as detailed herein, or a vector as detailed herein, or a cell as detailed herein, or a pharmaceutical composition as detailed herein, or a combination thereof. In some embodiments, modulating T cells comprises increasing T cells, or increasing memory T cells, or preventing T cell exhaustions, or reversing T cell exhaustions, or a combination thereof. In some embodiments, the methods increase expression or activity of CD103 or IL7Ra, or a combination thereof, in a T cell. The activators of T cells detailed herein may be delivered or administered, for example, to a cell in vitro or ex vivo or to a subject in vivo. The activators detailed herein may be delivered or administered to a cell for in vivo or ex vivo cell modification. In some embodiments, an activator as detailed herein is administered to a cell isolated from a subject. The cell may be autologous. The cell may be allogenic. A T cell modified by an activator as detailed herein may be administered to a subject. b. Methods of Increasing T Cells [000172] Provided herein are methods of increasing T cells. The methods may include administering to a cell or a subject a composition as detailed herein, or an isolated polynucleotide sequence as detailed herein, or a vector as detailed herein, or a cell as detailed herein, or a pharmaceutical composition as detailed herein, or a combination thereof. In some embodiments, the methods increase expression or activity of CD103 or IL7Ra, or a combination thereof, in a T cell. The activators detailed herein may be delivered or administered, for example, to a cell in vitro or ex vivo or to a subject in vivo. The activators detailed herein may be delivered or administered to a cell for in vivo or ex vivo cell modification. In some embodiments, an activator as detailed herein is administered to a cell isolated from a subject. The cell may be autologous. The cell may be allogenic. A T cell modified by an activator as detailed herein may be administered to a subject. c. Methods of Enhancing Adoptive T Cell Therapy (ACT) [000173] Provided herein are methods of enhancing adoptive T cell therapy (ACT) in a subject. The methods may include administering to the subject a composition as detailed herein, or an isolated polynucleotide sequence as detailed herein, or a vector as detailed herein, or a cell as detailed herein, or a pharmaceutical composition as detailed herein, or a combination thereof. In some embodiments, the methods increase expression or activity of CD103 or IL7Ra, or a combination thereof, in a T cell. The activators detailed herein may be delivered or administered, for example, to a cell in vitro or ex vivo or to a subject in vivo. The activators detailed herein may be delivered or administered to a cell for in vivo or ex vivo cell modification. In some embodiments, an activator as detailed herein is administered to a cell isolated from a subject. The cell may be autologous. The cell may be allogenic. A T cell modified by an activator as detailed herein may be administered to a subject. d. Methods of Treating Cancer [000174] Provided herein are methods of treating cancer in a subject. The methods may include administering to the subject a composition as detailed herein, or an isolated polynucleotide sequence as detailed herein, or a vector as detailed herein, or a cell as detailed herein, or a pharmaceutical composition as detailed herein, or a combination thereof. In some embodiments, the methods increase expression or activity of CD103 or IL7Ra, or a combination thereof, in a T cell. The activators detailed herein may be delivered or administered, for example, to a cell in vitro or ex vivo or to a subject in vivo. The activators detailed herein may be delivered or administered to a cell for in vivo or ex vivo cell modification. In some embodiments, an activator as detailed herein is administered to a cell isolated from a subject. The cell may be autologous. The cell may be allogenic. A T cell modified by an activator as detailed herein may be administered to a subject. 9. Examples [000175] The foregoing may be better understood by reference to the following examples, which are presented for purposes of illustration and are not intended to limit the scope of the invention. The present disclosure has multiple aspects and embodiments, illustrated by the appended non-limiting examples. Example 1 Materials and Methods [000176] Culturing primary human T cells. Human T cells were isolated from frozen peripheral blood mononuclear cells using negative selection kits or pre-isolated cells purchased directly. T cells were activated using CD3/CD28 dynabeads at a 3:1 bead:cell ratio. Cells were cultured in Immunocult-XF medium supplemented with 1% penicillin/streptomycin and 100 IU/mL rh-IL2. [000177] Lentivirus generation and human primary T cell transduction.7 x 106 HEK293T cells were plated in a 10 cm dish in the afternoon with 12 mL of complete opti- MEM (Opti-MEM™ I Reduced Serum Medium supplemented with 1x Glutamax, 5% FBS, 1 mM Sodium Pyruvate, and 1x MEM Non-Essential Amino Acids). The next morning, HEK293T cells were transfected with 3.25 ^g pMD2.G, 9.75 ^g psPAX2, and 4.3 ^g transgene using Lipofectamine 3000. Media was exchanged 6 hours after transfection, and lentiviral supernatant was collected and pooled at 24 hours and 48 hours after transfection. Collected lentivirus was centrifuged at 500g for 10 min to remove cellular debris. Supernatant was concentrated 100X using Lenti-X concentrator. Concentrated lentivirus was snap frozen and stored at -80°C until use. Human primary T cells were transduced 24 hours post activation with 10% v/v concentrated lentivirus. [000178] cDNA overexpression screens. Human primary CD8+ T cells were activated using dynabeads as described above. 24 hours post-activation, T cells were transduced with lentivirus encoding a library of human transcription factors (PMID: 36608654; Joung et al. Cell 2023, 186, 209-229, incorporated herein by reference). Cells were expanded for at least 7 days at which point they were collected for sorting. For the IL7Ra screen, the top and bottom 10% of cells were sorted based on IL7Ra protein levels. For the CD103 screen, CD103+ and CD103- negative populations were sorted. Genomic DNA was isolated using Qiagen’s (Hilden, Germany) DNeasy Blood and Tissue Kit from the sorted populations. Transcription factor barcodes were amplified and sequenced as previously described (PMID: 36608654; Joung et al. Cell 2023, 186, 209-229, incorporated herein by reference). Transcription factor barcode read counts were compared across the different groups using DESeq2. Hits were determined as transcription factors with DESeq2 adjusted P values < .05. [000179] Flow cytometry. For antibody staining, cells were spun down at 300g for 5 minutes and resuspended in Cell Staining Buffer (Biolegend, San Diego, CA) with the appropriate antibody dilutions. Cells were stained for 30 minutes on ice in the dark and washed once before sorting or analysis. [000180] RNA sequencing. CAR T cells were incubated with target cells at a 2:1 ratio in Immunocult-XF media unless otherwise indicated. After complete target cell elimination (2 days), cells were washed in fresh media. RNA was isolated using Norgen Biotek’s (Thorold, Ontario, Canada) Total RNA purification plus kit and submitted to Azenta for standard RNA- seq with polyA selection. Gene expression matrices were generated using salmon (v1.10.2) with the GRCh38 genome. Differential gene expression analysis was performed using DESeq2. [000181] scRNA-sequencing. CAR T cells generated from pan T cells were expanded for 7 days. Cells were spun down and resuspended in 49.5 μL cell staining buffer. Fc receptors were blocked by adding 0.5 μL TruStainFcX PLUS and incubating on ice for 10 minutes. 50 μL of additional cell staining buffer was then added. 1 μg of cell hashing antibodies was added to each CAR T population based on perturbation. Cells were incubated on ice for 30 minutes. Afterwards, cells were washed 3X in cell staining buffer and resuspended in 300 μL. Equal cell numbers were then pooled, pelleted, and resuspended at 2e7 cells/mL in cell staining buffer. A 25 μL aliquot of cells was staining with 25 μL of TotalSeqTM-B Human Universal Cocktail, V1.0 for 30 minutes at 4°C. Stained cells were washed 3X and resuspended in 100 μL PBS with 1% BSA. Diluted cells were loaded onto the Chromium X and captured using the Single Cell 3’ Gene Expression Kit (v3.1). Resulting libraries were sequenced using the Novaseq X. CellRanger (v7.0.0) was used to process and generate UMI matrices. CellBender (v0.3.0) was used to remove noise due to ambient RNA molecules. Cleaned matrices were loaded into Seurat (v5) and subset to include cells with nFeature_RNA > 1800, percent.mt < 10, and nCount_RNA < 50,000. High quality cells were assigned their perturbation (based on cell hashing antibody) using HTODemux. [000182] Multiplex ELISA. HER2-targeting CAR T cells were generated from pan T cells. CAR T cells were incubated with target cells (SKBR3) at a ratio of 4:1 in Immunocult-XF media without rh-IL2 supplementation. 20 hours after incubation, cells were centrifuged to pellet. Supernatant was collected and stored at -80°C. Thawed supernatant was later analyzed using the LEGENDplexTM Human CD8/NK Panel on a Sony SH800. [000183] In vitro killing assay. CD8 T cells were transduced with lentiviruses encoding either aHER2 CAR T2A-TGIF2LX or aHER2 CAR T2A-THY1.1 and expanded for at least 7 days. The afternoon before assay start, 625 target cells (SKBR3-GFP) were plated into a 96-well plate and allowed to attach overnight. The following day, CAR T cells were collected and diluted to 10 cells/μL in Immunocult-XF with 100 IU/rh-IL2. Media was removed from the target cell plate and replaced with 250 μL CAR T cell suspension resulting in a 4:1 E:T ratio. Plates were imaged on the IncuCyte® (Sartorius, Göttingen, Germany) S3 every 4 hours for 5 days. GFP-expressing target cells were enumerated and tracked over time using the IncuCyte® (Sartorius, Göttingen, Germany) Analysis Software. [000184] In vivo tumor model. 500,000 HER2+ HCC1952 were implanted orthotopically into the mammary fat pad of nod/scid mice. CD8 T cells were transduced with lentivirus encoding one of the following constructs: aHER2 CAR-T2A-THY1.1, aHER2 CAR-T2A- TGIFL2X, or aHER2 CAR-T2A-RUNX3, and expanded for 9 days. 500,000 CAR T cells were infused intravenously 21 days after tumor implantation. Immediately before CAR T administration, tumors were measured and mice were randomized into groups. Tumor volumes were tracked based on caliper measurements every 4-6 days. [000185] TGF-b1 incubation. CD8 T cells were activated and transduced 24 hours later with lentivirus encoding either aHER2 CAR T2A-TGIF2LX or aHER2 CAR T2A-THY1.1. One day later, media was replaced with fresh Immunocult-XF supplemented with 100 IU/rhIL2 and 2 ng/mL TGF-b1. Five days later cells were collected and prepared for flow cytometry. Example 2 Resident Memory T Cells for Adoptive Cell Therapy [000186] Memory T cells exhibit distinct functional characteristics depending on their location within the body. Recent advancements have unveiled a significant population of memory T cells residing in non-lymphoid tissues. These resident memory T cells have a role in combating infections, and their abundance has a positive correlation with improved survival rates across many solid tumor types. Resident memory T cells were to be used for adoptive cell therapy (FIG.1 and FIG.2). However, isolating these cells from donor-derived tissues was impractical. Therefore, epigenetic reprogramming of blood-derived T cells towards a tissue resident state was used. [000187] Two cDNA overexpression screens were conducted, using open reading frames (ORFs) encoding all known transcription factors, in primary CD8 T cells to identify genetic regulators of CD103 (a residency marker) and IL7Ra (a memory marker; FIGS.3A-3B, FIG. 14). The intersection of these screens identified the transcription factor TGIF2LX, capable of upregulating both markers (FIGS.4A-4C). The initial activation of naïve T cells by antigen is driven by transcription factors including BATF, IRF-4, and NFAT-AP1. Next, differentiation of the activated cells into KLRG-1loCD127hi memory precursor T cells (MPECs) occurs. MPECs can be differentiated into various types of cells depending on antigen levels and disease setting. Effector T cells (Teff cells) that form from MPECs are characterized by high KLRG-1 and effector functions, such as cytokine production, driven in part by the transcription factors T-bet and Blimp-1. Differentiation of MPECs to memory T cells (Tmem cells) occurs following antigen clearance in acute infections and vaccination, and self- renewing Tmem cells use the transcription factors Tcf-1 and Eomes. TRM cells do not circulate in the body but are rather retained in tissues and are characterized by high CD103 and CD69 expression and low CD127 expression. Additionally, resident memory cells are associated with low KLF-2 and expression of Hobit. Early exhausted cells form during chronic infections and cancer, where an antigen persists. Early exhausted T cells (Tex cells) are characterized by intermediate expression of PD-1 and low Eomes, with a role for Tcf-1 in a progenitor population. Early Tex cells give rise to terminally exhausted T cells that are characterized by high expression of PD-1 and eomesodermin (Eomes) and loss of Tcf-1 and loss of the ability to proliferate further upon additional antigen stimulation. Interestingly, this transcription factor is only expressed in male reproductive tissues, although other TGIF family members have been implicated in T cell function (FIGS.5A-5C). FIGS.6A-6E show that TGIF1 is upregulated in early resident memory. FIGS.15A-15B show the TGIF2LX domains that may be responsible for residency reprogramming. [000188] Hits were obtained from two separate marker-based screens (CD103 and IL7Ra). CD103 is a tissue-resident memory marker and IL7Ra is another memory marker. The upregulation of both CD103 and IL7Ra provided greater confidence that the listed transcription factors were skewing towards a more memory-like state. Additionally, upregulation of this combination of markers may gave a distinct phenotype (tissue- residency) from other perturbations previously identified that skew towards circulating T cell phenotypes. Example 3 TGIF2LX Expressing CAR Ts [000189] FIGS.7A-7E show a molecular characterization of transcription factor-engineered chimeric antigen receptor T cells (CAR Ts). Bulk RNA-sequencing revealed that TGIF2LX overexpression in T cells lead to widespread transcriptional reprogramming, including activation of genes associated with increased tissue residency programs (for example, ITGAE (i.e. CD103), CD69, CXCR6) and decreases in genes associated with circulation programs (e.g. KLF2, S1PR1; FIGS.8A-8J). TGIF2LX-overexpressing T cells were characterized using a multimodal single cell readout, pairing scRNA-seq with a CITE-seq library of 134 unique cell surface markers (FIGS.9A-9G). This demonstrated a distinct cell state with TGIFL2X-overexpressing T cells existing in clusters unique from those containing unenhanced cells. Of note, these cells also clustered separately from T cells overexpressing RUNX3, a transcription factor previously identified to enhance tissue residency and improve tumor control, suggesting a novel axis for controlling tissue residency (FIG.9D). FIGS.10A- 10F show that TGIF2LX increased tissue-residence associated surface markers. FIGS. 11A-11C show that CD8 subpopulations mirrored memory to effector hierarchy. FIGS.12A- 12E show that TGIF2LX CAR Ts maintained cytotoxic capabilities in vitro. [000190] A proliferation-based cDNA overexpression screen in CD8 T cells was completed. A library of about 3,500 transcription factors was introduced into HER2-targeting CAR Ts. The CAR Ts were repeatedly stimulated with cancer cells to induce exhaustion. CAR Ts were collected before and after repeated stimulation to identify transcription factors that modulate CAR T abundance with the hypothesis that transcription factors that increase CAR T durability will be more highly represented at the end of the screen. [000191] A marker-based cDNA overexpression screen in CD8 T cells was completed. The same library as above was introduced into HER2-targeting CAR Ts. The CAR Ts were expanded for 10 days at which point, CD103 high cells were collected as well as an unsorted population. The goal was to identify transcription factors that increase expression of CD103. CD103 is a tissue-resident memory T cell marker, and these cells are associated with improved survival in the context of checkpoint inhibitors. Example 4 TGIF2LX Overexpression Modulates CAR T Efficacy in a Human Breast Cancer Model [000192] The effect of TGIF2LX overexpression in modulating CAR T efficacy in a human breast cancer xenograft model was also tested. The protocol shown in FIG.13A was used. HCC1954 breast cancer cells were implanted orthotopically into the mammary fat of Nod/scid mice. After 10 days, HER2-targeting CAR T cells were administered intravenously at a sub-curative dose (5x105 cells). TGIF2LX-overexpression dramatically improved tumor control by CAR T cells relative to the unmodified control and RUNX3-overexpressing CAR Ts (FIG.13B). These data provided proof-of-concept of a strategy for enhancing adoptive cell therapies for solid tumors. [000193] TGIF2LX overexpression was found to give circulating T cells a tissue-resident- like phenotype. In addition, TGIF2LX overexpression was found to improve tumor control in the breast cancer CAR T xenograft model. The trafficking and effector function of TGIF2LX- overexpressing T cells in tumor and infection models may be examined further. Example 5 CRISPR Activation of TFs to Modulate T Cells [000194] CRISPRa will be used to increase the expression of a gene selected from TGIF2LX, TGIF1, TGIF2, CARF, EBF3, FOS, HMX3, HNF4A, KLF8, LHX4, LMX1A, NFKBIZ, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1. A dCas9 protein (such as dSpCas9) will be fused to an activation domain, such as at least one domain of VP64, p300, or p300 core. The resulting fusion protein may be VP64-dCas9-VP64 (SEQ ID NO: 43). T cells will be transduced with a lentiviral vector encoding the fusion protein such as VP64- dCas9-VP64 and a gRNA targeting a gene selected from TGIF2LX, TGIF1, TGIF2, CARF, EBF3, FOS, HMX3, HNF4A, KLF8, LHX4, LMX1A, NFKBIZ, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1. Exemplary gRNAs are shown in TABLE 3. The transduced T cells will be assayed for CD103 and IL7Ra protein levels (via FACS) and transcriptomic changes (via bulk RNA-seq), as described in Example 5. The transduced cells will show increased CD103 and IL7Ra expression, decreased circulatory marker expression, and increased tissue residency marker expression. *** [000195] The foregoing description of the specific aspects will so fully reveal the general nature of the invention that others can, by applying knowledge within the skill of the art, readily modify and/or adapt for various applications such specific aspects, without undue experimentation, without departing from the general concept of the present disclosure. Therefore, such adaptations and modifications are intended to be within the meaning and range of equivalents of the disclosed aspects, based on the teaching and guidance presented herein. It is to be understood that the phraseology or terminology herein is for the purpose of description and not of limitation, such that the terminology or phraseology of the present specification is to be interpreted by the skilled artisan in light of the teachings and guidance. [000196] The breadth and scope of the present disclosure should not be limited by any of the above-described exemplary aspects, but should be defined only in accordance with the following claims and their equivalents. [000197] All publications, patents, patent applications, and/or other documents cited in this application are incorporated by reference in their entirety for all purposes to the same extent as if each individual publication, patent, patent application, and/or other document were individually indicated to be incorporated by reference for all purposes. [000198] For reasons of completeness, various aspects of the invention are set out in the following numbered clauses: [000199] Clause 1. An isolated polynucleotide encoding a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof. [000200] Clause 2. An isolated polynucleotide encoding a transcription factor selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof. [000201] Clause 3. The isolated polynucleotide of clause 1 or 2, wherein the isolated polynucleotide comprises a sequence selected from SEQ ID NOs: 75-97. [000202] Clause 4. The isolated polynucleotide of any one of clauses 1-3, wherein the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof. [000203] Clause 5. A vector encoding a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof. [000204] Clause 6. A vector encoding a transcription factor selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof. [000205] Clause 7. The vector of clause 5 or 6, wherein the vector comprises a promoter operably linked to a polynucleotide sequence encoding the transcription factor. [000206] Clause 8. The vector of clause 7, wherein the promoter is non-endogenous to the transcription factor. [000207] Clause 9. The vector of clause 7 or 8, wherein the promoter is a constitutive promoter, or a ubiquitous promoter, or an inducible promoter, or a cell-specific promoter, or a tissue-specific promoter. [000208] Clause 10. The vector of any one of clauses 5-9, wherein the vector comprises an open reading frame (ORF) of the transcription factor. [000209] Clause 11. The vector of any one of clauses 5-10, wherein the vector comprises a sequence selected from SEQ ID NOs: 75-97 or encodes a polypeptide comprising a sequence selected from SEQ ID NOs: 98-120. [000210] Clause 12. The vector of any one of clauses 5-11, wherein the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof. [000211] Clause 13. The vector of any one of clauses 5-12, wherein the vector is a viral vector. [000212] Clause 14. The vector of clause 13, wherein the vector is a lentiviral vector. [000213] Clause 15. The vector of clause 13, wherein the vector is an adeno-associated virus (AAV) vector. [000214] Clause 16. The vector of clause 15, wherein the AAV vector is selected from AAV1, AAV2, AAV5, AAV6, AAV8, AAV9, and an engineered AAV vector. [000215] Clause 17. A method of modulating T cells, the method comprising administering to a T cell or a subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene. [000216] Clause 18. The method of clause 17, wherein modulating T cells comprises increasing T cells, or increasing memory T cells, or increasing T cell distribution, or increasing tissue infiltration, or preventing T cell exhaustions, or reversing T cell exhaustions, or a combination thereof. [000217] Clause 19. A method of increasing T cells, the method comprising administering to a T cell or a subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene. [000218] Clause 20. A method of enhancing adoptive T cell therapy (ACT) in a subject, the method comprising administering to a T cell or the subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene. [000219] Clause 21. A method of treating cancer in a subject, the method comprising administering to a T cell or the subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene. [000220] Clause 22. The method of any one of clauses 17-21, wherein the gene is selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof. [000221] Clause 23. The method of any one of clauses 17-22, wherein the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof. [000222] Clause 24. The method of any one of clauses 17-23, wherein the activator modulates T cells, and wherein modulating T cells comprises increasing T cells, or increasing T cell distribution, or increasing tissue infiltration, or increasing memory T cells, or increasing the lifetime of a T cell, or preventing T cell exhaustions, or reversing T cell exhaustions, or reducing T cell exhaustion, or enhancing the therapeutic potential of T cells, or a combination thereof. [000223] Clause 25. The method of any one of clauses 17-24, wherein administration of the activator to the T cell results in a modified T cell, and suitably wherein the modified T cell is administered to a subject. [000224] Clause 26. The method of any one of clauses 17-25, wherein the T cell is autologous or allogenic. [000225] Clause 27. The method of any one of clauses 17-26, wherein the activator modulates gene expression within the T cell. [000226] Clause 28. The method of any one of clauses 17-26, wherein the activator increases expression of CD103 or IL7Ra, or a combination thereof, in the T cell. [000227] Clause 29. The method of any one of clauses 17-28, wherein the activator comprises a polypeptide, or a polynucleotide, or a small molecule, or a combination thereof. [000228] Clause 30. The method of any one of clauses 17-29, wherein the activator comprises a polynucleotide encoding the gene. [000229] Clause 31. The method of any one of clauses 17-30, wherein the activator comprises a polynucleotide comprising the open reading frame of the gene or a polynucleotide encoding a protein encoded by the gene. [000230] Clause 32. The method of any one of clauses 17-31, wherein the activator comprises a polynucleotide selected from SEQ ID NOs: 75-97 or a polynucleotide encoding a polypeptide comprising an amino acid sequence selected from SEQ ID NOs: 98-120. [000231] Clause 33. The method of any one of clauses 17-32, wherein the activator comprises a polypeptide comprising a protein encoded by the gene. [000232] Clause 34. The method of any one of clauses 17-29, wherein the activator comprises a polypeptide selected from SEQ ID NOs: 98-120. [000233] Clause 35. The method of any one of clauses 17-32, wherein the activator comprises the vector of any one of clauses 5-16. [000234] Clause 36. The method of any one of clauses 17-35, wherein the activator or a polynucleotide encoding the activator is encapsulated within a lipid nanoparticle or polymeric carrier. [000235] Clause 37. The method of any one of clauses 17-36, the method further comprising administering at least one cancer therapy or at least one antiviral therapy. [000236] Clause 38. A vector comprising the isolated polynucleotide of any one of clauses 1-4. [000237] Clause 39. A cell comprising the isolated polynucleotide of any one of clauses 1- 4, or the vector of any one of clauses 5-16, or the vector of clause 38. [000238] Clause 40. The cell of clause 39, wherein the cell is a CD8+ T cell or a CD4+ T cell. [000239] Clause 41. A pharmaceutical composition comprising: the isolated polynucleotide of any one of clauses 1-4, or the vector of any one of clauses 5-16, or the vector of clause 38, or a combination thereof. [000240] Clause 42. The pharmaceutical composition of clause 41, further comprising at least one cancer therapy or at least one antiviral therapy. [000241] Clause 43. A composition for increasing T cells, the composition comprising an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof. [000242] Clause 44. The composition of clause 43, wherein the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof. [000243] Clause 45. The composition of clause 43 or 44, wherein the activator comprises a polynucleotide encoding the gene, or a polynucleotide encoding the open reading frame of the gene, or a polypeptide encoded by the gene, or a combination thereof. [000244] Clause 46. The composition of clause 45, wherein the activator comprises a polynucleotide selected from SEQ ID NOs: 75-97 or a polypeptide selected from SEQ ID NOs: 98-120. [000245] Clause 47. The composition of any one of clauses 43-46, further comprising at least one cancer therapy or at least one antiviral therapy. [000246] Clause 48. The composition of any one of clauses 43-47, wherein the activator comprises a DNA targeting composition, the DNA targeting composition comprising: (a) a Cas9 protein and at least one guide RNA (gRNA) that targets the Cas9 protein to the gene or a regulatory element thereof; or (b) a meganuclease, or (c) a fusion protein, wherein the fusion protein comprises two heterologous polypeptide domains, wherein the first polypeptide domain comprises a zinc finger protein or a TALE or a Cas12 protein or a Cas13 protein or a Cas9 protein, and wherein the second polypeptide domain has an activity selected from transcription activation activity, transcription repression activity, nuclease activity, base editing activity, prime editing activity, transcription release factor activity, histone modification activity, nucleic acid association activity, methylase activity, and demethylase activity, wherein when the first polypeptide domain comprises a Cas9 protein the DNA targeting composition further comprises at least one guide RNA (gRNA) that targets the Cas9 protein to the gene or a regulatory element thereof. [000247] Clause 49. A DNA targeting composition comprising: a Cas9 protein or a fusion protein, wherein the fusion protein comprises two heterologous polypeptide domains, wherein the first polypeptide domain comprises a zinc finger protein or a TALE or a Cas12 protein or a Cas13 protein or a Cas9 protein, and wherein the second polypeptide domain has an activity selected from transcription activation activity, transcription repression activity, nuclease activity, base editing activity, prime editing activity, transcription release factor activity, histone modification activity, nucleic acid association activity, methylase activity, and demethylase activity; and at least one guide RNA (gRNA) that targets the Cas9 protein to a target gene or a regulatory element thereof when the DNA targeting composition comprises a Cas9 protein, wherein the target gene is selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1. [000248] Clause 50. The composition of any one of clauses 48-49, wherein the gene is selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5. [000249] Clause 51. The composition of any one of clauses 48-50, wherein the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof. [000250] Clause 52. The composition of any one of clauses 48-51, wherein the gRNA is encoded by a polynucleotide comprising a sequence selected from SEQ ID NOs: 121-156, or comprises a sequence selected from SEQ ID NOs: 157-192. [000251] Clause 53. The composition of any one of clauses 48-52, wherein the Cas protein comprises a Streptococcus pyogenes Cas9 protein, or a Staphylococcus aureus Cas9 protein, or any fragment thereof. [000252] Clause 54. The composition of any one of clauses 48-53, wherein the Cas9 protein comprises the amino acid sequence of one of SEQ ID NOs: 26-29, or any fragment thereof, and/or wherein the Cas9 protein is encoded by a polynucleotide comprising a sequence selected from SEQ ID NOs: 30-39, and/or wherein the Cas9 protein comprises an amino acid sequence having at least 90% or greater identity to a sequence selected from SEQ ID NOs: 26-29, or any fragment thereof, and/or wherein the Cas9 protein is encoded by a polynucleotide comprising a sequence having at least 90% or greater identity to a sequence selected from SEQ ID NOs: 30-39, or any fragment thereof, and/or wherein the Cas9 protein comprises an amino acid sequence having one, two, three, four, five or more changes selected from amino acid substitutions, insertions, or deletions, relative to a sequence selected from SEQ ID NOs: 26-29, or any fragment thereof, and/or wherein the Cas9 protein is encoded by a polynucleotide comprising a sequence having one, two, three, four, five or more changes selected from nucleotide substitutions, insertions, or deletions, relative to a sequence selected from SEQ ID NOs: 30-39, or any fragment thereof. [000253] Clause 55. The composition of any one of clauses 48-54, wherein the fusion protein comprises more than one second polypeptide domain. [000254] Clause 56. The composition of any one of clauses 48-55, wherein the second polypeptide domain has transcription activation activity. [000255] Clause 57. The composition of clause 56, wherein the second polypeptide domain comprises a polypeptide selected from VP16, VP64, p65, TET1, VPR, VPH, Rta, and p300, or a fragment thereof. [000256] Clause 58. The composition of clause 57, wherein the second polypeptide domain comprises VP64, p300, VPH, or VPR, or a fragment thereof. [000257] Clause 59. The composition of one of clauses 48-58, wherein the second polypeptide domain comprises the amino acid sequence of SEQ ID NO: 41, 42, 53, or 55, or any fragment thereof, and/or wherein the second polypeptide domain is encoded by a polynucleotide comprising the sequence of SEQ ID NO: 54 or 56, and/or wherein the second polypeptide domain comprises an amino acid sequence having at least 90% or greater identity to SEQ ID NO: 41, 42, 53, or 55, or any fragment thereof, and/or wherein the second polypeptide domain is encoded by a polynucleotide comprising a sequence having at least 90% or greater identity to SEQ ID NO: 54 or 56, or any fragment thereof, and/or wherein the second polypeptide domain comprises an amino acid sequence having one, two, three, four, five or more changes selected from amino acid substitutions, insertions, or deletions, relative to SEQ ID NO: 41, 42, 53, or 55, or any fragment thereof, and/or wherein the second polypeptide domain is encoded by a polynucleotide comprising a sequence having one, two, three, four, five or more changes selected from nucleotide substitutions, insertions, or deletions, relative to SEQ ID NO: 54 or 56, or any fragment thereof. [000258] Clause 60. The composition of any one of clauses 48-59, wherein the fusion protein comprises the amino acid sequence of SEQ ID NO: 43, or any fragment thereof, and/or wherein the fusion protein is encoded by a polynucleotide comprising the sequence of SEQ ID NO: 44, and/or wherein the fusion protein comprises an amino acid sequence having at least 90% or greater identity to SEQ ID NO: 43, or any fragment thereof, and/or wherein the fusion protein is encoded by a polynucleotide comprising a sequence having at least 90% or greater identity to SEQ ID NO: 44, or any fragment thereof, and/or wherein the fusion protein comprises an amino acid sequence having one, two, three, four, five or more changes selected from amino acid substitutions, insertions, or deletions, relative to SEQ ID NO: 43, or any fragment thereof, and/or wherein the fusion protein is encoded by a polynucleotide comprising a sequence having one, two, three, four, five or more changes selected from nucleotide substitutions, insertions, or deletions, relative to SEQ ID NO: 44. [000259] Clause 61. The composition of any one of clauses 48-60, further comprising at least one cancer therapy or at least one antiviral therapy. [000260] Clause 62. An isolated polynucleotide sequence encoding the composition of any one of clauses 48-61. [000261] Clause 63. A vector comprising the isolated polynucleotide sequence of clause 62. [000262] Clause 64. A cell comprising the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or a combination thereof. [000263] Clause 65. The cell of clause 64, wherein the cell is a CD8+ T cell or a CD4+ T cell. [000264] Clause 66. A pharmaceutical composition comprising: the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or a combination thereof. [000265] Clause 67. A method of modulating T cells, the method comprising administering to a T cell or a subject the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or the cell of clause 64 or 65, or the pharmaceutical composition of clause 66, or a combination thereof. [000266] Clause 68. The method of clause 67, wherein modulating T cells comprises increasing T cells, or increasing memory T cells, or increasing T cell distribution, or increasing tissue infiltration, or preventing T cell exhaustions, or reversing T cell exhaustions, or a combination thereof. [000267] Clause 69. The method of clause 67 or 68, wherein the composition or isolated polynucleotide sequence or vector is administered to a T cell, and wherein the T cell thereby increases expression of CD103 or IL7Ra, or a combination thereof. [000268] Clause 70. A method of increasing T cells, the method comprising administering to a T cell or a subject the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or the cell of clause 64 or 65, or the pharmaceutical composition of clause 66, or a combination thereof. [000269] Clause 71. A method of enhancing adoptive T cell therapy (ACT) in a subject, the method comprising administering to a T cell or the subject the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or the cell of clause 64 or 65, or the pharmaceutical composition of clause 66, or a combination thereof. [000270] Clause 72. A method of treating cancer in a subject, the method comprising administering to a T cell or the subject the composition of any one of clauses 48-61, or the isolated polynucleotide sequence of clause 62, or the vector of clause 63, or the cell of clause 64 or 65, or the pharmaceutical composition of clause 66, or a combination thereof. SEQUENCES SEQ ID NO: 1 NRG (R = A or G; N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 2 NGG (N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 3 NAG (N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 4 NGGNG (N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 5 NNAGAAW (W = A or T; N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 6 NAAR (R = A or G; N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 7 NNGRR (R = A or G; N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 8 NNGRRN (R = A or G; N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 9 NNGRRT (R = A or G; N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 10 NNGRRV (R = A or G; N can be any nucleotide residue, e.g., any of A, G, C, or T; V = A or C or G) SEQ ID NO: 11 NNNNGATT (N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 12 NNNNGNNN (N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 13 NGA (N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 14 NNNRRT (R = A or G; N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 15 ATTCCT SEQ ID NO: 16 NGAN (N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 17 NGNG (N can be any nucleotide residue, e.g., any of A, G, C, or T) SEQ ID NO: 18 DNA sequence of the gRNA constant region gtttaagagctatgctggaaacagcatagcaagtttaaataaggctagtccgttatcaacttgaaaaa gtggcaccgagtcggtgc SEQ ID NO: 19 RNA sequence of the gRNA constant region guuuaagagcuaugcuggaaacagcauagcaaguuuaaauaaggcuaguccguuaucaacuugaaaaa guggcaccgagucggugc SEQ ID NO: 20 SV40 NLS (Pro-Lys-Lys-Lys-Arg-Lys-Val) SEQ ID NO: 21 GS linker (Gly-Gly-Gly-Gly-Ser) n , wherein n is an integer between 0 and 10 SEQ ID NO: 22 Gly-Gly-Gly-Gly-Gly SEQ ID NO: 23 Gly-Gly-Ala-Gly-Gly SEQ ID NO: 24 Gly-Gly-Gly-Gly-Ser-Ser-Ser SEQ ID NO: 25 Gly-Gly-Gly-Gly-Ala-Ala-Ala SEQ ID NO: 26 Streptococcus pyogenes Cas9 MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTA RRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIY HLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINAS GVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYD DDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVR QQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNG SIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPW NFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQ KKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEEN EDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTIL DFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELV KVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYL QNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWR QLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIRE VKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRK MIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLS MPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKK LKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGN ELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLS AYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRI DLSQLGGD SEQ ID NO: 27 Staphylococcus aureus Cas9 MKRNYILGLDIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRGARRLKRRRRHRIQRVK KLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHNVNEVEEDTGNELSTKE QISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQKAYHQLDQSFIDTYIDL LETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLYNALNDLNNLVITRDEN EKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFTNLKVYHDIKDITARKE IIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGTHNLSLKAINLILDELW HTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKVINAIIKKYGLPNDIII ELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHDMQEGKCLYSLEAIPLE DLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEENSKKGNRTPFQYLSSSDSKISYETFKKHILNLA KGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYFRVNNLDVKVKSINGGF TSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQMFEEKQAESMPEIETEQ EYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRELINDTLYSTRKDDKGNTLIVNNLNGLYDKDNDKL KKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKYSKKDNGPVIKKIKYYG NKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKKENYYEVNSKCYEEAKK LKKISNQAEFIASFYNNDLIKINGELYRVIGVNNDLLNRIEVNMIDITYREYLENMNDKRPPRIIKTI ASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKG SEQ ID NO: 28 Streptococcus pyogenes Cas9 (with D10A) MDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTA RRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIY HLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINAS GVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYD DDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVR QQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNG SIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPW NFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQ KKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEEN EDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTIL DFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELV KVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYL QNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWR QLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIRE VKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRK MIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLS MPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKK LKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGN ELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLS AYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRI DLSQLGGD SEQ ID NO: 29 Streptococcus pyogenes Cas9 (with D10A, H849A) MDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTA RRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIY HLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINAS GVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYD DDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVR QQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNG SIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPW NFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQ KKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEEN EDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTIL DFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELV KVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYL QNGRDMYVDQELDINRLSDYDVDAIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWR QLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIRE VKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRK MIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLS MPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKK LKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGN ELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLS AYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRI DLSQLGGD SEQ ID NO: 30 Polynucleotide sequence of D10A mutant of S. aureus Cas9 atgaaaagga actacattct ggggctggcc atcgggatta caagcgtggg gtatgggatt attgactatg aaacaaggga cgtgatcgac gcaggcgtca gactgttcaa ggaggccaac gtggaaaaca atgagggacg gagaagcaag aggggagcca ggcgcctgaa acgacggaga aggcacagaa tccagagggt gaagaaactg ctgttcgatt acaacctgct gaccgaccat tctgagctga gtggaattaa tccttatgaa gccagggtga aaggcctgag tcagaagctg tcagaggaag agttttccgc agctctgctg cacctggcta agcgccgagg agtgcataac gtcaatgagg tggaagagga caccggcaac gagctgtcta caaaggaaca gatctcacgc aatagcaaag ctctggaaga gaagtatgtc gcagagctgc agctggaacg gctgaagaaa gatggcgagg tgagagggtc aattaatagg ttcaagacaa gcgactacgt caaagaagcc aagcagctgc tgaaagtgca gaaggcttac caccagctgg atcagagctt catcgatact tatatcgacc tgctggagac tcggagaacc tactatgagg gaccaggaga agggagcccc ttcggatgga aagacatcaa ggaatggtac gagatgctga tgggacattg cacctatttt ccagaagagc tgagaagcgt caagtacgct tataacgcag atctgtacaa cgccctgaat gacctgaaca acctggtcat caccagggat gaaaacgaga aactggaata ctatgagaag ttccagatca tcgaaaacgt gtttaagcag aagaaaaagc ctacactgaa acagattgct aaggagatcc tggtcaacga agaggacatc aagggctacc gggtgacaag cactggaaaa ccagagttca ccaatctgaa agtgtatcac gatattaagg acatcacagc acggaaagaa atcattgaga acgccgaact gctggatcag attgctaaga tcctgactat ctaccagagc tccgaggaca tccaggaaga gctgactaac ctgaacagcg agctgaccca ggaagagatc gaacagatta gtaatctgaa ggggtacacc ggaacacaca acctgtccct gaaagctatc aatctgattc tggatgagct gtggcataca aacgacaatc agattgcaat ctttaaccgg ctgaagctgg tcccaaaaaa ggtggacctg agtcagcaga aagagatccc aaccacactg gtggacgatt tcattctgtc acccgtggtc aagcggagct tcatccagag catcaaagtg atcaacgcca tcatcaagaa gtacggcctg cccaatgata tcattatcga gctggctagg gagaagaaca gcaaggacgc acagaagatg atcaatgaga tgcagaaacg aaaccggcag accaatgaac gcattgaaga gattatccga actaccggga aagagaacgc aaagtacctg attgaaaaaa tcaagctgca cgatatgcag gagggaaagt gtctgtattc tctggaggcc atccccctgg aggacctgct gaacaatcca ttcaactacg aggtcgatca tattatcccc agaagcgtgt ccttcgacaa ttcctttaac aacaaggtgc tggtcaagca ggaagagaac tctaaaaagg gcaataggac tcctttccag tacctgtcta gttcagattc caagatctct tacgaaacct ttaaaaagca cattctgaat ctggccaaag gaaagggccg catcagcaag accaaaaagg agtacctgct ggaagagcgg gacatcaaca gattctccgt ccagaaggat tttattaacc ggaatctggt ggacacaaga tacgctactc gcggcctgat gaatctgctg cgatcctatt tccgggtgaa caatctggat gtgaaagtca agtccatcaa cggcgggttc acatcttttc tgaggcgcaa atggaagttt aaaaaggagc gcaacaaagg gtacaagcac catgccgaag atgctctgat tatcgcaaat gccgacttca tctttaagga gtggaaaaag ctggacaaag ccaagaaagt gatggagaac cagatgttcg aagagaagca ggccgaatct atgcccgaaa tcgagacaga acaggagtac aaggagattt tcatcactcc tcaccagatc aagcatatca aggatttcaa ggactacaag tactctcacc gggtggataa aaagcccaac agagagctga tcaatgacac cctgtatagt acaagaaaag acgataaggg gaataccctg attgtgaaca atctgaacgg actgtacgac aaagataatg acaagctgaa aaagctgatc aacaaaagtc ccgagaagct gctgatgtac caccatgatc ctcagacata tcagaaactg aagctgatta tggagcagta cggcgacgag aagaacccac tgtataagta ctatgaagag actgggaact acctgaccaa gtatagcaaa aaggataatg gccccgtgat caagaagatc aagtactatg ggaacaagct gaatgcccat ctggacatca cagacgatta ccctaacagt cgcaacaagg tggtcaagct gtcactgaag ccatacagat tcgatgtcta tctggacaac ggcgtgtata aatttgtgac tgtcaagaat ctggatgtca tcaaaaagga gaactactat gaagtgaata gcaagtgcta cgaagaggct aaaaagctga aaaagattag caaccaggca gagttcatcg cctcctttta caacaacgac ctgattaaga tcaatggcga actgtatagg gtcatcgggg tgaacaatga tctgctgaac cgcattgaag tgaatatgat tgacatcact taccgagagt atctggaaaa catgaatgat aagcgccccc ctcgaattat caaaacaatt gcctctaaga ctcagagtat caaaaagtac tcaaccgaca ttctgggaaa cctgtatgag gtgaagagca aaaagcaccc tcagattatc aaaaagggc SEQ ID NO: 31 Polynucleotide sequence of N580A mutant of S. aureus Cas9 atgaaaagga actacattct ggggctggac atcgggatta caagcgtggg gtatgggatt attgactatg aaacaaggga cgtgatcgac gcaggcgtca gactgttcaa ggaggccaac gtggaaaaca atgagggacg gagaagcaag aggggagcca ggcgcctgaa acgacggaga aggcacagaa tccagagggt gaagaaactg ctgttcgatt acaacctgct gaccgaccat tctgagctga gtggaattaa tccttatgaa gccagggtga aaggcctgag tcagaagctg tcagaggaag agttttccgc agctctgctg cacctggcta agcgccgagg agtgcataac gtcaatgagg tggaagagga caccggcaac gagctgtcta caaaggaaca gatctcacgc aatagcaaag ctctggaaga gaagtatgtc gcagagctgc agctggaacg gctgaagaaa gatggcgagg tgagagggtc aattaatagg ttcaagacaa gcgactacgt caaagaagcc aagcagctgc tgaaagtgca gaaggcttac caccagctgg atcagagctt catcgatact tatatcgacc tgctggagac tcggagaacc tactatgagg gaccaggaga agggagcccc ttcggatgga aagacatcaa ggaatggtac gagatgctga tgggacattg cacctatttt ccagaagagc tgagaagcgt caagtacgct tataacgcag atctgtacaa cgccctgaat gacctgaaca acctggtcat caccagggat gaaaacgaga aactggaata ctatgagaag ttccagatca tcgaaaacgt gtttaagcag aagaaaaagc ctacactgaa acagattgct aaggagatcc tggtcaacga agaggacatc aagggctacc gggtgacaag cactggaaaa ccagagttca ccaatctgaa agtgtatcac gatattaagg acatcacagc acggaaagaa atcattgaga acgccgaact gctggatcag attgctaaga tcctgactat ctaccagagc tccgaggaca tccaggaaga gctgactaac ctgaacagcg agctgaccca ggaagagatc gaacagatta gtaatctgaa ggggtacacc ggaacacaca acctgtccct gaaagctatc aatctgattc tggatgagct gtggcataca aacgacaatc agattgcaat ctttaaccgg ctgaagctgg tcccaaaaaa ggtggacctg agtcagcaga aagagatccc aaccacactg gtggacgatt tcattctgtc acccgtggtc aagcggagct tcatccagag catcaaagtg atcaacgcca tcatcaagaa gtacggcctg cccaatgata tcattatcga gctggctagg gagaagaaca gcaaggacgc acagaagatg atcaatgaga tgcagaaacg aaaccggcag accaatgaac gcattgaaga gattatccga actaccggga aagagaacgc aaagtacctg attgaaaaaa tcaagctgca cgatatgcag gagggaaagt gtctgtattc tctggaggcc atccccctgg aggacctgct gaacaatcca ttcaactacg aggtcgatca tattatcccc agaagcgtgt ccttcgacaa ttcctttaac aacaaggtgc tggtcaagca ggaagaggcc tctaaaaagg gcaataggac tcctttccag tacctgtcta gttcagattc caagatctct tacgaaacct ttaaaaagca cattctgaat ctggccaaag gaaagggccg catcagcaag accaaaaagg agtacctgct ggaagagcgg gacatcaaca gattctccgt ccagaaggat tttattaacc ggaatctggt ggacacaaga tacgctactc gcggcctgat gaatctgctg cgatcctatt tccgggtgaa caatctggat gtgaaagtca agtccatcaa cggcgggttc acatcttttc tgaggcgcaa atggaagttt aaaaaggagc gcaacaaagg gtacaagcac catgccgaag atgctctgat tatcgcaaat gccgacttca tctttaagga gtggaaaaag ctggacaaag ccaagaaagt gatggagaac cagatgttcg aagagaagca ggccgaatct atgcccgaaa tcgagacaga acaggagtac aaggagattt tcatcactcc tcaccagatc aagcatatca aggatttcaa ggactacaag tactctcacc gggtggataa aaagcccaac agagagctga tcaatgacac cctgtatagt acaagaaaag acgataaggg gaataccctg attgtgaaca atctgaacgg actgtacgac aaagataatg acaagctgaa aaagctgatc aacaaaagtc ccgagaagct gctgatgtac caccatgatc ctcagacata tcagaaactg aagctgatta tggagcagta cggcgacgag aagaacccac tgtataagta ctatgaagag actgggaact acctgaccaa gtatagcaaa aaggataatg gccccgtgat caagaagatc aagtactatg ggaacaagct gaatgcccat ctggacatca cagacgatta ccctaacagt cgcaacaagg tggtcaagct gtcactgaag ccatacagat tcgatgtcta tctggacaac ggcgtgtata aatttgtgac tgtcaagaat ctggatgtca tcaaaaagga gaactactat gaagtgaata gcaagtgcta cgaagaggct aaaaagctga aaaagattag caaccaggca gagttcatcg cctcctttta caacaacgac ctgattaaga tcaatggcga actgtatagg gtcatcgggg tgaacaatga tctgctgaac cgcattgaag tgaatatgat tgacatcact taccgagagt atctggaaaa catgaatgat aagcgccccc ctcgaattat caaaacaatt gcctctaaga ctcagagtat caaaaagtac tcaaccgaca ttctgggaaa cctgtatgag gtgaagagca aaaagcaccc tcagattatc aaaaagggc SEQ ID NO: 32 codon optimized polynucleotide encoding S. pyogenes Cas9 atggataaaa agtacagcat cgggctggac atcggtacaa actcagtggg gtgggccgtg attacggacg agtacaaggt accctccaaa aaatttaaag tgctgggtaa cacggacaga cactctataa agaaaaatct tattggagcc ttgctgttcg actcaggcga gacagccgaa gccacaaggt tgaagcggac cgccaggagg cggtatacca ggagaaagaa ccgcatatgc tacctgcaag aaatcttcag taacgagatg gcaaaggttg acgatagctt tttccatcgc ctggaagaat cctttcttgt tgaggaagac aagaagcacg aacggcaccc catctttggc aatattgtcg acgaagtggc atatcacgaa aagtacccga ctatctacca cctcaggaag aagctggtgg actctaccga taaggcggac ctcagactta tttatttggc actcgcccac atgattaaat ttagaggaca tttcttgatc gagggcgacc tgaacccgga caacagtgac gtcgataagc tgttcatcca acttgtgcag acctacaatc aactgttcga agaaaaccct ataaatgctt caggagtcga cgctaaagca atcctgtccg cgcgcctctc aaaatctaga agacttgaga atctgattgc tcagttgccc ggggaaaaga aaaatggatt gtttggcaac ctgatcgccc tcagtctcgg actgacccca aatttcaaaa gtaacttcga cctggccgaa gacgctaagc tccagctgtc caaggacaca tacgatgacg acctcgacaa tctgctggcc cagattgggg atcagtacgc cgatctcttt ttggcagcaa agaacctgtc cgacgccatc ctgttgagcg atatcttgag agtgaacacc gaaattacta aagcacccct tagcgcatct atgatcaagc ggtacgacga gcatcatcag gatctgaccc tgctgaaggc tcttgtgagg caacagctcc ccgaaaaata caaggaaatc ttctttgacc agagcaaaaa cggctacgct ggctatatag atggtggggc cagtcaggag gaattctata aattcatcaa gcccattctc gagaaaatgg acggcacaga ggagttgctg gtcaaactta acagggagga cctgctgcgg aagcagcgga cctttgacaa cgggtctatc ccccaccaga ttcatctggg cgaactgcac gcaatcctga ggaggcagga ggatttttat ccttttctta aagataaccg cgagaaaata gaaaagattc ttacattcag gatcccgtac tacgtgggac ctctcgcccg gggcaattca cggtttgcct ggatgacaag gaagtcagag gagactatta caccttggaa cttcgaagaa gtggtggaca agggtgcatc tgcccagtct ttcatcgagc ggatgacaaa ttttgacaag aacctcccta atgagaaggt gctgcccaaa cattctctgc tctacgagta ctttaccgtc tacaatgaac tgactaaagt caagtacgtc accgagggaa tgaggaagcc ggcattcctt agtggagaac agaagaaggc gattgtagac ctgttgttca agaccaacag gaaggtgact gtgaagcaac ttaaagaaga ctactttaag aagatcgaat gttttgacag tgtggaaatt tcaggggttg aagaccgctt caatgcgtca ttggggactt accatgatct tctcaagatc ataaaggaca aagacttcct ggacaacgaa gaaaatgagg atattctcga agacatcgtc ctcaccctga ccctgttcga agacagggaa atgatagaag agcgcttgaa aacctatgcc cacctcttcg acgataaagt tatgaagcag ctgaagcgca ggagatacac aggatgggga agattgtcaa ggaagctgat caatggaatt agggataaac agagtggcaa gaccatactg gatttcctca aatctgatgg cttcgccaat aggaacttca tgcaactgat tcacgatgac tctcttacct tcaaggagga cattcaaaag gctcaggtga gcgggcaggg agactccctt catgaacaca tcgcgaattt ggcaggttcc cccgctatta aaaagggcat ccttcaaact gtcaaggtgg tggatgaatt ggtcaaggta atgggcagac ataagccaga aaatattgtg atcgagatgg cccgcgaaaa ccagaccaca cagaagggcc agaaaaatag tagagagcgg atgaagagga tcgaggaggg catcaaagag ctgggatctc agattctcaa agaacacccc gtagaaaaca cacagctgca gaacgaaaaa ttgtacttgt actatctgca gaacggcaga gacatgtacg tcgaccaaga acttgatatt aatagactgt ccgactatga cgtagaccat atcgtgcccc agtccttcct gaaggacgac tccattgata acaaagtctt gacaagaagc gacaagaaca ggggtaaaag tgataatgtg cctagcgagg aggtggtgaa aaaaatgaag aactactggc gacagctgct taatgcaaag ctcattacac aacggaagtt cgataatctg acgaaagcag agagaggtgg cttgtctgag ttggacaagg cagggtttat taagcggcag ctggtggaaa ctaggcagat cacaaagcac gtggcgcaga ttttggacag ccggatgaac acaaaatacg acgaaaatga taaactgata cgagaggtca aagttatcac gctgaaaagc aagctggtgt ccgattttcg gaaagacttc cagttctaca aagttcgcga gattaataac taccatcatg ctcacgatgc gtacctgaac gctgttgtcg ggaccgcctt gataaagaag tacccaaagc tggaatccga gttcgtatac ggggattaca aagtgtacga tgtgaggaaa atgatagcca agtccgagca ggagattgga aaggccacag ctaagtactt cttttattct aacatcatga atttttttaa gacggaaatt accctggcca acggagagat cagaaagcgg ccccttatag agacaaatgg tgaaacaggt gaaatcgtct gggataaggg cagggatttc gctactgtga ggaaggtgct gagtatgcca caggtaaata tcgtgaaaaa aaccgaagta cagaccggag gattttccaa ggaaagcatt ttgcctaaaa gaaactcaga caagctcatc gcccgcaaga aagattggga ccctaagaaa tacgggggat ttgactcacc caccgtagcc tattctgtgc tggtggtagc taaggtggaa aaaggaaagt ctaagaagct gaagtccgtg aaggaactct tgggaatcac tatcatggaa agatcatcct ttgaaaagaa ccctatcgat ttcctggagg ctaagggtta caaggaggtc aagaaagacc tcatcattaa actgccaaaa tactctctct tcgagctgga aaatggcagg aagagaatgt tggccagcgc cggagagctg caaaagggaa acgagcttgc tctgccctcc aaatatgtta attttctcta tctcgcttcc cactatgaaa agctgaaagg gtctcccgaa gataacgagc agaagcagct gttcgtcgaa cagcacaagc actatctgga tgaaataatc gaacaaataa gcgagttcag caaaagggtt atcctggcgg atgctaattt ggacaaagta ctgtctgctt ataacaagca ccgggataag cctattaggg aacaagccga gaatataatt cacctcttta cactcacgaa tctcggagcc cccgccgcct tcaaatactt tgatacgact atcgaccgga aacggtatac cagtaccaaa gaggtcctcg atgccaccct catccaccag tcaattactg gcctgtacga aacacggatc gacctctctc aactgggcgg cgactag SEQ ID NO: 33 codon optimized nucleic acid sequences encoding S. aureus Cas9 atgaaaagga actacattct ggggctggac atcgggatta caagcgtggg gtatgggatt attgactatg aaacaaggga cgtgatcgac gcaggcgtca gactgttcaa ggaggccaac gtggaaaaca atgagggacg gagaagcaag aggggagcca ggcgcctgaa acgacggaga aggcacagaa tccagagggt gaagaaactg ctgttcgatt acaacctgct gaccgaccat tctgagctga gtggaattaa tccttatgaa gccagggtga aaggcctgag tcagaagctg tcagaggaag agttttccgc agctctgctg cacctggcta agcgccgagg agtgcataac gtcaatgagg tggaagagga caccggcaac gagctgtcta caaaggaaca gatctcacgc aatagcaaag ctctggaaga gaagtatgtc gcagagctgc agctggaacg gctgaagaaa gatggcgagg tgagagggtc aattaatagg ttcaagacaa gcgactacgt caaagaagcc aagcagctgc tgaaagtgca gaaggcttac caccagctgg atcagagctt catcgatact tatatcgacc tgctggagac tcggagaacc tactatgagg gaccaggaga agggagcccc ttcggatgga aagacatcaa ggaatggtac gagatgctga tgggacattg cacctatttt ccagaagagc tgagaagcgt caagtacgct tataacgcag atctgtacaa cgccctgaat gacctgaaca acctggtcat caccagggat gaaaacgaga aactggaata ctatgagaag ttccagatca tcgaaaacgt gtttaagcag aagaaaaagc ctacactgaa acagattgct aaggagatcc tggtcaacga agaggacatc aagggctacc gggtgacaag cactggaaaa ccagagttca ccaatctgaa agtgtatcac gatattaagg acatcacagc acggaaagaa atcattgaga acgccgaact gctggatcag attgctaaga tcctgactat ctaccagagc tccgaggaca tccaggaaga gctgactaac ctgaacagcg agctgaccca ggaagagatc gaacagatta gtaatctgaa ggggtacacc ggaacacaca acctgtccct gaaagctatc aatctgattc tggatgagct gtggcataca aacgacaatc agattgcaat ctttaaccgg ctgaagctgg tcccaaaaaa ggtggacctg agtcagcaga aagagatccc aaccacactg gtggacgatt tcattctgtc acccgtggtc aagcggagct tcatccagag catcaaagtg atcaacgcca tcatcaagaa gtacggcctg cccaatgata tcattatcga gctggctagg gagaagaaca gcaaggacgc acagaagatg atcaatgaga tgcagaaacg aaaccggcag accaatgaac gcattgaaga gattatccga actaccggga aagagaacgc aaagtacctg attgaaaaaa tcaagctgca cgatatgcag gagggaaagt gtctgtattc tctggaggcc tccccctgg aggacctgct gaacaatcca ttcaactacg aggtcgatca tattatcccc agaagcgtgt ccttcgacaa ttcctttaac aacaaggtgc tggtcaagca ggaagagaac tctaaaaagg gcaataggac tcctttccag tacctgtcta gttcagattc caagatctct tacgaaacct ttaaaaagca cattctgaat ctggccaaag gaaagggccg catcagcaag accaaaaagg agtacctgct ggaagagcgg gacatcaaca gattctccgt ccagaaggat tttattaacc ggaatctggt ggacacaaga tacgctactc gcggcctgat gaatctgctg cgatcctatt tccgggtgaa caatctggat gtgaaagtca agtccatcaa cggcgggttc acatcttttc tgaggcgcaa atggaagttt aaaaaggagc gcaacaaagg gtacaagcac catgccgaag atgctctgat tatcgcaaat gccgacttca tctttaagga gtggaaaaag ctggacaaag ccaagaaagt gatggagaac cagatgttcg aagagaagca ggccgaatct atgcccgaaa tcgagacaga acaggagtac aaggagattt tcatcactcc tcaccagatc aagcatatca aggatttcaa ggactacaag tactctcacc gggtggataa aaagcccaac agagagctga tcaatgacac cctgtatagt acaagaaaag acgataaggg gaataccctg attgtgaaca atctgaacgg actgtacgac aaagataatg acaagctgaa aaagctgatc aacaaaagtc ccgagaagct gctgatgtac caccatgatc ctcagacata tcagaaactg aagctgatta tggagcagta cggcgacgag aagaacccac tgtataagta ctatgaagag actgggaact acctgaccaa gtatagcaaa aaggataatg gccccgtgat caagaagatc aagtactatg ggaacaagct gaatgcccat ctggacatca cagacgatta ccctaacagt cgcaacaagg tggtcaagct gtcactgaag ccatacagat tcgatgtcta tctggacaac ggcgtgtata aatttgtgac tgtcaagaat ctggatgtca tcaaaaagga gaactactat gaagtgaata gcaagtgcta cgaagaggct aaaaagctga aaaagattag caaccaggca gagttcatcg cctcctttta caacaacgac ctgattaaga tcaatggcga actgtatagg gtcatcgggg tgaacaatga tctgctgaac cgcattgaag tgaatatgat tgacatcact taccgagagt atctggaaaa catgaatgat aagcgccccc ctcgaattat caaaacaatt gcctctaaga ctcagagtat caaaaagtac tcaaccgaca ttctgggaaa cctgtatgag gtgaagagca aaaagcaccc tcagattatc aaaaagggc SEQ ID NO: 34 codon optimized nucleic acid sequences encoding S. aureus Cas9 atgaagcgga actacatcct gggcctggac atcggcatca ccagcgtggg ctacggcatc atcgactacg agacacggga cgtgatcgat gccggcgtgc ggctgttcaa agaggccaac gtggaaaaca acgagggcag gcggagcaag agaggcgcca gaaggctgaa gcggcggagg cggcatagaa tccagagagt gaagaagctg ctgttcgact acaacctgct gaccgaccac agcgagctga gcggcatcaa cccctacgag gccagagtga agggcctgag ccagaagctg agcgaggaag agttctctgc cgccctgctg cacctggcca agagaagagg cgtgcacaac gtgaacgagg tggaagagga caccggcaac gagctgtcca ccaaagagca gatcagccgg aacagcaagg ccctggaaga gaaatacgtg gccgaactgc agctggaacg gctgaagaaa gacggcgaag tgcggggcag catcaacaga ttcaagacca gcgactacgt gaaagaagcc aaacagctgc tgaaggtgca gaaggcctac caccagctgg accagagctt catcgacacc tacatcgacc tgctggaaac ccggcggacc tactatgagg gacctggcga gggcagcccc ttcggctgga aggacatcaa agaatggtac gagatgctga tgggccactg cacctacttc cccgaggaac tgcggagcgt gaagtacgcc tacaacgccg acctgtacaa cgccctgaac gacctgaaca atctcgtgat caccagggac gagaacgaga agctggaata ttacgagaag ttccagatca tcgagaacgt gttcaagcag aagaagaagc ccaccctgaa gcagatcgcc aaagaaatcc tcgtgaacga agaggatatt aagggctaca gagtgaccag caccggcaag cccgagttca ccaacctgaa ggtgtaccac gacatcaagg acattaccgc ccggaaagag attattgaga acgccgagct gctggatcag attgccaaga tcctgaccat ctaccagagc agcgaggaca tccaggaaga actgaccaat ctgaactccg agctgaccca ggaagagatc gagcagatct ctaatctgaa gggctatacc ggcacccaca acctgagcct gaaggccatc aacctgatcc tggacgagct gtggcacacc aacgacaacc agatcgctat cttcaaccgg ctgaagctgg tgcccaagaa ggtggacctg tcccagcaga aagagatccc caccaccctg gtggacgact tcatcctgag ccccgtcgtg aagagaagct tcatccagag catcaaagtg atcaacgcca tcatcaagaa gtacggcctg cccaacgaca tcattatcga gctggcccgc gagaagaact ccaaggacgc ccagaaaatg atcaacgaga tgcagaagcg gaaccggcag accaacgagc ggatcgagga aatcatccgg accaccggca aagagaacgc caagtacctg atcgagaaga tcaagctgca cgacatgcag gaaggcaagt gcctgtacag cctggaagcc atccctctgg aagatctgct gaacaacccc ttcaactatg aggtggacca catcatcccc agaagcgtgt ccttcgacaa cagcttcaac aacaaggtgc tcgtgaagca ggaagaaaac agcaagaagg gcaaccggac cccattccag tacctgagca gcagcgacag caagatcagc tacgaaacct tcaagaagca catcctgaat ctggccaagg gcaagggcag aatcagcaag accaagaaag agtatctgct ggaagaacgg gacatcaaca ggttctccgt gcagaaagac ttcatcaacc ggaacctggt ggataccaga tacgccacca gaggcctgat gaacctgctg cggagctact tcagagtgaa caacctggac gtgaaagtga agtccatcaa tggcggcttc accagctttc tgcggcggaa gtggaagttt aagaaagagc ggaacaaggg gtacaagcac cacgccgagg acgccctgat cattgccaac gccgatttca tcttcaaaga gtggaagaaa ctggacaagg ccaaaaaagt gatggaaaac cagatgttcg aggaaaagca ggccgagagc atgcccgaga tcgaaaccga gcaggagtac aaagagatct tcatcacccc ccaccagatc aagcacatta aggacttcaa ggactacaag tacagccacc gggtggacaa gaagcctaat agagagctga ttaacgacac cctgtactcc acccggaagg acgacaaggg caacaccctg atcgtgaaca atctgaacgg cctgtacgac aaggacaatg acaagctgaa aaagctgatc aacaagagcc ccgaaaagct gctgatgtac caccacgacc cccagaccta ccagaaactg aagctgatta tggaacagta cggcgacgag aagaatcccc tgtacaagta ctacgaggaa accgggaact acctgaccaa gtactccaaa aaggacaacg gccccgtgat caagaagatt aagtattacg gcaacaaact gaacgcccat ctggacatca ccgacgacta ccccaacagc agaaacaagg tcgtgaagct gtccctgaag ccctacagat tcgacgtgta cctggacaat ggcgtgtaca agttcgtgac cgtgaagaat ctggatgtga tcaaaaaaga aaactactac gaagtgaata gcaagtgcta tgaggaagct aagaagctga agaagatcag caaccaggcc gagtttatcg cctccttcta caacaacgat ctgatcaaga tcaacggcga gctgtataga gtgatcggcg tgaacaacga cctgctgaac cggatcgaag tgaacatgat cgacatcacc taccgcgagt acctggaaaa catgaacgac aagaggcccc ccaggatcat taagacaatc gcctccaaga cccagagcat taagaagtac agcacagaca ttctgggcaa cctgtatgaa gtgaaatcta agaagcaccc tcagatcatc aaaaagggc SEQ ID NO: 35 codon optimized nucleic acid sequence encoding S. aureus Cas9 atgaagcgca actacatcct cggactggac atcggcatta cctccgtggg atacggcatc atcgattacg aaactaggga tgtgatcgac gctggagtca ggctgttcaa agaggcgaac gtggagaaca acgaggggcg gcgctcaaag aggggggccc gccggctgaa gcgccgccgc agacatagaa tccagcgcgt gaagaagctg ctgttcgact acaaccttct gaccgaccac tccgaacttt ccggcatcaa cccatatgag gctagagtga agggattgtc ccaaaagctg tccgaggaag agttctccgc cgcgttgctc cacctcgcca agcgcagggg agtgcacaat gtgaacgaag tggaagaaga taccggaaac gagctgtcca ccaaggagca gatcagccgg aactccaagg ccctggaaga gaaatacgtg gcggaactgc aactggagcg gctgaagaaa gacggagaag tgcgcggctc gatcaaccgc ttcaagacct cggactacgt gaaggaggcc aagcagctcc tgaaagtgca aaaggcctat caccaacttg accagtcctt tatcgatacc tacatcgatc tgctcgagac tcggcggact tactacgagg gtccagggga gggctcccca tttggttgga aggatattaa ggagtggtac gaaatgctga tgggacactg cacatacttc cctgaggagc tgcggagcgt gaaatacgca tacaacgcag acctgtacaa cgcgctgaac gacctgaaca atctcgtgat cacccgggac gagaacgaaa agctcgagta ttacgaaaag ttccagatta ttgagaacgt gttcaaacag aagaagaagc cgacactgaa gcagattgcc aaggaaatcc tcgtgaacga agaggacatc aagggctatc gagtgacctc aacgggaaag ccggagttca ccaatctgaa ggtctaccac gacatcaaag acattaccgc ccggaaggag atcattgaga acgcggagct gttggaccag attgcgaaga ttctgaccat ctaccaatcc tccgaggata ttcaggaaga actcaccaac ctcaacagcg aactgaccca ggaggagata gagcaaatct ccaacctgaa gggctacacc ggaactcata acctgagcct gaaggccatc aacttgatcc tggacgagct gtggcacacc aacgataacc agatcgctat tttcaatcgg ctgaagctgg tccccaagaa agtggacctc tcacaacaaa aggagatccc tactaccctt gtggacgatt tcattctgtc ccccgtggtc aagagaagct tcatacagtc aatcaaagtg atcaatgcca ttatcaagaa atacggtctg cccaacgaca ttatcattga gctcgcccgc gagaagaact cgaaggacgc ccagaagatg attaacgaaa tgcagaagag gaaccgacag actaacgaac ggatcgaaga aatcatccgg accaccggga aggaaaacgc gaagtacctg atcgaaaaga tcaagctcca tgacatgcag gaaggaaagt gtctgtactc gctggaggcc attccgctgg aggacttgct gaacaaccct tttaactacg aagtggatca tatcattccg aggagcgtgt cattcgacaa ttccttcaac aacaaggtcc tcgtgaagca ggaggaaaac tcgaagaagg gaaaccgcac gccgttccag tacctgagca gcagcgactc caagatttcc tacgaaacct tcaagaagca catcctcaac ctggcaaagg ggaagggtcg catctccaag accaagaagg aatatctgct ggaagaaaga gacatcaaca gattctccgt gcaaaaggac ttcatcaacc gcaacctcgt ggatactaga tacgctactc ggggtctgat gaacctcctg agaagctact ttagagtgaa caatctggac gtgaaggtca agtcgattaa cggaggtttc acctccttcc tgcggcgcaa gtggaagttc aagaaggaac ggaacaaggg ctacaagcac cacgccgagg acgccctgat cattgccaac gccgacttca tcttcaaaga atggaagaaa cttgacaagg ctaagaaggt catggaaaac cagatgttcg aagaaaagca ggccgagtct atgcctgaaa tcgagactga acaggagtac aaggaaatct ttattacgcc acaccagatc aaacacatca aggatttcaa ggattacaag tactcacatc gcgtggacaa aaagccgaac agggaactga tcaacgacac cctctactcc acccggaagg atgacaaagg gaataccctc atcgtcaaca accttaacgg cctgtacgac aaggacaacg ataagctgaa gaagctcatt aacaagtcgc ccgaaaagtt gctgatgtac caccacgacc ctcagactta ccagaagctc aagctgatca tggagcagta tggggacgag aaaaacccgt tgtacaagta ctacgaagaa actgggaatt atctgactaa gtactccaag aaagataacg gccccgtgat taagaagatt aagtactacg gcaacaagct gaacgcccat ctggacatca ccgatgacta ccctaattcc cgcaacaagg tcgtcaagct gagcctcaag ccctaccggt ttgatgtgta ccttgacaat ggagtgtaca agttcgtgac tgtgaagaac cttgacgtga tcaagaagga gaactactac gaagtcaact ccaagtgcta cgaggaagca aagaagttga agaagatctc gaaccaggcc gagttcattg cctccttcta taacaacgac ctgattaaga tcaacggcga actgtaccgc gtcattggcg tgaacaacga tctcctgaac cgcatcgaag tgaacatgat cgacatcact taccgggaat acctggagaa tatgaacgac aagcgcccgc cccggatcat taagactatc gcctcaaaga cccagtcgat caagaagtac agcaccgaca tcctgggcaa cctgtacgag gtcaaatcga agaagcaccc ccagatcatc aagaaggga SEQ ID NO: 36 codon optimized nucleic acid sequence encoding S. aureus Cas9 atggccccaaagaagaagcggaaggtcggtatccacggagtcccagcagccaagcggaactacatcct gggcctggacatcggcatcaccagcgtgggctacggcatcatcgactacgagacacgggacgtgatcg atgccggcgtgcggctgttcaaagaggccaacgtggaaaacaacgagggcaggcggagcaagagaggc gccagaaggctgaagcggcggaggcggcatagaatccagagagtgaagaagctgctgttcgactacaa cctgctgaccgaccacagcgagctgagcggcatcaacccctacgaggccagagtgaagggcctgagcc agaagctgagcgaggaagagttctctgccgccctgctgcacctggccaagagaagaggcgtgcacaac gtgaacgaggtggaagaggacaccggcaacgagctgtccaccagagagcagatcagccggaacagcaa ggccctggaagagaaatacgtggccgaactgcagctggaacggctgaagaaagacggcgaagtgcggg gcagcatcaacagattcaagaccagcgactacgtgaaagaagccaaacagctgctgaaggtgcagaag gcctaccaccagctggaccagagcttcatcgacacctacatcgacctgctggaaacccggcggaccta ctatgagggacctggcgagggcagccccttcggctggaaggacatcaaagaatggtacgagatgctga tgggccactgcacctacttccccgaggaactgcggagcgtgaagtacgcctacaacgccgacctgtac aacgccctgaacgacctgaacaatctcgtgatcaccagggacgagaacgagaagctggaatattacga gaagttccagatcatcgagaacgtgttcaagcagaagaagaagcccaccctgaagcagatcgccaaag aaatcctcgtgaacgaagaggatattaagggctacagagtgaccagcaccggcaagcccgagttcacc aacctgaaggtgtaccacgacatcaaggacattaccgcccggaaagagattattgagaacgccgagct gctggatcagattgccaagatcctgaccatctaccagagcagcgaggacatccaggaagaactgacca atctgaactccgagctgacccaggaagagatcgagcagatctctaatctgaagggctataccggcacc cacaacctgagcctgaaggccatcaacctgatcctggacgagctgtggcacaccaacgacaaccagat cgctatcttcaaccggctgaagctggtgcccaagaaggtggacctgtcccagcagaaagagatcccca ccaccctggtggacgacttcatcctgagccccgtcgtgaagagaagcttcatccagagcatcaaagtg atcaacgccatcatcaagaagtacggcctgcccaacgacatcattatcgagctggcccgcgagaagaa ctccaaggacgcccagaaaatgatcaacgagatgcagaagcggaaccggcagaccaacgagcggatcg aggaaatcatccggaccaccggcaaagagaacgccaagtacctgatcgagaagatcaagctgcacgac atgcaggaaggcaagtgcctgtacagcctggaagccatccctctggaagatctgctgaacaacccctt caactatgaggtggaccacatcatccccagaagcgtgtccttcgacaacagcttcaacaacaaggtgc tcgtgaagcaggaagaaaacagcaagaagggcaaccggaccccattccagtacctgagcagcagcgac agcaagatcagctacgaaaccttcaagaagcacatcctgaatctggccaagggcaagggcagaatcag caagaccaagaaagagtatctgctggaagaacgggacatcaacaggttctccgtgcagaaagacttca tcaaccggaacctggtggataccagatacgccaccagaggcctgatgaacctgctgcggagctacttc agagtgaacaacctggacgtgaaagtgaagtccatcaatggcggcttcaccagctttctgcggcggaa gtggaagtttaagaaagagcggaacaaggggtacaagcaccacgccgaggacgccctgatcattgcca acgccgatttcatcttcaaagagtggaagaaactggacaaggccaaaaaagtgatggaaaaccagatg ttcgaggaaaggcaggccgagagcatgcccgagatcgaaaccgagcaggagtacaaagagatcttcat caccccccaccagatcaagcacattaaggacttcaaggactacaagtacagccaccgggtggacaaga agcctaatagagagctgattaacgacaccctgtactccacccggaaggacgacaagggcaacaccctg atcgtgaacaatctgaacggcctgtacgacaaggacaatgacaagctgaaaaagctgatcaacaagag ccccgaaaagctgctgatgtaccaccacgacccccagacctaccagaaactgaagctgattatggaac agtacggcgacgagaagaatcccctgtacaagtactacgaggaaaccgggaactacctgaccaagtac tccaaaaaggacaacggccccgtgatcaagaagattaagtattacggcaacaaactgaacgcccatct ggacatcaccgacgactaccccaacagcagaaacaaggtcgtgaagctgtccctgaagccctacagat tcgacgtgtacctggacaatggcgtgtacaagttcgtgaccgtgaagaatctggatgtgatcaaaaaa gaaaactactacgaagtgaatagcaagtgctatgaggaagctaagaagctgaagaagatcagcaacca ggccgagtttatcgcctccttctacaacaacgatctgatcaagatcaacggcgagctgtatagagtga tcggcgtgaacaacgacctgctgaaccggatcgaagtgaacatgatcgacatcacctaccgcgagtac ctggaaaacatgaacgacaagaggccccccaggatcattaagacaatcgcctccaagacccagagcat taagaagtacagcacagacattctgggcaacctgtatgaagtgaaatctaagaagcaccctcagatca tcaaaaagggcaaaaggccggcggccacgaaaaaggccggccaggcaaaaaagaaaaag SEQ ID NO: 37 codon optimized nucleic acid sequence encoding S. aureus Cas9 accggtgcca ccatgtaccc atacgatgtt ccagattacg cttcgccgaa gaaaaagcgc aaggtcgaag cgtccatgaa aaggaactac attctggggc tggacatcgg gattacaagc gtggggtatg ggattattga ctatgaaaca agggacgtga tcgacgcagg cgtcagactg ttcaaggagg ccaacgtgga aaacaatgag ggacggagaa gcaagagggg agccaggcgc ctgaaacgac ggagaaggca cagaatccag agggtgaaga aactgctgtt cgattacaac ctgctgaccg accattctga gctgagtgga attaatcctt atgaagccag ggtgaaaggc ctgagtcaga agctgtcaga ggaagagttt tccgcagctc tgctgcacct ggctaagcgc cgaggagtgc ataacgtcaa tgaggtggaa gaggacaccg gcaacgagct gtctacaaag gaacagatct cacgcaatag caaagctctg gaagagaagt atgtcgcaga gctgcagctg gaacggctga agaaagatgg cgaggtgaga gggtcaatta ataggttcaa gacaagcgac tacgtcaaag aagccaagca gctgctgaaa gtgcagaagg cttaccacca gctggatcag agcttcatcg atacttatat cgacctgctg gagactcgga gaacctacta tgagggacca ggagaaggga gccccttcgg atggaaagac atcaaggaat ggtacgagat gctgatggga cattgcacct attttccaga agagctgaga agcgtcaagt acgcttataa cgcagatct tacaacgccc tgaatgacct gaacaacctg gtcatcacca gggatgaaaa cgagaaactg gaatactatg agaagttcca gatcatcgaa aacgtgttta agcagaagaa aaagcctaca ctgaaacaga ttgctaagga gatcctggtc aacgaagagg acatcaaggg ctaccgggtg acaagcactg gaaaaccaga gttcaccaat ctgaaagtgt atcacgatat taaggacatc acagcacgga aagaaatcat tgagaacgcc gaactgctgg atcagattgc taagatcctg actatctacc agagctccga ggacatccag gaagagctga ctaacctgaa cagcgagctg acccaggaag agatcgaaca gattagtaat ctgaaggggt acaccggaac acacaacctg tccctgaaag ctatcaatct gattctggat gagctgtggc atacaaacga caatcagatt gcaatcttta accggctgaa gctggtccca aaaaaggtgg acctgagtca gcagaaagag atcccaacca cactggtgga cgatttcatt ctgtcacccg tggtcaagcg gagcttcatc cagagcatca aagtgatcaa cgccatcatc aagaagtacg gcctgcccaa tgatatcatt atcgagctgg ctagggagaa gaacagcaag gacgcacaga agatgatcaa tgagatgcag aaacgaaacc ggcagaccaa tgaacgcatt gaagagatta tccgaactac cgggaaagag aacgcaaagt acctgattga aaaaatcaag ctgcacgata tgcaggaggg aaagtgtctg tattctctgg aggccatccc cctggaggac ctgctgaaca atccattcaa ctacgaggtc gatcatatta tccccagaag cgtgtccttc gacaattcct ttaacaacaa ggtgctggtc aagcaggaag agaactctaa aaagggcaat aggactcctt tccagtacct gtctagttca gattccaaga tctcttacga aacctttaaa aagcacattc tgaatctggc caaaggaaag ggccgcatca gcaagaccaa aaaggagtac ctgctggaag agcgggacat caacagattc tccgtccaga aggattttat taaccggaat ctggtggaca caagatacgc tactcgcggc ctgatgaatc tgctgcgatc ctatttccgg gtgaacaatc tggatgtgaa agtcaagtcc atcaacggcg ggttcacatc ttttctgagg cgcaaatgga agtttaaaaa ggagcgcaac aaagggtaca agcaccatgc cgaagatgct ctgattatcg caaatgccga cttcatcttt aaggagtgga aaaagctgga caaagccaag aaagtgatgg agaaccagat gttcgaagag aagcaggccg aatctatgcc cgaaatcgag acagaacagg agtacaagga gattttcatc actcctcacc agatcaagca tatcaaggat ttcaaggact acaagtactc tcaccgggtg gataaaaagc ccaacagaga gctgatcaat gacaccctgt atagtacaag aaaagacgat aaggggaata ccctgattgt gaacaatctg aacggactgt acgacaaaga taatgacaag ctgaaaaagc tgatcaacaa aagtcccgag aagctgctga tgtaccacca tgatcctcag acatatcaga aactgaagct gattatggag cagtacggcg acgagaagaa cccactgtat aagtactatg aagagactgg gaactacctg accaagtata gcaaaaagga taatggcccc gtgatcaaga agatcaagta ctatgggaac aagctgaatg cccatctgga catcacagac gattacccta acagtcgcaa caaggtggtc aagctgtcac tgaagccata cagattcgat gtctatctgg acaacggcgt gtataaattt gtgactgtca agaatctgga tgtcatcaaa aaggagaact actatgaagt gaatagcaag tgctacgaag aggctaaaaa gctgaaaaag attagcaacc aggcagagtt catcgcctcc ttttacaaca acgacctgat taagatcaat ggcgaactgt atagggtcat cggggtgaac aatgatctgc tgaaccgcat tgaagtgaat atgattgaca tcacttaccg agagtatctg gaaaacatga atgataagcg cccccctcga attatcaaaa caattgcctc taagactcag agtatcaaaa agtactcaac cgacattctg ggaaacctgt atgaggtgaa gagcaaaaag caccctcaga ttatcaaaaa gggctaagaa ttc SEQ ID NO: 38 codon optimized nucleic acid sequences encoding S. aureus Cas9 atggccccaaagaagaagcggaaggtcggtatccacggagtcccagcagccaagcggaactacatcct gggcctggacatcggcatcaccagcgtgggctacggcatcatcgactacgagacacgggacgtgatcg atgccggcgtgcggctgttcaaagaggccaacgtggaaaacaacgagggcaggcggagcaagagaggc gccagaaggctgaagcggcggaggcggcatagaatccagagagtgaagaagctgctgttcgactacaa cctgctgaccgaccacagcgagctgagcggcatcaacccctacgaggccagagtgaagggcctgagcc agaagctgagcgaggaagagttctctgccgccctgctgcacctggccaagagaagaggcgtgcacaac gtgaacgaggtggaagaggacaccggcaacgagctgtccaccaaagagcagatcagccggaacagcaa ggccctggaagagaaatacgtggccgaactgcagctggaacggctgaagaaagacggcgaagtgcggg gcagcatcaacagattcaagaccagcgactacgtgaaagaagccaaacagctgctgaaggtgcagaag gcctaccaccagctggaccagagcttcatcgacacctacatcgacctgctggaaacccggcggaccta ctatgagggacctggcgagggcagccccttcggctggaaggacatcaaagaatggtacgagatgctga tgggccactgcacctacttccccgaggaactgcggagcgtgaagtacgcctacaacgccgacctgtac aacgccctgaacgacctgaacaatctcgtgatcaccagggacgagaacgagaagctggaatattacga gaagttccagatcatcgagaacgtgttcaagcagaagaagaagcccaccctgaagcagatcgccaaag aaatcctcgtgaacgaagaggatattaagggctacagagtgaccagcaccggcaagcccgagttcacc aacctgaaggtgtaccacgacatcaaggacattaccgcccggaaagagattattgagaacgccgagct gctggatcagattgccaagatcctgaccatctaccagagcagcgaggacatccaggaagaactgacca atctgaactccgagctgacccaggaagagatcgagcagatctctaatctgaagggctataccggcacc cacaacctgagcctgaaggccatcaacctgatcctggacgagctgtggcacaccaacgacaaccagat cgctatcttcaaccggctgaagctggtgcccaagaaggtggacctgtcccagcagaaagagatcccca ccaccctggtggacgacttcatcctgagccccgtcgtgaagagaagcttcatccagagcatcaaagtg atcaacgccatcatcaagaagtacggcctgcccaacgacatcattatcgagctggcccgcgagaagaa ctccaaggacgcccagaaaatgatcaacgagatgcagaagcggaaccggcagaccaacgagcggatcg aggaaatcatccggaccaccggcaaagagaacgccaagtacctgatcgagaagatcaagctgcacgac atgcaggaaggcaagtgcctgtacagcctggaagccatccctctggaagatctgctgaacaacccctt caactatgaggtggaccacatcatccccagaagcgtgtccttcgacaacagcttcaacaacaaggtgc tcgtgaagcaggaagaaaacagcaagaagggcaaccggaccccattccagtacctgagcagcagcgac agcaagatcagctacgaaaccttcaagaagcacatcctgaatctggccaagggcaagggcagaatcag caagaccaagaaagagtatctgctggaagaacgggacatcaacaggttctccgtgcagaaagacttca tcaaccggaacctggtggataccagatacgccaccagaggcctgatgaacctgctgcggagctacttc agagtgaacaacctggacgtgaaagtgaagtccatcaatggcggcttcaccagctttctgcggcggaa gtggaagtttaagaaagagcggaacaaggggtacaagcaccacgccgaggacgccctgatcattgcca acgccgatttcatcttcaaagagtggaagaaactggacaaggccaaaaaagtgatggaaaaccagatg ttcgaggaaaagcaggccgagagcatgcccgagatcgaaaccgagcaggagtacaaagagatcttcat caccccccaccagatcaagcacattaaggacttcaaggactacaagtacagccaccgggtggacaaga agcctaatagagagctgattaacgacaccctgtactccacccggaaggacgacaagggcaacaccctg atcgtgaacaatctgaacggcctgtacgacaaggacaatgacaagctgaaaaagctgatcaacaagag ccccgaaaagctgctgatgtaccaccacgacccccagacctaccagaaactgaagctgattatggaac agtacggcgacgagaagaatcccctgtacaagtactacgaggaaaccgggaactacctgaccaagtac tccaaaaaggacaacggccccgtgatcaagaagattaagtattacggcaacaaactgaacgcccatct ggacatcaccgacgactaccccaacagcagaaacaaggtcgtgaagctgtccctgaagccctacagat tcgacgtgtacctggacaatggcgtgtacaagttcgtgaccgtgaagaatctggatgtgatcaaaaaa gaaaactactacgaagtgaatagcaagtgctatgaggaagctaagaagctgaagaagatcagcaacca ggccgagtttatcgcctccttctacaacaacgatctgatcaagatcaacggcgagctgtatagagtga tcggcgtgaacaacgacctgctgaaccggatcgaagtgaacatgatcgacatcacctaccgcgagtac ctggaaaacatgaacgacaagaggccccccaggatcattaagacaatcgcctccaagacccagagcat taagaagtacagcacagacattctgggcaacctgtatgaagtgaaatctaagaagcaccctcagatca tcaaaaagggcaaaaggccggcggccacgaaaaaggccggccaggcaaaaaagaaaaag SEQ ID NO: 39 codon optimized nucleic acid sequences encoding S. aureus Cas9 aagcggaactacatcctgggcctggacatcggcatcaccagcgtgggctacggcatcatcgactacga gacacgggacgtgatcgatgccggcgtgcggctgttcaaagaggccaacgtggaaaacaacgagggca ggcggagcaagagaggcgccagaaggctgaagcggcggaggcggcatagaatccagagagtgaagaag ctgctgttcgactacaacctgctgaccgaccacagcgagctgagcggcatcaacccctacgaggccag agtgaagggcctgagccagaagctgagcgaggaagagttctctgccgccctgctgcacctggccaaga gaagaggcgtgcacaacgtgaacgaggtggaagaggacaccggcaacgagctgtccaccaaagagcag atcagccggaacagcaaggccctggaagagaaatacgtggccgaactgcagctggaacggctgaagaa agacggcgaagtgcggggcagcatcaacagattcaagaccagcgactacgtgaaagaagccaaacagc tgctgaaggtgcagaaggcctaccaccagctggaccagagcttcatcgacacctacatcgacctgctg gaaacccggcggacctactatgagggacctggcgagggcagccccttcggctggaaggacatcaaaga atggtacgagatgctgatgggccactgcacctacttccccgaggaactgcggagcgtgaagtacgcct acaacgccgacctgtacaacgccctgaacgacctgaacaatctcgtgatcaccagggacgagaacgag aagctggaatattacgagaagttccagatcatcgagaacgtgttcaagcagaagaagaagcccaccct gaagcagatcgccaaagaaatcctcgtgaacgaagaggatattaagggctacagagtgaccagcaccg gcaagcccgagttcaccaacctgaaggtgtaccacgacatcaaggacattaccgcccggaaagagatt attgagaacgccgagctgctggatcagattgccaagatcctgaccatctaccagagcagcgaggacat ccaggaagaactgaccaatctgaactccgagctgacccaggaagagatcgagcagatctctaatctga agggctataccggcacccacaacctgagcctgaaggccatcaacctgatcctggacgagctgtggcac accaacgacaaccagatcgctatcttcaaccggctgaagctggtgcccaagaaggtggacctgtccca gcagaaagagatccccaccaccctggtggacgacttcatcctgagccccgtcgtgaagagaagcttca tccagagcatcaaagtgatcaacgccatcatcaagaagtacggcctgcccaacgacatcattatcgag ctggcccgcgagaagaactccaaggacgcccagaaaatgatcaacgagatgcagaagcggaaccggca gaccaacgagcggatcgaggaaatcatccggaccaccggcaaagagaacgccaagtacctgatcgaga agatcaagctgcacgacatgcaggaaggcaagtgcctgtacagcctggaagccatccctctggaagat ctgctgaacaaccccttcaactatgaggtggaccacatcatccccagaagcgtgtccttcgacaacag cttcaacaacaaggtgctcgtgaagcaggaagaaaacagcaagaagggcaaccggaccccattccagt acctgagcagcagcgacagcaagatcagctacgaaaccttcaagaagcacatcctgaatctggccaag ggcaagggcagaatcagcaagaccaagaaagagtatctgctggaagaacgggacatcaacaggttctc cgtgcagaaagacttcatcaaccggaacctggtggataccagatacgccaccagaggcctgatgaacc tgctgcggagctacttcagagtgaacaacctggacgtgaaagtgaagtccatcaatggcggcttcacc agctttctgcggcggaagtggaagtttaagaaagagcggaacaaggggtacaagcaccacgccgagga cgccctgatcattgccaacgccgatttcatcttcaaagagtggaagaaactggacaaggccaaaaaag tgatggaaaaccagatgttcgaggaaaagcaggccgagagcatgcccgagatcgaaaccgagcaggag tacaaagagatcttcatcaccccccaccagatcaagcacattaaggacttcaaggactacaagtacag ccaccgggtggacaagaagcctaatagagagctgattaacgacaccctgtactccacccggaaggacg acaagggcaacaccctgatcgtgaacaatctgaacggcctgtacgacaaggacaatgacaagctgaaa aagctgatcaacaagagccccgaaaagctgctgatgtaccaccacgacccccagacctaccagaaact gaagctgattatggaacagtacggcgacgagaagaatcccctgtacaagtactacgaggaaaccggga actacctgaccaagtactccaaaaaggacaacggccccgtgatcaagaagattaagtattacggcaac aaactgaacgcccatctggacatcaccgacgactaccccaacagcagaaacaaggtcgtgaagctgtc cctgaagccctacagattcgacgtgtacctggacaatggcgtgtacaagttcgtgaccgtgaagaatc tggatgtgatcaaaaaagaaaactactacgaagtgaatagcaagtgctatgaggaagctaagaagctg aagaagatcagcaaccaggccgagtttatcgcctccttctacaacaacgatctgatcaagatcaacgg cgagctgtatagagtgatcggcgtgaacaacgacctgctgaaccggatcgaagtgaacatgatcgaca tcacctaccgcgagtacctggaaaacatgaacgacaagaggccccccaggatcattaagacaatcgcc tccaagacccagagcattaagaagtacagcacagacattctgggcaacctgtatgaagtgaaatctaa gaagcaccctcagatcatcaaaaagggc SEQ ID NO: 40 Vector (pDO242) encoding codon optimized nucleic acid sequence encoding S. aureus Cas9 ctaaattgtaagcgttaatattttgttaaaattcgcgttaaatttttgttaaatcagctcatttttta accaataggccgaaatcggcaaaatcccttataaatcaaaagaatagaccgagatagggttgagtgtt gttccagtttggaacaagagtccactattaaagaacgtggactccaacgtcaaagggcgaaaaaccgt ctatcagggcgatggcccactacgtgaaccatcaccctaatcaagttttttggggtcgaggtgccgta aagcactaaatcggaaccctaaagggagcccccgatttagagcttgacggggaaagccggcgaacgtg gcgagaaaggaagggaagaaagcgaaaggagcgggcgctagggcgctggcaagtgtagcggtcacgct gcgcgtaaccaccacacccgccgcgcttaatgcgccgctacagggcgcgtcccattcgccattcaggc tgcgcaactgttgggaagggcgatcggtgcgggcctcttcgctattacgccagctggcgaaaggggga tgtgctgcaaggcgattaagttgggtaacgccagggttttcccagtcacgacgttgtaaaacgacggc cagtgagcgcgcgtaatacgactcactatagggcgaattgggtacCtttaattctagtactatgcaTg cgttgacattgattattgactagttattaatagtaatcaattacggggtcattagttcatagcccata tatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcc cattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccattgacgtcaatgg gtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcatatgccaagtacgccccc tattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttc ctacttggcagtacatctacgtattagtcatcgctattaccatggtgatgcggttttggcagtacatc aatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggag tttgttttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaa tgggcggtaggcgtgtacggtgggaggtctatataagcagagctctctggctaactaccggtgccacc ATGAAAAGGAACTACATTCTGGGGCTGGACATCGGGATTACAAGCGTGGGGTATGGGATTATTGACTA TGAAACAAGGGACGTGATCGACGCAGGCGTCAGACTGTTCAAGGAGGCCAACGTGGAAAACAATGAGG GACGGAGAAGCAAGAGGGGAGCCAGGCGCCTGAAACGACGGAGAAGGCACAGAATCCAGAGGGTGAAG AAACTGCTGTTCGATTACAACCTGCTGACCGACCATTCTGAGCTGAGTGGAATTAATCCTTATGAAGC CAGGGTGAAAGGCCTGAGTCAGAAGCTGTCAGAGGAAGAGTTTTCCGCAGCTCTGCTGCACCTGGCTA AGCGCCGAGGAGTGCATAACGTCAATGAGGTGGAAGAGGACACCGGCAACGAGCTGTCTACAAAGGAA CAGATCTCACGCAATAGCAAAGCTCTGGAAGAGAAGTATGTCGCAGAGCTGCAGCTGGAACGGCTGAA GAAAGATGGCGAGGTGAGAGGGTCAATTAATAGGTTCAAGACAAGCGACTACGTCAAAGAAGCCAAGC AGCTGCTGAAAGTGCAGAAGGCTTACCACCAGCTGGATCAGAGCTTCATCGATACTTATATCGACCTG CTGGAGACTCGGAGAACCTACTATGAGGGACCAGGAGAAGGGAGCCCCTTCGGATGGAAAGACATCAA GGAATGGTACGAGATGCTGATGGGACATTGCACCTATTTTCCAGAAGAGCTGAGAAGCGTCAAGTACG CTTATAACGCAGATCTGTACAACGCCCTGAATGACCTGAACAACCTGGTCATCACCAGGGATGAAAAC GAGAAACTGGAATACTATGAGAAGTTCCAGATCATCGAAAACGTGTTTAAGCAGAAGAAAAAGCCTAC ACTGAAACAGATTGCTAAGGAGATCCTGGTCAACGAAGAGGACATCAAGGGCTACCGGGTGACAAGCA CTGGAAAACCAGAGTTCACCAATCTGAAAGTGTATCACGATATTAAGGACATCACAGCACGGAAAGAA ATCATTGAGAACGCCGAACTGCTGGATCAGATTGCTAAGATCCTGACTATCTACCAGAGCTCCGAGGA CATCCAGGAAGAGCTGACTAACCTGAACAGCGAGCTGACCCAGGAAGAGATCGAACAGATTAGTAATC TGAAGGGGTACACCGGAACACACAACCTGTCCCTGAAAGCTATCAATCTGATTCTGGATGAGCTGTGG CATACAAACGACAATCAGATTGCAATCTTTAACCGGCTGAAGCTGGTCCCAAAAAAGGTGGACCTGAG TCAGCAGAAAGAGATCCCAACCACACTGGTGGACGATTTCATTCTGTCACCCGTGGTCAAGCGGAGCT TCATCCAGAGCATCAAAGTGATCAACGCCATCATCAAGAAGTACGGCCTGCCCAATGATATCATTATC GAGCTGGCTAGGGAGAAGAACAGCAAGGACGCACAGAAGATGATCAATGAGATGCAGAAACGAAACCG GCAGACCAATGAACGCATTGAAGAGATTATCCGAACTACCGGGAAAGAGAACGCAAAGTACCTGATTG AAAAAATCAAGCTGCACGATATGCAGGAGGGAAAGTGTCTGTATTCTCTGGAGGCCATCCCCCTGGAG GACCTGCTGAACAATCCATTCAACTACGAGGTCGATCATATTATCCCCAGAAGCGTGTCCTTCGACAA TTCCTTTAACAACAAGGTGCTGGTCAAGCAGGAAGAGAACTCTAAAAAGGGCAATAGGACTCCTTTCC AGTACCTGTCTAGTTCAGATTCCAAGATCTCTTACGAAACCTTTAAAAAGCACATTCTGAATCTGGCC AAAGGAAAGGGCCGCATCAGCAAGACCAAAAAGGAGTACCTGCTGGAAGAGCGGGACATCAACAGATT CTCCGTCCAGAAGGATTTTATTAACCGGAATCTGGTGGACACAAGATACGCTACTCGCGGCCTGATGA ATCTGCTGCGATCCTATTTCCGGGTGAACAATCTGGATGTGAAAGTCAAGTCCATCAACGGCGGGTTC ACATCTTTTCTGAGGCGCAAATGGAAGTTTAAAAAGGAGCGCAACAAAGGGTACAAGCACCATGCCGA AGATGCTCTGATTATCGCAAATGCCGACTTCATCTTTAAGGAGTGGAAAAAGCTGGACAAAGCCAAGA AAGTGATGGAGAACCAGATGTTCGAAGAGAAGCAGGCCGAATCTATGCCCGAAATCGAGACAGAACAG GAGTACAAGGAGATTTTCATCACTCCTCACCAGATCAAGCATATCAAGGATTTCAAGGACTACAAGTA CTCTCACCGGGTGGATAAAAAGCCCAACAGAGAGCTGATCAATGACACCCTGTATAGTACAAGAAAAG ACGATAAGGGGAATACCCTGATTGTGAACAATCTGAACGGACTGTACGACAAAGATAATGACAAGCTG AAAAAGCTGATCAACAAAAGTCCCGAGAAGCTGCTGATGTACCACCATGATCCTCAGACATATCAGAA ACTGAAGCTGATTATGGAGCAGTACGGCGACGAGAAGAACCCACTGTATAAGTACTATGAAGAGACTG GGAACTACCTGACCAAGTATAGCAAAAAGGATAATGGCCCCGTGATCAAGAAGATCAAGTACTATGGG AACAAGCTGAATGCCCATCTGGACATCACAGACGATTACCCTAACAGTCGCAACAAGGTGGTCAAGCT GTCACTGAAGCCATACAGATTCGATGTCTATCTGGACAACGGCGTGTATAAATTTGTGACTGTCAAGA ATCTGGATGTCATCAAAAAGGAGAACTACTATGAAGTGAATAGCAAGTGCTACGAAGAGGCTAAAAAG CTGAAAAAGATTAGCAACCAGGCAGAGTTCATCGCCTCCTTTTACAACAACGACCTGATTAAGATCAA TGGCGAACTGTATAGGGTCATCGGGGTGAACAATGATCTGCTGAACCGCATTGAAGTGAATATGATTG ACATCACTTACCGAGAGTATCTGGAAAACATGAATGATAAGCGCCCCCCTCGAATTATCAAAACAATT GCCTCTAAGACTCAGAGTATCAAAAAGTACTCAACCGACATTCTGGGAAACCTGTATGAGGTGAAGAG CAAAAAGCACCCTCAGATTATCAAAAAGGGCagcggaggcaagcgtcctgctgctactaagaaagctg gtcaagctaagaaaaagaaaggatcctacccatacgatgttccagattacgcttaagaattcctagag ctcgctgatcagcctcgactgtgccttctagttgccagccatctgttgtttgcccctcccccgtgcct tccttgaccctggaaggtgccactcccactgtcctttcctaataaaatgaggaaattgcatcgcattg tctgagtaggtgtcattctattctggggggtggggtggggcaggacagcaagggggaggattgggaag agaatagcaggcatgctggggaggtagcggccgcCCgcggtggagctccagcttttgttccctttagt gagggttaattgcgcgcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccgctc acaattccacacaacatacgagccggaagcataaagtgtaaagcctggggtgcctaatgagtgagcta actcacattaattgcgttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgcatt aatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctcact gactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggtt atccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaacc gtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcga cgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctc cctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaa gcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctg ggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtc caacccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagagcgaggt atgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtattt ggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaaca aaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctc aagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagggatt ttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatc aatctaaagtatatatgagtaaacttggtctgacagttaccaatgcttaatcagtgaggcacctatct cagcgatctgtctatttcgttcatccatagttgcctgactccccgtcgtgtagataactacgatacgg gagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcaccggctccagattt atcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctcca tccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgtt gttgccattgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttc ccaacgatcaaggcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctc cgatcgttgtcagaagtaagttggccgcagtgttatcactcatggttatggcagcactgcataattct cttactgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattctgaga atagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccacatagca gaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctg ttgagatccagttcgatgtaacccactcgtgcacccaactgatcttcagcatcttttactttcaccag cgtttctgggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaaat gttgaatactcatactcttcctttttcaatattattgaagcatttatcagggttattgtctcatgagc ggatacatatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaagt gccac SEQ ID NO: 41 Human p300 (with L553M mutation) protein MAENVVEPGPPSAKRPKLSSPALSASASDGTDFGSLFDLEHDLPDELINSTELGLTNGGDINQLQTSL GMVQDAASKHKQLSELLRSGSSPNLNMGVGGPGQVMASQAQQSSPGLGLINSMVKSPMTQAGLTSPNM GMGTSGPNQGPTQSTGMMNSPVNQPAMGMNTGMNAGMNPGMLAAGNGQGIMPNQVMNGSIGAGRGRQN MQYPNPGMGSAGNLLTEPLQQGSPQMGGQTGLRGPQPLKMGMMNNPNPYGSPYTQNPGQQIGASGLGL QIQTKTVLSNNLSPFAMDKKAVPGGGMPNMGQQPAPQVQQPGLVTPVAQGMGSGAHTADPEKRKLIQQ QLVLLLHAHKCQRREQANGEVRQCNLPHCRTMKNVLNHMTHCQSGKSCQVAHCASSRQIISHWKNCTR HDCPVCLPLKNAGDKRNQQPILTGAPVGLGNPSSLGVGQQSAPNLSTVSQIDPSSIERAYAALGLPYQ VNQMPTQPQVQAKNQQNQQPGQSPQGMRPMSNMSASPMGVNGGVGVQTPSLLSDSMLHSAINSQNPMM SENASVPSMGPMPTAAQPSTTGIRKQWHEDITQDLRNHLVHKLVQAIFPTPDPAALKDRRMENLVAYA RKVEGDMYESANNRAEYYHLLAEKIYKIQKELEEKRRTRLQKQNMLPNAAGMVPVSMNPGPNMGQPQP GMTSNGPLPDPSMIRGSVPNQMMPRITPQSGLNQFGQMSMAQPPIVPRQTPPLQHHGQLAQPGALNPP MGYGPRMQQPSNQGQFLPQTQFPSQGMNVTNIPLAPSSGQAPVSQAQMSSSSCPVNSPIMPPGSQGSH IHCPQLPQPALHQNSPSPVPSRTPTPHHTPPSIGAQQPPATTIPAPVPTPPAMPPGPQSQALHPPPRQ TPTPPTTQLPQQVQPSLPAAPSADQPQQQPRSQQSTAASVPTPTAPLLPPQPATPLSQPAVSIEGQVS NPPSTSSTEVNSQAIAEKQPSQEVKMEAKMEVDQPEPADTQPEDISESKVEDCKMESTETEERSTELK TEIKEEEDQPSTSATQSSPAPGQSKKKIFKPEELRQALMPTLEALYRQDPESLPFRQPVDPQLLGIPD YFDIVKSPMDLSTIKRKLDTGQYQEPWQYVDDIWLMFNNAWLYNRKTSRVYKYCSKLSEVFEQEIDPV MQSLGYCCGRKLEFSPQTLCCYGKQLCTIPRDATYYSYQNRYHFCEKCFNEIQGESVSLGDDPSQPQT TINKEQFSKRKNDTLDPELFVECTECGRKMHQICVLHHEIIWPAGFVCDGCLKKSARTRKENKFSAKR LPSTRLGTFLENRVNDFLRRQNHPESGEVTVRVVHASDKTVEVKPGMKARFVDSGEMAESFPYRTKAL FAFEEIDGVDLCFFGMHVQEYGSDCPPPNQRRVYISYLDSVHFFRPKCLRTAVYHEILIGYLEYVKKL GYTTGHIWACPPSEGDDYIFHCHPPDQKIPKPKRLQEWYKKMLDKAVSERIVHDYKDIFKQATEDRLT SAKELPYFEGDFWPNVLEESIKELEQEEEERKREENTSNESTDVTKGDSKNAKKKNNKKTSKNKSSLS RGNKKKPGMPNVSNDLSQKLYATMEKHKEVFFVIRLIAGPAANSLPPIVDPDPLIPCDLMDGRDAFLT LARDKHLEFSSLRRAQWSTMCMLVELHTQSQDRFVYTCNECKHHVETRWHCTVCEDYDLCITCYNTKN HDHKMEKLGLGLDDESNNQQAAATQSPGDSRRLSIQRCIQSLVHACQCRNANCSLPSCQKMKRVVQHT KGCKRKTNGGCPICKQLIALCCYHAKHCQENKCPVPFCLNIKQKLRQQQLQHRLQQAQMLRRRMASMQ RTGVVGQQQGLPSPTPATPTTPTGQQPTTPQTPQPTSQPQPTPPNSMPPYLPRTQAAGPVSQGKAAGQ VTPPTPPQTAQPPLPGPPPAAVEMAMQIQRAAETQRQMAHVQIFQRPIQHQMPPMTPMAPMGMNPPPM TRGPSGHLEPGMGPTGMQQQPPWSQGGLPQPQQLQSGMPRPAMMSVAQHGQPLNMAPQPGLGQVGISP LKPGTVSQQALQNLLRTLRSPSSPLQQQQVLSILHANPQLLAAFIKQRAAKYANSNPQPIPGQPGMPQ GQPGLQPPTMPGQQGVHSNPAMQNMNPMQAGVQRAGLPQQQPQQQLQPPMGGMSPQAQQMNMNHNTMP SQFRDILRRQQMMQQQQQQGAGPGIGPGMANHNQFQQPQGVGYPPQQQQRMQHHMQQMQQGNMGQIGQ LPQALGAEAGASLQAYQQRLLQQQMGSPVQPNPMSPQQHMLPNQAQSPHLQGQQIPNSLSNQVRSPQP VPSPRPQSQPPHSSPSPRMQPQPSPHHVSPQTSSPHPGLVAAQANPMEQGHFASPDQNSMLSQLASNP GMANLHGASATDLGLSTDNSDLNSNLSQSTLDIH SEQ ID NO: 42 Human p300 Core Effector protein (aa 1048-1664 of SEQ ID NO: 41) IFKPEELRQALMPTLEALYRQDPESLPFRQPVDPQLLGIPDYFDIVKSPMDLSTIKRKLDTGQYQEPW QYVDDIWLMFNNAWLYNRKTSRVYKYCSKLSEVFEQEIDPVMQSLGYCCGRKLEFSPQTLCCYGKQLC TIPRDATYYSYQNRYHFCEKCFNEIQGESVSLGDDPSQPQTTINKEQFSKRKNDTLDPELFVECTECG RKMHQICVLHHEIIWPAGFVCDGCLKKSARTRKENKFSAKRLPSTRLGTFLENRVNDFLRRQNHPESG EVTVRVVHASDKTVEVKPGMKARFVDSGEMAESFPYRTKALFAFEEIDGVDLCFFGMHVQEYGSDCPP PNQRRVYISYLDSVHFFRPKCLRTAVYHEILIGYLEYVKKLGYTTGHIWACPPSEGDDYIFHCHPPDQ KIPKPKRLQEWYKKMLDKAVSERIVHDYKDIFKQATEDRLTSAKELPYFEGDFWPNVLEESIKELEQE EEERKREENTSNESTDVTKGDSKNAKKKNNKKTSKNKSSLSRGNKKKPGMPNVSNDLSQKLYATMEKH KEVFFVIRLIAGPAANSLPPIVDPDPLIPCDLMDGRDAFLTLARDKHLEFSSLRRAQWSTMCMLVELH TQSQD SEQ ID NO: 43 VP64-dCas9-VP64 protein RADALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMVNPKKKRKVGRGMDKKY SIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYT RRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKK LVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAK AILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDN LLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPE KYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQ IHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEV VDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIV DLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILE DIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKS DGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGR HKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRD MYVDQELDINRLSDYDVDAIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNA KLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVIT LKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKS EQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVN IVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVK ELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALP SKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKH RDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQL GGDSRADPKKKRKVASRADALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML I SEQ ID NO: 44 VP64-dCas9-VP64 DNA cgggctgacgcattggacgattttgatctggatatgctgggaagtgacgccctcgatgattttgacct tgacatgcttggttcggatgcccttgatgactttgacctcgacatgctcggcagtgacgcccttgatg atttcgacctggacatggttaaccccaagaagaagaggaaggtgggccgcggaatggacaagaagtac tccattgggctcgccatcggcacaaacagcgtcggctgggccgtcattacggacgagtacaaggtgcc gagcaaaaaattcaaagttctgggcaataccgatcgccacagcataaagaagaacctcattggcgccc tcctgttcgactccggggaaaccgccgaagccacgcggctcaaaagaacagcacggcgcagatatacc cgcagaaagaatcggatctgctacctgcaggagatctttagtaatgagatggctaaggtggatgactc tttcttccataggctggaggagtcctttttggtggaggaggataaaaagcacgagcgccacccaatct ttggcaatatcgtggacgaggtggcgtaccatgaaaagtacccaaccatatatcatctgaggaagaag cttgtagacagtactgataaggctgacttgcggttgatctatctcgcgctggcgcatatgatcaaatt tcggggacacttcctcatcgagggggacctgaacccagacaacagcgatgtcgacaaactctttatcc aactggttcagacttacaatcagcttttcgaagagaacccgatcaacgcatccggagttgacgccaaa gcaatcctgagcgctaggctgtccaaatcccggcggctcgaaaacctcatcgcacagctccctgggga gaagaagaacggcctgtttggtaatcttatcgccctgtcactcgggctgacccccaactttaaatcta acttcgacctggccgaagatgccaagcttcaactgagcaaagacacctacgatgatgatctcgacaat ctgctggcccagatcggcgaccagtacgcagacctttttttggcggcaaagaacctgtcagacgccat tctgctgagtgatattctgcgagtgaacacggagatcaccaaagctccgctgagcgctagtatgatca agcgctatgatgagcaccaccaagacttgactttgctgaaggcccttgtcagacagcaactgcctgag aagtacaaggaaattttcttcgatcagtctaaaaatggctacgccggatacattgacggcggagcaag ccaggaggaattttacaaatttattaagcccatcttggaaaaaatggacggcaccgaggagctgctgg taaagcttaacagagaagatctgttgcgcaaacagcgcactttcgacaatggaagcatcccccaccag attcacctgggcgaactgcacgctatcctcaggcggcaagaggatttctacccctttttgaaagataa cagggaaaagattgagaaaatcctcacatttcggataccctactatgtaggccccctcgcccggggaa attccagattcgcgtggatgactcgcaaatcagaagagaccatcactccctggaacttcgaggaagtc gtggataagggggcctctgcccagtccttcatcgaaaggatgactaactttgataaaaatctgcctaa cgaaaaggtgcttcctaaacactctctgctgtacgagtacttcacagtttataacgagctcaccaagg tcaaatacgtcacagaagggatgagaaagccagcattcctgtctggagagcagaagaaagctatcgtg gacctcctcttcaagacgaaccggaaagttaccgtgaaacagctcaaagaagactatttcaaaaagat tgaatgtttcgactctgttgaaatcagcggagtggaggatcgcttcaacgcatccctgggaacgtatc acgatctcctgaaaatcattaaagacaaggacttcctggacaatgaggagaacgaggacattcttgag gacattgtcctcacccttacgttgtttgaagatagggagatgattgaagaacgcttgaaaacttacgc tcatctcttcgacgacaaagtcatgaaacagctcaagaggcgccgatatacaggatgggggcggctgt caagaaaactgatcaatgggatccgagacaagcagagtggaaagacaatcctggattttcttaagtcc gatggatttgccaaccggaacttcatgcagttgatccatgatgactctctcacctttaaggaggacat ccagaaagcacaagtttctggccagggggacagtcttcacgagcacatcgctaatcttgcaggtagcc cagctatcaaaaagggaatactgcagaccgttaaggtcgtggatgaactcgtcaaagtaatgggaagg cataagcccgagaatatcgttatcgagatggcccgagagaaccaaactacccagaagggacagaagaa cagtagggaaaggatgaagaggattgaagagggtataaaagaactggggtcccaaatccttaaggaac acccagttgaaaacacccagcttcagaatgagaagctctacctgtactacctgcagaacggcagggac atgtacgtggatcaggaactggacatcaatcggctctccgactacgacgtggatgccatcgtgcccca gtcttttctcaaagatgattctattgataataaagtgttgacaagatccgataaaaatagagggaaga gtgataacgtcccctcagaagaagttgtcaagaaaatgaaaaattattggcggcagctgctgaacgcc aaactgatcacacaacggaagttcgataatctgactaaggctgaacgaggtggcctgtctgagttgga taaagccggcttcatcaaaaggcagcttgttgagacacgccagatcaccaagcacgtggcccaaattc tcgattcacgcatgaacaccaagtacgatgaaaatgacaaactgattcgagaggtgaaagttattact ctgaagtctaagctggtctcagatttcagaaaggactttcagttttataaggtgagagagatcaacaa ttaccaccatgcgcatgatgcctacctgaatgcagtggtaggcactgcacttatcaaaaaatatccca agcttgaatctgaatttgtttacggagactataaagtgtacgatgttaggaaaatgatcgcaaagtct gagcaggaaataggcaaggccaccgctaagtacttcttttacagcaatattatgaattttttcaagac cgagattacactggccaatggagagattcggaagcgaccacttatcgaaacaaacggagaaacaggag aaatcgtgtgggacaagggtagggatttcgcgacagtccggaaggtcctgtccatgccgcaggtgaac atcgttaaaaagaccgaagtacagaccggaggcttctccaaggaaagtatcctcccgaaaaggaacag cgacaagctgatcgcacgcaaaaaagattgggaccccaagaaatacggcggattcgattctcctacag tcgcttacagtgtactggttgtggccaaagtggagaaagggaagtctaaaaaactcaaaagcgtcaag gaactgctgggcatcacaatcatggagcgatcaagcttcgaaaaaaaccccatcgactttctcgaggc gaaaggatataaagaggtcaaaaaagacctcatcattaagcttcccaagtactctctctttgagcttg aaaacggccggaaacgaatgctcgctagtgcgggcgagctgcagaaaggtaacgagctggcactgccc tctaaatacgttaatttcttgtatctggccagccactatgaaaagctcaaagggtctcccgaagataa tgagcagaagcagctgttcgtggaacaacacaaacactaccttgatgagatcatcgagcaaataagcg aattctccaaaagagtgatcctcgccgacgctaacctcgataaggtgctttctgcttacaataagcac agggataagcccatcagggagcaggcagaaaacattatccacttgtttactctgaccaacttgggcgc gcctgcagccttcaagtacttcgacaccaccatagacagaaagcggtacacctctacaaaggaggtcc tggacgccacactgattcatcagtcaattacggggctctatgaaacaagaatcgacctctctcagctc ggtggagacagcagggctgaccccaagaagaagaggaaggtggctagccgcgccgacgcgctggacga tttcgatctcgacatgctgggttctgatgccctcgatgactttgacctggatatgttgggaagcgacg cattggatgactttgatctggacatgctcggctccgatgctctggacgatttcgatctcgatatgtta atc SEQ ID NO: 45 Polypeptide sequence of KRAB protein RTLVTFKDVFVDFTREEWKLLDTAQQILYRNVMLENYKNLVSLGYQLTKPDVILRLEKGEEP WLV SEQ ID NO: 46 Polynucleotide sequence for KRAB cggacactggtgaccttcaaggatgtgtttgtggacttcaccagggaggagtggaagctgct ggacactgctcagcagatcctgtacagaaatgtgatgctggagaactataagaacctggttt ccttgggttatcagcttactaagccagatgtgatcctccggttggagaagggagaagagccc tggctggtg SEQ ID NO: 47 Polypeptide sequence of Streptococcus pyogenes dCas9-KRAB protein MDYKDHDGDYKDHDIDYKDDDDKMAPKKKRKVGRGMDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFK VLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRL EESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFL IEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGL FGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDI LRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFY KFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIE KILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLP KHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDS VEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDD KVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQV SGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERM KRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQELDINRLSDYDVDAIVPQSFLKD DSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFI KRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAH DAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLA NGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIA RKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFLEAKGYKE VKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQL FVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFK YFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDSRADPKKKRKVASDAKSLTAWSRTL VTFKDVFVDFTREEWKLLDTAQQILYRNVMLENYKNLVSLGYQLTKPDVILRLEKGEEPWLVEREIHQ ETHPDSETAFEIKSSVPKKKRKV SEQ ID NO: 48 Polynucleotide sequence encoding Streptococcus pyogenes dCas9-KRAB atggactacaaagaccatgacggtgattataaagatcatgacatcgattacaaggatgacgatgacaa gatggcccccaagaagaagaggaaggtgggccgcggaatggacaagaagtactccattgggctcgcca tcggcacaaacagcgtcggctgggccgtcattacggacgagtacaaggtgccgagcaaaaaattcaaa gttctgggcaataccgatcgccacagcataaagaagaacctcattggcgccctcctgttcgactccgg ggaaaccgccgaagccacgcggctcaaaagaacagcacggcgcagatatacccgcagaaagaatcgga tctgctacctgcaggagatctttagtaatgagatggctaaggtggatgactctttcttccataggctg gaggagtcctttttggtggaggaggataaaaagcacgagcgccacccaatctttggcaatatcgtgga cgaggtggcgtaccatgaaaagtacccaaccatatatcatctgaggaagaagcttgtagacagtactg ataaggctgacttgcggttgatctatctcgcgctggcgcatatgatcaaatttcggggacacttcctc atcgagggggacctgaacccagacaacagcgatgtcgacaaactctttatccaactggttcagactta caatcagcttttcgaagagaacccgatcaacgcatccggagttgacgccaaagcaatcctgagcgcta ggctgtccaaatcccggcggctcgaaaacctcatcgcacagctccctggggagaagaagaacggcctg tttggtaatcttatcgccctgtcactcgggctgacccccaactttaaatctaacttcgacctggccga agatgccaagcttcaactgagcaaagacacctacgatgatgatctcgacaatctgctggcccagatcg gcgaccagtacgcagacctttttttggcggcaaagaacctgtcagacgccattctgctgagtgatatt ctgcgagtgaacacggagatcaccaaagctccgctgagcgctagtatgatcaagcgctatgatgagca ccaccaagacttgactttgctgaaggcccttgtcagacagcaactgcctgagaagtacaaggaaattt tcttcgatcagtctaaaaatggctacgccggatacattgacggcggagcaagccaggaggaattttac aaatttattaagcccatcttggaaaaaatggacggcaccgaggagctgctggtaaagcttaacagaga agatctgttgcgcaaacagcgcactttcgacaatggaagcatcccccaccagattcacctgggcgaac tgcacgctatcctcaggcggcaagaggatttctacccctttttgaaagataacagggaaaagattgag aaaatcctcacatttcggataccctactatgtaggccccctcgcccggggaaattccagattcgcgtg gatgactcgcaaatcagaagagaccatcactccctggaacttcgaggaagtcgtggataagggggcct ctgcccagtccttcatcgaaaggatgactaactttgataaaaatctgcctaacgaaaaggtgcttcct aaacactctctgctgtacgagtacttcacagtttataacgagctcaccaaggtcaaatacgtcacaga agggatgagaaagccagcattcctgtctggagagcagaagaaagctatcgtggacctcctcttcaaga cgaaccggaaagttaccgtgaaacagctcaaagaagactatttcaaaaagattgaatgtttcgactct gttgaaatcagcggagtggaggatcgcttcaacgcatccctgggaacgtatcacgatctcctgaaaat cattaaagacaaggacttcctggacaatgaggagaacgaggacattcttgaggacattgtcctcaccc ttacgttgtttgaagatagggagatgattgaagaacgcttgaaaacttacgctcatctcttcgacgac aaagtcatgaaacagctcaagaggcgccgatatacaggatgggggcggctgtcaagaaaactgatcaa tgggatccgagacaagcagagtggaaagacaatcctggattttcttaagtccgatggatttgccaacc ggaacttcatgcagttgatccatgatgactctctcacctttaaggaggacatccagaaagcacaagtt tctggccagggggacagtcttcacgagcacatcgctaatcttgcaggtagcccagctatcaaaaaggg aatactgcagaccgttaaggtcgtggatgaactcgtcaaagtaatgggaaggcataagcccgagaata tcgttatcgagatggcccgagagaaccaaactacccagaagggacagaagaacagtagggaaaggatg aagaggattgaagagggtataaaagaactggggtcccaaatccttaaggaacacccagttgaaaacac ccagcttcagaatgagaagctctacctgtactacctgcagaacggcagggacatgtacgtggatcagg aactggacatcaatcggctctccgactacgacgtggatgccatcgtgccccagtcttttctcaaagat gattctattgataataaagtgttgacaagatccgataaaaatagagggaagagtgataacgtcccctc agaagaagttgtcaagaaaatgaaaaattattggcggcagctgctgaacgccaaactgatcacacaac ggaagttcgataatctgactaaggctgaacgaggtggcctgtctgagttggataaagccggcttcatc aaaaggcagcttgttgagacacgccagatcaccaagcacgtggcccaaattctcgattcacgcatgaa caccaagtacgatgaaaatgacaaactgattcgagaggtgaaagttattactctgaagtctaagctgg tctcagatttcagaaaggactttcagttttataaggtgagagagatcaacaattaccaccatgcgcat gatgcctacctgaatgcagtggtaggcactgcacttatcaaaaaatatcccaagcttgaatctgaatt tgtttacggagactataaagtgtacgatgttaggaaaatgatcgcaaagtctgagcaggaaataggca aggccaccgctaagtacttcttttacagcaatattatgaattttttcaagaccgagattacactggcc aatggagagattcggaagcgaccacttatcgaaacaaacggagaaacaggagaaatcgtgtgggacaa gggtagggatttcgcgacagtccggaaggtcctgtccatgccgcaggtgaacatcgttaaaaagaccg aagtacagaccggaggcttctccaaggaaagtatcctcccgaaaaggaacagcgacaagctgatcgca cgcaaaaaagattgggaccccaagaaatacggcggattcgattctcctacagtcgcttacagtgtact ggttgtggccaaagtggagaaagggaagtctaaaaaactcaaaagcgtcaaggaactgctgggcatca caatcatggagcgatcaagcttcgaaaaaaaccccatcgactttctcgaggcgaaaggatataaagag gtcaaaaaagacctcatcattaagcttcccaagtactctctctttgagcttgaaaacggccggaaacg aatgctcgctagtgcgggcgagctgcagaaaggtaacgagctggcactgccctctaaatacgttaatt tcttgtatctggccagccactatgaaaagctcaaagggtctcccgaagataatgagcagaagcagctg ttcgtggaacaacacaaacactaccttgatgagatcatcgagcaaataagcgaattctccaaaagagt gatcctcgccgacgctaacctcgataaggtgctttctgcttacaataagcacagggataagcccatca gggagcaggcagaaaacattatccacttgtttactctgaccaacttgggcgcgcctgcagccttcaag tacttcgacaccaccatagacagaaagcggtacacctctacaaaggaggtcctggacgccacactgat tcatcagtcaattacggggctctatgaaacaagaatcgacctctctcagctcggtggagacagcaggg ctgaccccaagaagaagaggaaggtggctagcgatgctaagtcactgactgcctggtcccggacactg gtgaccttcaaggatgtgtttgtggacttcaccagggaggagtggaagctgctggacactgctcagca gatcctgtacagaaatgtgatgctggagaactataagaacctggtttccttgggttatcagcttacta agccagatgtgatcctccggttggagaagggagaagagccctggctggtggagagagaaattcaccaa gagacccatcctgattcagagactgcatttgaaatcaaatcatcagttccgaaaaagaaacgcaaagt ttga SEQ ID NO: 49 Polypeptide sequence of Staphylococcus aureus dCas9-KRAB protein MAPKKKRKVGIHGVPAAKRNYILGLAIGITSVGYGIIDYETRDVIDAGVRLFKEANVENNEGRRSKRG ARRLKRRRRHRIQRVKKLLFDYNLLTDHSELSGINPYEARVKGLSQKLSEEEFSAALLHLAKRRGVHN VNEVEEDTGNELSTKEQISRNSKALEEKYVAELQLERLKKDGEVRGSINRFKTSDYVKEAKQLLKVQK AYHQLDQSFIDTYIDLLETRRTYYEGPGEGSPFGWKDIKEWYEMLMGHCTYFPEELRSVKYAYNADLY NALNDLNNLVITRDENEKLEYYEKFQIIENVFKQKKKPTLKQIAKEILVNEEDIKGYRVTSTGKPEFT NLKVYHDIKDITARKEIIENAELLDQIAKILTIYQSSEDIQEELTNLNSELTQEEIEQISNLKGYTGT HNLSLKAINLILDELWHTNDNQIAIFNRLKLVPKKVDLSQQKEIPTTLVDDFILSPVVKRSFIQSIKV INAIIKKYGLPNDIIIELAREKNSKDAQKMINEMQKRNRQTNERIEEIIRTTGKENAKYLIEKIKLHD MQEGKCLYSLEAIPLEDLLNNPFNYEVDHIIPRSVSFDNSFNNKVLVKQEEASKKGNRTPFQYLSSSD SKISYETFKKHILNLAKGKGRISKTKKEYLLEERDINRFSVQKDFINRNLVDTRYATRGLMNLLRSYF RVNNLDVKVKSINGGFTSFLRRKWKFKKERNKGYKHHAEDALIIANADFIFKEWKKLDKAKKVMENQM FEEKQAESMPEIETEQEYKEIFITPHQIKHIKDFKDYKYSHRVDKKPNRELINDTLYSTRKDDKGNTL IVNNLNGLYDKDNDKLKKLINKSPEKLLMYHHDPQTYQKLKLIMEQYGDEKNPLYKYYEETGNYLTKY SKKDNGPVIKKIKYYGNKLNAHLDITDDYPNSRNKVVKLSLKPYRFDVYLDNGVYKFVTVKNLDVIKK ENYYEVNSKCYEEAKKLKKISNQAEFIASFYNNDLIKINGELYRVIGVNNDLLNRIEVNMIDITYREY LENMNDKRPPRIIKTIASKTQSIKKYSTDILGNLYEVKSKKHPQIIKKGKRPAATKKAGQAKKKKGSD AKSLTAWSRTLVTFKDVFVDFTREEWKLLDTAQQILYRNVMLENYKNLVSLGYQLTKPDVILRLEKGE EPWLVEREIHQETHPDSETAFEIKSSVPKKKRKV SEQ ID NO: 50 Polynucleotide sequence of Staphylococcus aureus dCas9-KRAB protein atggccccaaagaagaagcggaaggtcggtatccacggagtcccagcagccaagcggaactacatcct gggcctggccatcggcatcaccagcgtgggctacggcatcatcgactacgagacacgggacgtgatcg atgccggcgtgcggctgttcaaagaggccaacgtggaaaacaacgagggcaggcggagcaagagaggc gccagaaggctgaagcggcggaggcggcatagaatccagagagtgaagaagctgctgttcgactacaa cctgctgaccgaccacagcgagctgagcggcatcaacccctacgaggccagagtgaagggcctgagcc agaagctgagcgaggaagagttctctgccgccctgctgcacctggccaagagaagaggcgtgcacaac gtgaacgaggtggaagaggacaccggcaacgagctgtccaccaaagagcagatcagccggaacagcaa ggccctggaagagaaatacgtggccgaactgcagctggaacggctgaagaaagacggcgaagtgcggg gcagcatcaacagattcaagaccagcgactacgtgaaagaagccaaacagctgctgaaggtgcagaag gcctaccaccagctggaccagagcttcatcgacacctacatcgacctgctggaaacccggcggaccta ctatgagggacctggcgagggcagccccttcggctggaaggacatcaaagaatggtacgagatgctga tgggccactgcacctacttccccgaggaactgcggagcgtgaagtacgcctacaacgccgacctgtac aacgccctgaacgacctgaacaatctcgtgatcaccagggacgagaacgagaagctggaatattacga gaagttccagatcatcgagaacgtgttcaagcagaagaagaagcccaccctgaagcagatcgccaaag aaatcctcgtgaacgaagaggatattaagggctacagagtgaccagcaccggcaagcccgagttcacc aacctgaaggtgtaccacgacatcaaggacattaccgcccggaaagagattattgagaacgccgagct gctggatcagattgccaagatcctgaccatctaccagagcagcgaggacatccaggaagaactgacca atctgaactccgagctgacccaggaagagatcgagcagatctctaatctgaagggctataccggcacc cacaacctgagcctgaaggccatcaacctgatcctggacgagctgtggcacaccaacgacaaccagat cgctatcttcaaccggctgaagctggtgcccaagaaggtggacctgtcccagcagaaagagatcccca ccaccctggtggacgacttcatcctgagccccgtcgtgaagagaagcttcatccagagcatcaaagtg atcaacgccatcatcaagaagtacggcctgcccaacgacatcattatcgagctggcccgcgagaagaa ctccaaggacgcccagaaaatgatcaacgagatgcagaagcggaaccggcagaccaacgagcggatcg aggaaatcatccggaccaccggcaaagagaacgccaagtacctgatcgagaagatcaagctgcacgac atgcaggaaggcaagtgcctgtacagcctggaagccatccctctggaagatctgctgaacaacccctt caactatgaggtggaccacatcatccccagaagcgtgtccttcgacaacagcttcaacaacaaggtgc tcgtgaagcaggaagaagccagcaagaagggcaaccggaccccattccagtacctgagcagcagcgac agcaagatcagctacgaaaccttcaagaagcacatcctgaatctggccaagggcaagggcagaatcag caagaccaagaaagagtatctgctggaagaacgggacatcaacaggttctccgtgcagaaagacttca tcaaccggaacctggtggataccagatacgccaccagaggcctgatgaacctgctgcggagctacttc agagtgaacaacctggacgtgaaagtgaagtccatcaatggcggcttcaccagctttctgcggcggaa gtggaagtttaagaaagagcggaacaaggggtacaagcaccacgccgaggacgccctgatcattgcca acgccgatttcatcttcaaagagtggaagaaactggacaaggccaaaaaagtgatggaaaaccagatg ttcgaggaaaagcaggccgagagcatgcccgagatcgaaaccgagcaggagtacaaagagatcttcat caccccccaccagatcaagcacattaaggacttcaaggactacaagtacagccaccgggtggacaaga agcctaatagagagctgattaacgacaccctgtactccacccggaaggacgacaagggcaacaccctg atcgtgaacaatctgaacggcctgtacgacaaggacaatgacaagctgaaaaagctgatcaacaagag ccccgaaaagctgctgatgtaccaccacgacccccagacctaccagaaactgaagctgattatggaac agtacggcgacgagaagaatcccctgtacaagtactacgaggaaaccgggaactacctgaccaagtac tccaaaaaggacaacggccccgtgatcaagaagattaagtattacggcaacaaactgaacgcccatct ggacatcaccgacgactaccccaacagcagaaacaaggtcgtgaagctgtccctgaagccctacagat tcgacgtgtacctggacaatggcgtgtacaagttcgtgaccgtgaagaatctggatgtgatcaaaaaa gaaaactactacgaagtgaatagcaagtgctatgaggaagctaagaagctgaagaagatcagcaacca ggccgagtttatcgcctccttctacaacaacgatctgatcaagatcaacggcgagctgtatagagtga tcggcgtgaacaacgacctgctgaaccggatcgaagtgaacatgatcgacatcacctaccgcgagtac ctggaaaacatgaacgacaagaggccccccaggatcattaagacaatcgcctccaagacccagagcat taagaagtacagcacagacattctgggcaacctgtatgaagtgaaatctaagaagcaccctcagatca tcaaaaagggcaaaaggccggcggccacgaaaaaggccggccaggcaaaaaagaaaaagggatccgat gctaagtcactgactgcctggtcccggacactggtgaccttcaaggatgtgtttgtggacttcaccag ggaggagtggaagctgctggacactgctcagcagatcctgtacagaaatgtgatgctggagaactata agaacctggtttccttgggttatcagcttactaagccagatgtgatcctccggttggagaagggagaa gagccctggctggtggagagagaaattcaccaagagacccatcctgattcagagactgcatttgaaat caaatcatcagttccgaaaaagaaacgcaaagtt SEQ ID NO: 51 Polypeptide sequence of Tet1CD LPTCSCLDRVIQKDKGPYYTHLGAGPSVAAVREIMENRYGQKGNAIRIEIVVYTGKEGKSSHGCPIAK WVLRRSSDEEKVLCLVRQRTGHHCPTAVMVVLIMVWDGIPLPMADRLYTELTENLKSYNGHPTDRRCT LNENRTCTCQGIDPETCGASFSFGCSWSMYFNGCKFGRSPSPRRFRIDPSSPLHEKNLEDNLQSLATR LAPIYKQYAPVAYQNQVEYENVARECRLGSKEGRPFSGVTACLDFCAHPHRDIHNMNNGSTVVCTLTR EDNRSLGVIPQDEQLHVLPLYKLSDTDEFGSKEGMEAKIKSGAIEVLAPRRKKRTCFTQPVPRSGKKR AAMMTEVLAHKIRAVEKKPIPRIKRKNNSTTTNNSKPSSLPTLGSNTETVQPEVKSETEPHFILKSSD NTKTYSLMPSAPHPVKEASPGFSWSPKTASATPAPLKNDATASCGFSERSSTPHCTMPSGRLSGANAA AADGPGISQLGEVAPLPTLSAPVMEPLINSEPSTGVTEPLTPHQPNHQPSFLTSPQDLASSPMEEDEQ HSEADEPPSDEPLSDDPLSPAEEKLPHIDEYWSDSEHIFLDANIGGVAIAPAHGSVLIECARRELHAT TPVEHPNRNHPTRLSLVFYQHKNLNKPQHGFELNKIKFEAKEAKNKKMKASEQKDQAANEGPEQSSEV NELNQIPSHKALTLTHDNVVTVSPYALTHVAGPYNHWV SEQ ID NO: 52 Polynucleotide sequence of Tet1CD CTGCCCACCTGCAGCTGTCTTGATCGAGTTATACAAAAAGACAAAGGCCCATATTATACACACCTTGG GGCAGGACCAAGTGTTGCTGCTGTCAGGGAAATCATGGAGAATAGGTATGGTCAAAAAGGAAACGCAA TAAGGATAGAAATAGTAGTGTACACCGGTAAAGAAGGGAAAAGCTCTCATGGGTGTCCAATTGCTAAG TGGGTTTTAAGAAGAAGCAGTGATGAAGAAAAAGTTCTTTGTTTGGTCCGGCAGCGTACAGGCCACCA CTGTCCAACTGCTGTGATGGTGGTGCTCATCATGGTGTGGGATGGCATCCCTCTTCCAATGGCCGACC GGCTATACACAGAGCTCACAGAGAATCTAAAGTCATACAATGGGCACCCTACCGACAGAAGATGCACC CTCAATGAAAATCGTACCTGTACATGTCAAGGAATTGATCCAGAGACTTGTGGAGCTTCATTCTCTTT TGGCTGTTCATGGAGTATGTACTTTAATGGCTGTAAGTTTGGTAGAAGCCCAAGCCCCAGAAGATTTA GAATTGATCCAAGCTCTCCCTTACATGAAAAAAACCTTGAAGATAACTTACAGAGTTTGGCTACACGA TTAGCTCCAATTTATAAGCAGTATGCTCCAGTAGCTTACCAAAATCAGGTGGAATATGAAAATGTTGC CCGAGAATGTCGGCTTGGCAGCAAGGAAGGTCGACCCTTCTCTGGGGTCACTGCTTGCCTGGACTTCT GTGCTCATCCCCACAGGGACATTCACAACATGAATAATGGAAGCACTGTGGTTTGTACCTTAACTCGA GAAGATAACCGCTCTTTGGGTGTTATTCCTCAAGATGAGCAGCTCCATGTGCTACCTCTTTATAAGCT TTCAGACACAGATGAGTTTGGCTCCAAGGAAGGAATGGAAGCCAAGATCAAATCTGGGGCCATCGAGG TCCTGGCACCCCGCCGCAAAAAAAGAACGTGTTTCACTCAGCCTGTTCCCCGTTCTGGAAAGAAGAGG GCTGCGATGATGACAGAGGTTCTTGCACATAAGATAAGGGCAGTGGAAAAGAAACCTATTCCCCGAAT CAAGCGGAAGAATAACTCAACAACAACAAACAACAGTAAGCCTTCGTCACTGCCAACCTTAGGGAGTA ACACTGAGACCGTGCAACCTGAAGTAAAAAGTGAAACCGAACCCCATTTTATCTTAAAAAGTTCAGAC AACACTAAAACTTATTCGCTGATGCCATCCGCTCCTCACCCAGTGAAAGAGGCATCTCCAGGCTTCTC CTGGTCCCCGAAGACTGCTTCAGCCACACCAGCTCCACTGAAGAATGACGCAACAGCCTCATGCGGGT TTTCAGAAAGAAGCAGCACTCCCCACTGTACGATGCCTTCGGGAAGACTCAGTGGTGCCAATGCTGCA GCTGCTGATGGCCCTGGCATTTCACAGCTTGGCGAAGTGGCTCCTCTCCCCACCCTGTCTGCTCCTGT GATGGAGCCCCTCATTAATTCTGAGCCTTCCACTGGTGTGACTGAGCCGCTAACGCCTCATCAGCCAA ACCACCAGCCCTCCTTCCTCACCTCTCCTCAAGACCTTGCCTCTTCTCCAATGGAAGAAGATGAGCAG CATTCTGAAGCAGATGAGCCTCCATCAGACGAACCCCTATCTGATGACCCCCTGTCACCTGCTGAGGA GAAATTGCCCCACATTGATGAGTATTGGTCAGACAGTGAGCACATCTTTTTGGATGCAAATATTGGTG GGGTGGCCATCGCACCTGCTCACGGCTCGGTTTTGATTGAGTGTGCCCGGCGAGAGCTGCACGCTACC ACTCCTGTTGAGCACCCCAACCGTAATCATCCAACCCGCCTCTCCCTTGTCTTTTACCAGCACAAAAA CCTAAATAAGCCCCAACATGGTTTTGAACTAAACAAGATTAAGTTTGAGGCTAAAGAAGCTAAGAATA AGAAAATGAAGGCCTCAGAGCAAAAAGACCAGGCAGCTAATGAAGGTCCAGAACAGTCCTCTGAAGTA AATGAATTGAACCAAATTCCTTCTCATAAAGCATTAACATTAACCCATGACAATGTTGTCACCGTGTC CCCTTATGCTCTCACACACGTTGCGGGGCCCTATAACCATTGGGTC SEQ ID NO: 53 Protein sequence for VPH DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSLPSASVEFEGSGGPSG QISNQALALAPSSAPVLAQTMVPSSAMVPLAQPPAPAPVLTPGPPQSLSAPVPKSTQAGEGTLSEALL HLQFDADEDLGALLGNSTDPGVFTDLASVDNSEFQQLLNQGVSMSHSTAEPMLMEYPEAITRLVTGSQ RPPDPAPTPLGTSGLPNGLSGDEDFSSIADMDFSALLSQISSSGQGGGGSGFSVDTSALLDLFSPSVT VPDMSLPDLDSSLASIQELLSPQEPPRPPEAENSSPDSGKQLVHYTAQPLFLLDPGSVDTGSNDLPVL FELGEGSYFSEGDGFAEDPTISLLTGSEPPKAKDPTVS SEQ ID NO: 54 DNA sequence for VPH Gatgctttagacgattttgacttagatatgcttggttcagacgcgttagacgacttcgacctagacat gttaggctcagatgcattggacgacttcgatttagatatgttgggctccgatgccctagatgactttg atctagatatgctagggtcactacccagcgccagcgtcgagttcgaaggcagcggcgggccttcaggg cagatcagcaaccaggccctggctctggcccctagctccgctccagtgctggcccagactatggtgcc ctctagtgctatggtgcctctggcccagccacctgctccagcccctgtgctgaccccaggaccacccc agtcactgagcgccccagtgcccaagtctacacaggccggcgaggggactctgagtgaagctctgctg cacctgcagttcgacgctgatgaggacctgggagctctgctggggaacagcaccgatcccggagtgtt cacagatctggcctccgtggacaactctgagtttcagcagctgctgaatcagggcgtgtccatgtctc atagtacagccgaaccaatgctgatggagtaccccgaagccattacccggctggtgaccggcagccag cggccccccgaccccgctccaactcccctgggaaccagcggcctgcctaatgggctgtccggagatga agacttctcaagcatcgctgatatggactttagtgccctgctgtcacagatttcctctagtgggcagg gaggaggtggaagcggcttcagcgtggacaccagtgccctgctggacctgttcagcccctcggtgacc gtgcccgacatgagcctgcctgaccttgacagcagcctggccagtatccaagagctcctgtctcccca ggagccccccaggcctcccgaggcagagaacagcagcccggattcagggaagcagctggtgcactaca cagcgcagccgctgttcctgctggaccccggctccgtggacaccgggagcaacgacctgccggtgctg tttgagctgggagagggctcctacttctccgaaggggacggcttcgccgaggaccccaccatctccct gctgacaggctcggagcctcccaaagccaaggaccccactgtctcc SEQ ID NO: 55 Protein sequence for VPR DALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSPKKKRKVGSQYLPDTD DRHRIEEKRKRTYETFKSIMKKSPFSGPTDPRPPPRRIAVPSRSSASVPKPAPQPYPFTSSLSTINYD EFPTMVFPSGQISQASALAPAPPQVLPQAPAPAPAPAMVSALAQAPAPVPVLAPGPPQAVAPPAPKPT QAGEGTLSEALLQLQFDDEDLGALLGNSTDPAVFTDLASVDNSEFQQLLNQGIPVAPHTTEPMLMEYP EAITRLVTGAQRPPDPAPAPLGAPGLPNGLLSGDEDFSSIADMDFSALLSQISSGSGSGSRDSREGMF LPKPEAGSAISDVFEGREVCQPKRIRPFHPPGSPWANRPLPASLAPTPTGPVHEPVGSLTPAPVPQPL DPAPAVTPEASHLLEDPDEETSQAVKALREMADTVIPQKEEAAICGQMDLSHPPPRGHLDELTTTLES MTEDLNLDSPLTPELNEILDTFLNDECLLHAMHISTGLSIFDTSLF SEQ ID NO: 56 DNA sequence for VPR gatgctttagacgattttgacttagatatgcttggttcagacgcgttagacgacttcgacctagacat gttaggctcagatgcattggacgacttcgatttagatatgttgggctccgatgccctagatgactttg atctagatatgctaggtagtcccaaaaagaagaggaaagtgggatcccagtatctgcccgacacagat gatagacaccgaatcgaagagaaacgcaagcgaacgtatgaaaccttcaaatcgatcatgaagaaatc gcccttctcgggtccgaccgatcccaggcccccaccgagaaggattgcggtcccgtcccgctcgtcgg ccagcgtgccgaagcctgcgccgcagccctaccccttcacgtcgagcctgagcacaatcaattatgac gagttcccgacgatggtgttcccctcgggacaaatctcacaagcctcggcgctcgcaccagcgcctcc ccaagtccttccgcaagcgcctgccccagcgcctgcaccggcaatggtgtccgccctcgcacaggccc ctgcgcccgtccccgtgctcgcgcctggaccgccccaggcggtcgctccaccggctccgaagccgacg caggccggagagggaacactctccgaagcacttcttcaactccagtttgatgacgaggatcttggagc actccttggaaactcgacagaccctgcggtgtttaccgacctcgcgtcagtagataactccgaatttc agcagcttttgaaccagggtatcccggtcgcgccacatacaacggagcccatgttgatggaatacccc gaagcaatcacgagacttgtgacgggagcgcagcggcctcccgatcccgcacccgcacctttgggggc acctggcctccctaacggacttttgagcggcgacgaggatttctcctccatcgccgatatggatttct cagccttgctgtcacagatttccagcggctctggcagcggcagccgggattccagggaagggatgttt ttgccgaagcctgaggccggctccgctattagtgacgtgtttgagggccgcgaggtgtgccagccaaa acgaatccggccatttcatcctccaggaagtccatgggccaaccgcccactccccgccagcctcgcac caacaccaaccggtccagtacatgagccagtcgggtcactgaccccggcaccagtccctcagccactg gatccagcgcccgcagtgactcccgaggccagtcacctgttggaggatcccgatgaagagacgagcca ggctgtcaaagcccttcgggagatggccgatactgtgattccccagaaggaagaggctgcaatctgtg gccaaatggacctttcccatccgcccccaaggggccatctggatgagctgacaaccacacttgagtcc atgaccgaggatctgaacctggactcacccctgaccccggaattgaacgagattctggataccttcct gaacgacgagtgcctcttgcatgccatgcatatcagcacaggactgtccatcttcgacacatctctgt tt SEQ ID NO: 57 ABE ecTadA wild-type, protein SEVEFSHEYWMRHALTLAKRAWDEREVPVGAVLVHNNRVIGEGWNRPIGRHDPTAHAEIMALRQGGLV MQNYRLIDATLYVTLEPCVMCAGAMIHSRIGRVVFGARDAKTGAAGSLMDVLHHPGMNHRVEITEGIL ADECAALLSDFFRMRRQEIKAQKKAQSSTD SEQ ID NO: 58 ABE ecTadA*7.9, protein SEVEFSHEYWMRHALTLAKRALDEREVPVGAVLVLNNRVIGEGWNRAIGLHDPTAHAEIMALRQGGLV MQNYRLIDATLYVTFEPCVMCAGAMIHSRIGRVVFGVRNAKTGAAGSLMDVLHYPGMNHRVEITEGIL ADECNALLCYFFRMPRQVFNAQKKAQSSTD SEQ ID NO: 59 ABE ecTadA*7.10, protein SEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNRAIGLHDPTAHAEIMALRQGGLV MQNYRLIDATLYVTFEPCVMCAGAMIHSRIGRVVFGVRNAKTGAAGSLMDVLHYPGMNHRVEITEGIL ADECAALLCYFFRMPRQVFNAQKKAQSSTD SEQ ID NO: 60 ABE ecTadA*8e, protein SEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNRAIGLHDPTAHAEIMALRQGGLV MQNYRLIDATLYVTFEPCVMCAGAMIHSRIGRVVFGVRNSKRGAAGSLMNVLNYPGMNHRVEITEGIL ADECAALLCDFYRMPRQVFNAQKKAQSSIN SEQ ID NO: 61 ABE ecTadA*8.8, protein SEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNRAIGLHDPTAHAEIMALRQGGLV MQNYRLIDATLYVTFEPCVMCAGAMIHSRIGRVVFGVRNAKTGAAGSLMDVLHHPGMNHRVEITEGIL ADECAALLCRFFRMPRRVFNAQKKAQSSTD SEQ ID NO: 62 ABE ecTadA*8.13, protein SEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNRAIGLHDPTAHAEIMALRQGGLV MQNYRLYDATLYVTFEPCVMCAGAMIHSRIGRVVFGVRNAKTGAAGSLMDVLHHPGMNHRVEITEGIL ADECAALLCRFFRMPRRVFNAQKKAQSSTD SEQ ID NO: 63 ABE ecTadA*8.17, protein SEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNRAIGLHDPTAHAEIMALRQGGLV MQNYRLIDATLYSTFEPCVMCAGAMIHSRIGRVVFGVRNAKTGAAGSLMDVLHYPGMNHRVEITEGIL ADECAALLCYFFRMPRRVFNAQKKAQSSTD SEQ ID NO: 64 ABE ecTadA*8.20, protein ecTadA*8.20 SEVEFSHEYWMRHALTLAKRARDEREVPVGAVLVLNNRVIGEGWNRAIGLHDPTAHAEIMALRQGGLV MQNYRLYDATLYSTFEPCVMCAGAMIHSRIGRVVFGVRNAKTGAAGSLMDVLHHPGMNHRVEITEGIL ADECAALLCRFFRMPRRVFNAQKKAQSSTD SEQ ID NO: 65 ABE ecTadA wild-type, DNA tctgaagtcgagtttagccacgagtattggatgaggcacgcactgaccctggcaaagcgagcatggga tgaaagagaagtccccgtgggcgccgtgctggtgcacaacaatagagtgatcggagagggatggaaca ggccaatcggccgccacgaccctaccgcacacgcagagatcatggcactgaggcagggaggcctggtc atgcagaattaccgcctgatcgatgccaccctgtatgtgacactggagccatgcgtgatgtgcgcagg agcaatgatccacagcaggatcggaagagtggtgttcggagcacgggacgccaagaccggcgcagcag gctccctgatggatgtgctgcaccaccccggcatgaaccaccgggtggagatcacagagggaatcctg gcagacgagtgcgccgccctgctgagcgatttctttagaatgcggagacaggagatcaaggcccagaa gaaggcacagagctccaccgac SEQ ID NO: 66 ABE ecTadA*7.9, DNA tccgaagtcgagttttcccatgagtactggatgagacacgcattgactctcgcaaagagggctctcga tgaacgcgaggtgcccgtgggggcagtactcgtgctcaacaatcgcgtaatcggcgaaggttggaata gggcaatcggactccacgaccccactgcacatgcggaaatcatggcccttcgacagggagggcttgtg atgcagaattatcgacttatcgatgcgacgctgtacgtcacgtttgaaccttgcgtaatgtgcgcggg agctatgattcactcccgcattggacgagttgtattcggtgttcgcaacgccaagacgggtgccgcag gttcactgatggacgtgctgcattacccaggcatgaaccaccgggtagaaatcacagaaggcatattg gcggacgaatgtaacgcgctgttgtgttacttttttcgcatgcccaggcaggtctttaacgcccagaa aaaagcacaatcctctactgac SEQ ID NO: 67 ABE ecTadA*7.10, DNA tccgaagtcgagttttcccatgagtactggatgagacacgcattgactctcgcaaagagggctcgaga tgaacgcgaggtgcccgtgggggcagtactcgtgctcaacaatcgcgtaatcggcgaaggttggaata gggcaatcggactccacgaccccactgcacatgcggaaatcatggcccttcgacagggagggcttgtg atgcagaattatcgacttatcgatgcgacgctgtacgtcacgtttgaaccttgcgtaatgtgcgcggg agctatgattcactcccgcattggacgagttgtattcggtgttcgcaacgccaagacgggtgccgcag gttcactgatggacgtgctgcattacccaggcatgaaccaccgggtagaaatcacagaaggcatattg gcggacgaatgtgcggcgctgttgtgttacttttttcgcatgcccaggcaggtctttaacgcccagaa aaaagcacaatcctctactgac SEQ ID NO: 68 ABE ecTadA*8e, DNA tctgaggtggagttttcccacgagtactggatgagacatgccctgaccctggccaagagggcacggga tgagagggaggtgcctgtgggagccgtgctggtgctgaacaatagagtgatcggcgagggctggaaca gagccatcggcctgcacgacccaacagcccatgccgaaattatggccctgagacagggcggcctggtc atgcagaactacagactgattgacgccaccctgtacgtgacattcgagccttgcgtgatgtgcgccgg cgccatgatccactctaggatcggccgcgtggtgtttggcgtgaggaactcaaaaagaggcgccgcag gctccctgatgaacgtgctgaactaccccggcatgaatcaccgcgtcgaaattaccgagggaatcctg gcagatgaatgtgccgccctgctgtgcgatttctatcggatgcctagacaggtgttcaatgctcagaa gaaggcccagagctccatcaac SEQ ID NO: 69 ABE ecTadA*8.8, DNA tccgaagtcgagttttcccatgagtactggatgagacacgcattgactctcgcaaagagggctcgaga tgaacgcgaggtgcccgtgggggcagtactcgtgctcaacaatcgcgtaatcggcgaaggttggaata gggcaatcggactccacgaccccactgcacatgcggaaatcatggcccttcgacagggagggcttgtg atgcagaattatcgacttatcgatgcgacgctgtacgtcacgtttgaaccttgcgtaatgtgcgcggg agctatgattcactcccgcattggacgagttgtattcggtgttcgcaacgccaagacgggtgccgcag gttcactgatggacgtgctgcatcatccaggcatgaaccaccgggtagaaatcacagaaggcatattg gcggacgaatgtgcggcgctgttgtgtcgtttttttcgcatgcccaggcgggtctttaacgcccagaa aaaagcacaatcctctactgactctggtggttcttctggtggttctagcggcagcgagactcccggga cctcagagtccgccacacccgaaagttctggtggttcttctggtggttct SEQ ID NO: 70 ABE ecTadA*8.13, DNA tccgaagtcgagttttcccatgagtactggatgagacacgcattgactctcgcaaagagggctcgaga tgaacgcgaggtgcccgtgggggcagtactcgtgctcaacaatcgcgtaatcggcgaaggttggaata gggcaatcggactccacgaccccactgcacatgcggaaatcatggcccttcgacagggagggcttgtg atgcagaattatcgactttatgatgcgacgctgtacgtcacgtttgaaccttgcgtaatgtgcgcggg agctatgattcactcccgcattggacgagttgtattcggtgttcgcaacgccaagacgggtgccgcag gttcactgatggacgtgctgcatcatccaggcatgaaccaccgggtagaaatcacagaaggcatattg gcggacgaatgtgcggcgctgttgtgtcgtttttttcgcatgcccaggcgggtctttaacgcccagaa aaaagcacaatcctctactgac SEQ ID NO: 71 ABE ecTadA*8.17, DNA tccgaagtcgagttttcccatgagtactggatgagacacgcattgactctcgcaaagagggctcgaga tgaacgcgaggtgcccgtgggggcagtactcgtgctcaacaatcgcgtaatcggcgaaggttggaata gggcaatcggactccacgaccccactgcacatgcggaaatcatggcccttcgacagggagggcttgtg atgcagaattatcgacttatcgatgcgacgctgtactcgacgtttgaaccttgcgtaatgtgcgcggg agctatgattcactcccgcattggacgagttgtattcggtgttcgcaacgccaagacgggtgccgcag gttcactgatggacgtgctgcattacccaggcatgaaccaccgggtagaaatcacagaaggcatattg gcggacgaatgtgcggcgctgttgtgttacttttttcgcatgcccaggcgtgtctttaacgcccagaa aaaagcacaatcctctactgac SEQ ID NO: 72 ABE ecTadA*8.20, DNA tccgaagtcgagttttcccatgagtactggatgagacacgcattgactctcgcaaagagggctcgaga tgaacgcgaggtgcccgtgggggcagtactcgtgctcaacaatcgcgtaatcggcgaaggttggaata gggcaatcggactccacgaccccactgcacatgcggaaatcatggcccttcgacagggagggcttgtg atgcagaattatcgactttatgatgcgacgctgtactcgacgtttgaaccttgcgtaatgtgcgcggg agctatgattcactcccgcattggacgagttgtattcggtgttcgcaacgccaagacgggtgccgcag gttcactgatggacgtgctgcatcatccaggcatgaaccaccgggtagaaatcacagaaggcatattg gcggacgaatgtgcggcgctgttgtgtcgtttttttcgcatgcccaggcgggtctttaacgcccagaa aaaagcacaatcctctactgac SEQ ID NO: 73 TOX protein sequence MDVRFYPPPAQPAAAPDAPCLGPSPCLDPYYCNKFDGENMYMSMTEPSQDYVPASQSYPGPSLESEDF NIPPITPPSLPDHSLVHLNEVESGYHSLCHPMNHNGLLPFHPQNMDLPEITVSNMLGQDGTLLSNSIS VMPDIRNPEGTQYSSHPQMAAMRPRGQPADIRQQPGMMPHGQLTTINQSQLSAQLGLNMGGSNVPHNS PSPPGSKSATPSPSSSVHEDEGDDTSKINGGEKRPASDMGKKPKTPKKKKKKDPNEPQKPVSAYALFF RDTQAAIKGQNPNATFGEVSKIVASMWDGLGEEQKQVYKKKTEAAKKEYLKQLAAYRASLVSKSYSEP VDVKTSQPPQLINSKPSVFHGPSQAHSALYLSSHYHQQPGMNPHLTAMHPSLPRNIAPKPNNQMPVTV SIANMAVSPPPPLQISPPLHQHLNMQQHQPLTMQQPLGNQLPMQVQSALHSPTMQQGFTLQPDYQTII NPTSTAAQVVTQAMEYVRSGCRNPPPQPVDWNNDYCSSGGMQRDKALYLT SEQ ID NO: 74 TOX DNA sequence atggacgtaagattttatccacctccagcccagcccgccgctgcgcccgacgctccctgtctgggacc ttctccctgcctggacccctactattgcaacaagtttgacggtgagaacatgtatatgagcatgacag agccgagccaggactatgtgccagccagccagtcctaccctggtccaagcctggaaagtgaagacttc aacattccaccaattactcctccttccctcccagaccactcgctggtgcacctgaatgaagttgagtc tggttaccattctctgtgtcaccccatgaaccataatggcctgctaccatttcatccacaaaacatgg acctccctgaaatcacagtctccaatatgctgggccaggatggaacactgctttctaattccatttct gtgatgccagatatacgaaacccagaaggaactcagtacagttcccatcctcagatggcagccatgag accaaggggccagcctgcagacatcaggcagcagccaggaatgatgccacatggccagctgactacca ttaaccagtcacagctaagtgctcaacttggtttgaatatgggaggaagcaatgttccccacaactca ccatctccacctggaagcaagtctgcaactccttcaccatccagttcagtgcatgaagatgaaggcga tgatacctctaagatcaatggtggagagaagcggcctgcctctgatatggggaaaaaaccaaaaactc ccaaaaagaagaagaagaaggatcccaatgagccccagaagcctgtgtctgcctatgcgttattcttt cgtgatactcaggccgccatcaagggccaaaatccaaacgctacctttggcgaagtctctaaaattgt ggcttcaatgtgggacggtttaggagaagagcaaaaacaggtctataaaaagaaaaccgaggctgcga agaaggagtacctgaagcaactcgcagcatacagagccagccttgtatccaagagctacagtgaacct gttgacgtgaagacatctcaacctcctcagctgatcaattcgaagccgtcggtgttccatgggcccag ccaggcccactcggccctgtacctaagttcccactatcaccaacaaccgggaatgaatcctcacctaa ctgccatgcatcctagtctccccaggaacatagcccccaagccgaataaccaaatgccagtgactgtc tctatagcaaacatggctgtgtcccctcctcctcccctccagatcagcccgcctcttcaccagcatct caacatgcagcagcaccagccgctcaccatgcagcagccccttgggaaccagctccccatgcaggtcc agtctgccttacactcacccaccatgcagcaaggatttactcttcaacccgactatcagactattatc aatcctacatctacagctgcacaagttgtcacccaggcaatggagtatgtgcgttcggggtgcagaaa tcctcccccacaaccggtggactggaataacgactactgcagtagtgggggcatgcagagggacaaag cactgtaccttact SEQ ID NO: 193 IL7Ra protein sequence MTILGTTFGMVFSLLQVVSGESGYAQNGDLEDAELDDYSFSCYSQLEVNGSQHSLTCAFEDPDVNITN LEFEICGALVEVKCLNFRKLQEIYFIETKKFLLIGKSNICVKVGEKSLTCKKIDLTTIVKPEAPFDLS VVYREGANDFVVTFNTSHLQKKYVKVLMHDVAYRQEKDENKWTHVNLSSTKLTLLQRKLQPAAMYEIK VRSIPDHYFKGFWSEWSPSYYFRTPEINNSSGEMDPILLTISILSFFSVALLVILACVLWKKRIKPIV WPSLPDHKKTLEHLCKKPRKNLNVSFNPESFLDCQIHRVDDIQARDEVEGFLQDTFPQQLEESEKQRL GGDVQSPNCPSEDVVITPESFGRDSSLTCLAGNVSACDAPILSSSRSLDCRESGKNGPHVYQDLLLSL GTTNSTLPPPFSLQSGILTLNPVAQGQPILTSLGSNQEEAYVTMSSFYQNQ SEQ ID NO: 194 IL7Ra DNA sequence atgacaattctaggtacaacttttggcatggttttttctttacttcaagtcgtttctggagaaagtgg ctatgctcaaaatggagacttggaagatgcagaactggatgactactcattctcatgctatagccagt tggaagtgaatggatcgcagcattcactgacctgtgcttttgaggacccagatgtcaacaccaccaat ctggaatttgaaatatgtggggccctcgtggaggtaaagtgcctgaatttcaggaaactacaagagat atatttcatcgagacaaagaaattcttactgattggaaagagcaatatatgtgtgaaggttggagaaa agagtctaacctgcaaaaaaatagacctaaccactatagttaaacctgaggctccttttgacctgagt gtcatctatcgggaaggagccaatgactttgtggtgacatttaatacatcacacttgcaaaagaagta tgtaaaagttttaatgcatgatgtagcttaccgccaggaaaaggatgaaaacaaatggacgcatgtga atttatccagcacaaagctgacactcctgcagagaaagctccaaccggcagcaatgtatgagattaaa gttcgatccatccctgatcactattttaaaggcttctggagtgaatggagtccaagttattacttcag aactccagagatcaataatagctcaggggagatggatcctatcttactaaccatcagcattttgagtt ttttctctgtcgctctgttggtcatcttggcctgtgtgttatggaaaaaaaggattaagcctatcgta tggcccagtctccccgatcataagaagactctggaacatctttgtaagaaaccaagaaaaaatttaaa tgtgagtttcaatcctgaaagtttcctggactgccagattcatagggtggatgacattcaagctagag atgaagtggaaggttttctgcaagatacgtttcctcagcaactagaagaatctgagaagcagaggctt ggaggggatgtgcagagccccaactgcccatctgaggatgtagtcgtcactccagaaagctttggaag agattcatccctcacatgcctggctgggaatgtcagtgcatgtgacgcccctattctctcctcttcca ggtccctagactgcagggagagtggcaagaatgggcctcatgtgtaccaggacctcctgcttagcctt gggactacaaacagcacgctgccccctccattttctctccaatctggaatcctgacattgaacccagt tgctcagggtcagcccattcttacttccctgggatcaaatcaagaagaagcatatgtcaccatgtcca gcttctaccaaaaccagtga SEQ ID NO: 195 CD103 protein sequence MWLFHTLLCIASLALLAAFNVDVARPWLTPKGGAPFVLSSLLHQDPSTNQTWLLVTSPRTKRTPGPLH RCSLVQDEILCHPVEHVPIPKGRHRGVTVVRSHHGVLICIQVLVRRPHSLSSELTGTCSLLGPDLRPQ AQANFFDLENLLDPDARVDTGDCYSNKEGGGEDDVNTARQRRALEKEEEEDKEEEEDEEEEEAGTEIA IILDGSGSIDPPDFQRAKDFISNMMRNFYEKCFECNFALVQYGGVIQTEFDLRDSQDVMASLARVQNI TQVGSVTKTASAMQHVLDSIFTSSHGSRRKASKVMVVLTDGGIFEDPLNLTTVINSPKMQGVERFAIG VGEEFKSARTARELNLIASDPDETHAFKVTNYMALDGLLSKLRYNIISMEGTVGDALHYQLAQIGFSA QILDERQVLLGAVGAFDWSGGALLYDTRSRRGRFLNQTAAAAADAEAAQYSYLGYAVAVLHKTCSLSY IAGAPRYKHHGAVFELQKEGREASFLPVLEGEQMGSYFGSELCPVDIDMDGSTDFLLVAAPFYHVHGE EGRVYVYRLSEQDGSFSLARILSGHPGFTNARFGFAMAAMGDLSQDKLTDVAIGAPLEGFGADDGASF GSVYIYNGHWDGLSASPSQRIRASTVAPGLQYFGMSMAGGFDISGDGLADITVGTLGQAVVFRSRPVV RLKVSMAFTPSALPIGFNGVVNVRLCFEISSVTTASESGLREALLNFTLDVDVGKQRRRLQCSDVRSC LGCLREWSSGSQLCEDLLLMPTEGELCEEDCFSNASVKVSYQLQTPEGQTDHPQPILDRYTEPFAIFQ LPYEKACKNKLFCVAELQLATTVSQQELVVGLTKELTLNINLTNSGEDSYMTSMALNYPRNLQLKRMQ KPPSPNIQCDDPQPVASVLIMNCRIGHPVLKRSSAHVSVVWQLEENAFPNRTADITVTVTNSNERRSL ANETHTLQFRHGFVAVLSKPSIMYVNTGQGLSHHKEFLFHVHGENLFGAEYQLQICVPTKLRGLQVVA VKKLTRTQASTVCTWSQERACAYSSVQHVEEWHSVSCVIASDKENVTVAAEISWDHSEELLKDVTELQ ILGEISFNKSLYEGLNAENHRTKITVVFLKDEKYHSLPIIIKGSVGGLLVLIVILVILFKCGFFKRKY QQLNLESIRKAQLKSENLLEEEN SEQ ID NO: 196 CD103 DNA sequence atgtggctcttccacactctgctctgcatagccagcctggccctgctggccgctttcaatgtggatgt ggcccggccctggctcacgcccaagggaggtgcccctttcgtgctcagctcccttctgcaccaagacc ccagcaccaaccagacctggctcctggtcaccagccccagaaccaagaggacaccagggcccctccat cgatgttcccttgtccaggatgaaatcctttgccatcctgtagagcatgtccccatccccaaggggag gcaccggggagtgaccgttgtccggagccaccacggtgttttgatatgcattcaagtgctggtccggc ggcctcacagcctcagctcagaactcacaggcacctgtagcctcctgggccctgacctccgtccccag gctcaggccaacttcttcgaccttgaaaatctcctggatccagatgcacgtgtggacactggagactg ctacagcaacaaagaaggcggtggagaagacgatgtgaacacagccaggcagcgccgggctctggaga aggaggaggaggaagacaaggaggaggaggaagacgaggaggaggaggaagctggcaccgagattgcc atcatcctggatggctcaggaagcattgatcccccagactttcagagagccaaagacttcatctccaa catgatgaggaacttctatgaaaagtgttttgagtgcaactttgccttggtgcagtatggaggagtga tccagactgagtttgaccttcgggacagccaggatgtgatggcctccctcgccagagtccagaacatc actcaagtggggagtgtcaccaagactgcctcagccatgcaacacgtcttagacagcatcttcacctc aagccacggctccaggagaaaggcatccaaggtcatggtggtgctcaccgatggtggcatattcgagg accccctcaaccttacgacagtcatcaactcccccaaaatgcagggtgttgagcgctttgccattggg gtgggagaagaatttaagagtgctaggactgcgagggaactgaacctgatcgcctcagacccggatga gacccatgctttcaaggtgaccaactacatggcgctggatgggctgctgagcaaactgcggtacaaca tcatcagcatggaaggcacggttggagacgcccttcactaccagctggcacagattggcttcagtgct cagatcctggatgagcggcaggtgctgctcggcgccgtcggggcctttgactggtccggaggggcgtt gctctacgacacacgcagccgccggggccgcttcctgaaccagacagcggcggcggcggcagacgcgg aggctgcgcagtacagctacctgggttacgctgtggccgtgctgcacaagacctgcagcctctcctac atcgcgggggctccacggtacaaacatcatggggccgtgtttgagctccagaaggagggcagagaggc cagcttcctgccagtgctggagggagagcagatggggtcctattttggctctgagctgtgccctgtgg acattgacatggatggaagcacggacttcttgctggtggctgctccattttaccacgttcatggagaa gaaggcagagtctacgtgtaccgtctcagcgagcaggatggttctttctccttggcacgcatactgag tgggcaccccgggttcaccaatgcccgctttggctttgccatggcggctatgggggatctcagtcagg ataagctcacagatgtggccatcggggcccccctggaaggttttggggcagatgatggtgccagcttc ggcagtgtgtatatctacaatggacactgggacggcctctccgccagcccctcgcagcggatcagagc ctccacggtggccccaggactccagtacttcggcatgtccatggctggtggctttgatattagtggcg acggccttgccgacatcaccgtgggcactctgggccaggcggttgtgttccgctcccggcctgtggtt cgcctgaaggtctccatggccttcacccccagcgcactgcccatcggcttcaacggcgtcgtgaatgt ccgtttatgttttgaaatcagctctgtaaccacagcctctgagtcaggcctccgcgaggcacttctca acttcacgctggatgtggatgtggggaagcagaggagacggctgcagtgttcagacgtaagaagctgt ctgggctgcctgagggagtggagcagcggatcccagctttgtgaggacctcctgctcatgcccacaga gggagagctctgtgaggaggactgcttctccaatgccagtgtcaaagtcagctaccagctccagaccc ctgagggacagacggaccatccccagcccatcctggaccgctacactgagccctttgccatcttccag ctgccctatgagaaggcctgcaagaataagctgttttgtgtcgcagaattacagttggccaccaccgt ctctcagcaggagttggtggtgggtctcacaaaggagctgaccctgaacattaacctaactaactccg gggaagattcctacatgacaagcatggccttgaattaccccagaaacctgcagttgaagaggatgcaa aagcctccctctccaaacattcagtgtgatgaccctcagccggttgcttctgtcctgatcatgaactg caggattggtcaccccgtcctcaagaggtcatctgctcatgtttcagtcgtttggcagctagaggaga atgcctttccaaacaggacagcagacatcactgtgactgtcaccaattccaatgaaagacggtctttg gccaacgagacccacacccttcaattcaggcatggcttcgttgcagttctgtccaaaccatccataat gtacgtgaacacaggccaggggctttctcaccacaaagaattcctcttccatgtacatggggagaacc tctttggagcagaataccagttgcaaatttgcgtcccaaccaaattacgaggtctccaggttgtagca gtgaagaagctgacgaggactcaggcctccacggtgtgcacctggagtcaggagcgcgcttgtgcgta cagttcggttcagcatgtggaagaatggcattcagtgagctgtgtcatcgcttcagataaagaaaatg tcaccgtggctgcagagatctcctgggatcactctgaggagttactaaaagatgtaactgaactgcag atccttggtgaaatatctttcaacaaatctctatatgagggactgaatgcagagaaccacagaactaa gatcactgtcgtcttcctgaaagatgagaagtaccattctttgcctatcatcattaaaggcagcgttg gtggacttctggtgttgatcgtgattctggtcatcctgttcaagtgtggcttttttaaaagaaaatat caacaactgaacttggagagcatcaggaaggcccagctgaaatcagagaatctgctcgaagaagagaa ttag

Claims

CLAIMS 1. An isolated polynucleotide encoding a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
2. An isolated polynucleotide encoding a transcription factor selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof.
3. The isolated polynucleotide of claim 1 or 2, wherein the isolated polynucleotide comprises a sequence selected from SEQ ID NOs: 75-97.
4. The isolated polynucleotide of any one of claims 1-3, wherein the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof.
5. A vector encoding a transcription factor selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
6. A vector encoding a transcription factor selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof.
7. The vector of claim 5 or 6, wherein the vector comprises a promoter operably linked to a polynucleotide sequence encoding the transcription factor.
8. The vector of claim 7, wherein the promoter is non-endogenous to the transcription factor.
9. The vector of claim 7 or 8, wherein the promoter is a constitutive promoter, or a ubiquitous promoter, or an inducible promoter, or a cell-specific promoter, or a tissue-specific promoter.
10. The vector of any one of claims 5-9, wherein the vector comprises an open reading frame (ORF) of the transcription factor.
11. The vector of any one of claims 5-10, wherein the vector comprises a sequence selected from SEQ ID NOs: 75-97 or encodes a polypeptide comprising a sequence selected from SEQ ID NOs: 98-120.
12. The vector of any one of claims 5-11, wherein the transcription factor is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof.
13. The vector of any one of claims 5-12, wherein the vector is a viral vector.
14. The vector of claim 13, wherein the vector is a lentiviral vector.
15. The vector of claim 13, wherein the vector is an adeno-associated virus (AAV) vector.
16. The vector of claim 15, wherein the AAV vector is selected from AAV1, AAV2, AAV5, AAV6, AAV8, AAV9, and an engineered AAV vector.
17. A method of modulating T cells, the method comprising administering to a T cell or a subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
18. The method of claim 17, wherein modulating T cells comprises increasing T cells, or increasing memory T cells, or increasing T cell distribution, or increasing tissue infiltration, or preventing T cell exhaustions, or reversing T cell exhaustions, or a combination thereof.
19. A method of increasing T cells, the method comprising administering to a T cell or a subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
20. A method of enhancing adoptive T cell therapy (ACT) in a subject, the method comprising administering to a T cell or the subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
21. A method of treating cancer in a subject, the method comprising administering to a T cell or the subject an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof, wherein the activator increases the expression of the gene or increases the level of a protein encoded by the gene.
22. The method of any one of claims 17-21, wherein the gene is selected from TGIF2LX_1, TGIF1_3, TGIF2_1, FOS_2, HNF4A_1, HNF4A_3, HNF4A_5, HNF4A_6, KLF8_2, NFKBIZ_2, CARF_1, EBF3_1, HMX3_1, LHX4_1, LMX1A_1, PLAG1_2, PLAGL1_2, POU2F3_2, SOX14_1, SOX14_2, TFAP2D_1, WT1_1, and WT1_5, or a combination thereof.
23. The method of any one of claims 17-22, wherein the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof.
24. The method of any one of claims 17-23, wherein the activator modulates T cells, and wherein modulating T cells comprises increasing T cells, or increasing T cell distribution, or increasing tissue infiltration, or increasing memory T cells, or increasing the lifetime of a T cell, or preventing T cell exhaustions, or reversing T cell exhaustions, or reducing T cell exhaustion, or enhancing the therapeutic potential of T cells, or a combination thereof.
25. The method of any one of claims 17-24, wherein administration of the activator to the T cell results in a modified T cell, and wherein the modified T cell is administered to a subject.
26. The method of any one of claims 17-25, wherein the T cell is autologous or allogenic.
27. The method of any one of claims 17-26, wherein the activator modulates gene expression within the T cell.
28. The method of any one of claims 17-26, wherein the activator increases expression of CD103 or IL7Ra, or a combination thereof, in the T cell.
29. The method of any one of claims 17-28, wherein the activator comprises a polypeptide, or a polynucleotide, or a small molecule, or a combination thereof.
30. The method of any one of claims 17-29, wherein the activator comprises a polynucleotide encoding the gene.
31. The method of any one of claims 17-30, wherein the activator comprises a polynucleotide comprising the open reading frame of the gene or a polynucleotide encoding a protein encoded by the gene.
32. The method of any one of claims 17-31, wherein the activator comprises a polynucleotide selected from SEQ ID NOs: 75-97 or a polynucleotide encoding a polypeptide comprising an amino acid sequence selected from SEQ ID NOs: 98-120.
33. The method of any one of claims 17-32, wherein the activator comprises a polypeptide comprising a protein encoded by the gene.
34. The method of any one of claims 17-29, wherein the activator comprises a polypeptide selected from SEQ ID NOs: 98-120.
35. The method of any one of claims 17-32, wherein the activator comprises the vector of any one of claims 5-16.
36. The method of any one of claims 17-35, wherein the activator or a polynucleotide encoding the activator is encapsulated within a lipid nanoparticle or polymeric carrier.
37. The method of any one of claims 17-36, the method further comprising administering at least one cancer therapy or at least one antiviral therapy.
38. A vector comprising the isolated polynucleotide of any one of claims 1-4.
39. A cell comprising the isolated polynucleotide of any one of claims 1-4, or the vector of any one of claims 5-16, or the vector of claim 38.
40. The cell of claim 39, wherein the cell is a CD8+ T cell or a CD4+ T cell.
41. A pharmaceutical composition comprising: the isolated polynucleotide of any one of claims 1-4, or the vector of any one of claims 5-16, or the vector of claim 38, or a combination thereof.
42. The pharmaceutical composition of claim 41, further comprising at least one cancer therapy or at least one antiviral therapy.
43. A composition for increasing T cells, the composition comprising an activator of a gene selected from TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, CARF, EBF3, HMX3, LHX4, LMX1A, PLAG1, PLAGL1, POU2F3, SOX14, TFAP2D, and WT1, or a combination thereof.
44. The composition of claim 43, wherein the gene is TGIF2LX, TGIF1, TGIF2, FOS, HNF4A, KLF8, NFKBIZ, or CARF, or a combination thereof.
45. The composition of claim 43 or 44, wherein the activator comprises a polynucleotide encoding the gene, or a polynucleotide encoding the open reading frame of the gene, or a polypeptide encoded by the gene, or a combination thereof.
46. The composition of claim 45, wherein the activator comprises a polynucleotide selected from SEQ ID NOs: 75-97 or a polypeptide selected from SEQ ID NOs: 98-120.
47. The composition of any one of claims 43-46, further comprising at least one cancer therapy or at least one antiviral therapy.
PCT/US2024/044668 2023-08-30 2024-08-30 Novel regulators of t cells Pending WO2025049903A2 (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US202363579780P 2023-08-30 2023-08-30
US63/579,780 2023-08-30
US202463636706P 2024-04-19 2024-04-19
US63/636,706 2024-04-19
US202463636859P 2024-04-21 2024-04-21
US63/636,859 2024-04-21

Publications (2)

Publication Number Publication Date
WO2025049903A2 true WO2025049903A2 (en) 2025-03-06
WO2025049903A3 WO2025049903A3 (en) 2025-06-12

Family

ID=94820368

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2024/044668 Pending WO2025049903A2 (en) 2023-08-30 2024-08-30 Novel regulators of t cells

Country Status (1)

Country Link
WO (1) WO2025049903A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12428631B2 (en) 2016-04-13 2025-09-30 Duke University CRISPR/Cas9-based repressors for silencing gene targets in vivo and methods of use
US12509492B2 (en) 2019-01-19 2025-12-30 Duke University Genome engineering with CRISPR-Cas systems in eukaryotes

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019195675A1 (en) * 2018-04-06 2019-10-10 President And Fellows Of Harvard College Methods of identifying combinations of transcription factors
WO2023283631A2 (en) * 2021-07-08 2023-01-12 The Broad Institute, Inc. Methods for differentiating and screening stem cells

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12428631B2 (en) 2016-04-13 2025-09-30 Duke University CRISPR/Cas9-based repressors for silencing gene targets in vivo and methods of use
US12509492B2 (en) 2019-01-19 2025-12-30 Duke University Genome engineering with CRISPR-Cas systems in eukaryotes

Also Published As

Publication number Publication date
WO2025049903A3 (en) 2025-06-12

Similar Documents

Publication Publication Date Title
JP7681753B2 (en) CRISPR-CAS Related Methods, Compositions and Components for Cancer Immunotherapy
KR102503130B1 (en) Targeted replacement of endogenous T cell receptors
TW202035693A (en) Compositions and methods for immunotherapy
KR20200130826A (en) Gene-modulating compositions and methods for improved immunotherapy
US12275963B2 (en) Artificially manipulated immune cell
IL262772B1 (en) Genetically engineered cells and methods of making the same
KR20200133219A (en) Gene-modulating compositions and methods for improved immunotherapy
US20230340139A1 (en) Immune cells having co-expressed shrnas and logic gate systems
US20250197823A1 (en) Compositions and methods for epigenome editing to enhance t cell therapy
US20220235380A1 (en) Immune cells having co-expressed shrnas and logic gate systems
JP7642548B2 (en) Gene Regulatory Compositions and Methods for Improved Immunotherapy - Patent application
JP2025519070A (en) Compositions and methods for efficient in vivo delivery
CN110885799A (en) Gene editing method based on adenovirus
WO2025049903A2 (en) Novel regulators of t cells
WO2025038982A2 (en) Regulators of t cell exhaustion
CN111107856A (en) Compositions and methods for enhancing the efficacy of T cell-based immunotherapy
JP7672006B2 (en) Compositions and methods for modifying regulatory T cells
WO2024220947A2 (en) Epigenetic targets for enhancing cancer immunotherapy
WO2024229292A2 (en) Compositions and methods for engineering mature ipsc-derived and esc-derived hepatocytes
WO2022182788A1 (en) In vivo crispr screening system for discovering therapeutic targets in cd8 t cells
Nesic et al. Generation of a PARPi-sensitive homozygous BRCA1-methylated OVCAR8 cell line using targeted CRISPR gene editing
TW202426060A (en) Engineered retrons and methods of use
WO2025117868A2 (en) Compositions and methods for the potentiation of cellular responses to matrix stiffness
WO2025049792A1 (en) Compositions and methods for epigenetic regulation of tgfbr2 expression
Moghadam Developing safe and controllable Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-based therapies with design principles of synthetic biology

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24861139

Country of ref document: EP

Kind code of ref document: A2