[go: up one dir, main page]

WO2024180050A1 - Detection of viral presence in cells - Google Patents

Detection of viral presence in cells Download PDF

Info

Publication number
WO2024180050A1
WO2024180050A1 PCT/EP2024/054932 EP2024054932W WO2024180050A1 WO 2024180050 A1 WO2024180050 A1 WO 2024180050A1 EP 2024054932 W EP2024054932 W EP 2024054932W WO 2024180050 A1 WO2024180050 A1 WO 2024180050A1
Authority
WO
WIPO (PCT)
Prior art keywords
cell
dna
viral dna
virus
mammalian
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/EP2024/054932
Other languages
French (fr)
Inventor
Suki ROY
Sanjanaa NAGARAJAN
Florian Böhl
Lingzhi Huang
Kit Yeng WONG
Rose Whelan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Evonik Operations GmbH
Original Assignee
Evonik Operations GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Evonik Operations GmbH filed Critical Evonik Operations GmbH
Priority to AU2024227981A priority Critical patent/AU2024227981A1/en
Priority to KR1020257032659A priority patent/KR20250154495A/en
Priority to CN202480029675.0A priority patent/CN121039296A/en
Publication of WO2024180050A1 publication Critical patent/WO2024180050A1/en
Anticipated expiration legal-status Critical
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/70Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6827Hybridisation assays for detection of mutation or polymorphism
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/154Methylation markers

Definitions

  • the present invention relates to a method that can detect the presence or absence of a virus and/or promoter thereof and the possible activity of the virus in the cell.
  • the method quantitatively measures the level of DNA methylation on the CpG sites of the viral DNA in the cells.
  • Viral infection results in a variety of human diseases including cancer.
  • the viruses hijack cellular machinery to promote their replication and produce viral particle progeny.
  • some viruses integrate their genomic DNA into the host genome which results in several consequences such as gene disruption, chromosome instability, gene mutation and oncogenesis.
  • provirus passively gets replicated along with the host genome; thus, it gets passed down from one generation to the next. Integration of viral DNA into the host genome may lead to a latent infection or a productive infection.
  • Latent infection refers to a state where the provirus is transcriptionally silent, however, the provirus gets activated to start transcription in response to changes in the host's environmental conditions. Hence, the host cell’s protein synthesis machinery is hijacked to generate more viruses leading to the destruction of the host cell.
  • Viral vectors are prevalently used to express transgene for gene therapy, cell line development, and therapeutic protein production.
  • a viral vector is designed by substituting a set of the viral gene responsible for viral replication and pathogenicity with the transgene cassette. Gene transfer using viral vectors results in a high and long-term expression of the transgene.
  • the viral promoter and enhancer are the major DNA regulatory elements in the viral vector that determines the transgene expression level in the cells.
  • the integration sites of the viral vector are susceptible to transcriptional regulation via epigenetic regulation such as histone modifications and DNA methylation. For example, the DNA methylation status of the viral vector is an important factor for protein production or expression stability in producer cells.
  • DNA methylation on the CpG sites of the viral promoter results in transgene silencing at transcription levels.
  • protein production variability in CHO cells has been associated with DNA methylation mediated regulation of Cytomegalovirus Major Immediate-Early and enhancer (CMV) promoter and simian vacuolating virus 40 (SV40) promoters which are the most frequently used promoters for the production of recombinant proteins in CHO cells.
  • CMV Cytomegalovirus Major Immediate-Early and enhancer
  • SV40 simian vacuolating virus 40
  • the presence of viral DNA in a host cell is not representative of the host cell being in serious danger or being infected by the virus.
  • the current methods used to test the presence or integration of viral DNA in a host cell lack the ability to determine the potency of the presence of the virus on the survival of the host cell. Accordingly, there is a need in the art that quantitatively and qualitatively not only determines the presence of a virus in a host cell but also the potency of the DNA integration on the host cell.
  • the present invention solves the problems above by providing a DNA-based method for determining the level of DNA methylation on the CpG sites on the viral DNA and/or viral promoter which can then be used to not only predict the viral activity on the host cell but also the expression of a transgene in cells.
  • the method according to any aspect of the present invention may be carried out on a DNA-based array.
  • the method according to any aspect of the present invention may thus provide an efficient, cheap and accurate means of determining not only the presence of a virus in a cell but also the activity of the virus in the cell.
  • a method of identifying integrated viral DNA in the genome of at least one mammalian test cell comprising the steps of:
  • step (b) determining the methylation status of at least one CpG site in the viral DNA of the DNA sample obtained from the mammalian test cell, wherein the presence of the viral DNA after step (a) and hypermethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being inactive respectively; the presence of the viral DNA after step (a) and baseline methylation or hypomethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being active respectively; and the absence of the viral DNA after step (a) is indicative of the viral DNA not being integrated in the mammalian test cell respectively; and wherein the method is carried out on an array.
  • viral DNA integration used synonymously with “integrated viral DNA” or “genome integrated viral DNA” refers to the incorporation of the whole genome or part of the genome of at least one virus into a host cell, particularly an animal cell, more particularly a mammalian cell.
  • Viral DNA integration is a means of viral insertion that is not transient as the viral DNA is genomically integrated into the host cell, i.e. mammalian cell.
  • Viral DNA integration is a potential diagnostic and prognostic marker.
  • viral DNA integration is a unique enzymatic process where doublestranded linear viral DNA is inserted into the host genome in a catalysed by a virus-encoded integrase.
  • viruses lie dormant in the infected cell, although their DNA may be integrated into the DNA of the host cell chromosome. Some viruses, present in the host cells may also lead to the cell being malignant and eventually causing tumours.
  • the family of Polyomaviruses e.g. simian virus 40 (SV40)
  • viruses from the Retroviridae family and Herpesviridae family are examples of viruses that can also be associated with malignancy (causing death or illness) in the host cell.
  • the viral DNA may be transformed from the viral RNA.
  • a “mammalian cell” as used herein refers to is a cell from any member of the order Mammalia which includes a cell from a mouse, a rat, a monkey, a guinea pig, a dog, a mini-pig, a human being, a cow, a sheep, a pig, a goat, a horse, a donkey, a mule, a hamster or the like.
  • the mammalian cell may also include an established cell line or immortalized cell line.
  • the mammalian cell according to any aspect of the present invention may be a CHO cell line which refers to immortal Chinese Hamster Ovary cell line (CHO) derived from Cricetulus griseus.
  • the CHO cell line may be selected from the group consisting of CHO-K1 (ATCC), CHO- DG44 (Thermo Fisher Scientific), CHO-DXB11 (ATCC), ExpiCHO-STM cells (Thermo Fisher Scientific), FreestyleTM CHO-STM cells (Thermo Fisher Scientific), CHO 1-15 [subscript 500] (ATCC), Agarabi CHO (ATCC), and a CHOK1 SV cell including all variants (e.g. POTELLIGENT®, Lonza, Slough, UK), a CHOK1SV GS-KO (glutamine synthetase knockout) cell including all variants (e.g., XCEEDTM Lonza, Slough, UK).
  • ATCC CHO-K1
  • CHO-DG44 Thermo Fisher Scientific
  • CHO-DXB11 ATCC
  • ExpiCHO-STM cells Thermo Fisher Scientific
  • FreestyleTM CHO-STM cells Thermo Fisher Scientific
  • the mammalian cell may be from Baby Hamster Kidney fibroblasts (BHK (ATCC CCL-10), or Vero cell (ATCC CCL-81).
  • Exemplary human cells include human embryonic kidney (HEK) cells, such as HEK293 (ATCC CRL-1573) , HEK 293T (ATCC CRL-3216), a HeLa cell (ATCC CCL-2), a NS0 cell (ECACC 85110503), or a Sp2/0 cell (ATCC CRL-1581).
  • the mammalian cells according to any aspect of the present invention may include mammalian cell cultures which can be either adherent cultures or suspension cultures.
  • the first step involves:
  • viral titre refers to a numeric expression of the quantity of a virus in a given volume, generally expressed as viral particles, transducing units, or infections particles, per millilitre (mL).
  • the presence or absence of viral DNA in a DNA sample obtained from the mammalian test cell may be determined using a DNA-based array according to any aspect of the present invention.
  • the DNA-based array according to any aspect of the present invention may then comprise probes that bind to any region (i.e. CpG and non-CpG sites) that is specific to the viral DNA.
  • CpG and non-CpG sites any region that is specific to the viral DNA.
  • the array according to any aspect of the present invention comprises probes that are specific to non-CpG sites that can detect the presence of integrated viral DNA in the genome of the DNA sample to be tested.
  • step (a) The presence of specific CpG and non-CpG sites in step (a) is indicative of the viral strain or variant thereof.
  • variant as used herein in respect to a virus refers to a virus strain that has changed most likely through mutation to a version that is different from the original virus.
  • variant refers to a form of the virus strain that deviates from what occurs in nature or which is considered the wild type.
  • the variant may be 50, 55, 60, 65, 70, 75, 80, 85, 90, 95% similar in sequence compared to the wild-type strain.
  • the second step according to any aspect of the present invention involves:
  • step (a) If in step (a) the presence and integration of the viral DNA or integrated viral DNA in the mammalian DNA is confirmed, then the methylation status of at least one of the CpGs of the integrated viral DNA may be determined from the DNA of the mammalian test cell.
  • a “CpG site” or “methylation site” is a nucleotide within a nucleic acid (DNA or RNA) that is susceptible to methylation either by natural occurring events in vivo or by an event instituted to chemically methylate the nucleotide in vitro. Some of these sites may be hypermethylated and some may be hypomethylated in a cell.
  • a “methylated nucleic acid molecule” refers to a nucleic acid molecule that contains one or more nucleotides that is/are methylated.
  • a “methylated nucleotide” or a “methylated nucleotide base” refers to the presence of a methyl moiety on a nucleotide base, where the methyl moiety is usually not present in a recognized typical nucleotide base.
  • cytosine in its usual form does not contain a methyl moiety on its pyrimidine ring, but 5-methylcytosine contains a methyl moiety at position 5 of its pyrimidine ring. Therefore, cytosine in its usual form may not be considered a methylated nucleotide and 5-methylcytosine may be considered a methylated nucleotide.
  • thymine may contain a methyl moiety at position 5 of its pyrimidine ring, however, for purposes herein, thymine may not be considered a methylated nucleotide when present in DNA.
  • Typical nucleotide bases for DNA are thymine, adenine, cytosine and guanine.
  • Typical bases for RNA are uracil, adenine, cytosine and guanine.
  • a "methylation site" is the location in the target gene nucleic acid region where methylation has the possibility of occurring. For example, a location containing CpG is a methylation site wherein the cytosine may or may not be methylated.
  • methylated nucleotide refers to nucleotides that carry a methyl group attached to a position of a nucleotide that is accessible for methylation. These methylated nucleotides are usually found in nature and to date, methylated cytosine that occurs mostly in the context of the dinucleotide CpG, but also in the context of CpNpG- and CpNpN-sequences may be considered the most common. In principle, other naturally occurring nucleotides may also be methylated but they will not be taken into consideration with regard to any aspect of the present invention.
  • a “CpG island” as used herein describes a segment of DNA sequence that comprises a functionally or structurally deviated CpG density.
  • Yamada et al. have described a set of standards for determining a CpG island: it must be at least 400 nucleotides in length, has a greater than 50% GC content, and an OCF/ECF ratio greater than 0.6 (Yamada et al., 2004, Genome Research, 14, 247-266).
  • Others have defined a CpG island less stringently as a sequence at least 200 nucleotides in length, having a greater than 50% GC content, and an OCF/ECF ratio greater than 0.6 (Takai et al., 2002, Proc. Natl.
  • methylation profile “methylation pattern”, “methylation state” or “methylation status,” are used herein to describe the state, situation or condition of methylation of a genomic sequence, and such terms refer to the characteristics of a DNA segment at a particular genomic locus in relation to methylation. Such characteristics include, but are not limited to, whether any of the cytosine (C) residues within this DNA sequence are methylated, location of methylated C residue(s), percentage of methylated C at any particular stretch of residues, and allelic differences in methylation due to, e.g., difference in the origin of the alleles.
  • C cytosine
  • methylation status refers to the status of a specific methylation site (i.e. methylated vs. non-methylated) which means a residue or methylation site is methylated or not methylated. Then, based on the methylation status of one or more methylation sites, a methylation profile may be determined.
  • cytosine (C) residue(s) not typically methylated within a DNA sequence are more methylated in a sample, it may be referred to as "hypermethylated”; whereas if cytosine (C) residue(s) typically methylated within a DNA sequence are less methylated, it may be referred to as "hypomethylated”.
  • the cytosine (C) residue(s) within a DNA sequence e.g., sample nucleic acid
  • the cytosine (C) residue(s) within a DNA sequence are more methylated when compared to another sequence from a different region or from a different individual (e.g., relative to normal nucleic acid), that sequence is considered hypermethylated compared to the other sequence.
  • cytosine (C) residue(s) within a DNA sequence are less methylated as compared to another sequence from a different region or from a different individual, that sequence is considered hypomethylated compared to the other sequence.
  • These sequences are said to be “differentially methylated”. For example, when the methylation status differs between inflamed and non-inflamed tissues, the sequences are considered “differentially methylated”.
  • Measurement of the levels of differential methylation may be done by a variety of ways known to those skilled in the art. One method is to measure the methylation level of individual interrogated CpG sites determined by the bisulfite sequencing method, as a non-limiting example.
  • Bisulfite treatment’ of genomic DNA used interchangeably with the term ‘bisulfite modification’, refers to the treatment of the genomic DNA with a deaminating agent such as a bisulfite that may be used to treat all DNA, methylated or not.
  • a deaminating agent such as a bisulfite that may be used to treat all DNA, methylated or not.
  • bisulfite as used herein encompasses any suitable type of bisulfite, such as sodium bisulfite, or other chemical agents that are capable of chemically converting a cytosine (C) to an uracil (U) without chemically modifying a methylated cytosine and therefore can be used to differentially modify a DNA sequence based on the methylation status of the DNA, e.g., U.S. Pat. Pub. US 2010/0112595.
  • a reagent that "differentially modifies" methylated or non-methylated DNA encompasses any reagent that modifies methylated and/or unmethylated DNA in a process through which distinguishable products result from methylated and non-methylated DNA, thereby allowing the identification of the DNA methylation status.
  • processes may include, but are not limited to, chemical reactions (such as a C to U conversion by bisulfite) and enzymatic treatment (such as cleavage by a methylation-dependent endonuclease).
  • an enzyme that preferentially cleaves or digests methylated DNA is one capable of cleaving or digesting a DNA molecule at a much higher efficiency when the DNA is methylated, whereas an enzyme that preferentially cleaves or digests unmethylated DNA exhibits a significantly higher efficiency when the DNA is not methylated.
  • step (a) the genomic DNA contained/ obtained or extracted from the cell, is first bisulfite treated.
  • TET-assisted pyridine borane sequencing may be used for detection of 5mC and 5hmC (Yibin Liu, et al., Nature Biotechnology, 37: 424-429 (2019).
  • genomic material refers to nucleic acid molecules or fragments of the genome of the animal according to any aspect of the present invention.
  • nucleic acid molecules or fragments are DNA or RNA or hybrids thereof, and most preferably are molecules of the DNA genome of a subject or group of subjects.
  • biological sample as used herein may be selected from the group consisting of muscle, organ tissue, milk, blood, brain, sperm and any other tissue or sample that provides genomic DNA to be used in the method according to any aspect of the present invention.
  • the biological sample may comprise any biological material obtained from the subject that contains DNA, and may be liquid, solid or both, may be tissue or bone, or a body fluid such as blood, lymph, etc.
  • the biological sample useful for the present invention may comprise biological cells or fragments thereof.
  • the “DNA sample” refers to the DNA extracted from a cell of the mammal according to any aspect of the present invention using known methods in the art.
  • the DNA used for the testing is purified high-molecular-weight genomic fragment of size ranging from 50 to 150 kb. Further washing steps removes low molecular weight DNA fragments which represents episomal DNA, virion DNA and cccDNA.
  • test used in conjunction with the term mammal or mammalian cell herein refers to the mammal or cell that is to be introduced to the array according to any aspect of the present invention and is the basis for an analysis application of the present invention to determine if the cell has viral DNA integrated in the test cell genome and whether this virus is active or not.
  • a “test profile” is therefore from an (individual) subject or group of subjects being tested according to the invention or a profile being obtained or generated in this context.
  • sample used in accordance with any aspect of the present invention refers to an entity that may be subject to the method of the present invention.
  • reference or ‘control’ shall denote, mostly predetermined, entities which are used for a comparison with the test entity.
  • a sample may be extracted DNA from a test mammalian cell that may be subject to the method of the present invention to determine if there is viral DNA integrated in the mammalian cell and if the virus is active in the test cell by first determining the presence or absence of the virus in the cell and then determining the DNA methylation profile of the test cell. There may be a further step of comparing this test methylation profile with a control and a ‘control’ refers to a mammalian cell, possibly of the same taxon, species, or cell line where the features as mentioned above are already known and where the methylation status is already known and used as a reference.
  • the DNA methylation profile of step (a) and (b) according to any aspect of the present invention is determined using an array.
  • a bead-based array is advantageous as it not only enables early detection of integrated viral DNA in a host mammalian cell but also helps in determining the effectiveness of a virus vector in a cell to be used for heterologous protein production.
  • the method according to any aspect of the present invention may be used for a better understanding of genome stability of mammalian cell lines and enables better control over the manufacturing/ process development/ product development/ scaling up/ validation process, thereby aiding in the selection of better mammalian cell lines for industrial applications.
  • Arrays allow for a high-throughput and robust method to determine semi-quantitative/quantitative DNA-methylation information through a small sample of extracted DNA of interest.
  • These custom designed arrays may use Illumina iScan and Infinium platform technology or an equivalent thereof, which allows on each chip for example 100,000 different bead types that covalently bind DNA- methylation probes.
  • Each probe represents one CpG Methylation site at the end of the probe sequence.
  • DNA samples undergo bisulfite conversion, amplification, fragmentation, precipitation and resuspension steps before hybridization on an array chip. Once on the chip the DNA hybridizes to the beads for each CpG site so that methylation changes at each site can be detected specifically through single nucleotide extension.
  • the array technology has a much shorter turn-around time.
  • the volume and complexity of data generated is lesser compared to sequencing making it computationally less intensive. This allows for quicker computation to achieve interpretable results from experimental groups.
  • Overall microarray technology is roughly 10x faster and 10x cheaper than traditional sequencing while still quantifiable for the methylation level at specific CpG sites.
  • array refers to an intentionally created collection of probe molecules which can be prepared either synthetically or biosynthetically.
  • the probe molecules in the array can be identical or different from each other.
  • the array can assume a variety of formats, for example, libraries of soluble molecules; libraries of compounds tethered to resin beads, silica chips, or other solid supports.
  • an array provides a convenient platform for simultaneous analysis of large numbers of CpG sites, for example, at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 50, 100, 500, 1000, 5000, 10,000, 100,000 or more sites or loci.
  • the array comprises a plurality of different probe molecules that can be attached to a substrate or otherwise spatially distinguished in an array.
  • arrays that may be used according to any aspect of the present invention include slide arrays, silicon wafer arrays, liquid arrays, bead-based arrays and the like.
  • array technology used according to any aspect of the present invention combines a miniaturized array platform, a high level of assay multiplexing, and scalable automation for sample handling and data processing.
  • the array according to any aspect of the present invention may be an array of arrays, also referred to as a composite array, having a plurality of individual arrays that is configured to allow processing of multiple samples simultaneously.
  • a substrate of a composite array may include a plurality of individual array locations, each having a plurality of probes, and each physically separated from other assay locations on the same substrate such that a fluid contacting one array location is prevented from contacting another array location.
  • Each array location can have a plurality of different probe molecules that are directly attached to the substrate or that are attached to the substrate via rigid particles in wells (also referred to herein as beads in wells).
  • an array substrate can be a fibre optical bundle or array of bundles as described in US6,023,540, US6,200,737 and/or US6,327,410.
  • An optical fibre bundle or array of bundles can have probes attached directly to the fibres or via beads.
  • W02004110246 further discloses other substrates and methods of attaching beads to the substrates that may be used in the array according to any aspect of the present invention.
  • a surface of the substrate may have physical alterations to enable the attachment of probes or produce array locations.
  • the surface of a substrate can be modified to contain chemically modified sites that are useful for attaching, either-covalently or non-covalently, probe molecules or particles having attached probe molecules.
  • Probes may be attached using any of a variety of methods known in the art including, an ink-jet printing method, a spotting technique, a photolithographic synthesis method, or printing method utilizing a mask. W02004110246 discloses these techniques in more detail.
  • the array according to any aspect of the present invention may be a bead-based array, where the beads are associated with a solid support such as those commercially available from Illumina, Inc. (San Diego, Calif.).
  • An array of beads useful according to any aspect of the present invention can also be in a fluid format such as a fluid stream of a flow cytometer or similar device.
  • Commercially available fluid formats for distinguishing beads include, for example, those used in XMAP(TM) technologies from Luminex or MPSS(TM) methods from Lynx Therapeutics.
  • solid support refers to a material or group of materials having a rigid or semi-rigid surface or surfaces.
  • at least one surface of the solid support will be substantially flat, although in some examples it may be desirable to physically separate synthesis regions for different compounds with, for example, wells, raised regions, pins, etched trenches, or the like.
  • the array or microarray according to any aspect of the present invention may be a very high- density array, for example, those having from about 10,000,000 probes/cm 2 to about 2,000,000,000 probes/cm 2 or from about 100,000,000 probes/cm 2 to about 1 ,000,000,000 probes/cm 2 .
  • High density arrays are especially useful according to any aspect of the present invention for including the multitude of CpG sites on the array.
  • the array according to any aspect of the present invention may be used to analyse or evaluate such pluralities of loci simultaneously or sequentially as desired.
  • a plurality of different probe molecules can be attached to a substrate or otherwise spatially distinguished in an array.
  • Each probe is typically specific for a particular locus and can be used to distinguish methylation state of the locus.
  • probe molecules or ‘probes’ as used interchangeably herein refers to a surface- immobilized molecule that can be recognized by a particular target. Probes used in the array can be specific for the methylated allele of a CpG site, the non-methylated allele of the CpG site or both or for the methylated allele of a non-CpG site, the non-methylated allele of the non-CpG site or both.
  • target refers to a molecule that has an affinity for a given probe molecule.
  • Targets may be naturally occurring or man-made molecules. Also, they can be employed in their unaltered state or as aggregates. Targets may be attached, covalently or noncovalently, to a binding member, either directly or via a specific binding substance. Examples of targets which can be employed according to any aspect of the present invention are methylated and non- methylated CpG sites. Targets are sometimes referred to in the art as anti-probes. As the term targets is used herein, no difference in meaning is intended.
  • complementary refers to the hybridization or base pairing between nucleotides or nucleic acids, such as, for instance, between the two strands of a double stranded DNA molecule or between an oligonucleotide primer and a primer binding site on a single stranded nucleic acid to be sequenced or amplified.
  • Complementary nucleotides are, generally, A and T (or A and U), or C and G.
  • Two single stranded RNA or DNA molecules are said to be complementary when the nucleotides of one strand, optimally aligned and compared and with appropriate nucleotide insertions or deletions, pair with at least about 80% of the nucleotides of the other strand, usually at least about 90% to 95%, and more preferably from about 98 to 100%.
  • Perfectly complementary refers to 100% complementarity over the length of a sequence. For example, a 25- base probe is perfectly complementary to a target when all 25 bases of the probe are complementary to a contiguous 25 base sequence of the target with no mismatches between the probe and the target over the length of the probe.
  • the presence of the viral DNA after step (a) according to any aspect of the present invention and hypermethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being inactive respectively; and the presence of the viral DNA after step (a) and baseline methylation or hypomethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being active respectively; and the absence of the viral DNA after step (a) is indicative of the viral DNA not being integrated in the mammalian test cell respectively.
  • hypomethylation refers to the average methylation state corresponding to an increased presence of 5-mCyt at one or a plurality of CpG dinucleotides within a DNA sequence of a test DNA sample, relative to the amount of 5-mCyt found at corresponding CpG dinucleotides within a normal control DNA sample.
  • hypomethylation refers to the average methylation state corresponding to a decreased presence of 5-mCyt at one or a plurality of CpG dinucleotides within a DNA sequence of a test DNA sample, relative to the amount of 5-mCyt found at corresponding CpG dinucleotides within a normal control DNA sample.
  • a virus is considered to be in an active state when the virus is not only integrated in the host mammalian cell but also capable of replication and protein production.
  • some viruses integrate their genomic DNA into the host genome which results in several consequences such as gene disruption, chromosome instability, gene mutation and oncogenesis.
  • viral DNA which is integrated into the host genome is known as provirus.
  • provirus passively gets replicated along with the host genome, thus it gets passed from one generation to the next.
  • Integrated viral DNA might lead to a latent infection or a productive infection.
  • productive infection the new virus particles will be produced via transcription of the provirus to generate mRNA and thus infect other cells.
  • 'Latent infection refers to a state where the provirus is transcriptionally silent, however, the provirus gets activated to start transcription in response to changes in the host's environmental conditions. Hence, the host cell’s protein synthesis machinery is hijacked to generate more viruses leading to the destruction of its host cell. Thus, it is an absolute necessity to check for viral DNA integration or integrated viral DNA for cell line assessment, vaccine development, patient samples etc. as a part of quality control. The converse is a true for a virus in an non-active state.
  • Viral promoters are rich in CpG sites which make them more prone to DNA methylation and thus suppressing the protein expression. Accordingly, studying DNA methylation of viruses and their promoters can be used to accurately determine the presence or absence of viral integration and activity of the integrated virus when present in the DNA.
  • promoter or “gene promoter” used interchangeably with the terms ‘regulatory region’ or ‘regulatory sequence’ refers to the respective contiguous gene DNA sequence extending from 1 .5 kb upstream to 1 .5 kb downstream relative to the transcription start site (TSS), or contiguous portions thereof.
  • regulatory region refers to the respective contiguous gene DNA sequence extending from 1 .5 kb upstream to 0.5 kb downstream relative to the TSS.
  • ‘regulatory region’ refers to the respective contiguous gene DNA sequence extending from 1 .5 kb upstream to the downstream edge of a CpG island that overlaps with the region from 1 .5 kb upstream to 1 .5 kb downstream from TSS (and is such cases, my thus extend even further beyond 1.5 kb downstream), and contiguous portions thereof.
  • Change in DNA methylation on the gene promoters responsible for protein glycosylation can lead to an improvement of protein quality. Protein glycosylation is a critical quality attribute that modulate the efficacy, stability, and half-life of a therapeutic protein. It is desirable to obtain a consistent glycoform profile in protein production due to regulatory concerns.
  • DNA methylation can act as a tool to determine integrated viral DNA in a mammalian host cell.
  • a method of identifying integrated viral DNA in the genome of at least one mammalian test cell comprising the steps of:
  • step (b) determining a test methylation profile of the pre-selected CpG and non-CpG sites within the DNA sample obtained from the mammalian test cell; and wherein the presence of one or more pre-selected CpG and non-CpG sites after step (a) and hypermethylation in (b) is indicative of the viral DNA being integrated in the mammalian test cell but the virus being inactive respectively; the presence of one or more pre-selected CpG and non-CpG sites after step (a) and baseline methylation or hypomethylation is indicative of the viral DNA being integrated in the mammalian test cell and the virus being active respectively; and the absence of the one or more pre-selected CpG and non-CpG sites after step (a) is indicative of the viral DNA not being integrated in the mammalian test cell respectively; and wherein the method is carried out on an array.
  • pre-selected sites used interchangeably with “pre-selected CpG and non-CpG sites” refers to sites that were selected from genes or regions that showed the highest degree of methylation variation during the training of the method and fulfils certain quality criteria such as a minimum sequencing coverage of >5x were considered and for >5 qualified CpG sites. Additionally, genes that have an average methylation level ⁇ 0.1 or an average methylation level >0.9 can be excluded due to their limited dynamic range. “Reference methylation profiles” may be defined on the basis of multiple training samples using multivariate statistical methods, such as such as Principal Component analysis or Multi-Dimensional Scaling.
  • Step (c) is an optional as step (b) of measuring methylation status is sufficient to determine the presence of hypomethylation or hypermethylation of a CpG site which can provide information on the virus and protein production without doing a further comparison in step (c).
  • a DNA array-based method of identifying integrated viral DNA in at least one mammalian test cell comprising the steps of:
  • step (c) comparing the methylation status of the CpG site and/or non-CpG site from (b) with that of a mammalian control cell where the viral DNA is incorporated within the control cell, wherein the presence of the CpG site and/or non-CpG site after step (a) and hypermethylation in (c) is indicative of the viral DNA being incorporated in the mammalian test cell but the virus being inactive respectively; the presence of the CpG site and/or non-CpG site after step (a) and baseline methylation or hypomethylation is indicative of the viral DNA being incorporated in the mammalian test cell and the virus being active respectively; and the absence of the CpG site and/or non-CpG site after step (a) is indicative of the viral DNA not being incorporated in the mammalian test cell respectively.
  • the method according to any aspect of the present invention comprising the step of: (ii) contacting the DNA sample from the test cell with a DNA-based array.
  • the method according to any aspect of the present invention further comprising the step of: (i) performing bisulfite modification to the DNA sample before step (a).
  • contacting means bringing about direct contact between the genomic material sample and DNA-based array.
  • the genomic material sample may be DNA that is extracted from the biological sample from the test mammal, and this directly brought into contact with the probe in the DNA-based array.
  • a first CHO control cell where the viral DNA is integrated in the genome of the control cell and the first CHO control cell has optimal heterologous protein production;
  • a second CHO control cell where the viral DNA is integrated in the genome of the control cell and the first CHO control cell has no optimal heterologous protein production; wherein significant similarity of the test methylation status of the CpG site with the first CHO control cell is indicative of the test CHO cell having optimal heterologous protein production, or with the second CHO control cell is indicative of the test CHO cell not having optimal heterologous protein production, wherein difference of the test methylation status of the CpG site with the first CHO control cell is indicative of the test CHO cell not having optimal heterologous protein production, or with the second CHO control cell is indicative of the test CHO cell having optimal heterologous protein production.
  • heterologous protein production refers to the production of a protein which is not endogenous to the cell. It means an expression of a gene or part of a gene, particularly a transgene in a host CHO cell which does not naturally express this gene.
  • the assays that are commonly used to quantify heterologous protein production include enzyme-linked immunosorbent assay (ELISA), chromatography & bioprocess analyser.
  • host cell refers to a cellular system for the expression of heterologous protein. For example, CHO cells are the main hosts for the production of various therapeutic proteins.
  • therapeutic protein refers to genetically engineered versions of naturally occurring human proteins.
  • therapeutic proteins include antibody-based drugs, anticoagulants, blood factors, bone morphogenetic proteins, engineered protein scaffolds, enzymes, growth factors, hormones, interferons, interleukins and the like.
  • optimal heterologous protein production refers to CHO cells that are capable of high-level protein production, particularly during industrial production or large-scale production of recombinant proteins, where the protein is usually a functional protein that is not naturally occurring in the wild-type CHO cell.
  • a CHO cell line has minimized metabolic burdens and toxic effects to the cell.
  • optimal heterologous protein production refers to high level protein production where the CHO cell line not only produces a high yield of the protein of interest but also that the protein production is constantly maintained over the period of production (i.e., the prolonged period of culture) such that the quality of the protein produced is also consistent and maintained.
  • the cell must at least display one of more of the following phenotypes of interest: phenotypic homogeneity, protein productivity, and protein quality.
  • the CHO cell may comprise phenotypic homogeneity and protein productivity, or phenotypic homogeneity, and protein quality, or protein productivity, and protein quality, or phenotypic homogeneity, protein productivity, and protein quality and good protein titre.
  • Optimal heterologous protein productivity can also only happen when the cell (i.e. the CHO cell) is stable and has high cell survivability.
  • phenotypic homogeneity refers to a state when all the cells in a population exhibit the same phenotype under a certain condition.
  • protein productivity refers to a measure of the amount of protein made per viable cell at a single titer point. It is calculated by dividing the titer (mg) by the viable cell density (VCD or cells/ml), and the final measurement is represented as the amount of protein per cell (mg/cell).
  • protein quality refers to the posttranslational modification of the protein that determines the efficacy and function of the protein.
  • the modifications generally include phosphorylation, glycosylation, ubiquitination, methylation, acetylation, protein folding etc.
  • protein glycosylation is a critical quality attribute that modulates the efficacy, stability, and half-life of a therapeutic protein. Protein quality can be determined using Immunoprecipitation based techniques, Biochemical Assays, Mass spectrometry (MS) and the like.
  • cell survivability refers to the capability of a cell to be viable and perform cell proliferation.
  • Cell viability is a measure of the proportion of live cells within a population.
  • Cell proliferation refers to an increase in cell number due to cell division.
  • the assays that are commonly used to test cell survivability include BrdU Cell Proliferation Assay, MTT Cell Proliferation Assays, trypan blue cell counting, and ATP Cell Viability Assays.
  • cell exhaustion refers to the state of the cell where it loses its capability to perform metabolic activity including heterologous protein production. Cell exhaustion can be determined by Metabolite Detection Assays.
  • test profile is significantly similar to the pre-determined reference profile if more than 50, 55, 60, 65, 70, 75, 80, 85, 90, 95 % of the methylation pattern/ profile overlaps with that of the reference profile.
  • a similarity of a test profile to more than one, such as two, three or even all reference profiles reduce the significance of the similarity.
  • pre-determined reference profile refers to a typical or standard methylation profile of the genomic material of a CHO cell line with a specific feature dependent on the context where the term is used.
  • predetermined reference profile refers to a typical or standard methylation profile of the genomic material of the CHO cell line displaying one or more of the phenotypic parameters selected from the group consisting of optimal glucose consumption, optimal growth rate, optimal lactic acid production, and optimal ammonia accumulation.
  • the pre-determined reference profile may be obtained from one or more reference CHO cell lines each expressing one or more phenotypic parameter.
  • the method according to this aspect of the present invention may use a methylation profile for a CHO cell line that has the potential for optimal heterologous protein production as the cell line may exhibit cell survivability, fitness, low cell exhaustion and good metabolic readouts.
  • the method according to this aspect of the present invention provides a prognostic methylation profile for ideal parental cell lines prior to transgene introduction. This methylation profile may be used as a control.
  • the cell according to any aspect of the present invention may be obtained from a biological sample selected from the group consisting of saliva, blood, brain, sperm and any other tissue or sample that provides genomic DNA.
  • the virus according to any aspect of the present invention may be from the Herpesviridae and/or Polyomaviridae genus.
  • the virus from the Herpesviridae genus is selected from the group consisting of Simplexvirus, Mardivirus, Varicellovirus, Lymphocryptovirus , Cytomegalovirus, Muromegalovirus, Roseolovirus, Lymphocryptovirus and Rhadinovirus
  • the virus from the Polyomaviridae genus is selected from the group consisting of Alphapolyomavirus , Betapolyomavirus, Deltapolyomavirus, Epsilonpolyomavirus, Gammapolyomavirus and Zetapolyomavirus.
  • the virus from the Herpesviridae genus is selected from the group consisting of Gallid alphaherpesvirus 2, Human Herpesvirus 1, Human Herpesvirus 2, Bovine Herpesvirus 1, Bovine Herpesvirus 5, Human Herpesvirus 3, Suid Herpesvirus 1, Human Herpesvirus 5, Murid Herpesvirus 1, Human Herpesvirus 6A, Human Herpesvirus 6B, Human Herpesvirus 7, Human Herpesvirus 4, Kaposi’s sarcoma-associated HV, and Murid Herpesvirus 4; and the virus from the Polyomavirus genus is selected from the group consisting of Human polyomavirus 5, Human polyomavirus 8, Human polyomavirus 9, Human polyomavirus 12, Human polyomavirus 13, Human polyomavirus 1, Human polyomavirus 2, Human polyomavirus 3, Human polyomavirus 4, Human polyomavirus 6, Human polyomavirus 7, Human polyomavirus 10, Human polyomavirus 11, and Human polyomavirus
  • the mammalian cell may be an immortalised cell line derived from the mammal.
  • CHO cells have Cytomegalovirus Major Immediate-Early and enhancer (CMV) promoter and simian vacuolating virus 40 (SV40) promoters which are the most frequently used promoters for the production of recombinant proteins in CHO cells.
  • CMV Cytomegalovirus Major Immediate-Early and enhancer
  • SV40 simian vacuolating virus 40
  • CHO DG44 (Thermo Fisher) is a parental CHO cell line which does not contact any viral promoter integrated into the genome.
  • Humira431 A*STAR Bioprocessing Technology Institute
  • Humira431 is a transgenic cell line derived from CHO DG44 cells which express biosimilar Humira from the modified CMV promoter.
  • DNA was extracted using the PureLink Genomic DNA Isolation Minikit kit (Invitrogen), including RNAase treatment following the manufacturer's instructions. DNA quantity was measured by PicoGreen assay and DNA quality is assessed via NanoDrop (Thermo Scientific) to ensure the A260/280 ratio is ⁇ 1 .8. A small amount of sample was then also analysed using automated electrophoresis on TapeStation (Agilent) to ensure each sample contains high molecular weight DNA.
  • the customized chip array data processing was performed in R version 4.1 .2 using sesame version 1 .14.2.
  • DNA methylation level for each site was calculated as methylation p-value.
  • Beta values were defined as methylated signal/(methylated signal + unmethylated signal). It can be computed using getBetas function.
  • the SeSAMe pipeline (Zhou et al. 2018) was used to generate normalized p-values and for quality control. Low intensity- based detection calling and making (based on p-value) was done with pOOBAH. Background subtraction based on normal-exponential deconvolution using out-of-band probes noob (Triche et al. 2013) and optionally with extra bleed-through subtraction were also implemented.
  • CHO Humira431 cell line contains CMV vector while CHO DG44 does not. The result shows that only CHO Humira431 cells can be detected by the probes targeted to CMV. In total, there are 538 CMV probes on TALOS1 . Table 1 lists the number of viral probes that can be successfully detected.
  • Table 1 Number of successfully detected CMV probes in 3 CHO DG44 samples and 3 CHO Humira431 samples
  • the protein production variability in CHO cells has been associated with DNA methylation mediated regulation of Cytomegalovirus Major Immediate-Early and enhancer (CMV) promoter and simian vacuolating virus 40 (SV40) promoters which are the most frequently used promoters for the production of recombinant proteins in CHO cells.
  • CMV Cytomegalovirus Major Immediate-Early and enhancer
  • SV40 simian vacuolating virus 40
  • CHO Humira431 cell line contains CMV vector while CHO DG44 does not.
  • Table 2 lists all the probes specific to CMV that can be used for CMV detection together with the beta values in CHO Humira431 samples.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Immunology (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Physics & Mathematics (AREA)
  • Biotechnology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Virology (AREA)
  • Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The present invention is related to a method of identifying integrated viral DNA in the genome of at least one mammalian test cell, the method comprising the steps of: (a) determining presence or absence of viral DNA in a DNA sample obtained from the mammalian test cell; and (b) determining the methylation status of at least one CpG site in the viral DNA of the DNA sample obtained from the mammalian test cell, wherein the presence of the viral DNA after step (a) and hypermethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being inactive respectively; the presence of the viral DNA after step (a) and baseline methylation or hypomethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being active respectively; and the absence of the viral DNA after step (a) is indicative of the viral DNA not being integrated in the mammalian test cell respectively; and wherein the method is carried out on an array.

Description

DETECTION OF VIRAL PRESENCE IN CELLS
FIELD OF THE INVENTION
The present invention relates to a method that can detect the presence or absence of a virus and/or promoter thereof and the possible activity of the virus in the cell. In particular, the method quantitatively measures the level of DNA methylation on the CpG sites of the viral DNA in the cells.
BACKGROUND OF THE INVENTION
Viral infection results in a variety of human diseases including cancer. Upon infecting host cells, the viruses hijack cellular machinery to promote their replication and produce viral particle progeny. To promote the long-term persistence of the virus in the cell, some viruses integrate their genomic DNA into the host genome which results in several consequences such as gene disruption, chromosome instability, gene mutation and oncogenesis. Such viral DNA which is integrated into the host genome is known as provirus. A provirus passively gets replicated along with the host genome; thus, it gets passed down from one generation to the next. Integration of viral DNA into the host genome may lead to a latent infection or a productive infection.
In case of a productive infection, the new virus particles will be produced via transcription of the provirus to generate mRNA and thus spreading infection to other cells. Latent infection on the other hand, refers to a state where the provirus is transcriptionally silent, however, the provirus gets activated to start transcription in response to changes in the host's environmental conditions. Hence, the host cell’s protein synthesis machinery is hijacked to generate more viruses leading to the destruction of the host cell.
Accordingly, it is necessary not to only check the presence or absence of viral DNA in a host cell but also predict the viral activity in the host cell to determine the seriousness of the viral presence. Further, in cell line assessment, vaccine development, patient samples etc as a part of quality control, it is an absolute necessity to check for viral DNA integration.
Viral vectors are prevalently used to express transgene for gene therapy, cell line development, and therapeutic protein production. A viral vector is designed by substituting a set of the viral gene responsible for viral replication and pathogenicity with the transgene cassette. Gene transfer using viral vectors results in a high and long-term expression of the transgene. The viral promoter and enhancer are the major DNA regulatory elements in the viral vector that determines the transgene expression level in the cells. The integration sites of the viral vector are susceptible to transcriptional regulation via epigenetic regulation such as histone modifications and DNA methylation. For example, the DNA methylation status of the viral vector is an important factor for protein production or expression stability in producer cells. An increase in DNA methylation on the CpG sites of the viral promoter results in transgene silencing at transcription levels. For example, the protein production variability in CHO cells has been associated with DNA methylation mediated regulation of Cytomegalovirus Major Immediate-Early and enhancer (CMV) promoter and simian vacuolating virus 40 (SV40) promoters which are the most frequently used promoters for the production of recombinant proteins in CHO cells.
However, as discussed above, the presence of viral DNA in a host cell is not representative of the host cell being in serious danger or being infected by the virus. The current methods used to test the presence or integration of viral DNA in a host cell, lack the ability to determine the potency of the presence of the virus on the survival of the host cell. Accordingly, there is a need in the art that quantitatively and qualitatively not only determines the presence of a virus in a host cell but also the potency of the DNA integration on the host cell.
DESCRIPTION OF THE INVENTION
The present invention solves the problems above by providing a DNA-based method for determining the level of DNA methylation on the CpG sites on the viral DNA and/or viral promoter which can then be used to not only predict the viral activity on the host cell but also the expression of a transgene in cells. In particular, the method according to any aspect of the present invention may be carried out on a DNA-based array. The method according to any aspect of the present invention may thus provide an efficient, cheap and accurate means of determining not only the presence of a virus in a cell but also the activity of the virus in the cell.
According to one aspect of the present invention, there is provided a method of identifying integrated viral DNA in the genome of at least one mammalian test cell, the method comprising the steps of:
(a) determining presence or absence of viral DNA in a DNA sample obtained from the mammalian test cell; and
(b) determining the methylation status of at least one CpG site in the viral DNA of the DNA sample obtained from the mammalian test cell, wherein the presence of the viral DNA after step (a) and hypermethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being inactive respectively; the presence of the viral DNA after step (a) and baseline methylation or hypomethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being active respectively; and the absence of the viral DNA after step (a) is indicative of the viral DNA not being integrated in the mammalian test cell respectively; and wherein the method is carried out on an array.
The term “viral DNA integration” used synonymously with “integrated viral DNA” or “genome integrated viral DNA” refers to the incorporation of the whole genome or part of the genome of at least one virus into a host cell, particularly an animal cell, more particularly a mammalian cell. Viral DNA integration is a means of viral insertion that is not transient as the viral DNA is genomically integrated into the host cell, i.e. mammalian cell. Viral DNA integration is a potential diagnostic and prognostic marker. In particular, viral DNA integration is a unique enzymatic process where doublestranded linear viral DNA is inserted into the host genome in a catalysed by a virus-encoded integrase. Many of these viruses however lie dormant in the infected cell, although their DNA may be integrated into the DNA of the host cell chromosome. Some viruses, present in the host cells may also lead to the cell being malignant and eventually causing tumours. The family of Polyomaviruses, e.g. simian virus 40 (SV40), and viruses from the Retroviridae family and Herpesviridae family are examples of viruses that can also be associated with malignancy (causing death or illness) in the host cell. The viral DNA may be transformed from the viral RNA.
A “mammalian cell” as used herein refers to is a cell from any member of the order Mammalia which includes a cell from a mouse, a rat, a monkey, a guinea pig, a dog, a mini-pig, a human being, a cow, a sheep, a pig, a goat, a horse, a donkey, a mule, a hamster or the like. The mammalian cell may also include an established cell line or immortalized cell line. For example, the mammalian cell according to any aspect of the present invention may be a CHO cell line which refers to immortal Chinese Hamster Ovary cell line (CHO) derived from Cricetulus griseus. In particular, the CHO cell line may be selected from the group consisting of CHO-K1 (ATCC), CHO- DG44 (Thermo Fisher Scientific), CHO-DXB11 (ATCC), ExpiCHO-S™ cells (Thermo Fisher Scientific), Freestyle™ CHO-S™ cells (Thermo Fisher Scientific), CHO 1-15 [subscript 500] (ATCC), Agarabi CHO (ATCC), and a CHOK1 SV cell including all variants (e.g. POTELLIGENT®, Lonza, Slough, UK), a CHOK1SV GS-KO (glutamine synthetase knockout) cell including all variants (e.g., XCEED™ Lonza, Slough, UK). The mammalian cell may be from Baby Hamster Kidney fibroblasts (BHK (ATCC CCL-10), or Vero cell (ATCC CCL-81). Exemplary human cells include human embryonic kidney (HEK) cells, such as HEK293 (ATCC CRL-1573) , HEK 293T (ATCC CRL-3216), a HeLa cell (ATCC CCL-2), a NS0 cell (ECACC 85110503), or a Sp2/0 cell (ATCC CRL-1581). The mammalian cells according to any aspect of the present invention may include mammalian cell cultures which can be either adherent cultures or suspension cultures.
According to any aspect of the present invention, the first step involves:
(a) determining presence or absence of viral DNA in a DNA sample obtained from the mammalian test cell.
The presence or absence of a viral DNA in a DNA sample may be determined by establishing a viral titre in a biological sample, i.e. DNA from the test mammalian cell. As used herein “viral titre” refers to a numeric expression of the quantity of a virus in a given volume, generally expressed as viral particles, transducing units, or infections particles, per millilitre (mL).
The presence or absence of viral DNA in a DNA sample obtained from the mammalian test cell may be determined using a DNA-based array according to any aspect of the present invention. The DNA-based array according to any aspect of the present invention may then comprise probes that bind to any region (i.e. CpG and non-CpG sites) that is specific to the viral DNA. When a probe specific to at least one virus CpG or non CpG site binds to the DNA test sample, that is an indication of the presence of the viral DNA and/or the presence of the viral DNA being integrated into the DNA of the mammalian test cell. More in particular, the array according to any aspect of the present invention comprises probes that are specific to non-CpG sites that can detect the presence of integrated viral DNA in the genome of the DNA sample to be tested.
The presence of specific CpG and non-CpG sites in step (a) is indicative of the viral strain or variant thereof. The term ‘variant’ as used herein in respect to a virus refers to a virus strain that has changed most likely through mutation to a version that is different from the original virus. The term “variant” refers to a form of the virus strain that deviates from what occurs in nature or which is considered the wild type. The variant may be 50, 55, 60, 65, 70, 75, 80, 85, 90, 95% similar in sequence compared to the wild-type strain.
The second step according to any aspect of the present invention involves:
(b) determining the methylation status of at least one CpG site in the viral DNA of the DNA sample obtained from the mammalian test cell.
If in step (a) the presence and integration of the viral DNA or integrated viral DNA in the mammalian DNA is confirmed, then the methylation status of at least one of the CpGs of the integrated viral DNA may be determined from the DNA of the mammalian test cell.
As used herein, a “CpG site” or “methylation site” is a nucleotide within a nucleic acid (DNA or RNA) that is susceptible to methylation either by natural occurring events in vivo or by an event instituted to chemically methylate the nucleotide in vitro. Some of these sites may be hypermethylated and some may be hypomethylated in a cell.
As used herein, a “methylated nucleic acid molecule” refers to a nucleic acid molecule that contains one or more nucleotides that is/are methylated.
As used herein, a “methylated nucleotide” or a “methylated nucleotide base” refers to the presence of a methyl moiety on a nucleotide base, where the methyl moiety is usually not present in a recognized typical nucleotide base. For example, cytosine in its usual form does not contain a methyl moiety on its pyrimidine ring, but 5-methylcytosine contains a methyl moiety at position 5 of its pyrimidine ring. Therefore, cytosine in its usual form may not be considered a methylated nucleotide and 5-methylcytosine may be considered a methylated nucleotide. In another example, thymine may contain a methyl moiety at position 5 of its pyrimidine ring, however, for purposes herein, thymine may not be considered a methylated nucleotide when present in DNA. Typical nucleotide bases for DNA are thymine, adenine, cytosine and guanine. Typical bases for RNA are uracil, adenine, cytosine and guanine. Correspondingly a "methylation site" is the location in the target gene nucleic acid region where methylation has the possibility of occurring. For example, a location containing CpG is a methylation site wherein the cytosine may or may not be methylated. In particular, the term “methylated nucleotide” refers to nucleotides that carry a methyl group attached to a position of a nucleotide that is accessible for methylation. These methylated nucleotides are usually found in nature and to date, methylated cytosine that occurs mostly in the context of the dinucleotide CpG, but also in the context of CpNpG- and CpNpN-sequences may be considered the most common. In principle, other naturally occurring nucleotides may also be methylated but they will not be taken into consideration with regard to any aspect of the present invention.
A “CpG island” as used herein describes a segment of DNA sequence that comprises a functionally or structurally deviated CpG density. For example, Yamada et al. have described a set of standards for determining a CpG island: it must be at least 400 nucleotides in length, has a greater than 50% GC content, and an OCF/ECF ratio greater than 0.6 (Yamada et al., 2004, Genome Research, 14, 247-266). Others have defined a CpG island less stringently as a sequence at least 200 nucleotides in length, having a greater than 50% GC content, and an OCF/ECF ratio greater than 0.6 (Takai et al., 2002, Proc. Natl. Acad. Sci. USA, 99, 3740-3745). In context of the present invention, the terms “methylation profile”, “methylation pattern”, “methylation state” or “methylation status,” are used herein to describe the state, situation or condition of methylation of a genomic sequence, and such terms refer to the characteristics of a DNA segment at a particular genomic locus in relation to methylation. Such characteristics include, but are not limited to, whether any of the cytosine (C) residues within this DNA sequence are methylated, location of methylated C residue(s), percentage of methylated C at any particular stretch of residues, and allelic differences in methylation due to, e.g., difference in the origin of the alleles.
The term "methylation status" refers to the status of a specific methylation site (i.e. methylated vs. non-methylated) which means a residue or methylation site is methylated or not methylated. Then, based on the methylation status of one or more methylation sites, a methylation profile may be determined.
The term "methylation level" refers to the level of a specific methylation site which can range from 0 (=unmethylated) to 1 (= fully methylated). Thus, based on the methylation level of one or more methylation sites, a methylation profile may be determined. Accordingly, the term "methylation" profile" or also “methylation pattern” refers to the relative or absolute concentration of methylated C or unmethylated C at any particular stretch of residues in a biological sample. For example, if cytosine (C) residue(s) not typically methylated within a DNA sequence are more methylated in a sample, it may be referred to as "hypermethylated"; whereas if cytosine (C) residue(s) typically methylated within a DNA sequence are less methylated, it may be referred to as "hypomethylated". Likewise, if the cytosine (C) residue(s) within a DNA sequence (e.g., sample nucleic acid) are more methylated when compared to another sequence from a different region or from a different individual (e.g., relative to normal nucleic acid), that sequence is considered hypermethylated compared to the other sequence. Alternatively, if the cytosine (C) residue(s) within a DNA sequence are less methylated as compared to another sequence from a different region or from a different individual, that sequence is considered hypomethylated compared to the other sequence. These sequences are said to be "differentially methylated". For example, when the methylation status differs between inflamed and non-inflamed tissues, the sequences are considered "differentially methylated”. Measurement of the levels of differential methylation may be done by a variety of ways known to those skilled in the art. One method is to measure the methylation level of individual interrogated CpG sites determined by the bisulfite sequencing method, as a non-limiting example.
Bisulfite treatment’ of genomic DNA used interchangeably with the term ‘bisulfite modification’, refers to the treatment of the genomic DNA with a deaminating agent such as a bisulfite that may be used to treat all DNA, methylated or not. In particular, the term “bisulfite” as used herein encompasses any suitable type of bisulfite, such as sodium bisulfite, or other chemical agents that are capable of chemically converting a cytosine (C) to an uracil (U) without chemically modifying a methylated cytosine and therefore can be used to differentially modify a DNA sequence based on the methylation status of the DNA, e.g., U.S. Pat. Pub. US 2010/0112595. As used herein, a reagent that "differentially modifies" methylated or non-methylated DNA encompasses any reagent that modifies methylated and/or unmethylated DNA in a process through which distinguishable products result from methylated and non-methylated DNA, thereby allowing the identification of the DNA methylation status. Such processes may include, but are not limited to, chemical reactions (such as a C to U conversion by bisulfite) and enzymatic treatment (such as cleavage by a methylation-dependent endonuclease). Thus, an enzyme that preferentially cleaves or digests methylated DNA is one capable of cleaving or digesting a DNA molecule at a much higher efficiency when the DNA is methylated, whereas an enzyme that preferentially cleaves or digests unmethylated DNA exhibits a significantly higher efficiency when the DNA is not methylated.
Accordingly, before step (a) according to any aspect of the present invention is carried out, the genomic DNA contained/ obtained or extracted from the cell, is first bisulfite treated.
An alternative method available in the art may be used instead of bisulfite treatment. A skilled person will understand which other methods to use. In one example, TET-assisted pyridine borane sequencing (TAPS) may be used for detection of 5mC and 5hmC (Yibin Liu, et al., Nature Biotechnology, 37: 424-429 (2019).
As used herein, the term “genomic material” refers to nucleic acid molecules or fragments of the genome of the animal according to any aspect of the present invention. In particular, such nucleic acid molecules or fragments are DNA or RNA or hybrids thereof, and most preferably are molecules of the DNA genome of a subject or group of subjects.
The term ‘biological sample’ as used herein may be selected from the group consisting of muscle, organ tissue, milk, blood, brain, sperm and any other tissue or sample that provides genomic DNA to be used in the method according to any aspect of the present invention. In particular, the biological sample may comprise any biological material obtained from the subject that contains DNA, and may be liquid, solid or both, may be tissue or bone, or a body fluid such as blood, lymph, etc. In particular, the biological sample useful for the present invention may comprise biological cells or fragments thereof.
As used herein, the “DNA sample” refers to the DNA extracted from a cell of the mammal according to any aspect of the present invention using known methods in the art. In particular, in order to detect the presence or absence of integrated viral DNA in the genome of the DNA tested, the DNA used for the testing is purified high-molecular-weight genomic fragment of size ranging from 50 to 150 kb. Further washing steps removes low molecular weight DNA fragments which represents episomal DNA, virion DNA and cccDNA.
The term “test” used in conjunction with the term mammal or mammalian cell herein refers to the mammal or cell that is to be introduced to the array according to any aspect of the present invention and is the basis for an analysis application of the present invention to determine if the cell has viral DNA integrated in the test cell genome and whether this virus is active or not. A “test profile” is therefore from an (individual) subject or group of subjects being tested according to the invention or a profile being obtained or generated in this context. Similarly, the term ‘sample’ used in accordance with any aspect of the present invention refers to an entity that may be subject to the method of the present invention. Conversely, the term “reference” or ‘control’ shall denote, mostly predetermined, entities which are used for a comparison with the test entity. In particular, a sample may be extracted DNA from a test mammalian cell that may be subject to the method of the present invention to determine if there is viral DNA integrated in the mammalian cell and if the virus is active in the test cell by first determining the presence or absence of the virus in the cell and then determining the DNA methylation profile of the test cell. There may be a further step of comparing this test methylation profile with a control and a ‘control’ refers to a mammalian cell, possibly of the same taxon, species, or cell line where the features as mentioned above are already known and where the methylation status is already known and used as a reference.
The DNA methylation profile of step (a) and (b) according to any aspect of the present invention is determined using an array. In particular, a bead-based array. The array according to any aspect of the present invention is advantageous as it not only enables early detection of integrated viral DNA in a host mammalian cell but also helps in determining the effectiveness of a virus vector in a cell to be used for heterologous protein production. In particular, the method according to any aspect of the present invention may be used for a better understanding of genome stability of mammalian cell lines and enables better control over the manufacturing/ process development/ product development/ scaling up/ validation process, thereby aiding in the selection of better mammalian cell lines for industrial applications.
Arrays allow for a high-throughput and robust method to determine semi-quantitative/quantitative DNA-methylation information through a small sample of extracted DNA of interest. These custom designed arrays may use Illumina iScan and Infinium platform technology or an equivalent thereof, which allows on each chip for example 100,000 different bead types that covalently bind DNA- methylation probes. Each probe represents one CpG Methylation site at the end of the probe sequence. DNA samples undergo bisulfite conversion, amplification, fragmentation, precipitation and resuspension steps before hybridization on an array chip. Once on the chip the DNA hybridizes to the beads for each CpG site so that methylation changes at each site can be detected specifically through single nucleotide extension. This is especially advantageous as the arraybased method is simple and the results of the array are accurate and reproducible. Further, compared to traditional sequencing which can take weeks to generate data, the array technology has a much shorter turn-around time. The volume and complexity of data generated is lesser compared to sequencing making it computationally less intensive. This allows for quicker computation to achieve interpretable results from experimental groups. Overall microarray technology is roughly 10x faster and 10x cheaper than traditional sequencing while still quantifiable for the methylation level at specific CpG sites.
The term “array” as used herein refers to an intentionally created collection of probe molecules which can be prepared either synthetically or biosynthetically. The probe molecules in the array can be identical or different from each other. The array can assume a variety of formats, for example, libraries of soluble molecules; libraries of compounds tethered to resin beads, silica chips, or other solid supports.
In particular, an array provides a convenient platform for simultaneous analysis of large numbers of CpG sites, for example, at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 50, 100, 500, 1000, 5000, 10,000, 100,000 or more sites or loci. In particular, the array comprises a plurality of different probe molecules that can be attached to a substrate or otherwise spatially distinguished in an array. Examples of arrays that may be used according to any aspect of the present invention include slide arrays, silicon wafer arrays, liquid arrays, bead-based arrays and the like. In one example, array technology used according to any aspect of the present invention combines a miniaturized array platform, a high level of assay multiplexing, and scalable automation for sample handling and data processing.
In particular, the array according to any aspect of the present invention may be an array of arrays, also referred to as a composite array, having a plurality of individual arrays that is configured to allow processing of multiple samples simultaneously. Examples of composite arrays and the technology behind them are disclosed at least in US 6,429,027 and US 2002/0102578. A substrate of a composite array may include a plurality of individual array locations, each having a plurality of probes, and each physically separated from other assay locations on the same substrate such that a fluid contacting one array location is prevented from contacting another array location. Each array location can have a plurality of different probe molecules that are directly attached to the substrate or that are attached to the substrate via rigid particles in wells (also referred to herein as beads in wells).
In one example, an array substrate can be a fibre optical bundle or array of bundles as described in US6,023,540, US6,200,737 and/or US6,327,410. An optical fibre bundle or array of bundles can have probes attached directly to the fibres or via beads. A skilled person would be able to easily determine which substrate will be most suitable for the array according to any aspect of the present invention. W02004110246 further discloses other substrates and methods of attaching beads to the substrates that may be used in the array according to any aspect of the present invention.
In one example, a surface of the substrate may have physical alterations to enable the attachment of probes or produce array locations. For example, the surface of a substrate can be modified to contain chemically modified sites that are useful for attaching, either-covalently or non-covalently, probe molecules or particles having attached probe molecules. Probes may be attached using any of a variety of methods known in the art including, an ink-jet printing method, a spotting technique, a photolithographic synthesis method, or printing method utilizing a mask. W02004110246 discloses these techniques in more detail.
In one example, the array according to any aspect of the present invention may be a bead-based array, where the beads are associated with a solid support such as those commercially available from Illumina, Inc. (San Diego, Calif.). An array of beads useful according to any aspect of the present invention can also be in a fluid format such as a fluid stream of a flow cytometer or similar device. Commercially available fluid formats for distinguishing beads include, for example, those used in XMAP(TM) technologies from Luminex or MPSS(TM) methods from Lynx Therapeutics.
The term “solid support”, “support”, and “substrate” as used herein are used interchangeably and refer to a material or group of materials having a rigid or semi-rigid surface or surfaces. In many examples, at least one surface of the solid support will be substantially flat, although in some examples it may be desirable to physically separate synthesis regions for different compounds with, for example, wells, raised regions, pins, etched trenches, or the like.
The array or microarray according to any aspect of the present invention may be a very high- density array, for example, those having from about 10,000,000 probes/cm2 to about 2,000,000,000 probes/cm2 or from about 100,000,000 probes/cm2 to about 1 ,000,000,000 probes/cm2. High density arrays are especially useful according to any aspect of the present invention for including the multitude of CpG sites on the array.
The array according to any aspect of the present invention may be used to analyse or evaluate such pluralities of loci simultaneously or sequentially as desired. In one example, a plurality of different probe molecules can be attached to a substrate or otherwise spatially distinguished in an array. Each probe is typically specific for a particular locus and can be used to distinguish methylation state of the locus.
The term “probe molecules” or ‘probes’ as used interchangeably herein refers to a surface- immobilized molecule that can be recognized by a particular target. Probes used in the array can be specific for the methylated allele of a CpG site, the non-methylated allele of the CpG site or both or for the methylated allele of a non-CpG site, the non-methylated allele of the non-CpG site or both.
The term “target” as used herein refers to a molecule that has an affinity for a given probe molecule. Targets may be naturally occurring or man-made molecules. Also, they can be employed in their unaltered state or as aggregates. Targets may be attached, covalently or noncovalently, to a binding member, either directly or via a specific binding substance. Examples of targets which can be employed according to any aspect of the present invention are methylated and non- methylated CpG sites. Targets are sometimes referred to in the art as anti-probes. As the term targets is used herein, no difference in meaning is intended.
The term “complementary” as used herein refers to the hybridization or base pairing between nucleotides or nucleic acids, such as, for instance, between the two strands of a double stranded DNA molecule or between an oligonucleotide primer and a primer binding site on a single stranded nucleic acid to be sequenced or amplified. Complementary nucleotides are, generally, A and T (or A and U), or C and G. Two single stranded RNA or DNA molecules are said to be complementary when the nucleotides of one strand, optimally aligned and compared and with appropriate nucleotide insertions or deletions, pair with at least about 80% of the nucleotides of the other strand, usually at least about 90% to 95%, and more preferably from about 98 to 100%. Perfectly complementary refers to 100% complementarity over the length of a sequence. For example, a 25- base probe is perfectly complementary to a target when all 25 bases of the probe are complementary to a contiguous 25 base sequence of the target with no mismatches between the probe and the target over the length of the probe.
The presence of the viral DNA after step (a) according to any aspect of the present invention and hypermethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being inactive respectively; and the presence of the viral DNA after step (a) and baseline methylation or hypomethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being active respectively; and the absence of the viral DNA after step (a) is indicative of the viral DNA not being integrated in the mammalian test cell respectively.
The term “hypermethylation” refers to the average methylation state corresponding to an increased presence of 5-mCyt at one or a plurality of CpG dinucleotides within a DNA sequence of a test DNA sample, relative to the amount of 5-mCyt found at corresponding CpG dinucleotides within a normal control DNA sample.
The term “hypomethylation” refers to the average methylation state corresponding to a decreased presence of 5-mCyt at one or a plurality of CpG dinucleotides within a DNA sequence of a test DNA sample, relative to the amount of 5-mCyt found at corresponding CpG dinucleotides within a normal control DNA sample.
A virus is considered to be in an active state when the virus is not only integrated in the host mammalian cell but also capable of replication and protein production. In order to promote the longterm persistence of the virus in the cell, some viruses integrate their genomic DNA into the host genome which results in several consequences such as gene disruption, chromosome instability, gene mutation and oncogenesis. Such viral DNA which is integrated into the host genome is known as provirus. A provirus passively gets replicated along with the host genome, thus it gets passed from one generation to the next. Integrated viral DNA might lead to a latent infection or a productive infection. In case of productive infection, the new virus particles will be produced via transcription of the provirus to generate mRNA and thus infect other cells. 'Latent infection’ as used herein refers to a state where the provirus is transcriptionally silent, however, the provirus gets activated to start transcription in response to changes in the host's environmental conditions. Hence, the host cell’s protein synthesis machinery is hijacked to generate more viruses leading to the destruction of its host cell. Thus, it is an absolute necessity to check for viral DNA integration or integrated viral DNA for cell line assessment, vaccine development, patient samples etc. as a part of quality control. The converse is a true for a virus in an non-active state.
Viral promoters are rich in CpG sites which make them more prone to DNA methylation and thus suppressing the protein expression. Accordingly, studying DNA methylation of viruses and their promoters can be used to accurately determine the presence or absence of viral integration and activity of the integrated virus when present in the DNA.
As used herein, the terms “promoter” or “gene promoter” used interchangeably with the terms ‘regulatory region’ or ‘regulatory sequence’ refers to the respective contiguous gene DNA sequence extending from 1 .5 kb upstream to 1 .5 kb downstream relative to the transcription start site (TSS), or contiguous portions thereof. In particular, ‘regulatory region’ refers to the respective contiguous gene DNA sequence extending from 1 .5 kb upstream to 0.5 kb downstream relative to the TSS. In some examples, ‘regulatory region’ refers to the respective contiguous gene DNA sequence extending from 1 .5 kb upstream to the downstream edge of a CpG island that overlaps with the region from 1 .5 kb upstream to 1 .5 kb downstream from TSS (and is such cases, my thus extend even further beyond 1.5 kb downstream), and contiguous portions thereof. Change in DNA methylation on the gene promoters responsible for protein glycosylation can lead to an improvement of protein quality. Protein glycosylation is a critical quality attribute that modulate the efficacy, stability, and half-life of a therapeutic protein. It is desirable to obtain a consistent glycoform profile in protein production due to regulatory concerns. Hence, DNA methylation can act as a tool to determine integrated viral DNA in a mammalian host cell.
According to another aspect of the present invention, there is provided a method of identifying integrated viral DNA in the genome of at least one mammalian test cell, the method comprising the steps of:
(a) determining presence or absence of one or more pre-selected CpG and non-CpG sites of the viral DNA in a DNA sample obtained from the mammalian test cell; and/or
(b) determining a test methylation profile of the pre-selected CpG and non-CpG sites within the DNA sample obtained from the mammalian test cell; and wherein the presence of one or more pre-selected CpG and non-CpG sites after step (a) and hypermethylation in (b) is indicative of the viral DNA being integrated in the mammalian test cell but the virus being inactive respectively; the presence of one or more pre-selected CpG and non-CpG sites after step (a) and baseline methylation or hypomethylation is indicative of the viral DNA being integrated in the mammalian test cell and the virus being active respectively; and the absence of the one or more pre-selected CpG and non-CpG sites after step (a) is indicative of the viral DNA not being integrated in the mammalian test cell respectively; and wherein the method is carried out on an array.
As used herein, the term “pre-selected sites” used interchangeably with “pre-selected CpG and non-CpG sites” refers to sites that were selected from genes or regions that showed the highest degree of methylation variation during the training of the method and fulfils certain quality criteria such as a minimum sequencing coverage of >5x were considered and for >5 qualified CpG sites. Additionally, genes that have an average methylation level <0.1 or an average methylation level >0.9 can be excluded due to their limited dynamic range. “Reference methylation profiles” may be defined on the basis of multiple training samples using multivariate statistical methods, such as such as Principal Component analysis or Multi-Dimensional Scaling.
According to any aspect of the present invention, there may be a further step:
(c) comparing the methylation status of the CpG site from (b) with that of a mammalian control cell where the viral DNA is integrated within the control cell.
Step (c) is an optional as step (b) of measuring methylation status is sufficient to determine the presence of hypomethylation or hypermethylation of a CpG site which can provide information on the virus and protein production without doing a further comparison in step (c).
According to a further aspect of the present invention, there is provided a DNA array-based method of identifying integrated viral DNA in at least one mammalian test cell, the method comprising the steps of:
(a) determining presence of at least one viral CpG site and/or non-CpG site in a DNA sample obtained from the mammalian test cell; and
(b) determining the methylation status of multiple CpG sites and/or non-CpG sites in the DNA sample obtained from the mammalian test cell, and
(c) comparing the methylation status of the CpG site and/or non-CpG site from (b) with that of a mammalian control cell where the viral DNA is incorporated within the control cell, wherein the presence of the CpG site and/or non-CpG site after step (a) and hypermethylation in (c) is indicative of the viral DNA being incorporated in the mammalian test cell but the virus being inactive respectively; the presence of the CpG site and/or non-CpG site after step (a) and baseline methylation or hypomethylation is indicative of the viral DNA being incorporated in the mammalian test cell and the virus being active respectively; and the absence of the CpG site and/or non-CpG site after step (a) is indicative of the viral DNA not being incorporated in the mammalian test cell respectively.
The method according to any aspect of the present invention comprising the step of: (ii) contacting the DNA sample from the test cell with a DNA-based array.
The method according to any aspect of the present invention further comprising the step of: (i) performing bisulfite modification to the DNA sample before step (a). The term ‘contacting’, as used herein, means bringing about direct contact between the genomic material sample and DNA-based array. For example, the genomic material sample may be DNA that is extracted from the biological sample from the test mammal, and this directly brought into contact with the probe in the DNA-based array.
The method according to any aspect of the present invention, wherein the mammalian cell is a CHO cell line and there is a further step (c):
(c) comparing the methylation status of the CpG site and/or non-CpG site in the viral DNA from (b) with that of:
(i) a first CHO control cell where the viral DNA is integrated in the genome of the control cell and the first CHO control cell has optimal heterologous protein production; and/or
(ii) a second CHO control cell where the viral DNA is integrated in the genome of the control cell and the first CHO control cell has no optimal heterologous protein production; wherein significant similarity of the test methylation status of the CpG site with the first CHO control cell is indicative of the test CHO cell having optimal heterologous protein production, or with the second CHO control cell is indicative of the test CHO cell not having optimal heterologous protein production, wherein difference of the test methylation status of the CpG site with the first CHO control cell is indicative of the test CHO cell not having optimal heterologous protein production, or with the second CHO control cell is indicative of the test CHO cell having optimal heterologous protein production.
The term ‘heterologous protein production’ as used herein refers to the production of a protein which is not endogenous to the cell. It means an expression of a gene or part of a gene, particularly a transgene in a host CHO cell which does not naturally express this gene. The assays that are commonly used to quantify heterologous protein production include enzyme-linked immunosorbent assay (ELISA), chromatography & bioprocess analyser. The term ‘host cell’ as used herein refers to a cellular system for the expression of heterologous protein. For example, CHO cells are the main hosts for the production of various therapeutic proteins.
As used herein, the term ‘therapeutic protein’ refers to genetically engineered versions of naturally occurring human proteins. Examples of therapeutic proteins include antibody-based drugs, anticoagulants, blood factors, bone morphogenetic proteins, engineered protein scaffolds, enzymes, growth factors, hormones, interferons, interleukins and the like.
The term ‘optimal heterologous protein production’ herein refers to CHO cells that are capable of high-level protein production, particularly during industrial production or large-scale production of recombinant proteins, where the protein is usually a functional protein that is not naturally occurring in the wild-type CHO cell. In particular, for optimal heterologous protein production a CHO cell line has minimized metabolic burdens and toxic effects to the cell. More in particular, ‘optimal heterologous protein production’ refers to high level protein production where the CHO cell line not only produces a high yield of the protein of interest but also that the protein production is constantly maintained over the period of production (i.e., the prolonged period of culture) such that the quality of the protein produced is also consistent and maintained. In particular, for a CHO cell according to any aspect of the present invention to be capable of ‘optimal heterologous protein production’, the cell must at least display one of more of the following phenotypes of interest: phenotypic homogeneity, protein productivity, and protein quality. More in particular, for ‘optimal heterologous protein production’, the CHO cell may comprise phenotypic homogeneity and protein productivity, or phenotypic homogeneity, and protein quality, or protein productivity, and protein quality, or phenotypic homogeneity, protein productivity, and protein quality and good protein titre. Optimal heterologous protein productivity can also only happen when the cell (i.e. the CHO cell) is stable and has high cell survivability.
As used herein, the term ‘phenotypic homogeneity’ refers to a state when all the cells in a population exhibit the same phenotype under a certain condition.
The term ‘protein productivity’ as used herein refers to a measure of the amount of protein made per viable cell at a single titer point. It is calculated by dividing the titer (mg) by the viable cell density (VCD or cells/ml), and the final measurement is represented as the amount of protein per cell (mg/cell).
The term ‘protein quality’ refers to the posttranslational modification of the protein that determines the efficacy and function of the protein. The modifications generally include phosphorylation, glycosylation, ubiquitination, methylation, acetylation, protein folding etc. For example, protein glycosylation is a critical quality attribute that modulates the efficacy, stability, and half-life of a therapeutic protein. Protein quality can be determined using Immunoprecipitation based techniques, Biochemical Assays, Mass spectrometry (MS) and the like.
As used herein, the term ‘cell survivability’ refers to the capability of a cell to be viable and perform cell proliferation. Cell viability is a measure of the proportion of live cells within a population. Cell proliferation refers to an increase in cell number due to cell division. The assays that are commonly used to test cell survivability include BrdU Cell Proliferation Assay, MTT Cell Proliferation Assays, trypan blue cell counting, and ATP Cell Viability Assays.
As used herein, the term ‘cell exhaustion’ refers to the state of the cell where it loses its capability to perform metabolic activity including heterologous protein production. Cell exhaustion can be determined by Metabolite Detection Assays.
The term “significantly similar” as used herein, and in particular in context with the comparison of methylation profiles (such as the comparison between test profiles (from test subject(s) and reference profiles) shall mean a similarity observed by statistical means (i.e. by using bioinformatics) and/or also by observation using the eye. A significant similarity is observed for example if a test profile overlaps with a reference profile that is defined by multiple training samples through multivariate statistical methods, such as Principal Component analysis or Multi- Dimensional Scaling. In particular, a test profile is significantly similar to the pre-determined reference profile if more than 50, 55, 60, 65, 70, 75, 80, 85, 90, 95 % of the methylation pattern/ profile overlaps with that of the reference profile. A similarity of a test profile to more than one, such as two, three or even all reference profiles reduce the significance of the similarity.
The term “pre-determined reference profile” as used herein refers to a typical or standard methylation profile of the genomic material of a CHO cell line with a specific feature dependent on the context where the term is used. In one example, for a method of determining a CHO cell line that displays at least one phenotypic parameter according to any aspect of the present invention conferring the potential of optimal heterologous protein production on the cell line, the term “predetermined reference profile” refers to a typical or standard methylation profile of the genomic material of the CHO cell line displaying one or more of the phenotypic parameters selected from the group consisting of optimal glucose consumption, optimal growth rate, optimal lactic acid production, and optimal ammonia accumulation. The pre-determined reference profile may be obtained from one or more reference CHO cell lines each expressing one or more phenotypic parameter.
The method according to this aspect of the present invention may use a methylation profile for a CHO cell line that has the potential for optimal heterologous protein production as the cell line may exhibit cell survivability, fitness, low cell exhaustion and good metabolic readouts. In particular, the method according to this aspect of the present invention provides a prognostic methylation profile for ideal parental cell lines prior to transgene introduction. This methylation profile may be used as a control.
The cell according to any aspect of the present invention may be obtained from a biological sample selected from the group consisting of saliva, blood, brain, sperm and any other tissue or sample that provides genomic DNA.
The virus according to any aspect of the present invention may be from the Herpesviridae and/or Polyomaviridae genus. In particular, the virus from the Herpesviridae genus is selected from the group consisting of Simplexvirus, Mardivirus, Varicellovirus, Lymphocryptovirus , Cytomegalovirus, Muromegalovirus, Roseolovirus, Lymphocryptovirus and Rhadinovirus and the virus from the Polyomaviridae genus is selected from the group consisting of Alphapolyomavirus , Betapolyomavirus, Deltapolyomavirus, Epsilonpolyomavirus, Gammapolyomavirus and Zetapolyomavirus. The virus from the Herpesviridae genus is selected from the group consisting of Gallid alphaherpesvirus 2, Human Herpesvirus 1, Human Herpesvirus 2, Bovine Herpesvirus 1, Bovine Herpesvirus 5, Human Herpesvirus 3, Suid Herpesvirus 1, Human Herpesvirus 5, Murid Herpesvirus 1, Human Herpesvirus 6A, Human Herpesvirus 6B, Human Herpesvirus 7, Human Herpesvirus 4, Kaposi’s sarcoma-associated HV, and Murid Herpesvirus 4; and the virus from the Polyomavirus genus is selected from the group consisting of Human polyomavirus 5, Human polyomavirus 8, Human polyomavirus 9, Human polyomavirus 12, Human polyomavirus 13, Human polyomavirus 1, Human polyomavirus 2, Human polyomavirus 3, Human polyomavirus 4, Human polyomavirus 6, Human polyomavirus 7, Human polyomavirus 10, Human polyomavirus 11, and Human polyomavirus 14. In particular, the virus may be Human Herpesvirus 5 (also known as human cytomegalovirus), Murid Herpesvirus 1 (also known as murine cytomegalovirus), or Simian virus 40.
According to a further aspect of the present invention, there is provided a use of DNA-based array to determine:
(a) the presence of integrated viral DNA in a mammalian cell; and
(b) methylation status of at least one CpG site of the virus, thereby determining the status of the virus.
The mammalian cell may be an immortalised cell line derived from the mammal.
According to yet another aspect of the present invention, there is provided a DNA-based array for determining
(a) the presence of integrated viral DNA in a mammalian cell; and
(b) methylation status of at least one CpG site of the virus, thereby determining the status of the virus.
EXAMPLES
The foregoing describes preferred embodiments, which, as will be understood by those skilled in the art, may be subject to variations or modifications in design, construction or operation without departing from the scope of the claims. These variations, for instance, are intended to be covered by the scope of the claims.
Example 1
Detection of viral promoter in CHO
CHO cells have Cytomegalovirus Major Immediate-Early and enhancer (CMV) promoter and simian vacuolating virus 40 (SV40) promoters which are the most frequently used promoters for the production of recombinant proteins in CHO cells. CHO DG44 (Thermo Fisher) is a parental CHO cell line which does not contact any viral promoter integrated into the genome. Humira431 (A*STAR Bioprocessing Technology Institute) cell line is a transgenic cell line derived from CHO DG44 cells which express biosimilar Humira from the modified CMV promoter.
DNA Extraction
DNA was extracted using the PureLink Genomic DNA Isolation Minikit kit (Invitrogen), including RNAase treatment following the manufacturer's instructions. DNA quantity was measured by PicoGreen assay and DNA quality is assessed via NanoDrop (Thermo Scientific) to ensure the A260/280 ratio is < 1 .8. A small amount of sample was then also analysed using automated electrophoresis on TapeStation (Agilent) to ensure each sample contains high molecular weight DNA.
Bisulfite Conversion and BeadChip Analysis The genomic DNA samples were then subjected to bisulfite conversion using the EZ DNA Methylation-Gold™ Kit (Zymo Research). The methylation levels were then quantified using our customized methylation BeadChip kits (Illumina) which can analyze over 50,000 methylation sites quantitatively across the genome at single-nucleotide resolution.
Data processing:
The customized chip array data processing was performed in R version 4.1 .2 using sesame version 1 .14.2. DNA methylation level for each site was calculated as methylation p-value. Beta values were defined as methylated signal/(methylated signal + unmethylated signal). It can be computed using getBetas function. The SeSAMe pipeline (Zhou et al. 2018) was used to generate normalized p-values and for quality control. Low intensity- based detection calling and making (based on p-value) was done with pOOBAH. Background subtraction based on normal-exponential deconvolution using out-of-band probes noob (Triche et al. 2013) and optionally with extra bleed-through subtraction were also implemented.
Percentage of viral probes which pass the p-value form the data indicated the presence and type of viral integrated in the CHO genome.
Results
3 CHO DG44 samples and 3 CHO Humira431 samples were run on TALOS. CHO Humira431 cell line contains CMV vector while CHO DG44 does not. The result shows that only CHO Humira431 cells can be detected by the probes targeted to CMV. In total, there are 538 CMV probes on TALOS1 . Table 1 lists the number of viral probes that can be successfully detected.
Table 1 : Number of successfully detected CMV probes in 3 CHO DG44 samples and 3 CHO Humira431 samples
Figure imgf000018_0001
Example 2
Quantifying the methylation levels of viral promoters in CHO genomes
The protein production variability in CHO cells has been associated with DNA methylation mediated regulation of Cytomegalovirus Major Immediate-Early and enhancer (CMV) promoter and simian vacuolating virus 40 (SV40) promoters which are the most frequently used promoters for the production of recombinant proteins in CHO cells. An increase in DNA methylation on the CpG sites of the viral promoter results in transgene silencing at transcription levels DNA extraction, bisulphite conversion, BeadChip analysis, quality control, data processing and differential methylation analysis were carried out as outlined in Example 1 . Analysing the level of DNA methylation on the CpG sites on the viral promoter can predict the expression of a transgene in CHO cells.
3 CHO DG44 samples and 3 CHO Humira431 samples were run on TALOS. CHO Humira431 cell line contains CMV vector while CHO DG44 does not. Table 2 lists all the probes specific to CMV that can be used for CMV detection together with the beta values in CHO Humira431 samples.
Table 2: Beta value of CMV probes in 3 CHO Humira431 samples
Figure imgf000019_0001
Figure imgf000019_0002
Figure imgf000020_0001
Figure imgf000021_0001
Figure imgf000021_0002
Figure imgf000022_0001
Figure imgf000023_0002
Figure imgf000023_0001
Figure imgf000024_0002
Figure imgf000024_0001

Claims

1 . A method of identifying integrated viral DNA in the genome of at least one mammalian test cell, the method comprising the steps of:
(a) determining presence or absence of viral DNA in a DNA sample obtained from the mammalian test cell; and
(b) determining the methylation status of at least one CpG site in the viral DNA of the DNA sample obtained from the mammalian test cell, wherein the presence of the viral DNA after step (a) and hypermethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being inactive respectively; the presence of the viral DNA after step (a) and baseline methylation or hypomethylation in step (b) is indicative of the viral DNA being integrated in the mammalian test cell and the virus being active respectively; and the absence of the viral DNA after step (a) is indicative of the viral DNA not being integrated in the mammalian test cell respectively; and wherein the method is carried out on an array.
2. The method according to claim 1 , wherein presence of specific CpG and non-CpG sites in step (a) is indicative of the viral strain or variant thereof.
3. A method of identifying integrated viral DNA in the genome of at least one mammalian test cell, the method comprising the steps of:
(a) determining presence or absence of one or more pre-selected CpG and/or non-CpG sites of the viral DNA in a DNA sample obtained from the mammalian test cell; and/or
(b) determining a test methylation profile of the pre-selected CpG and/or non-CpG sites within the DNA sample obtained from the mammalian test cell; and wherein the presence of one or more pre-selected CpG and/or non-CpG sites after step (a) and hypermethylation in (b) is indicative of the viral DNA being integrated in the mammalian test cell but the virus being inactive respectively; the presence of one or more pre-selected CpG and/or non-CpG sites after step (a) and baseline methylation or hypomethylation is indicative of the viral DNA being integrated in the mammalian test cell and the virus being active respectively; and the absence of the one or more pre-selected CpG and/or non-CpG sites after step (a) is indicative of the viral DNA not being integrated in the mammalian test cell respectively; and wherein the method is carried out on a DNA- based array.
4. The method according to any one of the preceding claims, wherein the mammal is a mouse, a rat, a guinea pig, a dog, a mini-pig, a human being, a cow, a sheep, a pig, a goat, a horse, a donkey, a mule, and a hamster.
5. The method according to any one of the preceding claims, wherein the mammalian cell is an immortalised cell line derived from the mammal.
6. The method according to claim 5, wherein the immortalised cell line is selected from the group consisting of CHO, BHK, Vero, HEK293, HEK 293T, HeLa cell, NSO cell, Sp2/0 cell, and derivatives thereof.
7. The method according to any one of the preceding claims, wherein the mammalian cell is a CHO cell line.
8. The method according to claim 7, wherein there is a further step (c):
(c) comparing the methylation status of the CpG site in the viral DNA from (b) with that of
(i) a first CHO control cell where the viral DNA is integrated in the genome of the control and the first CHO control cell has optimal heterologous protein production;
(ii) a second CHO control cell where the viral DNA is integrated in the genome of the control cell and the second CHO control cell has no optimal heterologous protein production; wherein significant similarity of the test methylation status of the CpG site with the first CHO control cell is indicative of the test CHO cell having optimal heterologous protein production, or with the second CHO control cell is indicative of the test CHO cell not having optimal heterologous protein production, wherein difference of the test methylation status of the CpG site with the first CHO control cell is indicative of the test CHO cell not having optimal heterologous protein production, or with the second or CHO control cell is indicative of the test CHO cell having optimal heterologous protein production.
9. The method according to any one of the preceding claims, wherein the integrated viral DNA is from at least one virus from the Herpesviridae and/or Polyomaviridae genus.
10. The method according to claim 9, wherein the virus from the Herpesviridae genus is selected from the group consisting of Simplexvirus, Mardivirus, Varicellovirus, Lymphocryptovirus , Cytomegalovirus, Muromegalovirus, Roseolovirus, Lymphocryptovirus and Rhadinovirus and the virus from the Polyomaviridae genus is selected from the group consisting of Alphapolyomavirus , Betapolyomavirus, Deltapolyomavirus, Epsilonpolyomavirus, Gammapolyomavirus and Zetapolyomavirus.
11 . The method according to claim 9 or 10, wherein
- the virus from the Herpesviridae genus is selected from the group consisting of Gallid alphaherpesvirus 2, Human Herpesvirus 1, Human Herpesvirus 2, Bovine Herpesvirus 1, Bovine Herpesvirus 5, Human Herpesvirus 3, Suid Herpesvirus 1, Human Herpesvirus 5, Murid Herpesvirus 1, Human Herpesvirus 6A, Human Herpesvirus 6B, Human Herpesvirus 7, Human Herpesvirus 4, Kaposi’s sarcoma-associated HV, and Murid Herpesvirus 4; and
- the virus from the Polyomavirus genus is selected from the group consisting of Human polyomavirus 5, Human polyomavirus 8, Human polyomavirus 9, Human polyomavirus 12, Human polyomavirus 13, Human polyomavirus 1, Human polyomavirus 2, Human polyomavirus 3, Human polyomavirus 4, Human polyomavirus 6, Human polyomavirus 7, Human polyomavirus 10, Human polyomavirus 11, and Human polyomavirus 14.
12. The method according to any one of the previous claims, wherein the virus is Human Herpesvirus 5 (also known as human cytomegalovirus), Murid Herpesvirus 1 (also known as murine cytomegalovirus), or Simian virus 40.
13. Use of DNA- bead based array in any one of the claims 1 to 12 to determine:
(a) the presence of integrated viral DNA in a mammalian cell; and
(b) methylation status of at least one CpG site of the virus, thereby determining the status of the virus.
14. Use of claim 13, wherein the mammalian cell is an immortalised cell line derived from the mammal.
15. DNA bead-based array for use in the method of any one of the claims 1 to 12, for determining
(a) the presence of integrated viral DNA in a mammalian cell; and
(b) methylation status of at least one CpG site of the virus, thereby determining the status of the virus.
PCT/EP2024/054932 2023-03-02 2024-02-27 Detection of viral presence in cells Pending WO2024180050A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
AU2024227981A AU2024227981A1 (en) 2023-03-02 2024-02-27 Detection of viral presence in cells
KR1020257032659A KR20250154495A (en) 2023-03-02 2024-02-27 Detection of virus presence in cells
CN202480029675.0A CN121039296A (en) 2023-03-02 2024-02-27 Detection of the presence of a virus in a cell

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP23159541.4 2023-03-02
EP23159541 2023-03-02

Publications (1)

Publication Number Publication Date
WO2024180050A1 true WO2024180050A1 (en) 2024-09-06

Family

ID=85415121

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2024/054932 Pending WO2024180050A1 (en) 2023-03-02 2024-02-27 Detection of viral presence in cells

Country Status (5)

Country Link
KR (1) KR20250154495A (en)
CN (1) CN121039296A (en)
AU (1) AU2024227981A1 (en)
TW (1) TW202503071A (en)
WO (1) WO2024180050A1 (en)

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6023540A (en) 1997-03-14 2000-02-08 Trustees Of Tufts College Fiber optic sensor with encoded microspheres
US6200737B1 (en) 1995-08-24 2001-03-13 Trustees Of Tufts College Photodeposition method for fabricating a three-dimensional, patterned polymer microstructure
US6327410B1 (en) 1997-03-14 2001-12-04 The Trustees Of Tufts College Target analyte sensors utilizing Microspheres
US20020102578A1 (en) 2000-02-10 2002-08-01 Todd Dickinson Alternative substrates and formats for bead-based array of arrays TM
US6429027B1 (en) 1998-12-28 2002-08-06 Illumina, Inc. Composite arrays utilizing microspheres
WO2004110246A2 (en) 2003-05-15 2004-12-23 Illumina, Inc. Methods and compositions for diagnosing conditions associated with specific dna methylation patterns
WO2009062604A2 (en) * 2007-11-02 2009-05-22 Magnus Von Knebel Doeberitz Compounds and methods associated with differential methylation of human papilloma virus genomes in epithelial cells
US20100112595A1 (en) 2004-11-08 2010-05-06 Life Technologies Corporation Bisulfite Conversion Reagent
WO2011128377A1 (en) * 2010-04-14 2011-10-20 F. Hoffmann-La Roche Ag Method for the selection of a long-term producing cell
US20140235484A1 (en) * 2013-02-19 2014-08-21 The Johns Hopkins University Methods for detection and differentiation of origin of viral dna
WO2016205572A1 (en) * 2014-06-16 2016-12-22 JBS Science Inc. Method and kit for detecting hbv dna
WO2017019751A1 (en) * 2015-07-27 2017-02-02 The Johns Hopkins University Method to use viral and host methylation markers for cervical cancer screening and triage in liquid prep, serum/plasma, and urine: pcr and sequencing based process methods

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6200737B1 (en) 1995-08-24 2001-03-13 Trustees Of Tufts College Photodeposition method for fabricating a three-dimensional, patterned polymer microstructure
US6023540A (en) 1997-03-14 2000-02-08 Trustees Of Tufts College Fiber optic sensor with encoded microspheres
US6327410B1 (en) 1997-03-14 2001-12-04 The Trustees Of Tufts College Target analyte sensors utilizing Microspheres
US6429027B1 (en) 1998-12-28 2002-08-06 Illumina, Inc. Composite arrays utilizing microspheres
US20020102578A1 (en) 2000-02-10 2002-08-01 Todd Dickinson Alternative substrates and formats for bead-based array of arrays TM
WO2004110246A2 (en) 2003-05-15 2004-12-23 Illumina, Inc. Methods and compositions for diagnosing conditions associated with specific dna methylation patterns
US20100112595A1 (en) 2004-11-08 2010-05-06 Life Technologies Corporation Bisulfite Conversion Reagent
WO2009062604A2 (en) * 2007-11-02 2009-05-22 Magnus Von Knebel Doeberitz Compounds and methods associated with differential methylation of human papilloma virus genomes in epithelial cells
WO2011128377A1 (en) * 2010-04-14 2011-10-20 F. Hoffmann-La Roche Ag Method for the selection of a long-term producing cell
US20140235484A1 (en) * 2013-02-19 2014-08-21 The Johns Hopkins University Methods for detection and differentiation of origin of viral dna
WO2016205572A1 (en) * 2014-06-16 2016-12-22 JBS Science Inc. Method and kit for detecting hbv dna
WO2017019751A1 (en) * 2015-07-27 2017-02-02 The Johns Hopkins University Method to use viral and host methylation markers for cervical cancer screening and triage in liquid prep, serum/plasma, and urine: pcr and sequencing based process methods

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
HOELZER K. ET AL: "Presence and role of cytosine methylation in DNA viruses of animals", vol. 36, no. 9, 26 March 2008 (2008-03-26), GB, pages 2825 - 2837, XP093070920, ISSN: 0305-1048, Retrieved from the Internet <URL:https://academic.oup.com/nar/article-lookup/doi/10.1093/nar/gkn121> DOI: 10.1093/nar/gkn121 *
SUTTER DIANE ET AL: "Methylation of integrated adenovirus type 12 DNA sequences in transformed cells is inversely correlated with viral gene expression (differential DNA methylation/Hpa II/ Msp I isoschizomers/Southem blot technique)", vol. 77, no. 1, 1 January 1980 (1980-01-01), pages 253 - 256, XP093070893, Retrieved from the Internet <URL:https://www.pnas.org/doi/pdf/10.1073/pnas.77.1.253> DOI: 10.1073/pnas.77.1.253 *
TAKAI ET AL., PROC. NATL. ACAD. SCI. USA, vol. 99, 2002, pages 3740 - 3745
TÓTH RENÁTA ET AL: "Methylation Status of the Adeno-Associated Virus Type 2 (AAV2)", VIRUSES, vol. 11, no. 1, 9 January 2019 (2019-01-09), pages 38, XP093058222, DOI: 10.3390/v11010038 *
YAMADA ET AL., GENOME RESEARCH, vol. 14, 2004, pages 247 - 266
YIBIN LIU ET AL., NATURE BIOTECHNOLOGY, vol. 37, 2019, pages 424 - 429

Also Published As

Publication number Publication date
CN121039296A (en) 2025-11-28
TW202503071A (en) 2025-01-16
AU2024227981A1 (en) 2025-10-09
KR20250154495A (en) 2025-10-28

Similar Documents

Publication Publication Date Title
Wyman et al. Post-transcriptional generation of miRNA variants by multiple nucleotidyl transferases contributes to miRNA transcriptome complexity
US20230088159A1 (en) Compositions and methods for assessing immune response
EP2660331B1 (en) Method for single cell genome analysis and kit therefor
Ning et al. Quantitative assessment of single-cell whole genome amplification methods for detecting copy number variation using hippocampal neurons
EP3802856A1 (en) Dna methylation biomarker of aging for human ex vivo and in vivo studies
US11492658B2 (en) Sequence conversion and signal amplifier DNA cascade reactions and detection methods using same
Doerfler et al. Inheritable epigenetic response towards foreign DNA entry by mammalian host cells: a guardian of genomic stability
US20210277458A1 (en) Methods, systems, and aparatus for nucleic acid detection
Pichon et al. Analysis and annotation of DNA methylation in two nonhuman primate species using the Infinium Human Methylation 450K and EPIC BeadChips
Weber et al. Destabilization of the human epigenome: consequences of foreign DNA insertions
Oh et al. A digital PCR-based protocol to detect and quantify RNA editing events at hotspots
AU2024227981A1 (en) Detection of viral presence in cells
KR20250057855A (en) Method for assessing protein production in CHO cells
US9574188B2 (en) DNA restriction library tagging and analysis
Al-Turkmani et al. Molecular assessment of human diseases in the clinical laboratory
US20240117414A1 (en) Sequence conversion and signal amplifier dna having abasic nucleic acids, and detection methods using same
CN108330187B (en) Method and kit for quantitatively detecting X chromosome in cell
CN111278988A (en) Endosialin (CD248) as an epigenetic marker for the identification of immune cells, especially naive CD8+ T cells
Kouroukli Allele-specific epigenome editing: development and clinical application
WO2025108794A1 (en) Predicting the fitness of a cho cell
WO2016199094A1 (en) Mirna aberration(s) in squamous cell carcinoma of head and neck (hnscc) and applications thereof
TW202540441A (en) Predicting the fitness of a cho cell
Patiño-Guillén et al. Quantification of disease-associated RNA tandem repeats by nanopore sensing
TW202541042A (en) A method of classifying cho cells
CN105671186A (en) Gene for assessing human mathematical ability, and detection kit and detection method thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24707198

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: AU2024227981

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: KR1020257032659

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 202517094690

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2024707198

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2024227981

Country of ref document: AU

Date of ref document: 20240227

Kind code of ref document: A

WWP Wipo information: published in national office

Ref document number: 1020257032659

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 11202505887X

Country of ref document: SG

WWP Wipo information: published in national office

Ref document number: 11202505887X

Country of ref document: SG

WWP Wipo information: published in national office

Ref document number: 202517094690

Country of ref document: IN

ENP Entry into the national phase

Ref document number: 2024707198

Country of ref document: EP

Effective date: 20251002

ENP Entry into the national phase

Ref document number: 2024707198

Country of ref document: EP

Effective date: 20251002

ENP Entry into the national phase

Ref document number: 2024707198

Country of ref document: EP

Effective date: 20251002