[go: up one dir, main page]

WO2013113921A1 - Procédé permettant d'obtenir des informations génomiques, transcriptomiques, protéomiques et/ou métabolomiques à partir d'un échantillon biologique unique - Google Patents

Procédé permettant d'obtenir des informations génomiques, transcriptomiques, protéomiques et/ou métabolomiques à partir d'un échantillon biologique unique Download PDF

Info

Publication number
WO2013113921A1
WO2013113921A1 PCT/EP2013/052134 EP2013052134W WO2013113921A1 WO 2013113921 A1 WO2013113921 A1 WO 2013113921A1 EP 2013052134 W EP2013052134 W EP 2013052134W WO 2013113921 A1 WO2013113921 A1 WO 2013113921A1
Authority
WO
WIPO (PCT)
Prior art keywords
rna
data
biological sample
purifying
separating
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/EP2013/052134
Other languages
English (en)
Inventor
Paul WILMES
Nikolaos VLASSIS
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Universite du Luxembourg
Original Assignee
Universite du Luxembourg
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Universite du Luxembourg filed Critical Universite du Luxembourg
Publication of WO2013113921A1 publication Critical patent/WO2013113921A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
    • G16B5/20Probabilistic models
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks

Definitions

  • the invention relates to methods for generating, analysing and/or modeling data in the framework of integrated molecular systems biology investigations of biological samples.
  • the invention also relates to methods for generating data representative of metabolite, protein, RNA and DNA components from single unique biological samples.
  • Heterogeneous cell populations are found in all biological systems, e.g. in mixed microbial communities or in cancerous human tumours. At the present time, it is not possible to study such heterogeneous cell populations in an integrated way to fully understand the biology at the system-level or to establish multifactorial molecular profiles (biomarkers) of such heterogeneous cell populations.
  • a method for combined metabolomic, proteomic and transcriptomic analysis from one single sample and suitable statistical evaluation and correlation analysis is known from EP1327883. According to this method, at least two compound classes from the group of metabolites, proteins and RNA are extracted from the same sample, identified and quantified. The data obtained are then combined to perform network analysis.
  • this document does not disclose how to provide concomitant data useful for genomic analysis from the same single biological sample. It has to be noted that access to concomitant genomic (genetic) information is of paramount importance when studying heterogeneous cell populations as in general the genetic context is unknown in such systems.
  • genomic information will be largely unknown for a mixed microbial community sampled from any eco-system.
  • genomic information is often also unknown in cancer tumours, i.e. the extent of genetic heterogeneity resulting from an accumulation of mutations is unknown, as well as the specific types of mutations are also often unknown.
  • the mere study of mRNA through transcriptomic techniques is not sufficient to establish good population profiles. Firstly, it is not sufficient because all RNA variant transcripts do not show the same transcription levels. Consequently, it is not possible to get information on the relative importance of each cell population in the sample or of the different mutations.
  • the invention solves at least one problem encountered in prior art in providing to the skilled man a method to obtain integrative molecular data from biomolecular components including genomic DNA, RNA, proteins and metabolites from single unique samples to identify meaningful associations between different biomolecules and/or genetic variants in heterogeneous cell populations.
  • the invention relates to a method for obtaining integrated genomic, transcriptomic, proteomic and metabolomic data from a biological sample characterised by the steps of:
  • biomolecular components comprising metabolites, genomic DNA, RNA and proteins
  • separating and purifying the biomolecular componentsfrom the biological sample comprises performing mechanical homogenization and partial lysis of the biological sample such that only a partial amount of the cells are lysed;
  • the step of separating and purifying biomolecular components comprises separating and purifying both polar and non-polar metabolites and/or a step of isolation of large and small RNA.
  • the transcriptomic analyses include a sequencing of small RNA in order to obtain information including microRNA.
  • the step of performing genomics includes obtaining epigenomic data.
  • the step of performing genomics further includes performing SNP genotyping to obtain SNP patterns.
  • the step of performing analysis and/or modeling of the molecular data includes a data preparation step, with preference said data preparation step includes at least one of (a) performing data filtering according to certain cut-offs, (b) performing missing value estimation (c) performing variable selection, and (d) any combination thereof.
  • the step of performing analysis and/or modeling of the molecular data includes at least one of (a) calculating linear or non-linear correlations between one or more of the datasets and/or variables identified within the datasets with preference by use of one or more correlation coefficients; (b) performing clustering;
  • PCA principal component analysis
  • ICA independent component analysis
  • SVD singular value decomposition
  • MDS multidimensional scaling
  • the step of performing analysis and/or modelling of the molecular data includes determining relationships between one or more of the datasets and/or variables identified within the datasets using alternative dependence measures with preference selected from mutual information and/or partial correlation.
  • the step of performing analysis and/or modeling of the molecular data includes determining relationships between one or more of the datasets and/or variables identified within the datasets using one of more of (a) probabilistic models for capturing the dependencies among set of variables with preference including determining probabilistic graphical models and/or copulas, (b) estimation techniques selected from likelihood maximisation and convex programming.
  • the single biological sample is collected and immediately snap-frozen at a temperature of below -70°C and with preference at a preferred temperature of - 196°C, and cryopreserved at said temperature until said step of separating and purifying biomolecular components.
  • the mechanical homogenization and partial lysis in the step of separating and purifying of biomolecular components is followed by a second mechanical lysis step in which all remaining intact cells are lysed.
  • the step of separating and purifying of biomolecular components from a single biological sample comprises the following sub-steps:
  • the partial lysis is halted when about 30 to 80 % and preferably about 50 % of cells have been lysed.
  • the steps of separating and purifying biomolecular components comprising metabolites, genomic DNA, RNA and proteins from a single biological sample; and the step of performing genomic, transcriptomic, proteomic and metabolomic analyses to identify and/or quantify said biomolecular components; are carried out taking several temporally and/or spatially-resolved samples.
  • the heterogeneous cell populations are selected from tumour cells and/or mixed microbial communities.
  • the method further comprises the step of comparing molecular data to databases of biological profiles to establish and/or quantify cell population profiles.
  • the invention relates to a method for obtaining integrated genomic, transcriptomic, proteomic and/or metabolomic data from single unique biological samples that entails the steps of:
  • biomolecular components including metabolites, genomic DNA, RNA and proteins from single unique biological samples
  • biomolecular components include intracellular components, or both intracellular and extracellular components.
  • the method includes performing genomic, transcriptomic, proteomic and metabolomic analyses on a single unique biological sample to identify and/or quantify said biomolecular components, resulting in molecular data.
  • the step of separating and purifying biomolecular components comprises separating and purifying both polar and non-polar metabolites.
  • the step of separating and purifying biomolecular components comprises a step of isolation of small RNA.
  • the single biological sample is collected and immediately snap-frozen at a temperature of less than -70°C and with preference at a preferred temperature of -196 °C, and cryopreserved at said temperature until the step of separating and purifying biomolecular components.
  • the step of separating and purifying biomolecular components from a single biological sample comprises the following sub-steps:
  • step (b) performing a metabolite extraction on the homogenised single unique biological sample from step (a) by addition of a phase separation solution, and centrifugation to form an upper phase, an interphase pellet and a lower phase; such that polar metabolites are in the upper phase, genomic DNA, RNA and proteins and the remaining cells not lysed by the mechanical lysis are in the interphase pellet, and non- polar metabolites are in the lower phase;
  • step (a) performing a sequential isolation of genomic DNA, RNA and proteins on the lysate.
  • the mechanical lysis of step (a) is halted when about 30 to 80 % and preferably about 50 % of cells have been lysed.
  • step (e) further comprises the steps of
  • step (e-2) Applying the solution of step (e-1 ) to a first chromatographic spin column under conditions for genomic DNA, large RNA and part of the proteins to bind, and for obtaining a flowthrough;
  • step (e-4) Applying the flowthrough of step (e-3) to a second chromatographic spin- column under conditions for small RNA to bind and for obtaining a flowthrough;
  • step (e-6) Collecting the flowthrough of step (e-4) and adjusting the pH with preference to pH 3,
  • step (e-7) Applying the pH adjusted flowthrough of step (e-4) to the first chromatographic spin-column;
  • the step of separating and purifying biomolecular components comprises separating and purifying non-polar metabolites and/or a step of isolation of small RNA.
  • the step of separating and purifying biomolecular components from a single biological sample comprises the following sub-steps: a) submiting the single biological sample, combined in an unhomogenised form with an extraction and partitioning solution containing amphipathic organic solvents, to alternating low and ultra-high hydrostatic pressure such that the majority of cells are lysed, with preference more than 90 % of the cells are lysed;
  • step (b) performing extraction by centrifugation of the sample of step (a) by centrifugation to form an upper phase, an interphase pellet and a lower phase; such that non-polar metabolites are in the upper phase, genomic DNA and RNA are in the interphase pellet, and proteins are in the lower phase;
  • amphipathic organic solvent is fluorinated alcohol such as hexafluoroisopropanol.
  • the steps of separating and purifying biomolecular components comprising metabolites, genomic DNA, RNA and proteins are carried out on a single biological sample; and the steps of performing at least one of genomic, transcriptomic, proteomic and/or metabolomic analyses to identify and/or quantify said biomolecular components; are carried out taking several time- and/or spatially- resolved samples.
  • a single or several biological samples comprise heterogeneous cell populations for which the genomic information and/or context is unknown.
  • Heterogeneous cell populations comprise for example tumour cells and/or mixed microbial communities.
  • several spatially and/or temporally resolved samples are collected from the same organism and/or environment.
  • the invention relates to a method for profiling a biological system comprising the steps of:
  • the plurality of datasets results from at least one of genomic, transcriptomic, proteomic and/or metabolomic analyses of biomolecular components separated and purified from at least two biological samples of a biological system.
  • biomolecular components include metabolites, genomic DNA, RNA and/or proteins from single unique biological samples.
  • metabolites include both polar and non-polar metabolites.
  • the step of performing transcriptomic analyses includes the sequencing of small RNA in order to obtain among others microRNA data.
  • the step of performing genomics may include the provision for obtaining epigenetic (epigenomic) information or data.
  • the step of performing genomics may further include performing SNP genotyping to obtain SNP patterns.
  • the step of performing transcriptomics may include performing analysis and quantification of post-transcriptional modifications of RNA transcripts.
  • the step of performing proteomics may further include performing analysis and quantification of proteolysis and/or of post-translational modifications.
  • the step of performing analysis and/or modeling of the molecular data includes one or more of the following approaches:
  • - performing a data preparation step that may involve filtering of the data according to certain cut-offs, estimating missing values and/or performing variable selection for identifying relevant features in the data;
  • PCA principal component analysis
  • ICA independent component analysis
  • SVD singular value decomposition
  • MDS multidimensional scaling
  • correlation coefficients for example the Pearson correlation coefficient, the Spearman correlation coefficient and/or the Kendall correlation coefficient. It is reminded that correlation refers to any of broad class of statistical relationships involving dependence between two or more variables, or two or more sets of data;
  • probabilistic graphical models allow one to model the stochastic dependencies of a set of variables in a form of a graph, while copulas allow one to model and estimate the joint probability distribution of sets of variables by separately estimating the marginals of each variable and their dependence structure;
  • the method further comprises the step of comparing molecular data to databases of biological information to establish and/or quantify cell population profiles.
  • Another object of the invention is an article of manufacture having computer- readable media with computer readable instructions embodied therein for performing the method according to first, second and/or third embodiment.
  • the method subject of the present invention represents a clear improvement over prior art techniques in providing data about all biomolecular components including polar and non-polar metabolites, genomic DNA (including the ability to resolve epigenetic imprints), small and large RNA (including posttranscriptional modifications) and proteins for network analyses.
  • the method subject of the present invention provides data about all biomolecular components including non-polar metabolites, genomic DNA (including the ability to resolve epigenetic imprints), small and large RNA (including posttranscriptional modifications) and proteins for network analyses. Since there was previously no possibility to extract concomitant genomic DNA,
  • RNA, proteins and metabolites in particular including both polar and non-polar metabolites
  • the data obtained by the method of the invention should facilitate meaningful data analysis and modelling.
  • any biomolecular associations e.g. between metabolite data, gene expression data and genetic information data, indeed reflect the actual processes in the cells under investigation.
  • the generation of integrative biomolecular data is an essential requirement for obtaining comprehensive views of biomolecular networks and understanding how perturbations, e.g. diseases, impact such networks.
  • the methodological framework according to the invention allows fine-scale patient stratification (an important requirement for personalised medicine) because of its ability to reflect all biomolecular information layers and may improve the discovery of new (multifactorial) biomarkers.
  • the methodological framework according to the invention forms the basis for the development of meaningful molecular models as well as inter- and intra-system comparative analyses.
  • the invention also allows the study of small non-coding RNA and in particular microRNA and to relate their abundances, to the protein level and/or to the underlying genetic information.
  • microRNA is involved in the normal functioning of eukaryotic cells, the dysregulation of microRNA complements has recently been associated with disease.
  • certain microRNAs have been linked to some forms of cancer, e.g. miRNA-21. Therefore it is of interest to be able to study from the same sample microRNA data in combination with information on the genomic DNA, RNA, proteins and/or metabolites.
  • the invention also allows the study of epigenomic modifications in relation to protein and/or metabolite expression levels.
  • Epigenetic modifications that are heritable and reversible, have been found to play an important role in gene expression and regulation.
  • Epigenetic modifications are also involved in numerous cellular processes such as cell differentiation and tumorigenesis.
  • the invention also allows the study of post-transcriptional modifications, i.e. modifications of eukaryotic mRNA, tRNA or other RNAs made after transcription.
  • the invention also allows the study of post-translational modifications, i.e. the chemical modifications of proteins following translation.
  • cryo-milling is equivalent to cryogenic grinding, freezer milling and/or freezer grinding and refers to the act of cooling or chilling a material and then reducing it to a small particle size.
  • partial lysis refers to a lysis process of cells conducted in order to be incomplete such that a significant fraction of the cells is not lysed.
  • incriminate cell type means that all cells are concerned independent of e.g. their morphology, identity, chemical composition, etc.
  • phase separation solution is a mixture comprising polar solvents and non-polar solvents.
  • phase separation is a process by which a single phase separates into two or more new phases, the new phases being liquid and/or solid.
  • genomic DNA refers to the genetic instruction given by the deoxyribonucleic acid chains in the cells.
  • genomic refers to the entirety of an organism's hereditary information.
  • epigenome refers to the set of epigenetic modification on the genetic material of a cell such as DNA methylations and/or histone modifications.
  • SNP single-nucleotide polymorphism
  • RNA refers to any ribonucleic acids that can be present in cells contained in the biological source material, including viral RNA.
  • small RNA refers to RNA below 200 nucleotides such as micro
  • RNA i.e. post-transcriptional regulators
  • tRNA transfer RNA that intervenes within the synthesis of protein molecules
  • large RNA refers to RNA above 200 nucleotides such as mRNA (i.e. transcripts from protein-encoding genes) and rRNA (i.e. forms part of ribosomes which are the site of protein synthesis in all living cells).
  • total RNA refers to a mixture of both small RNA ( ⁇ 200 nt) and large RNA (>200 nt).
  • large RNA fraction refers to an RNA fraction comprising mainly large RNA.
  • transcriptome refers to the set of all RNA molecules, including mRNA, rRNA, tRNA, and other non-coding RNA produced in one or a population of cells.
  • proteins refers to any molecules comprising amino acids connected via peptide bonds that are present in the biological source material.
  • proteome refers to the entire set of proteins expressed by a genome, cell, tissue or organism at a given time.
  • metabolites refers to any intermediate or product resulting from metabolism, i.e. from physical and chemical processes involved in the maintenance and reproduction of life in which nutrients are broken down to generate energy and simpler molecules which themselves may be used to form more complex molecules.
  • polar metabolites refers to all hydrophilic metabolites showing a capacity to interact with polar solvents, in particular with water or with other polar groups, such as sugars, amino acids, organic acid, etc.
  • non-polar metabolites refers to all hydrophobic metabolites having a tendency to dissolve in non-polar solvents, such as lipophilic compounds, lipids, waxes, chlorophyll, etc.
  • metabolome refers to the complete set of metabolites, such as metabolic intermediates or end products of metabolism, hormones and other signaling molecules to be found within a biological sample.
  • integrated data or “integrated information” refers to data or information resulting from omic analyses and further submitted to a step of analysis and/or modeling.
  • biomolecular component refers to biomolecules and is used as a synonym of "biomolecular complement”.
  • FIG. 1 shows a flowchart highlighting the methodological framework according to the invention
  • FIG. 2 shows a flowchart of the biomolecular isolation method flowchart according to the invention
  • figure 3 is a Pearson correlation network of microbial genera and detected metabolite features across 16 distinct microbial community samples obtained according to the method of the invention.
  • the method for obtaining integrated genomic, transcriptomic, proteomic and/or metabolomic data from a biological sample according to the invention includes the steps of:
  • biomolecular components comprising metabolites, genomic DNA, RNA and proteins from a single biological sample
  • Step 1 separating and purifying biomolecular components
  • the method provides an access to all biomolecular components in a standardised way.
  • a biological sample is collected and is immediately snap-frozen at a temperature minimum of -70°C and preferably -196°C, in order to stop inherent enzymatic activity.
  • the biological sample comprises heterogeneous cell populations, and is selected with preference from e.g. tumour tissues and/or mixed microbial communities; and/or cell population for which the genomic information or context is unknown.
  • the sample is cryopreserved at said temperature until applying the biomolecular extraction protocol according to the invention.
  • the sample is then submitted to a cryogenic homogenisation and lysis step.
  • this step is performed at -80°C at least, in an oscillating mill at a frequency of about 30 Hz during at least 2 min.
  • Cells for example microbial cells
  • the cryo-milling step is performed in order that about 30 to 60 %, or 30 to 70%, or 30 to 80 % and with preference about 50 % of the cells are lysed indiscriminately.
  • the mechanical nature of the first lysis step does not involve chemical products such as chaotropic agents that may affect non-polar metabolites.
  • halting the mechanical step before the lysis of the cells is complete avoids excessive milling and helps to preserve high quality and high molecular weight DNA, so the later isolated DNA is usable for genomic analyses.
  • the partial nature of the lysis step allows obtaining a representative protein fraction by preserving a part of the cells intact, independent of their morphology or their identity, during the metabolite extraction.
  • proteins of the lysed cells that are released with the first mechanical step and which show hydrophobic properties may dissolve in the non- polar solvents used during the metabolite extraction, and may therefore be lost for proteomic analyses.
  • to perform the first mechanical lysis step at very low temperature helps in preserving the respective biomolecules as a molecular snapshot of the time of sampling.
  • Metabolites are first extracted in a phase separation step.
  • a phase separation solution comprising a mixture of methanol, chloroform and water in the proportion of one volume of methanol, one volume of water and two volumes of chloroform is added to the cryo-milled sample.
  • the ratio of chloroform (or other non-polar solvents considered) in the mixture can be lowered by the skilled man if the sample is not particularly rich in lipids. Conversely, where the sample is expected to be rich in polar metabolites, the ratio of methanol (or other polar solvent) may be adjusted accordingly.
  • the sample mixed with the solvent mixture is then homogenised and centrifuged. Polar and non-polar metabolites are extracted separately as they are solubilised either in the polar or in the non-polar phase.
  • the centrifugation step allows a separation into three phases: an upper phase comprising polar metabolites, an interphase pellet comprising genomic DNA, large and small RNA, proteins and non-lysed cells, and a lower phase comprising non-polar metabolites.
  • an upper phase comprising polar metabolites
  • an interphase pellet comprising genomic DNA, large and small RNA, proteins and non-lysed cells
  • a lower phase comprising non-polar metabolites.
  • metabolomic studies are performed on the lower and upper phases.
  • the addition of the phase solution and the following homogenisation step are performed at a temperature below 0°C in order to protect RNA and DNA from degradation.
  • a combined mechanical and chemical lysis step is performed on the interphase pellet, which contains all remaining cellular constituents, or components.
  • the lysis solution comprises with preference Tris-EDTA and a lysis buffer, ⁇ - mercaptoethanol, for example added at 10 ⁇ per ml of lysis buffer, is added to the interphase together with the lysis solution to preserve RNA integrity.
  • ⁇ - mercaptoethanol for example added at 10 ⁇ per ml of lysis buffer, is added to the interphase together with the lysis solution to preserve RNA integrity.
  • differential nucleic acid and protein isolation is carried out using chromatographic spin-columns.
  • the method involves for differential nucleic acid and protein isolation the commercially available "All-in-One Purification Kit” (Total RNA, microRNA, total proteins and genomic DNA) from Norgen Biotek Corp.
  • This kit is based on the differential binding of biomolecules and their subsequent elution from chromatographic spin columns. The skilled man will use this kit because it provides the possibility to separately isolate small RNA ( ⁇ 200 nt) comprising microRNA from the total RNA fraction. Access to the small RNA fraction is a requirement for full and comprehensive integrative omic measurements.
  • the lysate is mixed with ethanol (for example 100 ⁇ of pure ethanol) and is applied to a first mineral support (first column) under conditions for genomic DNA, large RNA and part of the proteins to bind.
  • ethanol for example 100 ⁇ of pure ethanol
  • first column first mineral support
  • genomic DNA, large RNA and part of the proteins to bind.
  • the enhanced quantity of ethanol added results in higher efficiency of nucleic acid binding.
  • Large RNA, genomic DNA and a small part of proteins bind to the column while the flowthrough fraction contains unbound proteins and small
  • RNA The flowthrough fraction is collected.
  • the flowthrough is then applied to a second mineral support (second column) under conditions for small RNA to bind, allowing the purification of RNA smaller than 200 nt (small RNA fraction).
  • the flowthrough of the second mineral support is collected since it contains the proteins. Genomic DNA and large RNA are sequentially washed off the first mineral support and eluted two times with dedicated solutions, resulting in large RNA and genomic
  • DNA fractions The flowthrough containing proteins collected from the second mineral support is adjusted to pH 3 and is applied to the first mineral support under conditions to have the remaining proteins to bind together with the first bound proteins. Then the bound proteins are finally washed and eluted two times from the first mineral support, resulting in the protein fraction.
  • the method involves for differential nucleic acid and protein isolation the commercially available AHPrep ® DNA/RNA/Protein Mini kit (Qiagen, Valencia, CA). This kit is also based upon differential binding properties of biomolecules and their subsequent elution from chromatographic spin columns.
  • RNA smaller than 200 nt small and microRNAs
  • the lysate is first passed through a QIAshredder column which allows selective binding of genomic DNA and this is eluted in a further step using a dedicated buffer, resulting in the DNA fraction.
  • Ethanol for example 400 ⁇ for 600 ⁇ of initially added buffer modified with ⁇ - mercaptoethanol
  • RNA is eluded using a dedicated buffer, resulting in the RNA fraction.
  • An aqueous protein precipitation solution is added to the flow-through for the isolation of the total protein fraction.
  • the protein pellet is then re-dissolved in the designated buffer, resulting in the protein fraction.
  • the method involves for differential nucleic acid and protein isolation known methods using chromatographic spin-columns allowing the isolation of a total RNA fraction comprising small RNA and large RNA. Additionally a small RNA fraction can be obtained separately by any known isolation procedure.
  • the first and second mineral support are porous or non-porous and comprised of metal oxides or mixed metal oxides, silica gel, silicon carbide resin, silica membrane, glass particle, powdered glass, quartz, Aluminia, Zeolite, Titanium Dioxide, or Zirconium Dioxide.
  • ethanol is added to the lysate or to the flowthrough.
  • ethanol can be replaced with a dipolar atropic solvent selected from acetone, acetonitrile, tetrahydrofuran (THF), methyl ethyl ketone, ⁇ , ⁇ -dimethylformamide (DMF) and dimethyl sulfoxide.
  • a dipolar atropic solvent selected from acetone, acetonitrile, tetrahydrofuran (THF), methyl ethyl ketone, ⁇ , ⁇ -dimethylformamide (DMF) and dimethyl sulfoxide.
  • the lysis solution includes a chaotropic salt, non-ionic detergent (i.e. non-ionic surfactant) and reducing agent.
  • non-ionic detergent i.e. non-ionic surfactant
  • said chaotropic salt is guanidine HCI.
  • said non-ionic agent is selected from triethyleneglycol monolauryl ether, (octylplhenoxy)polyethoxyethanol, sorbitari monolaurate, T-octylphenoxyployethoxyethanol, or a combination thereof.
  • the non-ionic detergent or combination thereof is in the range of 0,1 - 10 %.
  • the reducing agent is 2-aminoethanethiol, tris- carboxyethylphosphine (TCEP), or ⁇ -mercaptoethanol.
  • Step 2 performing at least one of genomic, transcriptomic, proteomic and/or metabolomic analyses to identify and/or quantify said biomolecular components.
  • Genomic analyses result in several datasets. Each dataset is related to one of the fractions of biomolecular components separated and purified in step 1.
  • Genomics is performed on the genomic DNA fraction to obtain the genomic DNA information and establish the genome of the cells.
  • Genomic techniques include but are not limited to Whole Genome Sequencing (WGS) analysis, Restriction Fragment Length Polymorphism (RFLP) analysis followed by Southern Blot analysis, Polymerase Chain Reaction (PCR) followed optionally by amplicon sequencing, Short Tandem Repeats (STR) analysis, and Amplified Fragment Length Polymorphism (AmpFLP) analysis.
  • Epigenomics is performed on the genomic DNA fraction to obtain epigenetic information and establish the epigenome of the cells.
  • epigenomic techniques include but are not limited to chromatin immunoprecipitation (ChIP) technology with DNA micro arrays (termed ChlP-Chip), coupled chromatin immunoprecipitation and serial analysis of gene expression (ChlP-SAGE), coupled chromatin immunoprecipitation and paired end ditag sequencing (ChlP-PET) and coupled chromatin immunoprecipitation and DNA sequencing (ChlP-seq).
  • epigenomic techniques include but are not limited to Restriction Landmark Genome Scanning (RLGS) technique and bisulfite sequencing. With preference genomic DNA information further includes obtaining SNP patterns.
  • SNP genotyping techniques include but are not limited to enzyme based method such as Whole Genome Sequencing (WGS) analysis, Restriction Fragment Length polymorphism (RFLP), Primer extension and sequencing, PCR based method, Temperature gradient gel electrophoresis (TGGL), etc.
  • enzyme based method such as Whole Genome Sequencing (WGS) analysis, Restriction Fragment Length polymorphism (RFLP), Primer extension and sequencing, PCR based method, Temperature gradient gel electrophoresis (TGGL), etc.
  • Transcriptomic techniques are performed on the large and small RNA fractions in order to establish the transcriptome of the cells.
  • Transcriptomic techniques include, but are not limited to, microarray-based approaches such as DNA microarray, differential displays approaches, sequencing-based approaches such as RNAseq, Serial Analysis of Gene Expression (SAGE) or Massively Parallel Signature
  • the transcriptome can vary with external environmental conditions. Because it includes all RNA transcripts in the cell, the transcriptome reflects the genes or non-coding regions that are actively being transcribed at any given time, with the exception of mRNA degradation phenomena such as transcriptional attenuation.
  • the transcriptomes of cancer cells are of particular interest to classify tumours and, thus, for patient stratification. The information derived from gene expression profiling often has an impact on predicting the patient's clinical outcome.
  • study of the transcriptome may include analysis and quantification of post-transcriptional modifications including methylations of RNA transcripts; and/or of post-transcriptional regulation processes including splicing and/or RNA editing. Characterization and quantification of RNA post-transcriptional modifications can be performed by known techniques using for example stable isotope labeling of RNA in conjunction with mass spectrometric analysis.
  • the protein fraction is submitted to proteomics.
  • the techniques involved include, but are not limited to, liquid chromatography followed by mass spectrometry (MS), two dimensional gel-electrophoresis (2DE), SDS Polyacrylamide-Gel- Electrophoresis (SDS-PAGE), Western blotting, MS, ion-exchange chromatography, etc.
  • proteomics allows measuring protein abundances as well as studying and establishing the structure and the function of the proteins. The variation seen within the proteome of a cell is generally larger than the corresponding transcriptome due to alternative splicing of genes and post- translational modifications.
  • study of the proteome may include analyzing and/or quantifying proteolysis and/or one or more of the post-translational modifications of the proteins including but not limited to: proteolytic cleavage, glycosylation, acetylation, alkylation, methylation, biotinylation, glutamylation, glycylation, isoprenylation, lipoylation, phosphopantetheinylation, phosphorylation, sulfation, selenation, C-terminal amidation and/or any combination thereof.
  • proteolysis and/or posttranslational modifications is conducted by known methods using for example PROTOMAP technology which allows the identification of posttranslational changes to proteins that are manifested in altered migration patterns when subjected to for example one-dimensional SDS-PAGE, or two- dimensional gel electrophoresis, and difference gel electrophoresis.
  • Metabolomics is performed on the polar and non-polar metabolite fractions obtained. Metabolomics allows the study of the chemical processes of cells involving small molecules (metabolites). Metabolites include, but are not limited to, sugars, amino acids, organic acids, peptides, steroids, lipids, lipophilic compounds, chlorophyll, etc.
  • the metabolite fraction can also comprise pharmacophore and drug breakdown compounds. According to the invention both polar and non-polar metabolites can be extracted from the same unique sample and studied to establish the metabolomes of different cell populations.
  • Analytical approaches include, but are not limited to nuclear magnetic resonance (NMR) spectrometry and mass spectrometry (MS), which is used to identify and to quantify metabolites after separation, by gas chromatography (GC), high performance liquid chromatography (HPLC) or capillary electrophoresis (CE), for example.
  • NMR nuclear magnetic resonance
  • MS mass spectrometry
  • GC gas chromatography
  • HPLC high performance liquid chromatography
  • CE capillary electrophoresis
  • the invention provides information of genomic DNA, RNA, proteins and metabolites from a single unique sample.
  • the invention is remarkable in that it facilitates the generation of data which is particularly suitable for analyses of biological samples comprising heterogeneous cell populations and/or a cell population for which the genomic information or context is unknown.
  • the steps of separating and purifying biomolecular components comprising metabolites, genomic DNA, RNA and proteins from a single unique biological sample; and the step of performing at least one of genomic, transcriptomic, proteomic and/or metabolomic analyses to identify and/or quantify said biomolecular components; are carried out on several samples.
  • the data are obtained for each sample, so that variation of the genomic, transcriptomic, proteomic and/or metabolomic components over time, space or cell populations or any other known difference between the biological samples collected can be assessed.
  • the several samples can comprise samples collected at different time points for the same organism.
  • Step 3 performing analysis and/or modeling of the molecular data to obtain integrated data
  • the molecular data obtained through at least one of genomic, transcriptomic, proteomic and/or metabolomics analyses are prepared for data analysis and/or subsequent modeling.
  • Preparation of the data for subsequent analysis involves for example normalisation of the different datasets that are the result of genomic, epigenomic, transcriptomic, proteomic, and/or metabolomic analyses to bring them onto the same scale and/or the combination of the different datasets into a single file or the storage of the individual datasets in a centralised database. Preparation of the data may also involve filtering of the data according to certain cut-offs and/or estimating missing values. Variable selection for identifying relevant features in the data, such as minimum-redundancy-maximum-relevance or correlation feature selection, may be part of the data preparation phase.
  • the prepared data obtained are then analysed using methods that are appropriate for semi-quantitative, quantitative and qualitative data.
  • the semi-quantitative and quantitative data represent the amount of features present in the sample either in absolute terms (e.g. weight or moles per weight sample) or in relative terms (i.e. normalised to a certain reference during the preparation step).
  • Analyses carried out on the semi-quantitative and quantitative data involve suitable statistical, correlation and partial correlation analyses, calculating Pearson correlation coefficients, mutual information, establishing rank correlation coefficients such as Spearman's rank correlation coefficient or Kendall's rank correlation coefficient (which allow the identification of non-linear relationships between variables), etc.
  • Additional analyses may include clustering and dimension reduction methods such principal component analysis (PCA), independent component analysis (ICA), singular value decomposition (SVD) or multidimensional scaling (MDS).
  • PCA principal component analysis
  • ICA independent component analysis
  • SVD singular value decomposition
  • MDS multidimensional scaling
  • Probabilistic models may include ensembles of dependency trees and mixtures of copulas. Estimation techniques may include likelihood maximisation and convex programming.
  • the invention it is possible to uncover associations between for example genetic information contained within genomes with information contained within the transcriptomes, proteomes and/or metabolomes. It is also possible to correlate epigenome information (epigenomic signatures, e.g. methylation patterns) with information from the transcriptome, proteome and/or metabolome.
  • epigenome information e.g. methylation patterns
  • the genome and proteome information taken together allow the identification of proteins associated with specific diseases. This information can be complemented by transcriptome information and in particular with levels and information about biomolecules showing gene regulatory effects such as those exhibited by microRNA. Understanding the proteome, the structure and function of each protein and the complexities of protein-protein interactions will be critical for developing the most effective diagnostic techniques and disease treatments.
  • the invention thus allows developing personalized drugs that are more effective for an individual.
  • the analyses performed can be completed by integration of metabolome information to give a more complete picture of the physiology of a cell.
  • the method according to the invention allows meaningful correlations/relationships between, e.g. microRNA abundance and certain metabolites, microRNA abundances and protein expression, etc.
  • the invention allows the identification of meaningful correlations between SNP patterns in genomic DNA and for example variant protein expression patterns, etc.
  • the invention allows the establishment and/or quantification of the cell population profiles of heterogeneous cell populations by comparing them to measured data to e.g. databases of previously established profiles. For example it allows evaluating the extent of genetic heterogeneity in cancer tumours, the specific types of mutations in an individual, as well as the abundance of the cell populations showing such mutation, in order to optimise the treatment to be administered. It is understood that all possible mutations may not be found in existing databases of profiles and that the invention allows identifying new mutations and performing related post-genomic analyses.
  • the method allows determining both type and abundance of the different organisms in the sample.
  • the invention further allows the identifying both known and unknown or unexpected organisms in mixed microbial communities.
  • FIG. 3 shows a Pearson correlation network of microbial genera (determined using polymerase chain reaction amplification from isolated genomic DNA and deep 454 pyrosequencing of the phylogenetically informative 16S rRNA gene) and detected metabolite features extracted across 16 unique microbial community samples dominated by lipid accumulating organisms (the correlation network was built using the following cut-offs: r > 0.8; statistical p-value ⁇ 0.0001 ).
  • nodes representing different genera are color-coded according to taxonomic affiliation (see insert), metabolite feature nodes are highlighted in dark grey and edges highlighting correlations between the nodes are color-coded in light grey. Distinct relationships are discernible from the network, e.g. the abundance of
  • Methylibium spp. correlates strongly with a number of unknown nonpolar metabolites and, thus, Methylibium spp. may represent a keystone genus in this microbial community. These potential linkages are only discernable thanks to the integrative molecular systems biology approach subject of the present filing.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Chemical & Material Sciences (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • Immunology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Analytical Chemistry (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Medical Informatics (AREA)
  • Evolutionary Biology (AREA)
  • Physiology (AREA)
  • Microbiology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • Biochemistry (AREA)
  • Urology & Nephrology (AREA)
  • Hematology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Cell Biology (AREA)
  • Genetics & Genomics (AREA)
  • General Engineering & Computer Science (AREA)
  • Food Science & Technology (AREA)
  • Medicinal Chemistry (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
PCT/EP2013/052134 2012-02-03 2013-02-04 Procédé permettant d'obtenir des informations génomiques, transcriptomiques, protéomiques et/ou métabolomiques à partir d'un échantillon biologique unique Ceased WO2013113921A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
LU91940 2012-02-03
LU91940A LU91940B1 (en) 2012-02-03 2012-02-03 Method for obtaining integrated genomic, transcriptomic, proteomic and/or metabolomic information from a single unique biological sample

Publications (1)

Publication Number Publication Date
WO2013113921A1 true WO2013113921A1 (fr) 2013-08-08

Family

ID=48904430

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2013/052134 Ceased WO2013113921A1 (fr) 2012-02-03 2013-02-04 Procédé permettant d'obtenir des informations génomiques, transcriptomiques, protéomiques et/ou métabolomiques à partir d'un échantillon biologique unique

Country Status (2)

Country Link
LU (1) LU91940B1 (fr)
WO (1) WO2013113921A1 (fr)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2876443A1 (fr) * 2013-11-15 2015-05-27 Nanjing University Procédé d'analyse de la toxicité de l'eau
WO2015148689A1 (fr) * 2014-03-25 2015-10-01 Five3 Genomics, Llc Systèmes et procédés pour l'analyse d'arn dans une confirmation fonctionnelle de mutations cancéreuses
CN115661216A (zh) * 2022-10-19 2023-01-31 上海鹿明生物科技有限公司 一种空间转录组与空间代谢组数据关联方法及系统
CN115690015A (zh) * 2022-10-19 2023-02-03 上海鹿明生物科技有限公司 一种空间转录组与空间代谢组点对点匹配方法及系统

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1327883A2 (fr) 2002-01-10 2003-07-16 Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V. Analyses métabolomiques, protéomiques et transcriptomiques combinées du mème échantillon et évaluation appropriée des données
WO2013029919A1 (fr) * 2011-09-02 2013-03-07 Université Du Luxembourg Méthode et kit pour isoler l'adn, l'arn, les protéines génomiques et les métabolites d'un seul échantillon biologique

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8346482B2 (en) * 2003-08-22 2013-01-01 Fernandez Dennis S Integrated biosensor and simulation system for diagnosis and therapy

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1327883A2 (fr) 2002-01-10 2003-07-16 Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V. Analyses métabolomiques, protéomiques et transcriptomiques combinées du mème échantillon et évaluation appropriée des données
WO2003058238A1 (fr) * 2002-01-10 2003-07-17 Max-Planck-Gesellschaft Zur Förderung Der Wissenschaften Analyse metabolomique, proteomique et transcriptomique combinee a partir d'un seul echantillon, et evaluation statistique adequate de donnees
WO2013029919A1 (fr) * 2011-09-02 2013-03-07 Université Du Luxembourg Méthode et kit pour isoler l'adn, l'arn, les protéines génomiques et les métabolites d'un seul échantillon biologique

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
GROSS V; CARLSON G; KWAN AT; SMEJKAL G; FREEMAN E; IVANOV AR ET AL.: "Tissue fractionation by hydrostatic pressure cycling technology: the unified sample preparation technique for systems biology studies", J BIOMOL TECH, vol. 3, 2008, pages 189 - 199
JAMES LAUGHARN: "Novel technology used in cancer sample preparation", 22 April 2009 (2009-04-22), XP055040586, Retrieved from the Internet <URL:http://covarisinc.com/wp-content/uploads/cvAACRposterForWeb.pdf> [retrieved on 20121010] *
LAUGHARN JAMES ET AL: "Novel technology used in cancer sample preparation", PROCEEDINGS OF THE AMERICAN ASSOCIATION FOR CANCER RESEARCH, AACR, US, vol. 50, 22 April 2009 (2009-04-22), pages 798 - 799, XP009163540, ISSN: 0197-016X *
ROUME HUGO ET AL: "A biomolecular isolation framework for eco-systems biology", ISME JOURNAL, vol. 7, no. 1, January 2013 (2013-01-01), pages 110 - 121, XP009168480, ISSN: 1751-7362, DOI: 10.1038/ismej.2012.72 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2876443A1 (fr) * 2013-11-15 2015-05-27 Nanjing University Procédé d'analyse de la toxicité de l'eau
US10041118B2 (en) 2013-11-15 2018-08-07 Nanjing University Method for analyzing water toxicity
WO2015148689A1 (fr) * 2014-03-25 2015-10-01 Five3 Genomics, Llc Systèmes et procédés pour l'analyse d'arn dans une confirmation fonctionnelle de mutations cancéreuses
CN106852176A (zh) * 2014-03-25 2017-06-13 凡弗3基因组有限公司 用于功能证实癌症突变的rna分析的系统和方法
US10192027B2 (en) 2014-03-25 2019-01-29 Five3 Genomics, Llc Systems and methods for RNA analysis in functional confirmation of cancer mutations
CN115661216A (zh) * 2022-10-19 2023-01-31 上海鹿明生物科技有限公司 一种空间转录组与空间代谢组数据关联方法及系统
CN115690015A (zh) * 2022-10-19 2023-02-03 上海鹿明生物科技有限公司 一种空间转录组与空间代谢组点对点匹配方法及系统

Also Published As

Publication number Publication date
LU91940B1 (en) 2013-08-05

Similar Documents

Publication Publication Date Title
Al-Amrani et al. Proteomics: Concepts and applications in human medicine
Shaffer et al. Standardized multi-omics of Earth’s microbiomes reveals microbial and metabolite diversity
Gavin et al. Functional organization of the yeast proteome by systematic analysis of protein complexes
Huber Is proteomics heading in the wrong direction?
Richards et al. Proteome sequencing goes deep
Fiehn et al. Integrated studies on plant biology using multiparallel techniques
Brunner et al. A high-quality catalog of the Drosophila melanogaster proteome
Aebersold et al. Quantitative proteome analysis: methods and applications
Wei et al. Charting gene regulatory networks: strategies, challenges and perspectives
Erady et al. Pan-cancer analysis of transcripts encoding novel open-reading frames (nORFs) and their potential biological functions
O’Connor et al. Systems Biology and immune aging
Jäger et al. Separation and identification of human heart proteins
Stewart et al. Omics technologies used in systems biology
WO2013113921A1 (fr) Procédé permettant d&#39;obtenir des informations génomiques, transcriptomiques, protéomiques et/ou métabolomiques à partir d&#39;un échantillon biologique unique
Hause et al. Targeted protein-omic methods are bridging the gap between proteomic and hypothesis-driven protein analysis approaches
Conrads et al. An enriched look at tyrosine phosphorylation
CN107832585A (zh) 一种RNAseq数据分析方法
Boggi et al. A novel integrated extraction protocol for multi-omic studies in heavily degraded samples
Di et al. A basic phosphoproteomic-DIA workflow integrating precise quantification of phosphosites in systems biology
Dobhal et al. Multi-omics Approach Towards Cancer Therapy
Bai Protein Folding Protocols
Witting et al. Transcriptome and metabolome data integration—Technical perquisites for successful data fusion and visualization
Pregizer et al. Multi-omic single cell sequencing: Overview and opportunities for kidney disease therapeutic development
Bandesh et al. A systems perspective of complex diseases: from reductionism to integration
Richards Techniques for gene expression profiling

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13702477

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13702477

Country of ref document: EP

Kind code of ref document: A1