WO2020142035A1 - Disease diagnosis system - Google Patents
Disease diagnosis system Download PDFInfo
- Publication number
- WO2020142035A1 WO2020142035A1 PCT/TR2019/051101 TR2019051101W WO2020142035A1 WO 2020142035 A1 WO2020142035 A1 WO 2020142035A1 TR 2019051101 W TR2019051101 W TR 2019051101W WO 2020142035 A1 WO2020142035 A1 WO 2020142035A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- metabolic
- values
- metabolites
- individual
- test
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B5/00—Measuring for diagnostic purposes; Identification of persons
- A61B5/72—Signal processing specially adapted for physiological signals or for diagnostic purposes
- A61B5/7271—Specific aspects of physiological measurement analysis
- A61B5/7275—Determining trends in physiological measurement data; Predicting development of a medical condition based on physiological measurements, e.g. determining a risk factor
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B5/00—ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
Definitions
- the invention relates to a computer-based disease diagnosis system and method in which a disease of said test individual is predicted based on metabolomic measurements taken from a test individual.
- Accurate and early diagnosis of diseases, such as cancer, is important for the successful continuation of the treatment process and ensuring the correct treatment.
- Accurate diagnosis reduces both the patient's time loss and the patient's loss of financial resources.
- Phenotypes of diseases often have implications for patients' metabolism.
- the present invention relates to a robot in order to eliminate the above-mentioned disadvantages and bring new advantages to the relevant technical field.
- the present invention is a disease diagnosis system in which the disease of the said test individual is predicted based on metabolomic measurements taken from a test individual in order to accomplish all the objects mentioned above and which will emerge from the following detailed description.
- the innovation comprises a processor unit, and the said processor unit is configured to perform the following steps:
- a metabolic network model stored in a memory unit in which a metabolic network is defined, consisting of a number of metabolic pathways formed by metabolites and metabolic reactions that produce and consume these metabolites,
- the main novelty of the method described in the present specification of this invention is that it performs a holistic analysis on the entire metabolic network rather than isolating each pathway separately.
- the main advantage of such an approach is that it allows the detection of key pathways that have no (or few) gene/metabolite measurements associated with them in the analyzed ohmic data for a given disease.
- Pathifier and Paradigm are capable of evaluating and scoring pathways with at least a few metabolites in the analyzed metabolomics data set. Therefore, they may miss many important key pathways.
- a preferred embodiment of the invention is characterized in that the processor unit is configured to display the disease with which the patient individual is associated if the test difference values are predicted to belong to a diseased individual.
- Another preferred embodiment of the invention is characterized in that the said classification algorithm is a machine-learning based classification algorithm.
- the said classification algorithm is a logistic regression algorithm. Thus, the accuracy of disease prediction is further increased.
- Another preferred embodiment of the invention is characterized in that it comprises an input/output unit for receiving the test data as input.
- the invention is also a computer-based disease diagnosing method in which the disease of the said test individual is predicted based on metabolomic measurements from a test individual. Accordingly, the innovation comprises the following steps performed by a processor unit:
- Figure 1 shows a representative view of the disease diagnosis system.
- Figure 2 shows a representative view of the memory unit.
- Figure 3 shows a representative thermal map of the difference values of metabolic pathways according to diseased and healthy individuals.
- Figure 4 shows a representative view of the metabolic network and its components.
- the disease diagnosis system (100) comprises a processor unit (1 10), a memory unit (120) with which the said processor unit (1 10) is associated to perform read/write operations.
- the said processor unit (1 10) may be a microprocessor.
- the memory unit (120) may comprise a memory capable of storing data permanently or a suitable combination of memories that can store data permanently/temporarily. It may comprise an input/output unit (130) associated with the processor unit (1 10) to input data to the processor unit (110) and to receive data from the processor unit (1 10).
- the disease diagnosis system (100) may also comprise a communication unit (150) for enabling the processor unit (110) to exchange data with a communication network (200) such as the internet.
- the disease diagnosis system (100) may comprise a bus (140) that enables processor unit (1 10), memory unit (120), input/output unit (130), communication unit (150), and other components known in the art but not mentioned herein to associate them with each other in such a way as to make data transfer appropriately.
- a metabolic network model (121) is stored in the memory unit (120).
- Figure 2 shows a representative view of a metabolic network (400).
- the metabolic network model (121) comprises a plurality of metabolic pathways (410) which are associated with each other.
- Metabolic pathways (410) comprise reactions (412) and the metabolites (41 1) (such as alene, glutamine, fatty acids, etc.) involved in these reactions (412).
- Each metabolic pathway (410) can be expressed as a linear equation in terms of reaction (412) and metabolites (41 1) involved in the reaction (412). Below are examples where metabolic pathways (410) are defined as linear equations.
- the metabolic network model (121) can also be defined in terms of these reactions (412). More specifically, the metabolic network model (121) can be defined as an S matrix.
- the said matrix may comprise, for example, metabolites (411) in the rows and reactions (412) in the columns. In the cells, there are values according to the role of the corresponding metabolite (411) in the corresponding reaction (412) in the cell. These values are the stoichiometry of the metabolite (411) in the reaction (412).
- Data on metabolomic measurements from healthy individuals and data on metabolomic measurements of diseased individual associated with at least one disease are stored in a reference database (122) in the memory unit (120).
- the processor unit (1 10) enables a disease detection module comprising command lines in the memory unit (120) to execute the following steps as an innovative aspect of the invention: It accesses the metabolic network model (121). It accesses the reference database (122). For each metabolic pathway (410) in the metabolic network model (121), an average, minimum, and maximum metabolic flux values are determined based on the metabolite fold-change values in the reference database (122). For each individual in the reference database (122), a set of metabolic flux values is generated including individual-specific minimum and maximum metabolic flux values for each metabolic pathway (410) in the metabolic network model (121). It generates a set of reference difference values for each individual based on the deviations of said metabolic flux value sets from the average metabolic flux values.
- Test data including the fold-change values of metabolites (411) in metabolomic measurements taken from a test individual are received as input. It is associated with metabolic pathways (410) which covers the reactions (412) with said metabolites (411) in the test data. A minimum and maximum metabolic flux value are determined for each of these associated metabolic pathways (410). Test difference values are determined based on the deviations of the determined metabolic flux values from the average metabolic flux values. A classification algorithm is applied to the generated test difference values to predict to which set of reference difference values they belong. If it is determined that the test difference values belong to a diseased individual's reference difference value sets, one or more diseases associated with the diseased individual are provided as output by the input/output unit (130). Thus, the metabolic network is evaluated holistically and the test individual is matched with a diseased individual or healthy individuals and the diagnosis of the disease is made with an increased accuracy.
- the steady-state assumptions of the optimization problem are represented as constants in the optimization problem; the objective function of the optimization problem is to maximize the metabolic flux values of the reactions (412) that produce metabolites that are observed to increase in the cell of the test individual specific to the test individual, and to minimize metabolic flux values of the reactions (412) that produce metabolites that are observed to decrease in the test individual. More specifically, the objective function of the optimization problem is as follows:
- the average densities of all metabolites (41 1) are calculated based on the metabolomic measurements of healthy individuals held in the memory unit (120). Then the fold-change values are determined for each metabolite (41 1) of each individual based on their deviation from the average density.
- FVA Flux Variability Analysis
- the value of the objective function is determined by solving the optimization problem.
- the objective function is reconstructed so as to minimize the metabolic flux value and the optimization problem is solved again.
- the calculated objective function value is recorded as the lower limit (minimum) of the metabolic flux value of the R reaction.
- the average of the lower and upper limits of the metabolic flux values are calculated and the mean metabolic flux values are calculated.
- the lower and upper limits of the metabolic flux values are determined based on the metabolomic measurements of a test individual; the average of these lower and upper limits is calculated and a metabolic flux value is determined for each metabolic pathway (410).
- Test difference values are generated based on the difference between the metabolic flux values determined for the test individual and the average metabolic flux value.
- a statistical significance analysis of these difference values is carried out based on the difference values of diseased and healthy individuals.
- the major metabolic pathway (410) changes causing the disease can be detected.
- the statistical significance is determined by matching the test difference value with an F-value and a P-value calculated using ANOVA.
- a classification algorithm based on machine-learning is taught using the difference values calculated for the diseased individuals and healthy individuals. This algorithm predicts whether the result of metabolomic analysis, ie, test difference values and the significance values associated with these test difference values, belong to a diseased or a healthy individual. More specifically, it is determined which reference difference value sets for diseased and healthy individuals are the most similar to the results. If the results belong to a diseased individual, the disease associated with the diseased individual is displayed in the input/output unit (130). This possible embodiment is based on the logistic regression algorithm.
- Figure 3 shows the thermal map of the difference values of the metabolic pathways (410) of healthy individuals and diseased individuals. Values for healthy individuals are indicated in a circumscribed symbol. It can be observed that diseased individuals have metabolic pathways which have distinct difference values compared to healthy individuals.
- the metabolomic measurements of each test individual are recorded in the disease database or reference database (122) according to the analysis result, therefore the reference database (122) is constantly updated.
- Table 1 shows the difference values of the metabolic pathways (410) of a patient associated with breast cancer and the F and P values for the statistical significance of the difference values.
- the disease diagnosis system (100) is provided on a server.
- the server is connected to a communication network (200) and it receives the test data from the client devices (300) connected to the same communication network (200) as input and as a result, determines whether the test data belongs to a diseased individual or a healthy individual.
- the invention can be used to diagnose diseases, as well as to determine possible diseases that may be incurred by the individual at whom the measurements are made, to identify the main metabolic mechanisms of the disease and to present them visually to health researchers and clinicians, to plan appropriate personalized treatments according to the main metabolic mechanism of the disease, to make comparative analysis of metabolic mechanisms of diseases and as a result, to make recommendations to the type and drug researchers for the same/similar diseases for the use of existing treatment for another disease, to make recommendations on which possible proteins can be metabolic drug targets for the drugs to be developed against diseases that are not yet medicated.
- the scope of the protection of the invention is set forth in the annexed claims and certainly cannot be limited to exemplary explanations in this detailed description. It is evident that one skilled in the technic can make similar embodiments in the light of the explanations above without moving away the main theme of the invention.
Landscapes
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medical Informatics (AREA)
- Biomedical Technology (AREA)
- Public Health (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Physiology (AREA)
- Pathology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Theoretical Computer Science (AREA)
- Surgery (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Psychiatry (AREA)
- Signal Processing (AREA)
- Evolutionary Biology (AREA)
- Biotechnology (AREA)
- Heart & Thoracic Surgery (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Animal Behavior & Ethology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Veterinary Medicine (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Epidemiology (AREA)
- Primary Health Care (AREA)
- Medical Treatment And Welfare Office Work (AREA)
Abstract
The invention is a disease diagnosis system (100), wherein the disease of a test individual is predicted based on metabolomic measurements taken from the said test individual. Accordingly, it comprises a processor unit (110), characterized in that the said processor unit (110) is configured to perform the following steps: accessing a metabolic network model (121) stored in a memory unit, wherein a metabolic network (400) comprising a plurality of metabolic pathways (410) identified by metabolites and metabolic reactions (412) in which metabolites and metabolites (411) are involved; accessing a reference database (122) in which the metabolite (411) fold-change values of said individuals are stored in relation to the reactions (412) involving the metabolites (411) for each of the healthy individuals and the diseased individuals associated with at least one disease; determining an average metabolic flux value for each metabolic pathway (410) in the metabolic network model according to the metabolite (411) fold change values in the reference database (122); generating a set of metabolic flux values for each individual in the reference database comprising individual-specific minimum and maximum metabolic flux values of each metabolic pathway (410) in the metabolic network model (121); generating a set of reference difference values for each individual according to the deviations of said metabolic flux value sets from the average metabolic flux values; receiving test data of the fold-change values of the metabolites (411) in the metabolomic measurements taken from said test individual as input; associating the metabolites (411) in the test data with the metabolic pathways (410) containing the reactions (412) of said metabolites and determining a minimum and maximum metabolic flux value for each of the metabolic pathways (410); generating test difference values according to the deviations of the determined metabolic flux values from the average metabolic flux values; applying a classification algorithm in which the generated test difference values belong to which of the set of reference difference values is predicted.
Description
DISEASE DIAGNOSIS SYSTEM
TECHNICAL FIELD
The invention relates to a computer-based disease diagnosis system and method in which a disease of said test individual is predicted based on metabolomic measurements taken from a test individual.
BACKGROUND ART
Accurate and early diagnosis of diseases, such as cancer, is important for the successful continuation of the treatment process and ensuring the correct treatment. Accurate diagnosis reduces both the patient's time loss and the patient's loss of financial resources.
Phenotypes of diseases often have implications for patients' metabolism.
With the effect of the disease, the activity of some metabolic pathways in the metabolism network increases and some of them decrease. These changes can give information about the etiology of the disease.
It is known in the art to detect irregularities in metabolic pathways in tumor specimens and to generate scores for each metabolic pathway and to relate certain specific metabolic pathways to certain diseases (Drier, Y., Sheffer, M., & Domany, E. (2013). Pathway-based personalized analysis of cancer. Proceedings of the National Academy of Sciences, 110 (16), 6388-6393.).
The calculation and analysis of metabolic flux in the metabolic network are also known in the art (Orth, JD, Thiele, I., & Palsson, B. 0. (2010). What is flux balance analysis? Nature biotechnology, 28(3), 245-248.).
It is known in the art that certain metabolic pathways are significantly altered in cancer patients. (Vaske, C. J., Benz, S. C., Sanborn, J. Z. et al. (2010)). Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. Bioinformatics, 26(12), Ϊ237-Ϊ245.)
In the patent application with number US2015006445A1 , it is determined whether the patient responses appropriately to the treatment by analyzing specific metabolic pathways.
The above-mentioned methods treat each pathway independently in an isolated way one by one. None of the pathway analysis methods in the literature (including Pathifier and Paradigm) takes into account the fact that pathways are part of a large biological network and interact with each other.
As a result, all issues mentioned above made it necessary to make an innovation in the relevant technical field.
BRIEF DESCRIPTION OF THE INVENTION
The present invention relates to a robot in order to eliminate the above-mentioned disadvantages and bring new advantages to the relevant technical field.
It is an object of the invention to provide a disease diagnosis system and method for diagnosing diseases with increased accuracy.
The present invention is a disease diagnosis system in which the disease of the said test individual is predicted based on metabolomic measurements taken from a test individual in order to accomplish all the objects mentioned above and which will emerge from the following detailed description. Accordingly, the innovation comprises a processor unit, and the said processor unit is configured to perform the following steps:
- accessing a metabolic network model stored in a memory unit in which a metabolic network is defined, consisting of a number of metabolic pathways formed by metabolites and metabolic reactions that produce and consume these metabolites,
- accessing to a reference database in which the metabolite fold-change values of said individuals are stored in relation to the reactions involving the metabolites for each of the healthy individuals and the diseased individuals associated with at least one disease,
- determining an average minimum and maximum metabolic flux value for each metabolic pathway in the metabolic network model according to the metabolite fold-change values in the reference database,
- creating a set of metabolic flux values for each individual in the reference database containing individual minimum and maximum metabolic flux values of each metabolic pathway in the metabolic network model,
- generating a set of reference difference values for each individual according to the deviations of said metabolic flux value sets from the average metabolic flux values,
- receiving test data of the fold-change values of the metabolites in the metabolomic measurements taken from said test subject as input,
- linking these said metabolites to metabolic pathways with the reactions in the database,
- determining a minimum and maximum metabolic flux value for each of the associated metabolic pathways,
- generating test difference values according to the deviations of the determined metabolic flux values from the average metabolic flux values,
- applying a classification algorithm in which the generated test difference values belong to which of the set of reference difference values is predicted.
The main novelty of the method described in the present specification of this invention is that it performs a holistic analysis on the entire metabolic network rather than isolating each pathway separately. The main advantage of such an approach is that it allows the detection of key pathways that have no (or few) gene/metabolite measurements associated with them in the analyzed ohmic data for a given disease. On the other hand, Pathifier and Paradigm are capable of evaluating and scoring pathways with at least a few metabolites in the analyzed metabolomics data set. Therefore, they may miss many important key pathways.
Diseases cause significant changes in specific metabolic pathways but also cause the metabolic network to change entirely. Detecting a disease by considering the metabolic network as a whole and analyzing changes in the metabolic network are not mentioned in the background art. By means of the system and method of the invention, it is determined whether a test individual falls into the category of sick or healthy individuals by considering the whole metabolic network of healthy and sick individuals.
A preferred embodiment of the invention is characterized in that the processor unit is configured to display the disease with which the patient individual is associated if the test difference values are predicted to belong to a diseased individual.
Another preferred embodiment of the invention is characterized in that the said classification algorithm is a machine-learning based classification algorithm.
Another preferred embodiment of the invention is characterized in that the said classification algorithm is a logistic regression algorithm. Thus, the accuracy of disease prediction is further increased.
Another preferred embodiment of the invention is characterized in that it comprises an input/output unit for receiving the test data as input.
The invention is also a computer-based disease diagnosing method in which the disease of the said test individual is predicted based on metabolomic measurements from a test individual. Accordingly, the innovation comprises the following steps performed by a processor unit:
- accessing to a metabolic network model stored in a memory unit in which a metabolic network of multiple metabolic pathways is identified by metabolites and metabolic reactions of metabolites,
- accessing to a reference database in which the metabolite fold-change values of said individuals are stored in relation to the reactions involving the metabolites for each of the healthy individuals and the diseased individuals associated with at least one disease,
- determining an average metabolic flux value for each metabolic pathway in the metabolic network model according to the metabolite fold-change values in the reference database,
- generating a set of metabolic flux values for each individual in the reference database containing individual-specific metabolic flux values of each metabolic pathway in the metabolic network model,
- generating a set of reference difference values for each individual according to the deviations of said metabolic flux value sets from the average metabolic flux values,
- receiving test data of the fold-change values of metabolites in metabolomic measurements taken from said test subject that are associated with metabolic pathways with the reactions of said metabolites, as input.
- determining a metabolic flux value for each of the metabolic pathways in the test data,
- generating test difference values according to the deviations of the determined metabolic flux values from the average metabolic flux values,
- applying a classification algorithm in which the generated test difference values belong to which of the set of reference difference values is predicted.
BRIEF DESCRIPTION OF THE FIGURE
Figure 1 shows a representative view of the disease diagnosis system.
Figure 2 shows a representative view of the memory unit.
Figure 3 shows a representative thermal map of the difference values of metabolic pathways according to diseased and healthy individuals.
Figure 4 shows a representative view of the metabolic network and its components.
DETAILED DESCRIPTION OF THE INVENTION
In this detailed description, the subject matter of the invention is described by using examples only for a better understanding, which will have no limiting effect.
With reference to Figure 1 , the subject matter of the invention, the disease diagnosis system (100), comprises a processor unit (1 10), a memory unit (120) with which the said processor unit (1 10) is associated to perform read/write operations. The said processor unit (1 10) may be a microprocessor. The memory unit (120) may comprise a memory capable of storing data permanently or a suitable combination of memories that can store data permanently/temporarily. It may comprise an input/output unit (130) associated with the processor unit (1 10) to input data to the processor unit (110) and to receive data from the processor unit (1 10).
The disease diagnosis system (100) may also comprise a communication unit (150) for enabling the processor unit (110) to exchange data with a communication network (200) such as the internet. The disease diagnosis system (100) may comprise a bus (140) that enables processor unit (1 10), memory unit (120), input/output unit (130), communication unit (150), and other components known in the art but not mentioned herein to associate them with each other in such a way as to make data transfer appropriately.
A metabolic network model (121) is stored in the memory unit (120). Figure 2 shows a representative view of a metabolic network (400). The metabolic network model (121) comprises a plurality of metabolic pathways (410) which are associated with each other. Metabolic pathways (410) comprise reactions (412) and the metabolites (41 1) (such as alene, glutamine, fatty acids, etc.) involved in these reactions (412).
Each metabolic pathway (410) can be expressed as a linear equation in terms of reaction (412) and metabolites (41 1) involved in the reaction (412).
Below are examples where metabolic pathways (410) are defined as linear equations.
A <® B + C Reaction #1
B + 2C D Reaction #2
The metabolic network model (121) can also be defined in terms of these reactions (412). More specifically, the metabolic network model (121) can be defined as an S matrix. The said matrix may comprise, for example, metabolites (411) in the rows and reactions (412) in the columns. In the cells, there are values according to the role of the corresponding metabolite (411) in the corresponding reaction (412) in the cell. These values are the stoichiometry of the metabolite (411) in the reaction (412).
Data on metabolomic measurements from healthy individuals and data on metabolomic measurements of diseased individual associated with at least one disease are stored in a reference database (122) in the memory unit (120).
The processor unit (1 10) enables a disease detection module comprising command lines in the memory unit (120) to execute the following steps as an innovative aspect of the invention: It accesses the metabolic network model (121). It accesses the reference database (122). For each metabolic pathway (410) in the metabolic network model (121), an average, minimum, and maximum metabolic flux values are determined based on the metabolite fold-change values in the reference database (122). For each individual in the reference database (122), a set of metabolic flux values is generated including individual-specific minimum and maximum metabolic flux values for each metabolic pathway (410) in the metabolic network model (121). It generates a set of reference difference values for each individual based on the deviations of said metabolic flux value sets from the average metabolic flux values. Test data including the fold-change values of metabolites (411) in metabolomic measurements taken from a test individual are received as input. It is associated with metabolic pathways (410) which covers the reactions (412) with said metabolites (411) in the test data. A minimum and maximum metabolic flux value are determined for each of these associated metabolic pathways (410). Test difference values are determined based on the deviations of the determined metabolic flux values from the average metabolic flux values. A classification algorithm is applied to the generated test difference values to predict to which set of reference difference values they belong. If it is determined that the test difference values belong to a diseased individual's reference difference value sets, one or more diseases associated with the diseased individual are provided as output
by the input/output unit (130). Thus, the metabolic network is evaluated holistically and the test individual is matched with a diseased individual or healthy individuals and the diagnosis of the disease is made with an increased accuracy.
Obtaining the values from the steps described in the innovative aspect of the invention and making the prediction is explained in detail using mathematical methods below: Flux variables of reactions in the metabolic network are represented by a V vector. Metabolic flux values are calculated by solving the equation of SxV = 0. During the solution, the equation SxV=0 is transformed into an optimization problem. The steady-state assumptions of the optimization problem are represented as constants in the optimization problem; the objective function of the optimization problem is to maximize the metabolic flux values of the reactions (412) that produce metabolites that are observed to increase in the cell of the test individual specific to the test individual, and to minimize metabolic flux values of the reactions (412) that produce metabolites that are observed to decrease in the test individual. More specifically, the objective function of the optimization problem is as follows:
M: Metabolomic measurement values provided as input (metabolite fold-change values), mR: All reactions that produce metabolite (41 1) m,
m c: (412), the measured fold-change value of metabolite (41 1) m,
Flux(Ri): The metabolic flux variable for the reaction (412) of R,,
Stoichiometry (R,, m): Stoichiometry of metabolite m in the reaction (412) of R,,
TotalStoichiometry (m): Total stoichiometry of metabolite (411) m in reactions (412) producing m.
In order to calculate the fold-changes, firstly, the average densities of all metabolites (41 1) are calculated based on the metabolomic measurements of healthy individuals held in the memory unit (120). Then the fold-change values are determined for each metabolite (41 1) of each individual based on their deviation from the average density.
Thus, the objective function changes dynamically for each individual.
Since the number of reactions (412) in the metabolic network is greater than the number of metabolites (411), the part of the optimization problem described above is incomplete. Therefore, there can be many alternative solutions. Each alternative solution can be obtained in such a way as to allow the cells to achieve different targets. Flux Variability Analysis (FVA) is performed to cover all alternative solutions. The following steps are taken for FVA:
- The value of the objective function is determined by solving the optimization problem.
- The value of the objective function is added as an additional constraint to the optimization problem.
- For an R reaction in the metabolic network, a new objective function is created so as to maximize the metabolic flux of the reaction and the optimization problem is solved again. The calculated objective function value is recorded as the upper limit (maximum) of the metabolic flux value of the R reaction.
- For the R reaction (412), the objective function is reconstructed so as to minimize the metabolic flux value and the optimization problem is solved again. The calculated objective function value is recorded as the lower limit (minimum) of the metabolic flux value of the R reaction.
These steps are repeated for each reaction (412) in the metabolic network and the metabolic flux value limits of the reactions (412) are determined.
After performing FVA analysis, the average of the lower and upper limits of the metabolic flux values are calculated and the mean metabolic flux values are calculated. The lower and upper limits of the metabolic flux values are determined based on the metabolomic measurements of a test individual; the average of these lower and upper limits is calculated and a metabolic flux value is determined for each metabolic pathway (410). Test difference values are generated based on the difference between the metabolic flux values determined for the test individual and the average metabolic flux value.
In a possible embodiment of the invention, a statistical significance analysis of these difference values is carried out based on the difference values of diseased and healthy individuals. Thus, the major metabolic pathway (410) changes causing the disease can be detected. For each metabolic pathway (410), the statistical significance is determined by matching the test difference value with an F-value and a P-value calculated using ANOVA.
Following these procedures, a classification algorithm based on machine-learning is taught using the difference values calculated for the diseased individuals and healthy individuals. This algorithm predicts whether the result of metabolomic analysis, ie, test difference values and the significance values associated with these test difference values, belong to a diseased or a healthy individual.
More specifically, it is determined which reference difference value sets for diseased and healthy individuals are the most similar to the results. If the results belong to a diseased individual, the disease associated with the diseased individual is displayed in the input/output unit (130). This possible embodiment is based on the logistic regression algorithm.
Figure 3 shows the thermal map of the difference values of the metabolic pathways (410) of healthy individuals and diseased individuals. Values for healthy individuals are indicated in a circumscribed symbol. It can be observed that diseased individuals have metabolic pathways which have distinct difference values compared to healthy individuals.
In a possible embodiment, the metabolomic measurements of each test individual are recorded in the disease database or reference database (122) according to the analysis result, therefore the reference database (122) is constantly updated. For example, Table 1 shows the difference values of the metabolic pathways (410) of a patient associated with breast cancer and the F and P values for the statistical significance of the difference values.
In a possible embodiment of the invention, the disease diagnosis system (100) is provided on a server. The server is connected to a communication network (200) and it receives the test
data from the client devices (300) connected to the same communication network (200) as input and as a result, determines whether the test data belongs to a diseased individual or a healthy individual. Using the obtained analysis values, the invention can be used to diagnose diseases, as well as to determine possible diseases that may be incurred by the individual at whom the measurements are made, to identify the main metabolic mechanisms of the disease and to present them visually to health researchers and clinicians, to plan appropriate personalized treatments according to the main metabolic mechanism of the disease, to make comparative analysis of metabolic mechanisms of diseases and as a result, to make recommendations to the type and drug researchers for the same/similar diseases for the use of existing treatment for another disease, to make recommendations on which possible proteins can be metabolic drug targets for the drugs to be developed against diseases that are not yet medicated. The scope of the protection of the invention is set forth in the annexed claims and certainly cannot be limited to exemplary explanations in this detailed description. It is evident that one skilled in the technic can make similar embodiments in the light of the explanations above without moving away the main theme of the invention.
REFERENCE NUMBERS IN THE FIGURE
100: Disease diagnosis system
1 10: Processor unit
120: Memory unit
121 : Metabolic network model
122: Reference database
123: Disease diagnosis module
130: Input/output units
140: Bus
150: Communication unit
200: Communication network
300: Client device
400: Metabolic network
410: Metabolic pathway
411 : Metabolite
412: Reaction
Claims
1. A disease diagnosis system (100) wherein the disease of a test individual is predicted based on metabolomic measurements taken from the test individual and characterized in that it comprises a processor unit (1 10), and the said processor unit (1 10) is configured to perform the following steps:
- accessing a metabolic network model (121) stored in a memory unit (120) in which a metabolic network (400) of multiple metabolic pathways (410) is identified by metabolites and metabolic reactions (412) of metabolites (41 1),
- accessing a reference database (122) in which the metabolite fold-change values of said individuals are stored in relation to the reactions (412) involving the metabolites (41 1) for each of the healthy individuals and the diseased individuals associated with at least one disease,
- determining an average minimum and maximum metabolic flux value for each metabolic pathway (410) in the metabolic network model according to the metabolite fold change values in the reference database (122),
- generating a set of metabolic flux values for each individual in the reference database comprising individual-specific metabolic flux values of each metabolic pathway (410) in the metabolic network model (121),
- generating a set of reference difference values for each individual according to the deviations of said metabolic flux value sets from the average metabolic flux values,
- receiving the test data of the fold-change values of metabolites (41 1) in metabolomic measurements taken from the said test subject that are associated with metabolic pathways (410) with the reactions (412) of said metabolites (41 1), as input,
- determining a metabolic flux value for each of the metabolic pathways (410) in the test data,
- generating test difference values according to the deviations of the determined metabolic flux values from the average metabolic flux values,
- applying a classification algorithm in which the generated test difference values belong to which of the set of reference difference values is predicted.
2. A disease diagnosis system (100) for predicting the disease of a patient based on metabolomic measurements in accordance with Claim 1 characterized in that the processor unit (1 10) is configured to display the disease(s) to which the diseased individual is associated, if the test difference values are predicted to belong to a diseased individual.
3. A disease diagnosis system (100) for predicting the disease of a patient based on metabolomic measurements in accordance with Claim 1 characterized in that said classification algorithm is a machine-learning based classification algorithm.
4. A disease diagnosis system (100) for predicting the disease of a patient based on metabolomic measurements in accordance with Claim 1 characterized in that the said classification algorithm is the logistic regression algorithm.
5. A disease diagnosis system (100) for predicting the disease of a patient based on metabolomic measurements in accordance with Claim 1 characterized in that it comprises an input/output unit (130) for receiving the test data as input.
6. A computer-based disease diagnosis method in which a test individual's disease is predicted based on metabolomic measurements taken from said test individual and characterized in that it comprises the following steps performed by a processor unit (1 10):
- accessing a metabolic network model (121) stored in a memory unit (120), wherein a metabolic network comprising a plurality of metabolic pathways (410) identified by metabolites and metabolic reactions (412) in which metabolites and metabolites (41 1) are involved,
- accessing to a reference database (122) in which the metabolite (411) fold-change values of said individuals are stored in relation to the reactions (412) involving the metabolites (41 1) for each of the healthy individuals and the diseased individuals associated with at least one disease,
- determining an average metabolic flux value for each metabolic pathway (410) in the metabolic network model according to the metabolite (41 1) fold change values in the reference database (122),
- generating a set of metabolic flux values for each individual in the reference database comprising individual-specific minimum and maximum metabolic flux values of each metabolic pathway (410) in the metabolic network model (121),
- generating a set of reference difference values for each individual according to the deviations of said metabolic flux value sets from the average metabolic flux values,
- receiving test data of the fold-change values of the metabolites (411) in the metabolomic measurements taken from said test individual as input,
- associating the metabolites (41 1) in the test data with the metabolic pathways (410) containing the reactions (412) of said metabolites and stored in the memory unit (120),
- determining a minimum and maximum metabolic flux value for each of the metabolic pathways (410) in the test data,
- generating test difference values according to the deviations of the determined metabolic flux values from the average metabolic flux values,
- applying a classification algorithm in which the generated test difference values belong to which of the set of reference difference values is predicted.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| TR2018/21349 | 2018-12-31 | ||
| TR201821349 | 2018-12-31 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2020142035A1 true WO2020142035A1 (en) | 2020-07-09 |
Family
ID=71407228
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/TR2019/051101 Ceased WO2020142035A1 (en) | 2018-12-31 | 2019-12-18 | Disease diagnosis system |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2020142035A1 (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024095564A1 (en) * | 2022-10-31 | 2024-05-10 | 株式会社島津製作所 | Method for estimating metabolic flux distribution, analysis device, and program |
| CN119400283A (en) * | 2024-11-08 | 2025-02-07 | 宁夏大学 | A method and system for assessing safety risk level of cattle and sheep breeding feed |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2011119637A1 (en) * | 2010-03-22 | 2011-09-29 | Stemina Biomarker Discovery, Inc. | Predicting human developmental toxicity of pharmaceuticals using human stem-like cells and metabolomics |
| US8580490B1 (en) * | 2010-06-04 | 2013-11-12 | Colorado State University Research Foundation | Markers for screening anti-mycobacterial treatment efficacy |
| US20170241043A1 (en) * | 2016-02-18 | 2017-08-24 | The Regents Of The University Of California | Predicting metabolic side effects of transported drugs |
-
2019
- 2019-12-18 WO PCT/TR2019/051101 patent/WO2020142035A1/en not_active Ceased
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2011119637A1 (en) * | 2010-03-22 | 2011-09-29 | Stemina Biomarker Discovery, Inc. | Predicting human developmental toxicity of pharmaceuticals using human stem-like cells and metabolomics |
| US8580490B1 (en) * | 2010-06-04 | 2013-11-12 | Colorado State University Research Foundation | Markers for screening anti-mycobacterial treatment efficacy |
| US20170241043A1 (en) * | 2016-02-18 | 2017-08-24 | The Regents Of The University Of California | Predicting metabolic side effects of transported drugs |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2024095564A1 (en) * | 2022-10-31 | 2024-05-10 | 株式会社島津製作所 | Method for estimating metabolic flux distribution, analysis device, and program |
| CN119400283A (en) * | 2024-11-08 | 2025-02-07 | 宁夏大学 | A method and system for assessing safety risk level of cattle and sheep breeding feed |
| CN119400283B (en) * | 2024-11-08 | 2025-06-17 | 宁夏大学 | A method and system for assessing safety risk level of cattle and sheep breeding feed |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Mitchell et al. | National biobanks: clinical labor, risk production, and the creation of biovalue | |
| Hajirasouliha et al. | Precision medicine and artificial intelligence: overview and relevance to reproductive medicine | |
| EP4260340A1 (en) | Predicting fractional flow reserve from electrocardiograms and patient records | |
| AU2016280074B2 (en) | Systems and methods for patient-specific prediction of drug responses from cell line genomics | |
| Maguluri et al. | Big Data Solutions For Mapping Genetic Markers Associated With Lifestyle Diseases | |
| EP1839229A2 (en) | Methods, systems, and computer program products for developing and using predictive models for predicting a plurality of medical outcomes, for evaluating intervention strategies, and for simultaneously validating biomarker causality | |
| Yogeshappa | AI-driven Precision medicine: Revolutionizing personalized treatment plans | |
| Siristatidis et al. | Predicting IVF outcome: a proposed web-based system using artificial intelligence | |
| US20180039732A1 (en) | Dasatinib response prediction models and methods therefor | |
| CN116525116B (en) | A cardiogenic shock real-time risk early warning monitoring system, equipment and storage medium | |
| CN116864139A (en) | Disease risk assessment method, device, computer equipment and readable storage medium | |
| Bonazzola et al. | Unsupervised ensemble-based phenotyping enhances discoverability of genes related to left-ventricular morphology | |
| Gomes et al. | Genetic architecture of cardiac dynamic flow volumes | |
| Bruno et al. | Data reduction and data visualization for automatic diagnosis using gene expression and clinical data | |
| WO2020142035A1 (en) | Disease diagnosis system | |
| US20230253115A1 (en) | Methods and systems for predicting in-vivo response to drug therapies | |
| US20190189248A1 (en) | Methods, systems and apparatus for subpopulation detection from biological data based on an inconsistency measure | |
| Saab et al. | Comparison of machine learning algorithms for classifying adverse-event related 30-day hospital readmissions: potential implications for patient safety | |
| Qi et al. | DeepQA: A Unified Transcriptome‐Based Aging Clock Using Deep Neural Networks | |
| Natarajan et al. | A novel method for bioinformatics analysis in gene expression profiling framework for personalized healthcare applications | |
| Yao et al. | A network approach to quantifying radiotherapy effect on cancer: Radiosensitive gene group centrality | |
| Fahim et al. | Advancing cardiovascular health prediction: machine learning algorithm analysis | |
| CN119943261B (en) | An interpretation system, method and storage medium for precise drug use in tumors | |
| Blokh et al. | The utility of information theory based methods in the research of aging and longevity | |
| Katarína et al. | Innovative technologies in precision healthcare |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19907851 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 19907851 Country of ref document: EP Kind code of ref document: A1 |