WO2025009246A1 - Système d'analyse de composé - Google Patents
Système d'analyse de composé Download PDFInfo
- Publication number
- WO2025009246A1 WO2025009246A1 PCT/JP2024/015135 JP2024015135W WO2025009246A1 WO 2025009246 A1 WO2025009246 A1 WO 2025009246A1 JP 2024015135 W JP2024015135 W JP 2024015135W WO 2025009246 A1 WO2025009246 A1 WO 2025009246A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- compound
- information
- backbone
- compounds
- substances
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N27/00—Investigating or analysing materials by the use of electric, electrochemical, or magnetic means
- G01N27/62—Investigating or analysing materials by the use of electric, electrochemical, or magnetic means by investigating the ionisation of gases, e.g. aerosols; by investigating electric discharges, e.g. emission of cathode
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/40—Searching chemical structures or physicochemical data
Definitions
- the present invention relates to a compound analysis system.
- Mass spectrometry data such as mass spectra obtained by analyzing a sample with a mass spectrometer, indicates the masses of the compounds present in the sample.
- mass spectrometry data can be used to search for compounds in the sample that have the same mass as a specific compound, such as a dangerous drug.
- the search for metabolites is essential in metabolism testing, and high-resolution mass spectrometers are used to comprehensively search for metabolites or to infer and search for similar compounds based on the structure of the active pharmaceutical ingredient.
- Compounds such as dangerous drugs have a basic structure in which a substance such as CH3 , H, Cl, or O is bound to each bond of a backbone having one or more bonds.
- the present invention was made in consideration of the above problems, and aims to make it easy to create comprehensive information on a large number of compounds that share a common backbone.
- the compound analysis system is a compound analysis system for analyzing a compound having a basic structure in which a substance is bound to each of one or more bonds of a backbone having one or more bonds, and includes a compound database that holds information on the backbone of known compounds and information on candidate substances that bind to each of the one or more bonds of the backbone of each of the known compounds, a data extraction unit configured to identify a target backbone that is the backbone to be analyzed based on information input by a user and extract candidate substances that bind to each of the one or more bonds of the target backbone from the compound database, and an information creation unit configured to use the information extracted from the compound database by the data extraction unit to comprehensively create information on compounds generated by each combination of candidate substances that bind to each of the one or more bonds while comprehensively changing the combinations of candidate substances that bind to each of the one or more bonds of the target backbone, thereby creating comprehensive compound information on compounds having the target backbone.
- the target backbone to be analyzed is identified and information about candidate substances that can bind to each bond of the target backbone is extracted from a compound database, and the combinations of candidate substances that bind to each bond of the target backbone are comprehensively changed to comprehensively create information such as the composition formula of the compound formed by each combination of candidate substances that bind to each bond, thereby creating comprehensive information about compounds that have the target backbone, making it easy to create comprehensive information about a large number of compounds that have a common backbone (target backbone).
- FIG. 1 is a schematic configuration diagram showing an embodiment of a compound analyzing system. This is an example of the backbone of a compound. 1 is an example of a compound list. 13 is a flowchart showing an example of the compound information creation operation in the embodiment. 13 is a flow chart showing an example of screening in the same embodiment.
- the compound analysis system 1 includes an information processing device 2 and a display 4.
- the information processing device 2 is realized by installing a dedicated computer program in a computer device such as a PC (personal computer) equipped with a CPU (central processing unit), an information storage device, etc.
- the display 4 is electrically connected to the information processing device 2, and displays information transmitted from the information processing device 2.
- the compound database 6 holds information about the basic structures of known compounds.
- Known compounds here are compounds that have a backbone with bonds for binding substances at one or more locations, and have a basic structure in which a substance is bound to each bond of the backbone.
- Compounds with a common backbone have the same chemical effect even if the substance bound to the backbone changes. By changing the combination of substances bound to each bond, compounds with the same chemical effect but different composition formulas (masses) and structures (similar compounds) are obtained.
- the compound database 6 holds information such as the composition formulas and/or structures of the backbones of such known compounds, and the composition formulas of candidate substances that may be bound to each bond of the backbone.
- a benzodiazepine drug is formed of a backbone having four bonds R 1 to R 4 and substances bonded to each of the four bonds R 1 to R 4.
- Various substances can be bonded to each of the bonds R 1 to R 4.
- substances such as CH 3 and H can be bonded to the bond R 1
- substances such as O can be bonded to the bond R 2
- substances such as Cl, F, Br can be bonded to the bond R 3
- substances such as NO 2 , Cl, F, Br can be bonded to the bond R 4.
- the data extraction unit 8 is configured to identify a trunk to be analyzed (target trunk) from among the trunks held in the compound database 6 based on information input by the user, and to extract information on candidate substances that bind to each bond of the identified target trunk from the compound database 6.
- Information input by the user includes the compositional formula of a known compound, the compositional formula of a trunk of a known compound, etc.
- the data extraction unit 8 searches the compound database 6 for information on the compound having the input compositional formula, and extracts from the compound database 6 information on the trunk of that compound and candidate substances that bind to each bond of that trunk.
- FIG. 3 A part of the compound list created by the information creation unit 10 is shown in FIG. 3.
- the leftmost column of the compound list in FIG. 3 shows a list of composition formulas of compounds generated by changing the combination of binding substances to each bond of the target backbone.
- the column to the right of the composition formula column shows the number of compounds having each composition formula but different structures as the number of isomers, and the column to the right of that shows the structural formulas of each compound summarized in each composition formula.
- R1, R2, R3, R4, and R5 are identification information indicating each bond of the backbone
- the information presentation unit 12 is configured to present compound information, such as a compound list, created by the information creation unit 10 to the user by displaying it on the display 4.
- the screening unit 14 is a function that searches for compounds having the target backbone in a sample by applying the compound information created by the information creation unit 10 to mass analysis data obtained by analyzing a sample with a mass spectrometer.
- Mass spectra are an example of mass analysis data to which compound information can be applied.
- Information on the masses of all similar compounds can be extracted from the composition formula of each compound contained in the compound information created by the information creation unit 10.
- the user inputs information about the compound to be analyzed into the information processing device 2 (step 101).
- the data extraction unit 8 identifies the target backbone based on the information input by the user (step 102), and extracts information about the target backbone (position of the bond, what substances may bind to each bond) from the compound database 6 (step 103).
- the information creation unit 10 uses the information extracted by the data extraction unit 8 to select a substance to be bound to each bond of the target backbone from among candidates for substances that can be bound to each bond (step 104), and generates a compositional formula and a structural formula of a compound obtained by binding the selected candidate substances to each bond (step 105).
- the information creation unit 10 repeats the operations of steps 104 and 105 until all patterns of combinations of substances to be bound to each bond of the target backbone are covered (step 106), thereby creating a compound list that covers the compositional formulas and structural formulas of all compounds having the target backbone (step 107).
- the information presentation unit 12 displays the created compound list on the display 4 (step 108).
- the screening unit 14 extracts mass information for each compound from the compositional formula of the compounds listed in the target compound list (step 202), and applies the extracted mass information for each compound to the target mass analysis data to search for compounds whose mass matches that of each compound in the compound list in the mass analysis data (step 203). If the search results in the presence of a compound in the mass analysis data whose mass matches that of a compound in the compound list, the screening unit 14 extracts information from the compound list about the compound with the matching mass, such as the compositional formula and/or structural formula, and the information presentation unit 12 displays the information about the extracted compound on the display 4 (step 204).
- One embodiment of the compound analyzing system is a compound analyzing system for analyzing a compound having a basic structure in which a backbone has one or more binding sites and a substance is bound to each of the one or more binding sites, the compound analyzing system comprising: a compound database that holds information on the backbones of known compounds and information on candidate substances that bind to each of the one or more binding sites of the backbones of each of the known compounds; a data extraction unit configured to identify a target backbone that is a backbone to be analyzed based on information input by a user, and to extract candidates for substances that bind to each of the one or more binding sites of the target backbone from the compound database; and an information creation unit configured to use the information extracted from the compound database by the data extraction unit to comprehensively create information on compounds generated by each combination of candidate substances that bind to each of the one or more binding parts while comprehensively changing the combinations of candidate substances that bind to each of the one or more binding parts of the target backbone, thereby creating comprehensive compound information on compounds having the
- the compound information includes the composition formula of each compound having the target backbone.
- the compound information includes a structural formula indicating, for each compound having the target backbone, what substance is bound to each of the one or more bonds of the target backbone, and the information creation unit is configured to associate the composition formula and the structural formula of the same compound with each other.
- the structural formula can include identification information indicating each of the one or more bonds in the backbone, and information on the substance bonded to each of the one or more bonds.
- the information creation unit may be configured to create a compound list that shows, as the compound information, a list of the compositional formulas of each compound having the target backbone, with compounds associated with the same compositional formula aggregated into one compositional formula, and that shows the structural formulas of the compounds associated with each compositional formula.
- a screening unit is provided that is configured to search for compounds having the target backbone in the sample by applying the mass information of each compound determined from the composition formula to mass analysis data acquired by a mass analyzer for the sample.
Landscapes
- Chemical & Material Sciences (AREA)
- Crystallography & Structural Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computing Systems (AREA)
- Theoretical Computer Science (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Electrochemistry (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Immunology (AREA)
- Pathology (AREA)
- Other Investigation Or Analysis Of Materials By Electrical Means (AREA)
Abstract
L'invention concerne un système d'analyse de composé (1) pour analyser un composé ayant une structure de base dans laquelle une substance est liée au site de liaison ou à chacun des sites de liaison d'un squelette présentant un ou plusieurs sites de liaison, le système d'analyse de composé (1) comprenant : une base de données de composés (6) qui contient des informations sur des squelettes de composés connus et des informations sur des candidats pour des substances qui se lient au site de liaison ou à chacun des sites de liaison du squelette de chacun des composés connus ; une unité d'extraction de données (8) configurée de façon à identifier un squelette objet, qui est un squelette à analyser, sur la base d'informations entrées par un utilisateur, et à extraire, de la base de données de composés (6), des candidats pour des substances qui se lient au site de liaison ou à chacun des sites de liaison du squelette objet ; et une unité de création d'informations (10) configurée de façon à utiliser les informations extraites de la base de données de composés (6) par l'unité d'extraction de données (8) pour créer de manière exhaustive des informations concernant une partie de composé générée par chaque combinaison de candidats pour des substances qui se lient au site de liaison ou à chacun des sites de liaison tout en changeant de manière exhaustive une combinaison de candidats pour des substances qui se lient au site de liaison ou à chacun des sites de liaison du squelette objet, ce qui permet de créer des informations de composé qui comprennent le composé présentant le squelette objet.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2023-109935 | 2023-07-04 | ||
| JP2023109935 | 2023-07-04 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025009246A1 true WO2025009246A1 (fr) | 2025-01-09 |
Family
ID=94171684
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/JP2024/015135 Pending WO2025009246A1 (fr) | 2023-07-04 | 2024-04-16 | Système d'analyse de composé |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2025009246A1 (fr) |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001058962A (ja) * | 1999-08-20 | 2001-03-06 | Mitsubishi Chemicals Corp | 分子構造開発支援システム及び分子構造開発支援方法、並びに、分子構造抽出装置,分子構造抽出方法及び分子構造抽出プログラムを格納したコンピュータ読取可能な記録媒体 |
| WO2012095948A1 (fr) * | 2011-01-11 | 2012-07-19 | 株式会社島津製作所 | Procédé d'analyse de données de spectrométrie de masse, dispositif d'analyse de données de spectrométrie de masse et programme d'analyse de données de spectrométrie de masse |
| WO2013051148A1 (fr) * | 2011-10-07 | 2013-04-11 | 株式会社島津製作所 | Méthode et dispositif d'analyse de données d'analyse de masses |
| WO2022149395A1 (fr) * | 2021-01-07 | 2022-07-14 | 富士フイルム株式会社 | Dispositif de traitement d'informations, procédé de traitement d'informations, et programme de traitement d'informations |
-
2024
- 2024-04-16 WO PCT/JP2024/015135 patent/WO2025009246A1/fr active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001058962A (ja) * | 1999-08-20 | 2001-03-06 | Mitsubishi Chemicals Corp | 分子構造開発支援システム及び分子構造開発支援方法、並びに、分子構造抽出装置,分子構造抽出方法及び分子構造抽出プログラムを格納したコンピュータ読取可能な記録媒体 |
| WO2012095948A1 (fr) * | 2011-01-11 | 2012-07-19 | 株式会社島津製作所 | Procédé d'analyse de données de spectrométrie de masse, dispositif d'analyse de données de spectrométrie de masse et programme d'analyse de données de spectrométrie de masse |
| WO2013051148A1 (fr) * | 2011-10-07 | 2013-04-11 | 株式会社島津製作所 | Méthode et dispositif d'analyse de données d'analyse de masses |
| WO2022149395A1 (fr) * | 2021-01-07 | 2022-07-14 | 富士フイルム株式会社 | Dispositif de traitement d'informations, procédé de traitement d'informations, et programme de traitement d'informations |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Milman et al. | The chemical space for non-target analysis | |
| Cajka et al. | LC–MS-based lipidomics and automated identification of lipids using the LipidBlast in-silico MS/MS library | |
| Luedemann et al. | TagFinder: preprocessing software for the fingerprinting and the profiling of gas chromatography–mass spectrometry based metabolome analyses | |
| Clarke et al. | Peer reviewed: systematic LC/MS metabolite identification in drug discovery | |
| Ubukata et al. | Non-targeted analysis of electronics waste by comprehensive two-dimensional gas chromatography combined with high-resolution mass spectrometry: Using accurate mass information and mass defect analysis to explore the data | |
| Reichenbach et al. | Computer language for identifying chemicals with comprehensive two-dimensional gas chromatography and mass spectrometry | |
| Oberacher et al. | Compound identification in forensic toxicological analysis with untargeted LC–MS-based techniques | |
| CN103389345A (zh) | 色谱质谱分析用数据处理系统 | |
| Getzinger et al. | Illuminating the exposome with high-resolution accurate-mass mass spectrometry and nontargeted analysis | |
| Winkler | Processing metabolomics and proteomics data with open software: a practical guide | |
| JP5664667B2 (ja) | 質量分析データ解析方法、質量分析データ解析装置、及び質量分析データ解析用プログラム | |
| Fels et al. | Liquid chromatography‐quadrupole‐time‐of‐flight mass spectrometry screening procedure for urine samples in forensic casework compared to gas chromatography‐mass spectrometry | |
| Heinsvig et al. | Forensic drug screening by liquid chromatography hyphenated with high-resolution mass spectrometry (LC-HRMS) | |
| Goncalves et al. | Suitability of high-resolution mass spectrometry in analytical toxicology: focus on drugs of abuse | |
| Alanazi | Recent Advances in Liquid Chromatography–Mass Spectrometry (LC–MS) Applications in Biological and Applied Sciences | |
| WO2025009246A1 (fr) | Système d'analyse de composé | |
| CN115667912B (zh) | 复合测量整合阅览器及计算机程序产品 | |
| Loss et al. | Using NMR Data on GLYCOSCIENCES. de | |
| Blunt et al. | The role of databases in marine natural products research | |
| Nanni et al. | PTM MarkerFinder, a software tool to detect and validate spectra from peptides carrying post‐translational modifications | |
| JP6295910B2 (ja) | 質量分析データ処理装置 | |
| JP2018040655A (ja) | 質量分析用データ処理装置 | |
| Rosnack et al. | Screening solution using the software platform UNIFI: an integrated workflow by waters | |
| Lundgren et al. | Protein identification using TurboSEQUEST | |
| Oppermann et al. | High precision measurement and fragmentation analysis for metabolite identification |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24835768 Country of ref document: EP Kind code of ref document: A1 |
|
| ENP | Entry into the national phase |
Ref document number: 2025530985 Country of ref document: JP Kind code of ref document: A |