WO2013035904A1 - Système et procédé de traitement de pipeline d'analyse d'informations biométriques - Google Patents
Système et procédé de traitement de pipeline d'analyse d'informations biométriques Download PDFInfo
- Publication number
- WO2013035904A1 WO2013035904A1 PCT/KR2011/006686 KR2011006686W WO2013035904A1 WO 2013035904 A1 WO2013035904 A1 WO 2013035904A1 KR 2011006686 W KR2011006686 W KR 2011006686W WO 2013035904 A1 WO2013035904 A1 WO 2013035904A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- analysis
- information
- image
- pipeline
- task
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
- G16B50/30—Data warehousing; Computing architectures
Definitions
- the present invention relates to a bioinformation analysis pipeline processing system capable of generating a bioinformation analysis pipeline and processing the same according to a bioinformation analysis method desired by a user, and a processing method thereof.
- bioinformatics can give potentially meaningful answers to questions such as what parts of DNA control the various chemistries of life, what are the functions of new proteins, and can predict the structure of new proteins? have.
- the fields of biology in which bioinformatics is currently applied include all fields of molecular biology, such as genomics, transcriptome, proteomics, metabolism, and pharmacogenomics.
- Cloud computing services related to conventional bioinformation analysis are limited to providing a specific analysis tool or analysis pipeline or data storage device such as Amazon EC2 service. Therefore, there is a problem that each user cannot supply the desired analysis pipeline smoothly, and there is a problem that the user is difficult to use because the information on the analysis pipeline stored in each cloud service device is insufficient. In addition, there is a problem that can not fundamentally address the concern about the security of important research data distributed in each cloud service device.
- the present invention is to solve the above problems, the technical problem to be achieved by the present invention in the life information analysis pipeline processing system and method, without suffering the above-mentioned problems, life information analysis pipeline in the cloud computing base To make it workable.
- the life information analysis pipeline processing system is a cloud for installing a virtual machine (VM) image including an analysis task for analyzing the life information data on the cloud service device Resource supply apparatus; Receive meta information about the life analysis pipeline and VM image from the client, extract Identifier (ID) information of the VM image mapped with the meta information, and perform the analysis operation sequentially according to the execution position of the analysis operation in the VM image.
- a server that directs and executes an analysis pipeline in synchronization with a cloud resource supply device and synchronizes job status and results; And a database that stores meta information and ID information of a mapped VM image.
- the client writes meta information about the pipeline and the VM image, and instructs the server to generate an object model.
- the VM image includes at least one analysis task for analyzing the life information, is installed in the computing service device included in the cloud service device analysis work It characterized in that to perform.
- the pipeline in the life information analysis pipeline processing system converts at least one or more analysis tasks for analyzing life information, links for linking each analysis task, and the format of input / output data. It characterized in that it comprises a data conversion script.
- life information in the life information analysis pipeline processing system is characterized in that it comprises at least one of sequence data, sequence alignment data, protein structure or molecular interaction network.
- the meta information represents a connection relationship between analysis tasks included in the pipeline and a VM image executing the analysis task.
- the meta information may be selected from among the structure information of the pipeline, input / output data conversion script information, execution location information of an analysis operation in a VM image, or VM resource allocation information. It characterized in that it comprises at least one.
- the server extracts meta information from a database and generates a VM package by extracting a VM image mapped thereto through a cloud resource supply device. It is done.
- the server receives the VM package from the client, and stores the meta information contained therein in a database, the cloud resource supply device cloud the VM image Instructing the service device to install.
- the life information analysis pipeline processing method comprises the steps of installing a virtual machine (VM) image including an analysis operation for analyzing life information data on the cloud service device ; Receiving meta information about the life information analysis pipeline and the VM image; Extracting Identifier (ID) information of the VM image mapped with the meta information; Instructing an analysis task sequentially according to an execution position of the analysis task in the VM image; Running an analysis pipeline and synchronizing the work state and outputs; And storing the meta information and the ID information of the mapped VM image.
- VM virtual machine
- ID Identifier
- life information analysis pipeline processing method further includes the step of instructing to create the meta-information about the pipeline and the VM image and generate the object model.
- the VM image at least one analysis task for analyzing the life information, is installed in the computing service device included in the cloud service device analysis operation It characterized in that to perform.
- the pipeline converts at least one or more analysis tasks for analyzing life information, links for linking each analysis task, and a format of input / output data. It characterized in that it comprises a data conversion script.
- the life information includes at least one of sequence data, sequence alignment data, protein structure, or molecular interaction network.
- the meta information represents a connection relationship between analysis tasks included in the pipeline and a VM image executing the analysis task.
- the meta information may be selected from among the structure information of the pipeline, input / output data conversion script information, execution location information of an analysis operation in a VM image, or VM resource allocation information. It characterized in that it comprises at least one.
- life information analysis pipeline processing method further comprises the step of extracting the meta information and the VM image to generate a VM package.
- the life information analysis pipeline processing method further includes the step of receiving a VM package, storing meta information contained therein, and instructing to install the VM image on the cloud service device.
- backup and reinstallation to a cloud computing environment may be facilitated.
- FIG. 1 is a diagram illustrating a life information analysis tool object-oriented class diagram according to an embodiment of the present invention.
- FIG. 2 is a diagram illustrating a class diagram of a life information analysis pipeline object model according to an embodiment of the present invention.
- FIG. 3 is a diagram illustrating a general life information analysis pipeline model according to an embodiment of the present invention.
- FIG. 4 is a diagram illustrating a life information analysis pipeline processing system according to an embodiment of the present invention.
- FIG. 5 is a flowchart illustrating a life information analysis pipeline processing method according to an embodiment of the present invention.
- FIG. 6 is a flowchart illustrating a VM package export method in the life information analysis pipeline processing method according to an embodiment of the present invention.
- FIG. 7 is a flowchart illustrating a VM package importing method of the life information analysis pipeline processing method according to an embodiment of the present invention.
- the present invention addresses the bioinformation analysis pipeline that integrates and analyzes various and heterogeneous bioinformation.
- the bioinformation pipeline includes a plurality of analysis tools that receive bioinformation data, perform analysis to output the result data, links linking the plurality of analysis tools, and / or result data output from each analysis tool.
- Data conversion scripts are included to convert the life information data types to corresponding analysis tools. Since the above analysis tool refers to an entity executing a task of analyzing life information data, the following analysis tool and analysis work may be used in the same sense.
- bioinformatics tools vary in their purpose and practice, they need to be effectively integrated.
- One thing to consider to effectively integrate bioinformation analysis tools is that there are many ways to implement even analytical tools that do the same.
- an analysis tool that obtains a single sequence of information from a genetic database may have various execution methods such as command line execution, scripting, and remote invocation methods such as web services, XML-RPC, and REST. Therefore, to integrate effective bioinformatics analysis tools, a flexible integration schema must be implemented considering these various execution methods.
- life information analysis requires a large amount of computing, so it cannot be analyzed with a personal computer, and it is appropriate to use a super computer or a cloud service device capable of distributed computing. Therefore, the present invention proposes a device and method for analyzing life information using a cloud service device.
- FIG. 1 is a diagram illustrating a life information analysis tool object-oriented class diagram according to an embodiment of the present invention.
- the present invention is a subclass of the analysis tool (BioTool, 1010) class for each execution method of the analysis tool, command line based execution method (CommandLineBioTool, 1011), web service execution method (WebServicesBioTool, 1012), script-based execution method (ScriptBioTool). 1013) is defined.
- ConandLineBioTool 1011
- WebServicesBioTool web service execution method
- ScriptBioTool script-based execution method
- the analysis tools run in different ways, so integrating them requires scalability and flexibility.
- the present invention may generate an analysis tool using XML (eXtensible Markup Language).
- XML eXtensible Markup Language
- XML has advantages such as portability, reusability, extensibility, and efficient data exchange, and thus is suitable for integrating various life information analysis tools in the present invention to form a life information analysis pipeline.
- the life information analysis pipeline according to the present invention is used as an input of another analysis tool in which a plurality of analysis tools are interconnected by a link and the output of one analysis tool is linked. Therefore, semantic-based hierarchical data schema can be defined in order to be freely linked between analysis tools having heterogeneous input / output data formats.
- OWL Ontology Web Language
- the input and output data of most bioinformatics analysis tools are in the form of strings, semantically diverse and can be classified hierarchically. Therefore, OWL can be used to define the input and output data format of the bioinformation analysis tool.
- Input and output data formats may include strings, numbers, Boolean values, as well as bioinformation data such as sequence data, sequence alignment data, protein structures, molecular interaction networks, and the like. Whether or not the two analysis tools can be linked to I / O is determined by determining whether the form class for the output data is a subclass of the form class for the input data in the OWL data schema.
- the analysis tool is XML and the input / output data is OWL.
- the flexibility and extensibility of the input / output data linkage between the analysis tools can be obtained for each analysis tool.
- One life information analysis pipeline 2010 may include a plurality of analysis tool execution tasks 2020 in a lower layer.
- Each analysis tool execution task 2020 may include a plurality of input data 2030 and output data 2040 in a lower layer.
- each of the input data 2030 and the output data 2040 includes a plurality of links 2050 in a lower layer, and the links 2050 correspond one-to-one with the input / output data conversion script 2060.
- Each analysis tool execution task 2020 above is executed in each analysis tool and may be referred to as an analysis tool below.
- the life information analysis pipeline 2010 may include a plurality of analysis tools 2020 that perform analysis work therein, and each analysis tool may include various types of data as input / output data 2030 and 2040. have.
- each link 2050 linking the analysis tools 2020 a plurality of links 2050 may be linked to one analysis tool, and each link 2050 converts the data when the data formats are different between the analysis tools. May involve data conversion script 2060.
- One bioinformatics analysis pipeline is represented as a directed acyclic graph with the main task as the peak of the BioTask corresponding to the unit analysis task that processes the specific task using the input data, and the link that links the input / output between the BioTasks. Can be.
- FIG. 3 is a diagram illustrating a general life information analysis pipeline model according to an embodiment of the present invention.
- BioTask 1 3010 is the initial performance analysis task and corresponds to the highest vertex.
- BioTask 1 (3010) has BioTask 2 (3020), BioTask 3 (3030), BioTask 4 (3040), BioTask 5 (3050) as child tasks, and BioTask 1 (3010) becomes parent task to them.
- a task executes in parallel when all of its parent tasks are terminated and its output is set appropriately for the input of that task. That is, in the case of BioTask 6 (3060), both BioTask 3 (3030) and BioTask 4 (3040), which are parent tasks, are terminated, and their output values are transmitted to BioTask 6 (3060) for execution.
- BioTask 3 (3030) or BioTask 4 (3040) is not suitable as the input data of BioTask 6 (3060)
- the data conversion script accompanying the link is converted into a suitable input data format and the BioTask 6 (3060) is used. ) Is entered.
- the present invention proposes a life information analysis pipeline system using a cloud service device.
- FIG. 4 is a diagram illustrating a life information analysis pipeline processing system according to an embodiment of the present invention.
- the life information analysis pipeline processing system may include a client 4010, a server 4020, a cloud service device 4030, and / or a database 4040.
- the client 4010 may include a pipeline designer 4011, an execution status monitor 4012, and / or a data uploader 4013.
- the pipeline execution server 4020 includes a request processing unit 4021, a pipeline VM package handler 4022, a DB object accessor 4023, a pipeline execution scheduler 4024, a job execution unit 4025, and a cloud resource supply device. 4026, object-relation mapper (ORM) engine 4027, and / or pipeline VM package 4028.
- the cloud resource supply device 4026 and / or the ORM engine 4027 may exist independently of the pipeline execution server 4020.
- the cloud service device 4030 may include a storage service device 4031 and / or a computing service device 4032.
- the life information analysis pipeline processing system may generate a pipeline VM package and use the same to provide analysis of life information in a cloud service device.
- the client 4010 may generate a pipeline model for analyzing life information to be analyzed.
- the pipeline designer 4011 may create meta information of the pipeline used for life information analysis, generate an object model of the server 4020, and store it in the database 4040.
- the meta information may indicate a connection relationship between the analysis tasks included in the pipeline and the VM image that executes the analysis tasks.
- the meta information may include information necessary for the server 4020 to execute a pipeline for analyzing life information, and may include information required for pipeline computing using the cloud service device 4030.
- the meta information may include at least one of pipeline structure information, input / output data conversion script information, execution location information of each analysis tool in the VM, or VM resource allocation information.
- the client may monitor the execution status of the pipeline at the server 4020 using the execution status monitor 4012.
- the created meta information can be delivered to the server by the data uploader 4013.
- the server 4020 may receive a request of the client 4010 through the request processor 4021.
- the connection between the client 4010 and the server 4020 may be based on web services or COMET.
- the pipeline VM package handler 4022 may extract ID information of a VM image mapped to specific pipeline meta information and store the ID information in the database 4040.
- the pipeline VM package handler 4022 is involved in the pipeline VM package 4028 import / export operation, which is described again below.
- DB object accessor 4023 may be used to import specific data from database 4040 to server 4020.
- the operation of storing the database 4040 is performed through the ORM engine 4027 connected to the server 4020.
- OR object-relation
- object-oriented data models expressed in XML can be mapped to relational databases.
- the components of the pipeline processing system can be implemented by referring to the object-oriented data model of pipeline meta information, and as a result, it is possible to construct the entire system object-oriented.
- Meta information of a specific pipeline stored above and ID information of a VM image mapped thereto may be used to utilize the cloud service device 4030 when analyzing the corresponding pipeline.
- the pipeline execution scheduler 4024 When executing life information analysis, the pipeline execution scheduler 4024 requests the cloud resource provider 4026 to create a VM image mapped to the pipeline as a VM instance, and when the VM instance is created, the pipeline meta information
- Each defined analysis tool can be used to direct sequential analysis tasks.
- the task executor 4025 may receive such an instruction and generate an analysis tool execution task in a corresponding VM in synchronization with the cloud resource supply device 4026 and synchronize the task state and the result.
- the above-mentioned analysis work and the result storage are performed by using the cloud service device 4030.
- the analysis operation is performed by the computing service device 4032 including the VM instance, which may include a VM image manager and a VM instance activated and scheduled to run in memory.
- Data generated by the analysis operation is made by the storage service device 4031.
- the storage service device 4031 may include a common data manager, a big data manager, and a distributed file manager.
- the user using the present invention through the life information analysis work using the cloud service device has the advantage that can quickly and accurately analyze complex and vast life information through large-scale distributed computing.
- life information analysis pipeline processing system may provide import / export of the life information pipeline VM package 4028.
- the client 4010 may request the export of the specific pipeline VM package 4028 through the request processor 4021.
- the pipeline VM package handler 4022 of the server 4020 may extract meta information of the pipeline from the database 4040 and generate the XML.
- the DB object accessor 4023 is used.
- the pipeline VM package handler 4022 may generate a VM package 4028 by taking a VM image mapped through the cloud resource supply device 4026 and merging it with meta information.
- the client 4010 may download the generated pipeline VM package image and export it to a specific local storage space.
- the user can export the VM package 4028 image of the life information analysis pipeline designed by the user from the server and save it on the personal computer, which can be analyzed again using any other server anytime, anywhere. can do.
- the client 4010 may upload a specific pipeline VM package image to the server 4020 using the data uploader 4013.
- the pipeline VM package handler 4022 of the server 4020 extracts the pipeline meta information from the uploaded VM package image and stores it in the database 4040, and the VM image is a cloud service through the cloud resource provider 4026.
- the computing service device 4032 may be installed in the device 4030.
- FIG. 5 is a flowchart illustrating a life information analysis pipeline processing method according to an embodiment of the present invention.
- the cloud resource supply device 4026 assigns an ID to the VM image and installs the VM image in the cloud service device in operation S5010. Each VM image is installed in a computing service device 4032 in a cloud service device 4030.
- the client 4010 writes meta information of the pipeline through the program designer 4011 and generates an object model of the server 4020 (S5020).
- the meta information may include at least one of pipeline structure information, input / output data conversion script information, execution location information of each analysis tool in the VM, or VM resource allocation information.
- the pipeline VM package handler 4022 extracts meta information of a pipeline to be analyzed and ID information of a VM image mapped thereto (S5030).
- the ID information of the VM image may be used to execute the VM image including the analysis tool when the analysis tools in the pipeline are to be executed. This is because it is necessary to access a specific computing service device 4032 in the cloud service device 4030 and execute a VM image corresponding to a desired pipeline.
- the server stores the object model and meta information in the database 4040 (S5040).
- the operation of storing the database 4040 is performed through the ORM engine 4027 connected to the server 4020.
- OR object-relation
- object-oriented data models expressed in XML can be mapped to relational databases.
- the storing step (S5040) is not an essential component in the embodiment of the present invention, it may be excluded according to the embodiment.
- the pipeline execution scheduler 4024 may request the cloud resource provider 4026 to create a VM instance for the VM image mapped to the pipeline.
- the analysis operation is instructed sequentially using the execution position in each analysis tool VM defined in the pipeline meta information (S5050).
- the job executor 4025 executes an analysis tool in the VM in association with the cloud resource supply device 4026 and synchronizes the job status and the result (S5060).
- FIG. 6 is a flowchart illustrating a VM package export method in the life information analysis pipeline processing method according to an embodiment of the present invention.
- the client 4010 requests the export of a specific pipeline VM package (S6010).
- the pipeline VM package handler 4022 included in the server 4020 may extract meta information of the corresponding pipeline from a database and generate the XML.
- the VM image to be mapped is extracted using the cloud resource supply apparatus (S6020).
- the pipeline VM package handler 4022 generates the VM package image by merging the extracted meta information and the VM image (S6030).
- the client 4010 downloads the generated pipeline VM package image and exports it to a specific local storage space (S6040).
- the user can export the VM package image of the life information analysis pipeline designed by the user from the server and save it on the personal computer. . How to import back to the server is described below.
- FIG. 7 is a flowchart illustrating a VM package importing method of the life information analysis pipeline processing method according to an embodiment of the present invention.
- the client 4010 uploads a specific pipeline VM package image to the server 4020 (S7010).
- the client 4010 may use the data uploader 4013.
- the pipeline VM package handler extracts the VM image and pipeline meta information from the uploaded VM package image (S7020).
- the pipeline VM package handler 4022 stores the extracted meta information in the database 4040 (S7030).
- the storage of meta information is via object-relational mapping via the ORM engine 4027.
- the VM image is installed in the cloud service device 4030 through the cloud resource supply device 4026 (S7040). At this time, the installed place is the computing service device 4032.
- the present invention enables the user to secure his / her life information analysis pipeline model through the pipeline VM package import / export operation described above, and execute the analysis task at any time through any server.
- the present invention may be applied, in whole or in part, to a life information analysis pipeline processing system.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Theoretical Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Databases & Information Systems (AREA)
- Bioethics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
La présente invention porte sur un système de traitement de pipeline d'analyse d'informations biométriques apte à générer un pipeline d'analyse d'informations biométriques conformément à un procédé d'analyse d'informations biométriques souhaité par un utilisateur et de traiter le pipeline d'analyse d'informations biométriques généré, et sur un procédé de traitement correspondant. Un système de traitement de pipeline d'analyse d'informations biométriques selon un mode de réalisation de la présente invention comprend : un fournisseur de ressources en nuage pour installer une image de machine virtuelle (VM), qui comprend une tâche d'analyse pour analyser des données d'informations biométriques, dans un dispositif de service en nuage ; un serveur pour recevoir des méta-informations sur un pipeline d'analyse d'informations biométriques et l'image VM en provenance d'un client, extraire des informations d'identification (ID) de l'image VM qui est mappée aux méta-informations, diriger séquentiellement la tâche d'analyse conformément à une position d'exécution de la tâche d'analyse d'image VM, exécuter un pipeline d'analyse en association avec le fournisseur de ressources en nuage, et synchroniser un état de tâche avec un résultat ; et une base de données pour stocker les méta-informations et les informations ID mappées de l'image VM.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/KR2011/006686 WO2013035904A1 (fr) | 2011-09-08 | 2011-09-08 | Système et procédé de traitement de pipeline d'analyse d'informations biométriques |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/KR2011/006686 WO2013035904A1 (fr) | 2011-09-08 | 2011-09-08 | Système et procédé de traitement de pipeline d'analyse d'informations biométriques |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2013035904A1 true WO2013035904A1 (fr) | 2013-03-14 |
Family
ID=47832331
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/KR2011/006686 Ceased WO2013035904A1 (fr) | 2011-09-08 | 2011-09-08 | Système et procédé de traitement de pipeline d'analyse d'informations biométriques |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2013035904A1 (fr) |
Cited By (25)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105812175A (zh) * | 2016-03-07 | 2016-07-27 | 联想(北京)有限公司 | 一种资源管理方法及资源管理设备 |
| US9558321B2 (en) | 2014-10-14 | 2017-01-31 | Seven Bridges Genomics Inc. | Systems and methods for smart tools in sequence pipelines |
| US9851993B2 (en) | 2013-09-24 | 2017-12-26 | International Business Machines Corporation | Virtual machine template optimization |
| US9898575B2 (en) | 2013-08-21 | 2018-02-20 | Seven Bridges Genomics Inc. | Methods and systems for aligning sequences |
| US9904763B2 (en) | 2013-08-21 | 2018-02-27 | Seven Bridges Genomics Inc. | Methods and systems for detecting sequence variants |
| US10053736B2 (en) | 2013-10-18 | 2018-08-21 | Seven Bridges Genomics Inc. | Methods and systems for identifying disease-induced mutations |
| US10055539B2 (en) | 2013-10-21 | 2018-08-21 | Seven Bridges Genomics Inc. | Systems and methods for using paired-end data in directed acyclic structure |
| US10078724B2 (en) | 2013-10-18 | 2018-09-18 | Seven Bridges Genomics Inc. | Methods and systems for genotyping genetic samples |
| US10262102B2 (en) | 2016-02-24 | 2019-04-16 | Seven Bridges Genomics Inc. | Systems and methods for genotyping with graph reference |
| US10275567B2 (en) | 2015-05-22 | 2019-04-30 | Seven Bridges Genomics Inc. | Systems and methods for haplotyping |
| US10364468B2 (en) | 2016-01-13 | 2019-07-30 | Seven Bridges Genomics Inc. | Systems and methods for analyzing circulating tumor DNA |
| US10460829B2 (en) | 2016-01-26 | 2019-10-29 | Seven Bridges Genomics Inc. | Systems and methods for encoding genetic variation for a population |
| US10545792B2 (en) | 2016-09-12 | 2020-01-28 | Seven Bridges Genomics Inc. | Hashing data-processing steps in workflow environments |
| US10584380B2 (en) | 2015-09-01 | 2020-03-10 | Seven Bridges Genomics Inc. | Systems and methods for mitochondrial analysis |
| US10672156B2 (en) | 2016-08-19 | 2020-06-02 | Seven Bridges Genomics Inc. | Systems and methods for processing computational workflows |
| US10678613B2 (en) | 2017-10-31 | 2020-06-09 | Seven Bridges Genomics Inc. | System and method for dynamic control of workflow execution |
| US10724110B2 (en) | 2015-09-01 | 2020-07-28 | Seven Bridges Genomics Inc. | Systems and methods for analyzing viral nucleic acids |
| US10793895B2 (en) | 2015-08-24 | 2020-10-06 | Seven Bridges Genomics Inc. | Systems and methods for epigenetic analysis |
| US10832797B2 (en) | 2013-10-18 | 2020-11-10 | Seven Bridges Genomics Inc. | Method and system for quantifying sequence alignment |
| US11049587B2 (en) | 2013-10-18 | 2021-06-29 | Seven Bridges Genomics Inc. | Methods and systems for aligning sequences in the presence of repeating elements |
| US11055135B2 (en) | 2017-06-02 | 2021-07-06 | Seven Bridges Genomics, Inc. | Systems and methods for scheduling jobs from computational workflows |
| CN113674798A (zh) * | 2020-05-15 | 2021-11-19 | 复旦大学 | 蛋白质组学数据的分析系统 |
| US11250931B2 (en) | 2016-09-01 | 2022-02-15 | Seven Bridges Genomics Inc. | Systems and methods for detecting recombination |
| US11347704B2 (en) | 2015-10-16 | 2022-05-31 | Seven Bridges Genomics Inc. | Biological graph or sequence serialization |
| US12431217B2 (en) | 2014-01-10 | 2025-09-30 | Seven Bridges Genomics Inc. | Systems and methods for use of known alleles in read mapping |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090089128A1 (en) * | 2007-09-27 | 2009-04-02 | Microsoft Corporation | Service-oriented pipeline based architecture |
| KR20100001350A (ko) * | 2008-06-26 | 2010-01-06 | 한국생명공학연구원 | 파이프라인 역 탐색을 이용한 프로세스 설계시스템 및 방법 |
| KR20100011825A (ko) * | 2008-07-25 | 2010-02-03 | 한국생명공학연구원 | 정보 분석프로세스 추천 설계시스템 및 방법 |
-
2011
- 2011-09-08 WO PCT/KR2011/006686 patent/WO2013035904A1/fr not_active Ceased
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090089128A1 (en) * | 2007-09-27 | 2009-04-02 | Microsoft Corporation | Service-oriented pipeline based architecture |
| KR20100001350A (ko) * | 2008-06-26 | 2010-01-06 | 한국생명공학연구원 | 파이프라인 역 탐색을 이용한 프로세스 설계시스템 및 방법 |
| KR20100011825A (ko) * | 2008-07-25 | 2010-02-03 | 한국생명공학연구원 | 정보 분석프로세스 추천 설계시스템 및 방법 |
Non-Patent Citations (1)
| Title |
|---|
| BYEONG-CHUL, KANG ET AL.: "Web-based Research Assistant Tools for Analysis of Microbial Diversity", JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, vol. 14, no. 5, 2004, pages 545 - 550 * |
Cited By (37)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9898575B2 (en) | 2013-08-21 | 2018-02-20 | Seven Bridges Genomics Inc. | Methods and systems for aligning sequences |
| US9904763B2 (en) | 2013-08-21 | 2018-02-27 | Seven Bridges Genomics Inc. | Methods and systems for detecting sequence variants |
| US9851993B2 (en) | 2013-09-24 | 2017-12-26 | International Business Machines Corporation | Virtual machine template optimization |
| US10832797B2 (en) | 2013-10-18 | 2020-11-10 | Seven Bridges Genomics Inc. | Method and system for quantifying sequence alignment |
| US11447828B2 (en) | 2013-10-18 | 2022-09-20 | Seven Bridges Genomics Inc. | Methods and systems for detecting sequence variants |
| US10053736B2 (en) | 2013-10-18 | 2018-08-21 | Seven Bridges Genomics Inc. | Methods and systems for identifying disease-induced mutations |
| US11049587B2 (en) | 2013-10-18 | 2021-06-29 | Seven Bridges Genomics Inc. | Methods and systems for aligning sequences in the presence of repeating elements |
| US10078724B2 (en) | 2013-10-18 | 2018-09-18 | Seven Bridges Genomics Inc. | Methods and systems for genotyping genetic samples |
| US10204207B2 (en) | 2013-10-21 | 2019-02-12 | Seven Bridges Genomics Inc. | Systems and methods for transcriptome analysis |
| US10055539B2 (en) | 2013-10-21 | 2018-08-21 | Seven Bridges Genomics Inc. | Systems and methods for using paired-end data in directed acyclic structure |
| US12431217B2 (en) | 2014-01-10 | 2025-09-30 | Seven Bridges Genomics Inc. | Systems and methods for use of known alleles in read mapping |
| US9558321B2 (en) | 2014-10-14 | 2017-01-31 | Seven Bridges Genomics Inc. | Systems and methods for smart tools in sequence pipelines |
| US10083064B2 (en) | 2014-10-14 | 2018-09-25 | Seven Bridges Genomics Inc. | Systems and methods for smart tools in sequence pipelines |
| US10275567B2 (en) | 2015-05-22 | 2019-04-30 | Seven Bridges Genomics Inc. | Systems and methods for haplotyping |
| US11697835B2 (en) | 2015-08-24 | 2023-07-11 | Seven Bridges Genomics Inc. | Systems and methods for epigenetic analysis |
| US10793895B2 (en) | 2015-08-24 | 2020-10-06 | Seven Bridges Genomics Inc. | Systems and methods for epigenetic analysis |
| US12365933B2 (en) | 2015-08-24 | 2025-07-22 | Seven Bridges Genomics Inc. | Systems and methods for epigenetic analysis |
| US12173374B2 (en) | 2015-09-01 | 2024-12-24 | Seven Bridges Genomics Inc. | Systems and methods for analyzing viral nucleic acids |
| US11702708B2 (en) | 2015-09-01 | 2023-07-18 | Seven Bridges Genomics Inc. | Systems and methods for analyzing viral nucleic acids |
| US10724110B2 (en) | 2015-09-01 | 2020-07-28 | Seven Bridges Genomics Inc. | Systems and methods for analyzing viral nucleic acids |
| US10584380B2 (en) | 2015-09-01 | 2020-03-10 | Seven Bridges Genomics Inc. | Systems and methods for mitochondrial analysis |
| US11649495B2 (en) | 2015-09-01 | 2023-05-16 | Seven Bridges Genomics Inc. | Systems and methods for mitochondrial analysis |
| US11347704B2 (en) | 2015-10-16 | 2022-05-31 | Seven Bridges Genomics Inc. | Biological graph or sequence serialization |
| US10364468B2 (en) | 2016-01-13 | 2019-07-30 | Seven Bridges Genomics Inc. | Systems and methods for analyzing circulating tumor DNA |
| US11560598B2 (en) | 2016-01-13 | 2023-01-24 | Seven Bridges Genomics Inc. | Systems and methods for analyzing circulating tumor DNA |
| US10460829B2 (en) | 2016-01-26 | 2019-10-29 | Seven Bridges Genomics Inc. | Systems and methods for encoding genetic variation for a population |
| US10262102B2 (en) | 2016-02-24 | 2019-04-16 | Seven Bridges Genomics Inc. | Systems and methods for genotyping with graph reference |
| CN105812175B (zh) * | 2016-03-07 | 2020-05-26 | 联想(北京)有限公司 | 一种资源管理方法及资源管理设备 |
| CN105812175A (zh) * | 2016-03-07 | 2016-07-27 | 联想(北京)有限公司 | 一种资源管理方法及资源管理设备 |
| US10672156B2 (en) | 2016-08-19 | 2020-06-02 | Seven Bridges Genomics Inc. | Systems and methods for processing computational workflows |
| US11250931B2 (en) | 2016-09-01 | 2022-02-15 | Seven Bridges Genomics Inc. | Systems and methods for detecting recombination |
| US12482535B2 (en) | 2016-09-01 | 2025-11-25 | Seven Bridges Genomics Inc. | Systems and methods for detecting recombination |
| US10545792B2 (en) | 2016-09-12 | 2020-01-28 | Seven Bridges Genomics Inc. | Hashing data-processing steps in workflow environments |
| US11055135B2 (en) | 2017-06-02 | 2021-07-06 | Seven Bridges Genomics, Inc. | Systems and methods for scheduling jobs from computational workflows |
| US10678613B2 (en) | 2017-10-31 | 2020-06-09 | Seven Bridges Genomics Inc. | System and method for dynamic control of workflow execution |
| CN113674798A (zh) * | 2020-05-15 | 2021-11-19 | 复旦大学 | 蛋白质组学数据的分析系统 |
| CN113674798B (zh) * | 2020-05-15 | 2024-04-26 | 复旦大学 | 蛋白质组学数据的分析系统 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2013035904A1 (fr) | Système et procédé de traitement de pipeline d'analyse d'informations biométriques | |
| KR101282798B1 (ko) | 생명 정보 분석 파이프라인 처리 시스템 및 방법 | |
| Aziz et al. | SEED servers: high-performance access to the SEED genomes, annotations, and metabolic models | |
| Alpdemir et al. | Service-based distributed querying on the grid | |
| Turilli et al. | Middleware building blocks for workflow systems | |
| Fjukstad et al. | A review of scalable bioinformatics pipelines | |
| Mohr et al. | qPortal: A platform for data-driven biomedical research | |
| KR101279392B1 (ko) | 생명 정보 분석 파이프라인 처리 시스템 및 방법 | |
| Ferrés et al. | Protocol for post-processing of bacterial pangenome data using Pagoo pipeline | |
| WO2011065660A2 (fr) | Système de simulation de calcul et son procédé | |
| Senk et al. | A collaborative simulation-analysis workflow for computational neuroscience using HPC | |
| Woollard et al. | Scientific software as workflows: From discovery to distribution | |
| WO2020222407A1 (fr) | Système et procédé de plateforme de gestion d'apprentissage automatique | |
| WO2022139325A1 (fr) | Système informatique pour apprentissage adaptatif multi-domaine basé sur un réseau neuronal unique sans sur-apprentissage, et procédé associé | |
| Skidmore et al. | iPlant atmosphere: a gateway to cloud infrastructure for the plant sciences | |
| Sommerfeld et al. | AUGUSTUS at MediGRID: adaption of a bioinformatics application to grid computing for efficient genome analysis | |
| Cieślik et al. | A lightweight, flow-based toolkit for parallel and distributed bioinformatics pipelines | |
| WO2022107995A1 (fr) | Procédé et dispositif de traitement intégré de données pour modèle croisé de mégadonnées | |
| Shields et al. | A Java/CORBA‐based visual program composition environment for PSEs | |
| Gnimpieba et al. | RNA-Seq gene and transcript expression analysis using the BioExtract Server and iPlant Collaborative | |
| Kelly | Applying functional programming theory to the design of workflow engines | |
| Feagan et al. | Bioinformatics process management: information flow via a computational journal | |
| Türker et al. | B-fabric: an open source life sciences data management system | |
| Antezana et al. | ONTO-ToolKit: enabling bio-ontology engineering via Galaxy | |
| WO2025037801A1 (fr) | Appareil et procédé de construction d'un modèle de résumé qui résume un document de brevet, et appareil de résumé de document de brevet utilisant un modèle de résumé |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11872103 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 11872103 Country of ref document: EP Kind code of ref document: A1 |