[go: up one dir, main page]

CN109326330A - The production method of analysis of biological information tool, device and can storage medium - Google Patents

The production method of analysis of biological information tool, device and can storage medium Download PDF

Info

Publication number
CN109326330A
CN109326330A CN201811004190.9A CN201811004190A CN109326330A CN 109326330 A CN109326330 A CN 109326330A CN 201811004190 A CN201811004190 A CN 201811004190A CN 109326330 A CN109326330 A CN 109326330A
Authority
CN
China
Prior art keywords
analysis
biological information
order
analysis module
output
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811004190.9A
Other languages
Chinese (zh)
Other versions
CN109326330B (en
Inventor
袁晓辉
付玉华
李东野
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sanya Boruiyuan Technology Co.,Ltd.
Original Assignee
Wuhan Ancient Gene Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan Ancient Gene Technology Co Ltd filed Critical Wuhan Ancient Gene Technology Co Ltd
Priority to CN201811004190.9A priority Critical patent/CN109326330B/en
Publication of CN109326330A publication Critical patent/CN109326330A/en
Application granted granted Critical
Publication of CN109326330B publication Critical patent/CN109326330B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Apparatus Associated With Microorganisms And Enzymes (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a kind of production methods of analysis of biological information tool, device and can storage medium.Wherein method includes the analysis process for parsing a biological information, establishes the implementation order of at least two mutually independent analytical procedures and any analytical procedure;According to the input traffic of analytical procedure Allocation Analysis module and analysis module, output stream and Parameter File;The execution program of Allocation Analysis module implementation analytical procedure;Implementing between two analysis modules adjacent in order, the output stream for configuring the analysis module of first order and the input traffic of the analysis module in rear order have same interface standard, establish the interactive link of all analysis modules according to different interface standards and execute sequence.According to the execution program of analysis module, Parameter File, and interactive link, the execution sequentially built analysis of biological information tool of any analysis module.The present invention can fast, efficiently establish the modularization analysis of biological information tool with good reusability.

Description

The production method of analysis of biological information tool, device and can storage medium
Technical field
The present invention relates to biology information technology analysis fields, in particular to a kind of system of analysis of biological information tool Make method, apparatus and can storage medium.
Background technique
With the development of sequencing technologies, related sequencing data is increased with the rate for surpassing More's law, service sequencing number According to bioinformatics software be widely deployed.
Since sequencing data to be treated is more, the operation flow of current biological information software and the realization of source code are caused It is second-rate;So, it is very heavy to provide the analysis of biological information tool with good readable, reusability and portability It wants.
Summary of the invention
The embodiment of the present invention at least provides a kind of production method of analysis of biological information tool, can be according to different biologies Information fast, efficient foundation have the analysis tool of good reusability.
The specific implementation of above-described embodiment, as described below.
The described method includes:
The analysis process for parsing a biological information establishes at least two mutually independent analytical procedures and any analysis The implementation order of step;
According to the analytical procedure Allocation Analysis module and input traffic, output stream and the ginseng of the analysis module Number file;
Configure the execution program that the analysis module implements the analytical procedure;
Between two analysis modules adjacent in the implementation order, the defeated of the analysis module of first order is configured The input traffic of data flow and the analysis module in rear order has same interface standard out, according to the different interface marks Standard establishes the interactive link of all analysis modules and executes sequence.
According to the execution program of the analysis module, Parameter File and all analysis modules interactive link, hold Analysis of biological information tool described in row sequentially built.
Preferred version is to establish several independent implementations time respectively according to the analysis process in the present embodiment Sequence.
Preferred version is that the input traffic includes at least initial data, sample names, connector letter in the present embodiment One of breath and journal file are a variety of;
Preferred version is that the output stream includes at least outgoing route and/or cache path in the present embodiment.
Preferred version is that the Parameter File is to be stored in config interface in the analysis module in the present embodiment Build one of library type, minimum length and sliding window size or a variety of.
Preferred version is in the present embodiment, which comprises
The analysis process for parsing a biological information establishes at least two mutually independent analytical procedures and any analysis The implementation order of step;
According to the analytical procedure Allocation Analysis module and input traffic, output stream and the ginseng of the analysis module Number file;
One workflow frame is established according to the implementation order between any analysis module using nextflow;
Match the process of different analysis modules respectively in the workflow frame;
Implement the nextflow shell script of the process according to analytical procedure configuration;
Parameter is implemented in config interface setting according to Parameter File in the process;
In the process according to input data stream configuration input, output is configured according to output data stream configuration, Implement to configure the output of the process of first order and in rear order between two adjacent process of order The input of process has same-interface standard;
The interactive link between process described in any two is established according to interface standard identical input and output And execute sequence;
The execution program and implementation for the process, the process configuration for including according to the workflow frame Analysis of biological information tool described in parameter, the interactive connection of any process and execution sequentially built.
The embodiment of the present invention separately disclose one kind can storage medium, for storing instruction, which is characterized in that described instruction is located Reason device realizes above-mentioned steps when executing.
The embodiment of the present invention separately discloses a kind of producing device of analysis of biological information tool,
Described device includes:
Parsing module parses the analysis process of a biological information, establishes at least two mutually independent analytical procedures and appoints The implementation order for the analytical procedure of anticipating;According to the analytical procedure Allocation Analysis module and the input data of the analysis module Stream, output stream and Parameter File;
Module is made, one workflow is established according to the implementation order between any analysis module using nextflow Frame;Match the process of different analysis modules respectively in the workflow frame;It is configured according to the analytical procedure real Apply the nextflow shell script of the process;Config interface according to Parameter File in the process is arranged in fact Apply parameter;In the process according to input data stream configuration input, output is configured according to output data stream configuration, in reality It applies between two adjacent process of order, configures the output and the process in rear order of the process of first order Input have same-interface standard;It is established described in any two according to interface standard identical input and output Interactive link between process and execute sequence;
Package module, the execution of the process for including according to the workflow frame, the process configuration Analysis of biological information tool described in program and implementation parameter, the interactive connection of any process and execution sequentially built.
For above scheme, the present invention is by being referring to the drawings described in detail disclosed exemplary embodiment, also The other feature and its advantage for making the embodiment of the present invention understand.
Detailed description of the invention
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below will be to needed in the embodiment attached Figure is briefly described, it should be understood that the following drawings illustrates only certain embodiments of the present invention, therefore is not construed as pair The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this A little attached drawings obtain other relevant attached drawings.
Fig. 1 is the flow chart of embodiment method;
Fig. 2 is the structure chart of embodiment device.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is A part of the embodiment of the present invention, instead of all the embodiments.The present invention being usually described and illustrated herein in the accompanying drawings is implemented The component of example can be arranged and be designed with a variety of different configurations.
Therefore, the detailed description of the embodiment of the present invention provided in the accompanying drawings is not intended to limit below claimed The scope of the present invention, but be merely representative of selected embodiment of the invention.Based on the embodiments of the present invention, this field is common Technical staff's every other embodiment obtained without creative efforts belongs to the model that the present invention protects It encloses.
The present embodiment provides a kind of production methods of analysis of biological information tool.Method through this embodiment, this implementation Example will decompose the analysis process of biological information, make the biggish execution program of a scale of construction in fact according to biological information Analytical procedure be divided into several different analysis modules;It is relatively independent and have a single function between analysis module, it can be independent Operation executes program, is easily managed and uses.Between different analysis modules, it is only necessary to application such as nextflow and binding analysis stream Journey bonds different analysis modules by defining interface type, realizes the continuous of analysis module.
In order to realize above content, Fig. 1 is please referred to.The present embodiment makes modularization for carrying out data analysis to mRNA Analysis of biological information tool, specifically include:
Step100, parsing are number Quality Control (QC), genome alignment to the mRNA analysis process for implementing data analysis (Mapping), quantitative (Quantify), variance analysis (Diff), variable sheer (AS), variation detection (SNV) and function enrichment (GO/KEGG) the implementation order between independent analytical procedure and analytical procedure such as.
Step210, according to above-mentioned analytical procedure respectively establish matching QC, Mapping, AS, Quantify, SNV, Diff and The analysis module of the analytical procedures such as GO/KEGG.
Step220, input, processing and output according to analytical procedure to data configure config interface in analysis module And abstract input traffic, output stream and other parameters.
For example, the corresponding input traffic of config interface of QC analysis module include at least initial data, sample names, Joint information and journal file, output stream include at least outgoing route, cache path, and other parameters include at least and build library class Type, minimum length and sliding window size etc..
Step310, a workflow frame is established using biological information process Customization Tool nextflow and in workflow Independent process is established according to each analysis module in frame.
Nextflow is mainly the interface standard for passing through definition input and output, different implementation project, that is, process It connects, to realize the analysis process for carrying out data analysis to biological information.
Step320, configuration realize the nextflow script of process, include at least actuator, environmental variance and process ginseng Number etc..The language of nextflow script is preferably the superset of groovy language.
Step330, other above-mentioned ginsengs that each process corresponding with analysis module is respectively configured according to config interface Number.
Step340, according to implement order process configuration meet above-mentioned input interface standard and with abstract input number According to the corresponding input of stream;Configuration meets above-mentioned output interface standard and output corresponding with abstract output stream.
I.e. between adjacent two process for implementing order, upper one implements the output of order process and next reality The interface standard for applying the input of order process is identical, realizes interactive link;
The interactive link that the present embodiment is formed by input between any two process and output is established all Interaction between process and sequence is executed, and then establishes the analysis process of analysis of biological information tool.
Step350, in the analysis process of analysis of biological information tool and the interactive link of all process and sequence is executed After the completion of foundation, that is, analysis of biological information tool can be used to execute nextflow_run order realization to the analysis work of biological information Make.
Through the above scheme, the interface standard that the present embodiment application nextflow is defined quickly connects different process' Input and output stream;Above-mentioned process can individually be developed under different language environment according to specific analytical procedure, Simplify production process;Meanwhile the analysis of biological information tool of the present embodiment method production has reusability, effective solution Development difficulty existing for current biological information analysis tool is big, reusability is poor and is difficult to problem difficult in maintenance.
Further, the present embodiment separately provide a kind of computer can storage medium, executed based on above-mentioned steps by storing The instruction of calculation machine.
Referring to FIG. 2, the present embodiment separately discloses a kind of producing device of analysis of biological information tool, including parsing module, Make module and package module.
Parsing module parses the analysis process of a biological information, establishes at least two mutually independent analytical procedures and any The implementation order of analytical procedure;According to the input traffic of analytical procedure Allocation Analysis module and analysis module, output stream And Parameter File.It makes module application nextflow and one workflow frame is established according to the implementation order between any analysis module Frame;Match the process of different analysis modules respectively in workflow frame;Implemented according to analytical procedure configuration The nextflow shell script of process;Parameter is implemented in config interface setting according to Parameter File in process;? Process configures output according to input data stream configuration input, according to output data stream configuration, and implementing, order is adjacent Between two process, the input for configuring the output and the process in rear order of the process of first order has phase Same interface standard;According to interface standard identical input and output establish interactive link between any two process and Execute sequence.Package module, the execution program for process, process configuration for including according to workflow frame and implementation are joined The interactive connection and execution sequentially built analysis of biological information tool of several, any process.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field For art personnel, the invention may be variously modified and varied.All within the spirits and principles of the present invention, made any to repair Change, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.

Claims (8)

1. a kind of production method of analysis of biological information tool, which is characterized in that
The described method includes:
The analysis process for parsing a biological information establishes at least two mutually independent analytical procedures and any analytical procedure Implementation order;
According to the input traffic of the analytical procedure Allocation Analysis module and the analysis module, output stream and parameter text Part;
Configure the execution program that the analysis module implements the analytical procedure;
Between two analysis modules adjacent in the implementation order, the output number of the analysis module of first order is configured There is same interface standard according to stream and the input traffic of the analysis module in rear order, built according to the different interface standards It founds the interactive link of all analysis modules and executes sequence.
It is suitable according to the execution program of the analysis module, the interactive link of Parameter File and all analysis modules, execution Sequence makes the analysis of biological information tool.
2. the production method of analysis of biological information tool as described in claim 1, which is characterized in that
Several independent implementation order are established respectively according to the analysis process.
3. the production method of analysis of biological information tool as described in claim 1, which is characterized in that the input data is flow to It less include one of initial data, sample names, joint information and journal file or a variety of.
4. the production method of analysis of biological information tool as described in claim 1, which is characterized in that the output data is flow to It less include outgoing route and/or cache path.
5. the production method of analysis of biological information tool as described in claim 1, which is characterized in that the Parameter File is to deposit Storage config interface in the analysis module builds one of library type, minimum length and sliding window size or a variety of.
6. the production method of analysis of biological information tool as described in claim 1, which is characterized in that the described method includes:
The analysis process for parsing a biological information establishes at least two mutually independent analytical procedures and any analytical procedure Implementation order;
According to the input traffic of the analytical procedure Allocation Analysis module and the analysis module, output stream and parameter text Part;
One workflow frame is established according to the implementation order between any analysis module using nextflow;
Match the process of different analysis modules respectively in the workflow frame;
Implement the nextflow shell script of the process according to analytical procedure configuration;
Parameter is implemented in config interface setting according to Parameter File in the process;
In the process according to input data stream configuration input, output is configured according to output data stream configuration, is being implemented Between two adjacent process of order, the output of the process of first order is configured with the process's in rear order Input has same-interface standard;
According to the interface standard identical input and output interactive link established between process described in any two and hold Row sequence;
The execution program and implementation ginseng for the process, the process configuration for including according to the workflow frame Analysis of biological information tool described in number, the interactive connection of any process and execution sequentially built.
7. one kind can storage medium, for storing instruction, which is characterized in that realize when described instruction is executed by processor such as right It is required that the step of 1 to 6 any one the method.
8. a kind of producing device of analysis of biological information tool, which is characterized in that
Described device includes:
Parsing module parses the analysis process of a biological information, establishes at least two mutually independent analytical procedures and any institute State the implementation order of analytical procedure;According to the input traffic of the analytical procedure Allocation Analysis module and the analysis module, Output stream and Parameter File;
Module is made, one workflow frame is established according to the implementation order between any analysis module using nextflow Frame;Match the process of different analysis modules respectively in the workflow frame;It is configured and is implemented according to the analytical procedure The nextflow shell script of the process;It is arranged according to Parameter File in the config interface of the process and is implemented Parameter;In the process according to input data stream configuration input, output is configured according to output data stream configuration, is being implemented Between two adjacent process of order, the output of the process of first order is configured with the process's in rear order Input has same-interface standard;Process described in any two is established according to interface standard identical input and output Between interactive link and execute sequence;
Package module, the execution program of the process for including according to the workflow frame, the process configuration And implements parameter, the interactive connection of any process and execute analysis of biological information tool described in sequentially built.
CN201811004190.9A 2018-08-30 2018-08-30 Method and device for manufacturing biological information analysis tool and storage medium Active CN109326330B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811004190.9A CN109326330B (en) 2018-08-30 2018-08-30 Method and device for manufacturing biological information analysis tool and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811004190.9A CN109326330B (en) 2018-08-30 2018-08-30 Method and device for manufacturing biological information analysis tool and storage medium

Publications (2)

Publication Number Publication Date
CN109326330A true CN109326330A (en) 2019-02-12
CN109326330B CN109326330B (en) 2020-10-16

Family

ID=65264268

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811004190.9A Active CN109326330B (en) 2018-08-30 2018-08-30 Method and device for manufacturing biological information analysis tool and storage medium

Country Status (1)

Country Link
CN (1) CN109326330B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110750362A (en) * 2019-12-19 2020-02-04 深圳华大基因科技服务有限公司 Analysis method, device and storage medium for biological information
CN115757520A (en) * 2022-11-28 2023-03-07 北京元码医学检验实验室有限公司 Flow control processing method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020168664A1 (en) * 1999-07-30 2002-11-14 Joseph Murray Automated pathway recognition system
CN107004069A (en) * 2015-04-30 2017-08-01 株式会社Xcoo Genome analysis device and genome visualization method
CN107122626A (en) * 2017-03-13 2017-09-01 上海海云生物科技有限公司 The method and system of the bioinformatic analysis of two generations sequencing DNA mutation detection
CN107563152A (en) * 2017-08-03 2018-01-09 北京百迈客生物科技有限公司 The data analysis application system that methylates based on biological cloud platform
CN107590365A (en) * 2017-08-22 2018-01-16 武汉古奥基因科技有限公司 Application of the Docker technologies in high-flux sequence data analysis
CN107657150A (en) * 2017-09-07 2018-02-02 北京荣之联科技股份有限公司 One kind visualization analysis of biological information instrument generation method and device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020168664A1 (en) * 1999-07-30 2002-11-14 Joseph Murray Automated pathway recognition system
CN107004069A (en) * 2015-04-30 2017-08-01 株式会社Xcoo Genome analysis device and genome visualization method
CN107122626A (en) * 2017-03-13 2017-09-01 上海海云生物科技有限公司 The method and system of the bioinformatic analysis of two generations sequencing DNA mutation detection
CN107563152A (en) * 2017-08-03 2018-01-09 北京百迈客生物科技有限公司 The data analysis application system that methylates based on biological cloud platform
CN107590365A (en) * 2017-08-22 2018-01-16 武汉古奥基因科技有限公司 Application of the Docker technologies in high-flux sequence data analysis
CN107657150A (en) * 2017-09-07 2018-02-02 北京荣之联科技股份有限公司 One kind visualization analysis of biological information instrument generation method and device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110750362A (en) * 2019-12-19 2020-02-04 深圳华大基因科技服务有限公司 Analysis method, device and storage medium for biological information
CN115757520A (en) * 2022-11-28 2023-03-07 北京元码医学检验实验室有限公司 Flow control processing method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN109326330B (en) 2020-10-16

Similar Documents

Publication Publication Date Title
Sim et al. ChemOS 2.0: An orchestration architecture for chemical self-driving laboratories
CN114424257A (en) Automatic rendering and extraction of form data using machine learning
Ahmad et al. A framework for architecture-driven migration of legacy systems to cloud-enabled software
JP2005521947A5 (en)
CN119558340B (en) Methods and related devices for handling explicit result acquisition tasks based on multi-agent collaboration
CN105653900A (en) Reference-genome-free transcriptome analysis system and reference-genome-free transcriptome analysis method
CN109522005A (en) Cross-platform GRAPHICAL PROGRAMMING method
CN107657150A (en) One kind visualization analysis of biological information instrument generation method and device
US20250252376A1 (en) Workflow Optimization Leveraging Generative AI and Quantum Simulation
Bates et al. Unified modeling language code generation from diagram images using multimodal large language models
CN109326330A (en) The production method of analysis of biological information tool, device and can storage medium
Petrakis et al. Enhancing DevOps Practices in the IoT–Edge–Cloud Continuum: Architecture, Integration, and Software Orchestration Demonstrated in the COGNIFOG Framework
Ko et al. ModelCenter MBSE for OpenMBEE: MBSE analysis integration for distributed development
Han Bioworks: a workflow system for automation of bioinformatics analysis processes
Prakash et al. Systematic Analysis of Factors Influencing Modulith Architecture Adoption over Microservices
CN114327580B (en) Management method, system, device and storage medium based on software warehouse
Lücke-Tieke et al. Lowering the barrier for successful replication and evaluation
CN118034672A (en) Spark-based visual data processing analysis method and system
JP2024132871A (en) How to automatically optimize tasks in production and research facilities
CN117762404A (en) Configurable operator processing method and device for data mining
Frolov et al. Digital Transformation in the Biopharmaceutical Industry: Rebuilding the Way We Discover Complex Therapeutics
Mileff Design and development of a web-based graph editor and simulator application
CN105956033A (en) Plate form generation method and apparatus
Maldonado et al. Towards a Domain-Specific Language for Provisioning Multiple Cloud Testing Environments for Mobile Applications
de Luz Palomino Valdivia et al. Guided visual analysis of multivariate time series

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20230406

Address after: No. 303, 3rd Floor, Building 9, Youyou Industrial Park, Yazhou Bay Science and Technology City, Yazhou District, Sanya City, Hainan Province, 572025

Patentee after: Sanya Boruiyuan Technology Co.,Ltd.

Address before: Room 3, 4th Floor, Building 6, Phase 1, Yinjiu Technology Industrial Park, No. 35 Guanggu Avenue, Donghu New Technology Development Zone, Wuhan City, Hubei Province, 430000

Patentee before: WUHAN GOOAL GENE TECHNOLOGY CO.,LTD.

TR01 Transfer of patent right