[go: up one dir, main page]

CN111755116A - Method for judging sex of sample and device for implementing method - Google Patents

Method for judging sex of sample and device for implementing method Download PDF

Info

Publication number
CN111755116A
CN111755116A CN201910252113.3A CN201910252113A CN111755116A CN 111755116 A CN111755116 A CN 111755116A CN 201910252113 A CN201910252113 A CN 201910252113A CN 111755116 A CN111755116 A CN 111755116A
Authority
CN
China
Prior art keywords
chrx
sample
specific
sites
sequencing data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910252113.3A
Other languages
Chinese (zh)
Inventor
王晶
李川
蒋立坤
侯光远
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Euroimmun Medizinische Labordiagnostika AG
Original Assignee
Euroimmun Medizinische Labordiagnostika AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Euroimmun Medizinische Labordiagnostika AG filed Critical Euroimmun Medizinische Labordiagnostika AG
Priority to CN201910252113.3A priority Critical patent/CN111755116A/en
Publication of CN111755116A publication Critical patent/CN111755116A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6879Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for sex determination
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Public Health (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Medical Informatics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Data Mining & Analysis (AREA)
  • Organic Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Wood Science & Technology (AREA)
  • Epidemiology (AREA)
  • Biomedical Technology (AREA)
  • Analytical Chemistry (AREA)
  • Primary Health Care (AREA)
  • Databases & Information Systems (AREA)
  • Zoology (AREA)
  • Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention relates to a method for judging the sex of a sample, which comprises the following steps: (1) obtaining sequencing data of a sample; (2) calculating whether a specific locus on the X chromosome is a heterozygous locus based on the sequencing data; and (3) judging the sex of the sample according to the result of the step (2), wherein if one or more specific sites are heterozygous sites, the sample is female, otherwise, the sample is male. The invention also relates to a device and an apparatus for carrying out the method.

Description

Method for judging sex of sample and device for implementing method
Technical Field
The invention relates to a method for analyzing sequencing data. More particularly, the present invention relates to a method for determining the sex of a sample by analyzing sequencing data and an apparatus for performing the method.
Background
In recent years, with the rapid development of the second-generation sequencing technology, the high-throughput gene sequencing technology is mature day by day, and the continuous innovation of the sequencing technology principle and the continuous breakthrough of the application bottleneck of engineering materials enable the high-throughput genome sequence reading to tend to meet the individual requirements, and gradually shift from the application of a conventional basic scientific research laboratory to the application of clinical medicine. Since the advent of the second-generation sequencing technology, immeasurable prospect advantages are provided for clinical application due to the high-efficiency and low single-base sequencing cost, and particularly, compared with the traditional Sanger sequencing method, the data volume generated each time is almost astronomical numbers, and the continuous development of information science is added, so that the effective processing of such large data is realized.
Recently, the second generation sequencing technology has been used to detect the causative gene of various complex diseases such as cancer, genetic diseases, etc., and gradually goes to clinical diagnosis. In the analysis of genetic diseases, gender is one of the key factors affecting the analysis. Especially in some sexually-linked genetic diseases, such as sexually-linked dominant genetic diseases: anhidrosis, vitamin D resistant rickets, hereditary nephritis, lipoma, syringomyelia, etc.; and sexually linked recessive genetic diseases: the sex information is the main reference factor in the detection and analysis of pathogenic genes in anerythrochloropsia, hemophilia, broad bean disease, familial hereditary optic atrophy, hemangioma, testicular feminization syndrome, renal diabetes, congenital cataract, eye-free deformity and the like. Therefore, gender information is of great importance during the clinical analysis of genetic diseases. Only on the basis of acquiring the sex information, the disease analysis can be more accurately carried out, and the clinical assistance is realized.
Currently, in the process of analyzing genetic diseases by using next-generation sequencing, a patient or a doctor usually inputs sex information and then directly transmits the sex information to a data analyzer. However, in the information transmission process, the condition that the manual entry is mistaken or omitted often occurs, so that the deviation or the error occurs in the subsequent analysis. In addition, in practice, due to the serial bit of the experimental operation, the sample and the sequencing data are also in error, so that the sequencing analysis result is not matched with the sample source, and even a wrong diagnosis result is given, so that the sex information needs to be controlled. Also, there is a need to judge gender in cases where it is impossible to judge gender using morphological methods, such as corpses with severely damaged appearance in forensic and catastrophic events, early fetal diagnosis, and corpses with ossified in archaeology.
Therefore, a simple and effective gender determination method is needed to realize quality control of gender information and timely find the serial position in the experiment operation to avoid experiment waste, thereby ensuring the accuracy of the subsequent analysis result.
Disclosure of Invention
The inventors have unexpectedly found that the presence of certain specific heterozygous sites on the X chromosome can be used to judge gender. Specifically, since female samples have 2X chromosomes, while male samples have only 1X chromosome, these specific heterozygous loci may only be present in female samples (while specific homozygous loci may be present in male or female samples).
Accordingly, a first aspect of the present invention relates to a method of determining the sex of a sample, comprising the steps of:
(1) obtaining sequencing data of a sample;
(2) calculating whether a specific locus on the X chromosome is a heterozygous locus based on the sequencing data; and
(3) judging the sex of the sample according to the result of the step (2), wherein one or more specific sites are heterozygous sites which indicate that the sample is female, and otherwise, the sample is male;
wherein the specific site on the X chromosome is selected from the sites shown in Table 1 below:
table 1:
Figure BDA0002012656320000021
Figure BDA0002012656320000031
in a second aspect, the invention also relates to a device for determining the sex of a sample, comprising:
-data extraction means for obtaining sequencing data of the sample;
-computing means for computing whether a specific locus on the X chromosome is a heterozygous locus based on the sequencing data; and
-determining means for determining the sex of the sample, wherein a heterozygous site for one or more of said specific sites indicates that the sample is female, otherwise male;
wherein the specific site is selected from the sites shown in Table 1 above.
In a third aspect, the invention also relates to an apparatus for determining the sex of a sample, comprising:
a memory configured to store one or more programs;
a processing unit coupled to the memory and configured to execute the one or more programs to cause a management system to perform a plurality of actions, the actions comprising:
(1) inputting sequencing data of a sample;
(2) calculating whether a specific locus on the X chromosome is a heterozygous locus based on the sequencing data; and
(3) judging the sex of the sample, wherein one or more of the specific sites are heterozygous sites which indicates that the sample is female, otherwise, the sample is male;
wherein the specific site is selected from the sites shown in Table 1 above.
In a fourth aspect, the invention also relates to a computer readable storage medium having stored thereon machine executable instructions which, when executed, cause a machine to perform the steps of the method of determining gender according to the invention.
As used herein, the term "heterozygous site" means that two different bases are detected at the site, and the frequency of each base is 0.5. As used herein, "base frequency" refers to the ratio of the number of the most abundant bases among the detected bases to the total number of bases detected. For example, if there are 10 sequences covering a site after sequence alignment, wherein all the sequences detect the same base A at the site, the base frequency of the site is 1; if 9 of the sequences detect a base A at the site and 1 other sequence detects a base C at the site, the base frequency at the site is 0.9. The base frequency of a site can be counted and whether the site is a heterozygous site can be determined by any method known to those skilled in the art. Theoretically, if the base frequency of a certain locus is 0.5, the locus is judged to be a "heterozygous locus"; if the base frequency of a certain locus is 1, the locus is judged to be a "homozygous locus".
Of course, those skilled in the art will also understand that the base frequency of the "heterozygous site" is not always the theoretical value of 0.5, since the actually detected base frequency may vary within a reasonable error due to problems such as experimental error. For example, the base frequency of the detected heterozygous sites can vary within a range of 0.5. + -. 0.1. The reasonable error range can be determined by one skilled in the art by routine methods.
It should be further appreciated that the present disclosure may be embodied as methods, apparatus, systems, and/or computer program products. The computer program product may include a computer-readable storage medium having computer-readable program instructions embodied thereon for carrying out various aspects of the present disclosure.
The computer readable storage medium may be a tangible device that can hold and store the instructions for use by the instruction execution device. The computer readable storage medium may be, for example, but not limited to, an electronic memory device, a magnetic memory device, an optical memory device, an electromagnetic memory device, a semiconductor memory device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a Static Random Access Memory (SRAM), a portable compact disc read-only memory (CD-ROM), a Digital Versatile Disc (DVD), a memory stick, a floppy disk, a mechanical coding device, such as punch cards or in-groove projection structures having instructions stored thereon, and any suitable combination of the foregoing. Computer-readable storage media as used herein is not to be construed as transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission medium (e.g., optical pulses through a fiber optic cable), or electrical signals transmitted through electrical wires.
The computer-readable program instructions described herein may be downloaded from a computer-readable storage medium to a respective computing/processing device, or to an external computer or external storage device via a network, such as the internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, fiber optic transmission, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. The network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network and forwards the computer-readable program instructions for storage in a computer-readable storage medium in the respective computing/processing device.
The computer program instructions for carrying out operations of the present disclosure may be assembly instructions, Instruction Set Architecture (ISA) instructions, machine related instructions, microcode, firmware instructions, state setting data, or source code or object code written in any combination of one or more programming languages, including an object oriented programming language such as Python, Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The computer-readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider). In some embodiments, aspects of the present disclosure are implemented by personalizing an electronic circuit, such as a programmable logic circuit, a Field Programmable Gate Array (FPGA), or a Programmable Logic Array (PLA), that can execute computer-readable program instructions using state information of the computer-readable program instructions.
These computer-readable program instructions may be provided to a processing unit of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processing unit of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart block or blocks. These computer-readable program instructions may also be stored in a computer-readable storage medium that can direct a computer, programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer-readable medium storing the instructions comprises an article of manufacture including instructions which implement the various aspects of the function/act specified in the flowchart block or blocks.
The invention has the advantages that: the sex of the sample can be accurately judged by calculating the base frequency of the specific site without additional experiments and only by analyzing the existing sequencing data, the cost is low and the efficiency is high, so that the problem of unmatched sample and data in data entry or experiment operation can be timely found and corrected, and the waste of experiment expenses and time is prevented. Meanwhile, the method can also be used for the condition that the gender judgment can not be carried out by using a morphological method, such as a disaster event and investigation and research in legal medicine.
Having described various embodiments of the present disclosure, the foregoing description is illustrative and is not intended to limit the invention in any way. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments.
The invention will be further illustrated with reference to specific examples.
Detailed Description
Example 1 identification of specific sites on the X chromosome
Second-generation sequencing data of 50 male samples and 50 female samples were obtained, aligned to Hg19 human genome, aligned sites located on the X chromosome common to 100 samples were selected from the aligned sites while excluding sites located at homologous regions of the X chromosome and the Y chromosome, and then base frequencies of the remaining aligned sites were calculated. And constructing a gender classification model by using the base frequencies of all the remaining comparison sites and the sample gender labels through a logistic regression model, calculating the correct contribution values of all the remaining sites to gender judgment, and finally obtaining 102 specific sites with the maximum contribution values, wherein the base frequencies can effectively distinguish the gender of the sample. The 102 specific sites are shown in table 1.
The statistical analysis of the base frequencies of the 102 specific sites shows that: in female samples, at least one or more of the specific loci present on the X chromosome are heterozygous loci.
Example 2 determination of the sex of a sample according to the method of the invention
The sequencing data of 32 samples were obtained, the base frequency of a specific site on the X chromosome shown in Table 1 was calculated according to the method of the present invention, and the sex of the sample was judged from the specific site and the base frequency thereof (according to the result of statistical analysis, a site having a base frequency of 0.5. + -. 0.1 was judged as a heterozygous site). The results are shown in Table 2 below (only the detected heterozygous sites and their base frequencies are shown).
Table 2.
Figure BDA0002012656320000071
Figure BDA0002012656320000081
Figure BDA0002012656320000091
Figure BDA0002012656320000101
These results show that the method according to the present invention can very accurately determine the sex of the sample with an accuracy rate of 100%.
The above description is only an example of the present invention and is not intended to limit the present invention, and it is obvious to those skilled in the art that the present invention may be modified and changed. Any modification, equivalent replacement, or improvement made without departing from the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (6)

1. A method of determining the gender of a sample, comprising the steps of:
(1) obtaining sequencing data of a sample;
(2) calculating whether a specific locus on the X chromosome is a heterozygous locus based on the sequencing data; and
(3) judging the sex of the sample according to the result of the step (2), wherein if one or more specific sites are heterozygous sites, the sample is female, otherwise, the sample is male;
wherein the specific sites on the X chromosome are selected from the sites shown in the following table:
ChrX_31138253 ChrX_32503194 ChrX_119605603 ChrX_32380996 ChrX_31138589 ChrX_32534883 ChrX_119605614 ChrX_32383469 ChrX_31191589 ChrX_32535002 ChrX_119605988 ChrX_32383865 ChrX_31224684 ChrX_32535353 ChrX_135227615 ChrX_32384608 ChrX_31286408 ChrX_32563263 ChrX_135229144 ChrX_32384863 ChrX_31289217 ChrX_32835845 ChrX_135229627 ChrX_32384973 ChrX_31289364 ChrX_32836961 ChrX_135247580 ChrX_32385008 ChrX_31289621 ChrX_32837739 ChrX_135248311 ChrX_32385390 ChrX_31289944 ChrX_32869873 ChrX_135250115 ChrX_32386206 ChrX_31526639 ChrX_32870932 ChrX_135250315 ChrX_32387761 ChrX_31527607 ChrX_32871492 ChrX_135275056 ChrX_77360724 ChrX_31528051 ChrX_32871630 ChrX_135275279 ChrX_77361215 ChrX_31529248 ChrX_32871715 ChrX_135275808 ChrX_77361669 ChrX_31676096 ChrX_33150918 ChrX_135275923 ChrX_77373524 ChrX_31697636 ChrX_33358758 ChrX_135276283 ChrX_119560140 ChrX_31854782 ChrX_33362295 ChrX_135277100 ChrX_119560599 ChrX_31856533 ChrX_71817252 ChrX_135292022 ChrX_119571073 ChrX_31856950 ChrX_71817653 ChrX_135293082 ChrX_119571541 ChrX_31859793 ChrX_71817654 ChrX_149531210 ChrX_119572584 ChrX_31893307 ChrX_71819207 ChrX_149680221 ChrX_119572586 ChrX_31986430 ChrX_71819691 ChrX_149680554 ChrX_32173896 ChrX_71896025 ChrX_149681925 ChrX_32305619 ChrX_71936623 ChrX_149734031 ChrX_32305961 ChrX_77357999 ChrX_149734343 ChrX_32305968 ChrX_77359322 ChrX_149826503 ChrX_32307265 ChrX_77360306 ChrX_153609616 ChrX_32310076 ChrX_77360526 ChrX_153609617
2. the method of claim 1, wherein step (2) comprises calculating the base frequency of the specific site.
3. An apparatus for determining the gender of a sample, comprising:
-data extraction means for obtaining sequencing data of the sample;
-computing means for computing whether a specific locus on the X chromosome is a heterozygous locus based on the sequencing data; and
-determining means for determining the sex of the sample, wherein a base frequency of 1 or more of said specific sites being less than or equal to a predetermined value indicates that the sample is female, otherwise it is male;
wherein the specific site is as defined in claim 1.
4. An apparatus for determining the gender of a sample, comprising:
a memory configured to store one or more programs;
a processing unit coupled to the memory and configured to execute the one or more programs to cause a management system to perform a plurality of actions, the actions comprising:
(1) inputting sequencing data of a sample;
(2) calculating whether a specific locus on the X chromosome is a heterozygous locus based on the sequencing data; and
(3) judging the sex of the sample, wherein if one or more specific sites are heterozygous sites, the sample is female, otherwise, the sample is male;
wherein the specific site is as defined in claim 1.
5. A computer readable storage medium having stored thereon machine executable instructions which, when executed, cause a machine to perform the steps of the method of claim 1.
6. Use of the method of claim 1 or 2, the apparatus of claim 3, the device of claim 4 or the computer-readable storage medium of claim 5 for gender information control, forensics, catastrophic events, archaeology or early fetal diagnosis.
CN201910252113.3A 2019-03-29 2019-03-29 Method for judging sex of sample and device for implementing method Pending CN111755116A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910252113.3A CN111755116A (en) 2019-03-29 2019-03-29 Method for judging sex of sample and device for implementing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910252113.3A CN111755116A (en) 2019-03-29 2019-03-29 Method for judging sex of sample and device for implementing method

Publications (1)

Publication Number Publication Date
CN111755116A true CN111755116A (en) 2020-10-09

Family

ID=72672542

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910252113.3A Pending CN111755116A (en) 2019-03-29 2019-03-29 Method for judging sex of sample and device for implementing method

Country Status (1)

Country Link
CN (1) CN111755116A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119517158A (en) * 2024-11-04 2025-02-25 广州金域医学检验中心有限公司 Gender detection method, device, equipment and medium based on second-generation sequencing technology

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150337373A1 (en) * 2008-07-04 2015-11-26 Axial Biotech, Inc. Genetic Markers Associated with Degenerative Disc Disease and Uses Thereof
CN105354442A (en) * 2015-11-25 2016-02-24 广州金域检测科技股份有限公司 High-throughput sequencing data preprocessing method
WO2017023148A1 (en) * 2015-08-06 2017-02-09 이원 다이애그노믹스 게놈센타(주) Novel method capable of differentiating fetal sex and fetal sex chromosome abnormality on various platforms

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150337373A1 (en) * 2008-07-04 2015-11-26 Axial Biotech, Inc. Genetic Markers Associated with Degenerative Disc Disease and Uses Thereof
WO2017023148A1 (en) * 2015-08-06 2017-02-09 이원 다이애그노믹스 게놈센타(주) Novel method capable of differentiating fetal sex and fetal sex chromosome abnormality on various platforms
CN105354442A (en) * 2015-11-25 2016-02-24 广州金域检测科技股份有限公司 High-throughput sequencing data preprocessing method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZHANG I.L.等: "Using genomic information to predict sex in dairy cattle", 《PROCEEDINGS OF THE NEW ZEALAND SOCIETY OF ANIMAL PRODUCTION》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119517158A (en) * 2024-11-04 2025-02-25 广州金域医学检验中心有限公司 Gender detection method, device, equipment and medium based on second-generation sequencing technology

Similar Documents

Publication Publication Date Title
Gong et al. Detection of somatic structural variants from short-read next-generation sequencing data
Rakocevic et al. Fast and accurate genomic analyses using genome graphs
Wang et al. CNVcaller: highly efficient and widely applicable software for detecting copy number variations in large populations
Sedlazeck et al. Accurate detection of complex structural variations using single-molecule sequencing
Heo et al. BLESS: bloom filter-based error correction solution for high-throughput sequencing reads
Ge et al. FusionMap: detecting fusion genes from next-generation sequencing data at base-pair resolution
EP3621080B1 (en) Reducing error in predicted genetic relationships
Wu et al. Genome-wide association analysis by lasso penalized logistic regression
Cheng et al. Assessing single nucleotide variant detection and genotype calling on whole-genome sequenced individuals
Bartenhagen et al. Robust and exact structural variation detection with paired-end and soft-clipped alignments: SoftSV compared with eight algorithms
Daley et al. Modeling genome coverage in single-cell sequencing
Govind et al. ShatterProof: operational detection and quantification of chromothripsis
Martin et al. Transcriptome sequencing from diverse human populations reveals differentiated regulatory architecture
US20200098448A1 (en) Methods of normalizing and correcting rna expression data
Strauch et al. CI-SpliceAI—improving machine learning predictions of disease causing splicing variants using curated alternative splice sites
Schatz et al. Hawkeye and AMOS: visualizing and assessing the quality of genome assemblies
Sante et al. ViVar: a comprehensive platform for the analysis and visualization of structural genomic variation
Vu et al. Cell-level somatic mutation detection from single-cell RNA sequencing
Kehr et al. PopIns: population-scale detection of novel sequence insertions
Brynildsrud et al. CNOGpro: detection and quantification of CNVs in prokaryotic whole-genome sequencing data
Roy et al. SeqReporter: automating next-generation sequencing result interpretation and reporting workflow in a clinical laboratory
Zojer et al. Variant profiling of evolving prokaryotic populations
Holtgrewe et al. Methods for the detection and assembly of novel sequence in high-throughput sequencing data
Tae et al. Discretized Gaussian mixture for genotyping of microsatellite loci containing homopolymer runs
Kroon et al. Detecting dispersed duplications in high-throughput sequencing data using a database-free approach

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information

Inventor after: Li Chuan

Inventor after: Jiang Likun

Inventor after: Hou Guangyuan

Inventor before: Wang Jing

Inventor before: Li Chuan

Inventor before: Jiang Likun

Inventor before: Hou Guangyuan

CB03 Change of inventor or designer information