[go: up one dir, main page]

CN106676178A - System and method for tumor heterogeneity assessment - Google Patents

System and method for tumor heterogeneity assessment Download PDF

Info

Publication number
CN106676178A
CN106676178A CN201710043183.9A CN201710043183A CN106676178A CN 106676178 A CN106676178 A CN 106676178A CN 201710043183 A CN201710043183 A CN 201710043183A CN 106676178 A CN106676178 A CN 106676178A
Authority
CN
China
Prior art keywords
variation
cnv
ctdna
sequencing
equipotential
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710043183.9A
Other languages
Chinese (zh)
Other versions
CN106676178B (en
Inventor
吴爱伟
常连鹏
李进
龚玉华
管彦芳
易鑫
杨玲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jiyingjia Technology Co.,Ltd.
SUZHOU JIYINJIA BIOMEDICAL ENGINEERING Co.,Ltd.
Original Assignee
Beijing Gene+ Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gene+ Technology Co Ltd filed Critical Beijing Gene+ Technology Co Ltd
Priority to CN201710043183.9A priority Critical patent/CN106676178B/en
Publication of CN106676178A publication Critical patent/CN106676178A/en
Application granted granted Critical
Publication of CN106676178B publication Critical patent/CN106676178B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Organic Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Molecular Biology (AREA)
  • Analytical Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Biophysics (AREA)
  • Biotechnology (AREA)
  • Wood Science & Technology (AREA)
  • Zoology (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The invention discloses a system and a method for tumor heterogeneity assessment and particularly provides a molecular clone analysis method. On the basis of various types of variation detection results of high-throughput sequencing in circulating tumor DNA, all variations are divided into different molecular clones to realize tumor heterogeneity assessment by molecular clone levels. By adoption of the method, tumor heterogeneity assessment based on ctDNA high-throughput variation detection is realized to effectively assist in making of tumor prognosis and treatment schemes.

Description

A kind of method and system of assessment Tumor Heterogeneity
Technical field
The invention belongs to biological technical field, particularly relates to assess the method for Tumor Heterogeneity and is System.
Background technology
Tumour is a kind of disease caused by gene alteration.Tumour often relates to various Genetic Variations types, including Single nucleotide variations (Single Nucleotide Variations, SNV), short insertion and deletion (small insertions And deletions, indel), copy number variation (Copy Number Variations, CNV), structure variation (Structure Variations, SV) etc..From the beginning of forming first variation from tumour cell, the mistake for accumulating that makes a variation just is started Journey.Favourable maintenance condition is built in propulsion and the evolution of tumour over time, the variation that the harmful variation for first occurring occurs after being, The ability for making tumour cell constantly obtain or strengthen the aspects such as suppression apoptosis, infinite copy, immunologic escape, therefore tumour The speed of the variation accumulation of cell is more faster than normal cell.The tumour for ultimately forming is in fact have different genetics characteristics The mixing of cell colony:Some cells only carry early stage variation, and some then also carry later stage variation simultaneously;In these tumour cells In, the involved cell proportion of variation also with its time of origin by morning to evening from large to small;Occur simultaneously in a cell Variation share a common destiny, life or death in tumour evolutionary process, the cell proportion being related to is identical.The cell proportion distribution of the variation in tumour Complexity can reflect Tumor Heterogeneity, and the latter is the most direct and most important embodiment of tumour complexity, and itself and tumour are suffered from Person's prognosis and life span are closely bound up.
The method for adopting the multidigit point sampling and high-flux sequence of same tumor patient assessment Tumor Heterogeneity at present more, i.e., Carry out after pathology sampling, by the method for high-flux sequence each being analyzed by the multiple positions of tissue to patient or multiple focuses The variation of sampling point, and be described and hierarchical statistics to having the corresponding cell proportion of variation and variation.The method has Following shortcoming:(1) there is bias in the clinical sampling in many sites, can only represent the molecular variant feature at taken position, it is impossible to generation The complexity of table entirety tumour;(2) with certain clinical risk;(3) MET of some types is difficult to obtain, such as pleuroperitoneum Metastatic lesion;(4) inaccuracy, by having the heterogeneous analysis method that variation is carried out, by total variation same layer is recognized as Level, is not specifically divided, so as to cause partial analysis result inaccurate (Gerlinger, M.et to total variation al.Intratumor heterogeneity and branched evolution revealed by multiregion sequencing.The New England journal of medicine 366,883-892,doi:10.1056/ NEJMoa1113205(2012);Hao,J.J.et al.Spatial intratumoral heterogeneity and temporal clonal evolution in esophageal squamous cell carcinoma.Nature genetics,doi:10.1038/ng.3683(2016)).Additionally, also there is the copy number variation knot that method is only sampled by single-point Fruit is estimated (Oesper, L., Satas, G.&Raphael, B.J.Quantifying tumor to Tumor Heterogeneity heterogeneity in whole-genome and whole-exome sequencing data.Bioinformatics 30,3532-3540,doi:10.1093/bioinformatics/btu651 (2014)), except there is sampling bias in the method Property shortcoming outside, also have the shortcomings that crowd's coverage is low, i.e., can only covering part exist a large amount of copy number variations cancerous swellings or Crowd.
Therefore, there is a need in the art for the heterogeneous analysis method of more accurate evaluation tumour, pre- with effectively adjuvant therapy Formulate with therapeutic scheme afterwards.
The content of the invention
For the heterogeneity of more accurate evaluation tumour, the present invention propose a kind of molecular cloning (Molecular Clone, MClone) analysis method, the method based on high-flux sequence Circulating tumor DNA (circulating tumor DNA, All variations are divided into different molecular clonings, using molecular cloning layer by the polytype variation testing result in ctDNA) The heterogeneity of level assessment tumour.The method of the present invention realizes that the Tumor Heterogeneity based on the variation detection of ctDNA high fluxs is commented Estimate, formulated with effective adjuvant therapy prognosis and therapeutic scheme.
Therefore, in a first aspect, the invention provides a kind of method of assessment Tumor Heterogeneity, methods described includes:
1) (preferred high-flux sequence) is sequenced to the dissociative DNA (cell-free DNA, cfDNA) of patient, is surveyed Sequence information;
2) determine that ctDNA makes a variation using the sequencing information, become according to the sequencing information and the ctDNA of the determination It is different, variation allelic frequency is calculated, actual total copy number of definitive variation region calculates the ratio that ctDNA accounts for cfDNA;
3) according to the step 2) in determine ratio and ctDNA variation sequencing information and copy number information to described CtDNA variations are clustered, and are clustered each cluster for obtaining and are defined as a molecular cloning, obtain the clone's level for clustering;
4) its Tumor Heterogeneity is estimated according to clone's level of the patient, clone's level of the patient is got over Many, its Tumor Heterogeneity is higher.
In second aspect, the invention provides a kind of heterogeneous method of the different patient tumors of comparison, methods described includes:
Using first aspect present invention method the step of 1) -3) calculate described each patient molecular cloning level, it is right In different patients, clone's level is more, and its Tumor Heterogeneity is higher.
In an embodiment of first or second aspect of the present invention, step 2) include:
2.1) variation V (the variation V is selected from SNV, indel and SV) (V is obtained using the sequencing informationi, i=1 ..., N) reference equipotential sequencing depth (Ri), variation equipotential sequencing depth (Mi), and calculate variation allelic frequency (Variant Allele Fraction, VAFi),
Wherein, depth (R is sequenced with reference to equipotentiali) it is that the normal sequence of the variation does not occur in corresponding site in sequencing result Bar number;Variation equipotential sequencing depth (Mi) it is that the bar number of the series of variation of the variation occurs in corresponding site in sequencing result;
2.2) using variation ViCNV (the CNV of regioni, i=1 ..., n), calculating variation ViThe reference of region is copied Shellfish number (rCNi) and actual total copy number (CNi),
If in step 1) used in accurate CNV detection methods (such as using SNP chip detection), for not in male sex's property Variation on chromosome, can obtain the special copy number variation (CNV of equipotential on two chromosomei,major, CNVi,minor, CNVi,major≥CNVi,minor) information, the copy number (CN special so as to obtain actual equipotentiali,major, CNi,minor),
2.3) ctDNA ratios assessment:With maximum variation allelic frequency to assess cfDNA in ctDNA proportions (CTF),
CTF=max (VAFi), i=1 ..., n (formula 5)
In one embodiment, the step of method of the present invention 3) in, by the mutant ratio predicted, to variation Clustered, for example with PyClone (v0.13, such as current latest edition, non-specifically are indicated, and below refer both to the version) software.
In one embodiment, the step of method of the present invention 3) in, the reference and change of the V (SNV/indel/SV) that makes a variation Different equipotential depth data (Ri, Mi):For assessing variation tumour cell ratio with mono- piece of CTF and CNV.In one embodiment, The step of method of the present invention 3) in, all tumour cells are accounted for variation place cell mass each described using PyClone softwares Ratio be predicted, software parameters can set as follows:Overall nodule cell proportion (CTF)=variation gene frequency Peak;Iterations=20000;Other specification is acquiescence.
In one embodiment, the step of method of the present invention 3) in, using PyClone to n the detected V that makes a variation (SNV/indel/SV) clustered, in addition to following parameter, using default parameters:
(a)--tumour_contentsCTF;
(b)--num_iters 20000;
(c) -- prior total_copy_number, when using the special CNV data of equipotential as input
When, the parameter is set to parental_copy_number;
(d) -- density pyclone_beta_binomial, when step 1) use the relatively low full base of sequencing depth During because of group sequencing technologies, the parameter is set to pyclone_binomial;
E () -- in_files patient.tsv, patient.tsv file is the file with tab as decollator;Remove Outside header line, often row includes the information of variation V (SNV/indel/SV);Six row are included, is followed successively by:mutation_id、 Ref_counts, var_counts, normal_cn, minor_cn and major_cn.
In the third aspect, the invention provides a kind of system of assessment Tumor Heterogeneity, the system includes:
1) for the module of the cfDNA of (preferred high-flux sequence) patient to be sequenced;
2) for performing the module of following steps:
A) receive from module 1) sequencing information;
B) compared by the sequence information with normal gene sequence, obtain the ctDNA variations in cfDNA;
C) made a variation according to the sequencing information and the ctDNA, calculate variation allelic frequency, definitive variation region Actually the special copy number of the equipotential of total copy number or reality, calculates the ratio that ctDNA accounts for cfDNA;
D) sequencing information for being made a variation according to the ratio and ctDNA that determine in the step c) and copy number information are to described CtDNA variations are clustered, and determine molecular cloning, and calculate molecular cloning level;
3) result output module:
The result of Tumor Heterogeneity is exported according to the molecular cloning level of the patient, clone's level of the patient is got over Many, its Tumor Heterogeneity is higher.
In fourth aspect, the invention provides a kind of heterogeneous system of the different patient tumors of comparison, the system includes:
1) for the module of the cfDNA of (preferred high-flux sequence) patient to be sequenced;
2) for performing the module of following steps:
A) receive from module 1) sequencing information;
B) compared by the sequence information with normal gene sequence, obtain the ctDNA variations in cfDNA;
C) according to the sequencing information and the variation result, variation allelic frequency is calculated, definitive variation region Actually the special copy number of the equipotential of total copy number or reality, calculates the ratio that ctDNA accounts for cfDNA;
D) sequencing information for being made a variation according to the ratio and ctDNA that determine in the step c) and copy number information are to described CtDNA variations are clustered, and determine molecular cloning, and calculate molecular cloning level;
3) result output module:
The molecular cloning level of different patients is compared, the Tumor Heterogeneity result of the comparison of the different patients of output, Clone's level of patient is more, and its Tumor Heterogeneity is higher.
The present invention the 3rd or fourth aspect an embodiment in, module 2) the step of c) include step:
C.1) variation V (the variation V is selected from SNV, indel and SV) (V is obtained using the sequencing informationi, i=1 ..., N) reference equipotential sequencing depth (Ri), variation equipotential sequencing depth (Mi), and calculate variation allelic frequency (Variant Allele Fraction, VAFi),
Wherein, depth (R is sequenced with reference to equipotentiali) it is that the normal sequence of the variation does not occur in corresponding site in sequencing result Bar number;Variation equipotential sequencing depth (Mi) it is that the bar number of the series of variation of the variation occurs in corresponding site in sequencing result;
C.2) using variation ViCNV (the CNV of regioni, i=1 ..., n), calculating variation ViThe reference of region is copied Shellfish number (rCNi) and actual total copy number (CNi),
If in step 1) used in accurate CNV detection methods (such as using SNP chip detection), for not in male sex's property Variation on chromosome, can obtain the special copy number variation (CNV of equipotential on two chromosomei,major, CNVi,minor, CNVi,major≥CNVi,minor) information, the copy number (CN special so as to obtain actual equipotentiali,major, CNi,minor),
C.3) ctDNA ratios assessment:With maximum variation allelic frequency to assess cfDNA in ctDNA proportions (CTF),
CTF=max (VAFi), i=1 ..., n (formula 5)
In an embodiment of the present invention the 3rd or fourth aspect, module 2) it is a plurality of instruction for performing the step Computer-readable medium.Module 3) be the instruction for performing the step computer-readable medium.
In one embodiment, the module 2 of system of the invention) the step of d) in, make a variation V's (SNV/indel/SV) With reference to and variation equipotential depth data (Ri, Mi):For assessing variation tumour cell ratio with mono- piece of CTF and CNV.In a reality In applying scheme, the present invention system module 2) the step of d) in, using PyClone softwares to variation place cell each described Group accounts for the ratio of all tumour cells and is predicted, and software parameters can set as follows:Overall nodule cell proportion (CTF)=change The peak of heteroallele frequency;Iterations=20000;Other specification is acquiescence.
In one embodiment, the module 2 of system of the invention) the step of d) in, by the mutant ratio predicted Example, clusters, for example with PyClone softwares to variation.
In one embodiment, the module 2 of system of the invention) the step of d) in, using PyClone to the n that detects Individual variation V (SNV/indel/SV) is clustered, in addition to following parameter, using default parameters:
(a)--tumour_contentsCTF;
(b)--num_iters 20000;
(c) -- prior total_copy_number, when using the special CNV data of equipotential as input
When, the parameter is set to parental_copy_number;
(d) -- density pyclone_beta_binomial, when module 1) in use sequencing relatively low complete of depth During genomic sequencing technique, the parameter is set to pyclone_binomial;
E () -- in_files patient.tsv, patient.tsv file is the file with tab as decollator;Remove Outside header line, often row includes the information of variation V (SNV/indel/SV);Six row are included, is followed successively by:mutation_id、 Ref_counts, var_counts, normal_cn, minor_cn and major_cn.
Theory and the high flux variation detection technique of ctDNA that the present invention is evolved based on tumour, swell from clone's level analysis Knurl makes a variation, there is provided more meet the heterogeneous appraisal procedure of tumor development rule.
It is a discovery of the invention that higher Tumor Heterogeneity has higher tumour progression risk.
Relative to other analysis methods, the advantage of the present invention is as follows:
1) information is comprehensive:Relative to unit point or the sample of tissue bias in many sites, ctDNA can react more fully Tumor cells feature;
2) convenience is sampled:Sample of tissue is typically derived from operation or punctures, compared to sample of tissue, especially many sites Sample of tissue, the detection of ctDNA only needs noninvasive blood sampling, clinically easily feasible;
3) high degree of accuracy:Make full use of variation information, cover SNV, indel and SV, retain variation concrete frequency and The bi-values that non-used detect/is not detected, based on tumour Evolution Theory, from clone's aspect, unmanifest aspect is carried out to heterogeneity Assessment.
3 points by more than, the method for the present invention and system can the more accurate and rational heterogeneities for assessing tumour.
Description of the drawings
By the following drawings, the present invention will be described.
Fig. 1 mClone analysis process figures, band * is that each patient is carried out respectively the step of indicate.
Fig. 2 survival analysises, left-most curve is high heterogeneous, and the right curve is low heterogeneity.
Specific embodiment
In the present invention, Gene Name is adopted using official's name (Official Symbol) in NCBI-Gene Representation generally in the art represents gene mutation and protein mutant.For example, c.518T>C (p.V173A) represents missense mutation, The T sequence changes that presentation code area is the 518th are C bases, so as to the amino acid for causing 173 sports smart ammonia by histidine V Sour A;C.2235_2249delGGAATTAAGAGAAGC (p.E746_A750del) represents small fragment disappearance, presentation code area the The bases G GAATTAAGAGAAGC disappearance of 2235 to 2249, so as to 5 amino acid for causing the 746th to 750 lack Lose;c.2663+1G>A represents shearing mutation, and the 2663rd place in presentation code area exon 3 end is close to first of introne Base changes into A by G;c.7081C>T (p.Q2361*) represents nonsense mutation, and the C sequence changes that code area is the 7081st are T alkali Base, so as to the Q for causing the 2361st is changed into terminator codon.
In the present invention, mathematic sign ceil is referred to and rounded up.
In the present invention, cfDNA can also be blood (blood plasma), saliva, pleural effusions and ascites, urine, excrement equal samples DNA。
In the present invention, the tumour is selected from, but not limited to,:Lung cancer, colorectal cancer, cancer of the stomach, breast cancer, kidney, pancreas Cancer, oophoroma, carcinoma of endometrium, thyroid cancer, cervical carcinoma, the cancer of the esophagus and liver cancer.It is described in a specific embodiment Tumour is lung cancer, and the variation is the variation listed in table 1.
Method of the present invention flow chart to ctDNA using high-flux sequence as shown in figure 1, to every tested patients, become It is different detected after, according to the sequencing result of ctDNA variations, assess ctDNA and account for the ratio of cfDNA;Aforementioned proportion and detection Variation together, as input, is clustered to variation, is clustered each cluster for obtaining and is defined as a molecular cloning, Ran Houji Clone's level is calculated, the Tumor Heterogeneity of every patient is estimated finally according to clone's level of all patients.The present inventor It was found that, for lung cancer, it is heterogeneous high patient more than 3.5 to clone level, and clone's level is low for heterogeneity less than 3.5 Patient.
It is below the main technical flows and principles and methods of the method for the present invention:
1. high-flux sequence detection ctDNA variations
First, to selecting several same cancer kind patients as study subject, row variation detection and parameter meter are entered to each patient Calculate:
1) by high throughput sequencing technologies and corresponding informatics such as the capture sequencings of full-length genome, full extron group or probe Analysis method, is sequenced to experimenter cfDNA, the variation included in ctDNA is obtained, including SNV, indel, SV, CNV etc.;
2) according to step 1) in sequencing result, obtain variation V (variation V be selected from SNV, indel and SV) (Vi, i= 1 ..., reference equipotential n) is sequenced depth (Ri), variation equipotential sequencing depth (Mi), and calculate variation allelic frequency (Variant Allele Fraction, VAFi),
Wherein, depth (R is sequenced with reference to equipotentiali) it is that the normal sequence of the variation does not occur in corresponding site in sequencing result Bar number;Variation equipotential sequencing depth (Mi) it is that the bar number of the series of variation of the variation occurs in corresponding site in sequencing result;
3) using variation ViCNV (the CNV of regioni, i=1 ..., n), calculating variation ViThe reference copies of region Number (rCNi) and actual total copy number (CNi),
If the accurate CNV detection methods (such as being detected using SNP chip) used in 1), for not in the dyeing of male sex's property Variation on body, can obtain the special copy number variation (CNV of equipotential on two chromosomei,major, CNVi,minor, CNVi,major ≥CNVi,minor) information, the copy number (CN special so as to obtain actual equipotentiali,major, CNi,minor),
Accurate CNV detections refer to the special copy number variation of the equipotential for obtaining two chromosomes, such as using SNP chip Detection.
2. variation cluster and clone's level are calculated
Then, to every patient, the variation for detecting is carried out cluster analysis and clone's layer by the parameter obtained in foundation 1 Level is calculated:
1) ctDNA ratios assessment:With maximum variation allelic frequency to assess cfDNA in ctDNA proportions (CTF),
CTF=max (VAFi), i=1 ..., n (formula 5)
2) variation cluster:
For arbitrary variation (SNV/indel/SV), the derived cell of cfDNA is divided into three classes:Normal cell (N), The tumour cell (C0) for not carrying the variation is, the tumour cell (C1) for carrying the variation, carries the tumour cell (C1) of the variation The ratio referred to as variation tumour cell ratio of all tumour cells (C1+C0) is accounted for, if the variation tumour of two or more variations Cell proportion is suitable, then the time that they occur is approximate, can be endowed identical cluster label, is clustered into cluster, i.e., one point Son clone.
Therefore, the cluster that makes a variation is needed to use data below:
A) reference of variation V (SNV/indel/SV) and variation equipotential depth data (Ri, Mi):For with CTF and CNV mono- Block assessment variation tumour cell ratio;
B) step 1.3) in reference copies number (rCNi) and actual total copy number (CNi) or the equipotential of reality special copy Shellfish number (CNi,major, CNi,minor):For a certain variation, the copy number amplification of the variation equipotential or disappearance can cause the tumour that makes a variation The false rising or false reduction of cell proportion estimate, therefore add copy number delta data more accurately to judge C1 cells Genotype, correct variation frequency, correct assessment variation tumour cell ratio;
c)CTF:To the composition for estimating cfDNA derived cells, i.e., tumour cell (C0+ in all cells (N+C0+C1) C1 the ratio shared by), the accurate setting of the parameter contributes to being computed correctly from the reference equipotential of normal cell and from tumour The quantitative proportion of the reference equipotential of cell.
For example, n variation V (SNV/indel/SV) for detecting is entered using PyClone v0.13 (current latest edition) Row cluster, in addition to following parameter, using default parameters:
(a)--tumour_contentsCTF;
(b)--num_iters 20000;
(c) -- prior total_copy_number, when using the special CNV data of equipotential as input
When, the parameter is set to parental_copy_number;
D () -- density pyclone_beta_binomial, when 1.1) using the relatively low full genome of depth is sequenced During group sequencing technologies, the parameter is set to pyclone_binomial;
E () -- in_files patient.tsv, patient.tsv file is the file with tab as decollator;Remove Outside header line, often row includes the information of variation V (SNV/indel/SV);Six row are included, is followed successively by:mutation_id、 Ref_counts, var_counts, normal_cn, minor_cn and major_cn.
PyClone(Roth,A.et al.PyClone:statistical inference of clonal
population structure in cancer.Nature methods 11,396-398,
doi:10.1038/nmeth.2883 (2014) .) V is estimated according to variation V (SNV/indel/SV) and CNV informationi The cell at place accounts for the ratio of all tumour cells, and according to this to each variation one cluster label (C of impartingi, i=1 ..., n, Ci ∈ 1 ..., and c }, c is the number of cluster).
Other versions or other variation clustering softwares of PyClone can also be adopted to variation cluster.
3) clone level to calculate:
Clone's level, that is, make a variation the molecular cloning number c being polymerized to.Tumour during constantly progress, evolve and send out by tumour The structure of raw tree also gradually becomes huge and complexity, and molecular cloning also can be more, and clone's level is constantly deepened, therefore clones level Size it is closely related with Tumor Heterogeneity.
3. Tumor Heterogeneity assessment
Take clone's level median of all tested patients threshold value high/low as every patient tumors heterogeneity is judged; , less than the patient of the threshold value, its Tumor Heterogeneity is relatively low, otherwise then Tumor Heterogeneity is higher for clone's level.
Because genome mutation situation has notable difference, therefore the method for the present invention it is not recommended that across cancer kind ratio between cancer kind It is more heterogeneous.
In the method for the invention, in addition to sequencing steps, other steps can be in the form of instruction in calculating In machine computer-readable recording medium, it is only necessary to by the sequencing result input calculating equipment, the computing device can just read the calculating Instruction in machine computer-readable recording medium, completes other steps of the inventive method.The computing device includes but is not limited to computer, just Take formula computer, PAD, smart mobile phone, intelligent wrist etc..
Embodiment
In the present embodiment, by taking 10 patients with lung cancer as an example, the present invention will be described.It should be noted that the enforcement Example is merely to illustrate that purpose, and can not by any way be construed to the restriction to the application.
The variation list that 1.ctDNA high-flux sequences are detected
1) make a variation V (SNV/indel/SV)
10 patients with lung cancer detect respectively 2-8 variation, and variation V (SNV/indel/SV) detection list is shown in Table 1.
Variation V of table 1 (SNV/indel/SV) detection list
2)CNV
In 10 patients with lung cancer, only S5 detection EGFR amplifications, amplification times are 1.73, are shown in Table 2.Therefore, detect in S5 The EGFR Deletion total copy numbers of corresponding reality that make a variation be estimated as 4.
Table 2CNV detects list
Sample number into spectrum Gene Copy number variation state Copy number variation multiple
S5 EGFR gain 1.73
2.mClone analysis results are counted
Pyclone is clustered
The variation for detecting is clustered using PyClone v0.13, in addition to following parameter, using acquiescence ginseng Number:
a)--tumour_contents
b)--num_iters 20000
c)--prior total_copy_number
d)--density pyclone_beta_binomial
e)--in_files
Parameter a) and e) respectively specify that CTF and input file.The CTF of each patient and the content of input file are shown in Table 3:
Table 3Pyclone input datas
Wherein, mutation_id represents variation numbering, and ref_counts represents reference count, and var_counts represents change Different counting, normal_cn represents normal copy number, i.e. CNi, minor_cn represents little copy number, i.e. CNi,minor, major_cn tables Show big copy number, i.e. CNi,major
The result and follow-up follow up data of the mClone analyses carried out using the method for the present invention as shown in table 4, is taken all The median of clone's level, i.e. cut-off=3.5, it is heterogeneous high patient to clone level more than 3.5, and clone's level is less than 3.5 are heterogeneous low patient.
Table 4mClone analysis results and the clinical information table of comparisons
Sample number into spectrum Clone's level Tumor Heterogeneity Progression free survival phase (week)
S1 2 It is low 54
S2 1 It is low 49
S3 4 It is high 11
S4 4 It is high 27
S5 6 It is high 9
S6 6 It is high 17
S7 3 It is low 17
S8 3 It is low 34
S9 5 It is high 22
S10 2 It is low 36
Survival analysis (see Fig. 2) is carried out to this batch of sample, the Tumor Heterogeneity result using clone's level assessment is found, it is right Patient's prognosis (progression free survival phase) has significant prediction effect, and (p, 0.044), higher Tumor Heterogeneity enters with higher Exhibition risk (Hazard ratio is 9.386).The result verification using mClone analytical technologies assess Tumor Heterogeneity validity and Accuracy.
The molecular cloning level that the molecular cloning mClone analysis methods of the present invention are obtained can be used for assessing the different of tumour Matter, and the heterogeneity of tumour represents the developing stage of tumour, in the heterogeneous bigger more later stage for representing patient in tumour, suffers from It is bigger that the tumour of person continues in the recent period developing risk.Above experimental data confirms this point.

Claims (11)

1. a kind of method of assessment Tumor Heterogeneity, methods described includes:
1) cfDNA of patient is sequenced (preferred high-flux sequence), is obtained sequencing information;
2) determine that ctDNA makes a variation using the sequencing information, made a variation according to the sequencing information and the ctDNA of the determination, meter Variation allelic frequency is calculated, actual total copy number of definitive variation region calculates the ratio that ctDNA accounts for cfDNA;
3) according to the step 2) in determine ratio and ctDNA variation sequencing information and copy number information to described CtDNA variations are clustered, and are clustered each cluster for obtaining and are defined as a molecular cloning, obtain the clone's level for clustering;
4) its Tumor Heterogeneity is estimated according to clone's level of the patient, clone's level of the patient is more, its Tumor Heterogeneity is higher.
2. a kind of heterogeneous method of the multiple patient tumors of comparison, methods described includes:
1) cfDNA of the plurality of Patient Sample A is sequenced (preferred high-flux sequence), is obtained sequencing information;
2) for each Patient Sample A, determine that ctDNA makes a variation using the sequencing information, according to the sequencing information and it is described really Fixed ctDNA variations, calculate variation allelic frequency, and actual total copy number of definitive variation region calculates ctDNA and accounts for The ratio of cfDNA;
3) according to the step 2) in determine ratio and ctDNA variation sequencing information and copy number information to described CtDNA variations are clustered, and are clustered each cluster for obtaining and are defined as a molecular cloning, obtain the cluster of each Patient Sample A Clone's level;
4) its Tumor Heterogeneity is compared according to clone's level of the plurality of patient, clones patient its tumour more than level It is heterogeneous high.
3. method according to claim 1 and 2, the step 2) include:
2.1) variation V is obtained using the sequencing informationi,=1 ..., reference equipotential sequencing depth Ri, variation equipotential sequencing it is deep Degree Mi, and calculate variation allelic frequency VAFi,
VAF i = M i M i + R i * 100 % , i = 1 ... n
Wherein, depth R is sequenced with reference to equipotentialiIt is that the bar number of the normal sequence of the variation does not occur in corresponding site in sequencing result; Variation equipotential sequencing depth MiIt is that the bar number of the series of variation of the variation occurs in corresponding site in sequencing result;
2.2) using variation ViThe CNV of regioni,=1 ..., calculate variation ViThe reference copies number rCN of regioniAnd reality The total copy number CN in borderi,
CN i = c e i l ( CNV i &times; rCN i ) , CNV i &GreaterEqual; 1 1 , CNV i < 1 , i = 1 , ... , n
If in step 1) used in accurate CNV detection methods (such as being detected using SNP chip), for not in the dyeing of male sex's property Variation on body, can obtain the special copy number variation (CNV of equipotential on two chromosomei,major, CNVi,minor, CNVi,major ≥CNVi,minor) information, the copy number (CN special so as to obtain actual equipotentiali,major, CNi,minor),
CN i , min o r = c e i l ( CNV i , min o r ) , CNV i , min o r &GreaterEqual; 1 0 , CNV i , min o r < 1 , i = 1 , ... , n
CN i , m a j o r = c e i l ( CNV i , m a j o r ) , CNV i , m a j o r &GreaterEqual; 1 1 , CNV i , m a j o r < 1 , i = 1 , ... , n
2.3) ctDNA ratios assessment:With the V that makes a variationi,=1 ..., maximum variation allelic frequency to assess cfDNA in shared by ctDNA Ratio CTF,
CTF=max (VAFi) ,=1 ...,.
4. method according to claim 1 and 2, the step 3) in by the mutant ratio of prediction, to make a variation into Row cluster, for example with PyClone softwares.
5. method according to claim 4, the variation Vi,=1 ..., reference and variation equipotential depth data RiAnd Mi With mono- piece of assessment variation tumour cell ratio of CTF and CNV.
6. a kind of system of assessment Tumor Heterogeneity, the system includes:
1) for the module of the cfDNA of (preferred high-flux sequence) patient to be sequenced;
2) for performing the module of following steps:
A) receive from module 1) sequencing information;
B) compared by the sequence information with normal gene sequence, obtain the ctDNA variations in cfDNA;
C) made a variation according to the sequencing information and the ctDNA, calculate variation allelic frequency, the reality of definitive variation region Always the special copy number of the equipotential of copy number or reality, calculates the ratio that ctDNA accounts for cfDNA;
D) sequencing information for being made a variation according to the ratio and ctDNA that determine in the step c) and copy number information are to described CtDNA variations are clustered, and determine molecular cloning, and calculate molecular cloning level;
3) result output module:
The result of Tumor Heterogeneity is exported according to the molecular cloning level of the patient, clone's level of the patient is more, its Tumor Heterogeneity is higher.
7. a kind of heterogeneous system of the different patient tumors of comparison, the system includes:
1) for the module of the cfDNA of (preferred high-flux sequence) patient to be sequenced;
2) for performing the module of following steps:
A) receive from step 1) module sequencing information;
B) compared by the sequence information with normal gene sequence, obtain the ctDNA variations in cfDNA;
C) according to the sequencing information and the variation result, variation allelic frequency, the reality of definitive variation region are calculated Always the special copy number of the equipotential of copy number or reality, calculates the ratio that ctDNA accounts for cfDNA;
D) sequencing information for being made a variation according to the ratio and ctDNA that determine in the step c) and copy number information are to described CtDNA variations are clustered, and determine molecular cloning, and calculate molecular cloning level;
3) result output module:
The molecular cloning level of different patients is compared, the Tumor Heterogeneity result of the comparison of the different patients of output, patient Clone's level it is more, its Tumor Heterogeneity is higher.
8. the system according to claim 6 or 7, the module 2) the step of c) include step:
C.1) variation V is obtained using the sequencing informationi,=1 ..., reference equipotential sequencing depth Ri, variation equipotential sequencing it is deep Degree Mi, and calculate variation allelic frequency VAFi,
VAF i = M i M i + R i * 100 % , i = 1 ... n
Wherein, depth R is sequenced with reference to equipotentialiIt is that the bar number of the normal sequence of the variation does not occur in corresponding site in sequencing result; Variation equipotential sequencing depth MiIt is that the bar number of the series of variation of the variation occurs in corresponding site in sequencing result;
C.2) using variation ViThe CNV of regioni,=1 ..., calculate variation ViThe reference copies number rCN of regioniAnd reality The total copy number CN in borderi,
CN i = c e i l ( CNV i &times; rCN i ) , CNV i &GreaterEqual; 1 1 , CNV i < 1 , i = 1 , ... , n
If in step 1) used in accurate CNV detection methods (such as being detected using SNP chip), for not in the dyeing of male sex's property Variation on body, can obtain the special copy number variation CNV of equipotential on two chromosomei,majorAnd CNVi,minorInformation, wherein CNVi,major≥CNVi,minor, the copy number CN special so as to obtain actual equipotentiali,majorAnd CNi,minor,
CN i , min o r = c e i l ( CNV i , min o r ) , CNV i , min o r &GreaterEqual; 1 0 , CNV i , min o r < 1 , i = 1 , ... , n
CN i , m a j o r = c e i l ( CNV i , m a j o r ) , CNV i , m a j o r &GreaterEqual; 1 1 , CNV i , m a j o r < 1 , i = 1 , ... , n
C.3) ctDNA ratios assessment:With the V that makes a variationi,=1 ..., maximum variation allelic frequency to assess cfDNA in ctDNA institutes Accounting example CTF,
CTF=max (VAFi) ,=1 ...,.
9. the system according to claim 6 or 7, the module 2) and/or module 3) it is a plurality of finger for performing the step The computer-readable medium of order.
10. the system according to claim 6 or 7, the module 2) the step of d) in, make a variation Vi,=1 ..., reference and Variation equipotential depth data RiAnd MiWith mono- piece of assessment variation tumour cell ratio of CTF and CNV.
11. systems according to claim 6 or 7, the module 2) the step of d) in, by predict mutant ratio Example, clusters, for example with PyClone softwares to variation.
CN201710043183.9A 2017-01-19 2017-01-19 Method and system for evaluating tumor heterogeneity Active CN106676178B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710043183.9A CN106676178B (en) 2017-01-19 2017-01-19 Method and system for evaluating tumor heterogeneity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710043183.9A CN106676178B (en) 2017-01-19 2017-01-19 Method and system for evaluating tumor heterogeneity

Publications (2)

Publication Number Publication Date
CN106676178A true CN106676178A (en) 2017-05-17
CN106676178B CN106676178B (en) 2020-03-24

Family

ID=58860011

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710043183.9A Active CN106676178B (en) 2017-01-19 2017-01-19 Method and system for evaluating tumor heterogeneity

Country Status (1)

Country Link
CN (1) CN106676178B (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109390034A (en) * 2018-09-20 2019-02-26 成都中珠健联基因科技有限责任公司 A kind of method of normal tissue content and tumour copy number in detection tumor tissues
CN110289047A (en) * 2019-05-15 2019-09-27 西安电子科技大学 Method and system for predicting tumor purity and absolute copy number based on sequencing data
CN110853706A (en) * 2018-08-01 2020-02-28 中国科学院深圳先进技术研究院 A method and system for constructing tumor clone composition integrating epigenetics
WO2020124625A1 (en) * 2018-12-20 2020-06-25 北京优迅医学检验实验室有限公司 Ctdna-based gene detection method and apparatus, storage medium, and computer system
CN111402952A (en) * 2020-03-27 2020-07-10 深圳裕策生物科技有限公司 Method and system for detecting tumor heterogeneity degree
CN111684078A (en) * 2018-02-12 2020-09-18 豪夫迈·罗氏有限公司 Methods for predicting response to therapy by assessing tumor genetic heterogeneity
CN112802548A (en) * 2021-01-07 2021-05-14 深圳吉因加医学检验实验室 Method for predicting allele-specific copy number variation of single-sample whole genome
CN112863594A (en) * 2021-03-31 2021-05-28 中国工商银行股份有限公司 Tumor purity estimation method and device
CN114242172A (en) * 2021-07-12 2022-03-25 广州燃石医学检验所有限公司 Method for assessing intratumoral heterogeneity based on blood sequencing and use thereof for predicting the response to immunotherapy
CN116434830A (en) * 2023-04-13 2023-07-14 深圳市睿法生物科技有限公司 Tumor focus position identification method based on ctDNA multi-site methylation
CN117604086A (en) * 2023-11-17 2024-02-27 苏州吉因加生物医学工程有限公司 A quantitative method for plasma ctDNA levels in subjects

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1687455A (en) * 2005-04-04 2005-10-26 上海奇诺肿瘤生物高新技术有限公司 Reagent and method for separating and determining dissociative DNA in blood
CN103710452A (en) * 2013-12-27 2014-04-09 朱运峰 Kit and oligonucleotides for detecting free DNA (deoxyribonucleic acid) of peripheral blood
CN105574361A (en) * 2015-11-05 2016-05-11 上海序康医疗科技有限公司 Method for detecting variation of copy numbers of genomes
CN105602938A (en) * 2016-01-22 2016-05-25 北京圣谷同创科技发展有限公司 Plasma cfDNA extracting method
CN105603062A (en) * 2006-05-03 2016-05-25 人口诊断股份有限公司 Method of evaluating genetic disorders
CN105760712A (en) * 2016-03-01 2016-07-13 西安电子科技大学 Copy number variation detection method based on next generation sequencing
CN106055923A (en) * 2016-05-13 2016-10-26 万康源(天津)基因科技有限公司 Method for gene copy number variation analysis

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1687455A (en) * 2005-04-04 2005-10-26 上海奇诺肿瘤生物高新技术有限公司 Reagent and method for separating and determining dissociative DNA in blood
CN105603062A (en) * 2006-05-03 2016-05-25 人口诊断股份有限公司 Method of evaluating genetic disorders
CN103710452A (en) * 2013-12-27 2014-04-09 朱运峰 Kit and oligonucleotides for detecting free DNA (deoxyribonucleic acid) of peripheral blood
CN105574361A (en) * 2015-11-05 2016-05-11 上海序康医疗科技有限公司 Method for detecting variation of copy numbers of genomes
CN105602938A (en) * 2016-01-22 2016-05-25 北京圣谷同创科技发展有限公司 Plasma cfDNA extracting method
CN105760712A (en) * 2016-03-01 2016-07-13 西安电子科技大学 Copy number variation detection method based on next generation sequencing
CN106055923A (en) * 2016-05-13 2016-10-26 万康源(天津)基因科技有限公司 Method for gene copy number variation analysis

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111684078A (en) * 2018-02-12 2020-09-18 豪夫迈·罗氏有限公司 Methods for predicting response to therapy by assessing tumor genetic heterogeneity
US12071661B2 (en) 2018-02-12 2024-08-27 Roche Sequencing Solutions, Inc. Method of predicting response to therapy by assessing tumor genetic heterogeneity
CN111684078B (en) * 2018-02-12 2024-04-19 豪夫迈·罗氏有限公司 Methods for predicting response to treatment by assessing tumor genetic heterogeneity
CN110853706B (en) * 2018-08-01 2022-07-22 中国科学院深圳先进技术研究院 A method and system for constructing tumor clone composition integrating epigenetics
CN110853706A (en) * 2018-08-01 2020-02-28 中国科学院深圳先进技术研究院 A method and system for constructing tumor clone composition integrating epigenetics
CN109390034A (en) * 2018-09-20 2019-02-26 成都中珠健联基因科技有限责任公司 A kind of method of normal tissue content and tumour copy number in detection tumor tissues
WO2020124625A1 (en) * 2018-12-20 2020-06-25 北京优迅医学检验实验室有限公司 Ctdna-based gene detection method and apparatus, storage medium, and computer system
CN110289047A (en) * 2019-05-15 2019-09-27 西安电子科技大学 Method and system for predicting tumor purity and absolute copy number based on sequencing data
CN111402952A (en) * 2020-03-27 2020-07-10 深圳裕策生物科技有限公司 Method and system for detecting tumor heterogeneity degree
CN112802548B (en) * 2021-01-07 2021-10-22 深圳吉因加医学检验实验室 A single-sample genome-wide approach to predict allele-specific copy number variation
CN112802548A (en) * 2021-01-07 2021-05-14 深圳吉因加医学检验实验室 Method for predicting allele-specific copy number variation of single-sample whole genome
CN112863594A (en) * 2021-03-31 2021-05-28 中国工商银行股份有限公司 Tumor purity estimation method and device
CN114242172A (en) * 2021-07-12 2022-03-25 广州燃石医学检验所有限公司 Method for assessing intratumoral heterogeneity based on blood sequencing and use thereof for predicting the response to immunotherapy
WO2023284260A1 (en) * 2021-07-12 2023-01-19 广州燃石医学检验所有限公司 Method for evaluating intra-tumor heterogeneity on basis of blood sequencing, and application thereof to prediction of response to immunotherapy
CN114242172B (en) * 2021-07-12 2025-04-25 广州燃石医学检验所有限公司 Blood sequencing-based method for assessing intratumor heterogeneity and its use in predicting response to immunotherapy
CN116434830A (en) * 2023-04-13 2023-07-14 深圳市睿法生物科技有限公司 Tumor focus position identification method based on ctDNA multi-site methylation
CN116434830B (en) * 2023-04-13 2024-01-23 深圳市睿法生物科技有限公司 Tumor focus position identification method based on ctDNA multi-site methylation
CN117604086A (en) * 2023-11-17 2024-02-27 苏州吉因加生物医学工程有限公司 A quantitative method for plasma ctDNA levels in subjects

Also Published As

Publication number Publication date
CN106676178B (en) 2020-03-24

Similar Documents

Publication Publication Date Title
CN106676178A (en) System and method for tumor heterogeneity assessment
US20240347131A1 (en) Cancer detection model and construction method therefor, and reagent kit
CN106778073B (en) A method and system for assessing changes in tumor burden
US12191000B2 (en) Systems and methods for classifying patients with respect to multiple cancer classes
AU2019277698A1 (en) Convolutional neural network systems and methods for data classification
US20200219587A1 (en) Systems and methods for using fragment lengths as a predictor of cancer
WO2018223066A1 (en) Methods and systems for identifying or monitoring lung disease
US20240249798A1 (en) Systems and methods for enriching for cancer-derived fragments using fragment size
Tan et al. The prognostic value of circulating cell-free DNA in breast cancer: A meta-analysis
CN112218957A (en) System and method for determining tumor fraction in cell-free nucleic acid
CN103324846A (en) Screening method of colorectal cancer treatment prognosis biomarkers
CN114694750B (en) Single-sample tumor somatic mutation distinguishing and TMB (tumor necrosis factor) detecting method based on NGS (Next Generation broadcasting) platform
CN105132407A (en) Method for low-frequency mutant-enriched sequencing of DNA of exfoliative cells
US20240401035A1 (en) Fragment size characterization of cell-free dna mutations from clonal hematopoiesis
EP3788173A1 (en) Surrogate marker and method for tumor mutation burden measurement
Fussey et al. The clinical utility of cell-free DNA measurement in differentiated thyroid cancer: a systematic review
Lucas et al. Characterizing the evolutionary dynamics of cancer proliferation in single-cell clones with SPRINTER
Williams et al. Tracking clonal evolution of drug resistance in ovarian cancer patients by exploiting structural variants in cfDNA
CN110004229A (en) Application of polygenes as EGFR monoclonal antibody drug resistance markers
Yang et al. A novel signature based on anoikis associated with BCR-free survival for prostate cancer
Wilmott et al. Tumour procurement, DNA extraction, coverage analysis and optimisation of mutation-detection algorithms for human melanoma genomes
Livesey et al. Transforming RNA-Seq gene expression to track cancer progression in the multi-stage early to advanced-stage cancer development
Li et al. Liquid biopsy in diffuse large B-cell lymphoma: utility in cell origin determination and survival prediction in Chinese patients
CN116312789A (en) Tumor malignancy evaluation method, electronic equipment and storage medium
JP2024527142A (en) Methods for mutation detection in liquid biopsy

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20210610

Address after: Room 501, 5 / F, building 2, area 1, No.8, life Garden Road, Zhongguancun Life Science Park, Longguan Town, Changping District, Beijing

Patentee after: Beijing Jiyingjia Technology Co.,Ltd.

Patentee after: SUZHOU JIYINJIA BIOMEDICAL ENGINEERING Co.,Ltd.

Address before: 102206 Room 501, 5 / F, building 2, area 1, No.8, shengshengyuan Road, Zhongguancun Life Science Park, Huilongguan town, Changping District, Beijing

Patentee before: Beijing Jiyingjia Technology Co.,Ltd.

TR01 Transfer of patent right