[go: up one dir, main page]

WO2019010775A1 - Étiquette moléculaire, jonction et procédé de détermination de séquence nucléotidique contenant une mutation à basse fréquence - Google Patents

Étiquette moléculaire, jonction et procédé de détermination de séquence nucléotidique contenant une mutation à basse fréquence Download PDF

Info

Publication number
WO2019010775A1
WO2019010775A1 PCT/CN2017/100421 CN2017100421W WO2019010775A1 WO 2019010775 A1 WO2019010775 A1 WO 2019010775A1 CN 2017100421 W CN2017100421 W CN 2017100421W WO 2019010775 A1 WO2019010775 A1 WO 2019010775A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
molecular tag
acid sequence
molecular
linker
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2017/100421
Other languages
English (en)
Chinese (zh)
Inventor
曾晓静
高晓峘
韩颖鑫
张印新
何哲
王佳伟
夏伟成
李胜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Jingke Dx Co Ltd
Original Assignee
Guangzhou Jingke Dx Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Jingke Dx Co Ltd filed Critical Guangzhou Jingke Dx Co Ltd
Publication of WO2019010775A1 publication Critical patent/WO2019010775A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Definitions

  • the present invention relates to the field of nucleic acid sequencing technology. Specifically, the present invention relates to a molecular tag and a composition thereof, a molecular tag-containing linker and a composition thereof, and a method for determining a target region of a sample to be tested containing a low frequency mutant nucleic acid sequence.
  • High-throughput sequencing is currently the most widely used sequencing technology.
  • the incidence is 0.1%-0.2% or higher, and the DNA polymerase used in the PCR process is also wrong.
  • the rate and error rate are 10-7 to 10-5, especially as the number of PCR cycles increases, the error rate also increases.
  • Each position of the molecular tag can be one of four bases A, T, C, and G.
  • the length of the molecular tag is selected according to actual experimental needs. According to the length of the molecular tag and the change of four bases, the molecular tag There can be 4 n-th power types. If the molecular tags of the original template are completely randomly distributed, the diversity of the molecular tags ensures that each original template is unique after the molecular tag is attached to the original library.
  • each original template will act as The initial template forms a cluster of "molecular clusters". If there are no sequencing errors and PCR errors, the molecular sequences in each cluster are the error-free "replication strands" of the original template positive and negative strands.
  • the base sequences at each position of the molecular tag are completely randomly distributed.
  • the same amount of A, T, The four bases of C and G because the energy or synthesis efficiency required for the synthesis of these four bases is different, the frequency of occurrence of the four bases A, T, C, and G at each position is not completely equal.
  • the object of the present invention is to provide a molecular tag which has a completely random distribution of bases by optimizing the design of the molecular tag, and a molecular tag composition having a ratio of 0.95-1.05:1 for each molecular tag, and using the molecular tag and The linker synthesized by the composition was constructed and sequenced to effectively distinguish between sequencing errors and low frequency mutations.
  • the invention provides a molecular tag having up to two consecutive identical bases.
  • Another aspect of the present invention also provides a molecular tag composition comprising the above molecular tag, and the ratio of each molecular tag is from 0.95 to 1.05:1.
  • Another aspect of the present invention also provides a linker comprising the above molecular tag, And the molecular tag is located at any position other than the overhang "T" of the linker and the 20 bp base of the non-overhanging end.
  • Another aspect of the present invention also provides a linker composition comprising the above-described linker, and the ratio of each linker is from 0.95 to 1.05:1.
  • Another aspect of the present invention also provides a method for determining a target region of a sample to be tested containing a low frequency mutant nucleic acid sequence, comprising the steps of:
  • the molecular tag provided by the present invention does not have a plurality of identical bases in succession, thereby avoiding a situation in which the sequencing quality is poor due to the appearance of a plurality of consecutive bases. And various kinds of molecular tags inside The proportion of the labels is the same, avoiding the situation of the dominant labels, and maximizing the performance of the molecular labels.
  • FIG. 1 is a schematic view showing the structure of a molecular tag in a fully complementary double link head according to an embodiment of the present invention.
  • FIG. 2 is a schematic view showing the structure of a molecular tag in a Y-type connector with one end complementary to one end at the complementary end in the embodiment of the present invention.
  • FIG. 3 is a schematic view showing the structure of a molecular tag in a Y-type connector with one end complementary to one end in an open end.
  • Figure 4 is a schematic illustration of a Y-type structure in which a molecular tag is not located on a linker, but a linker can be introduced by PCR, in accordance with an embodiment of the present invention.
  • FIG. 5 is a flow chart of a method for determining a target region of a sample to be tested containing a low frequency mutant nucleic acid sequence according to an embodiment of the present invention.
  • the present invention provides a molecular tag having up to two consecutive identical bases on the molecular tag.
  • the molecular tag is a single strand or a reverse complementary double strand.
  • the number of bases of the molecular tag is 6-24 bp.
  • the present invention also provides a molecular tag composition
  • a molecular tag composition comprising the molecular tag as described above, and the ratio of each molecular tag is from 0.95 to 1.05:1.
  • the ratio includes at least one of a molar ratio, a molecular mass ratio, and a molecular ratio.
  • the number of species of the molecular tag comprises 4n and n is equal to 6-24.
  • n is equal to 6-24.
  • 4096, 16384, 65536, 262144, 16777216, 268435456, and even more types can be designed.
  • the molecular tag when the molecular tag is a single-stranded structure, the molecular tag is in a ratio of a molar number of 0.95-1.05:1, or a molecular mass of 0.95-1.05:1, or a molecular number of 0.95-1.05:1.
  • the ratio is mixed.
  • the molecular tag when the molecular tag is a single-stranded structure, the molecular tag is mixed in a ratio of 1:1 molar ratio, or a molecular mass ratio of 1:1, or a molecular number of 1:1.
  • the single-stranded molecular tag is firstly matched according to the ratio of the molar number of 0.95-1.05:1, or the molecular mass of 0.95-1.05:1, or the ratio of the number of molecules of 0.95-1.05:1.
  • the reverse complementary sequences are annealed to form a molecular tag of a double-stranded structure, and these double-stranded molecular tags are then mixed in a ratio of 0.95-1.05:1.
  • the single-stranded molecular tag is firstly reversed according to a molar ratio of 1:1, or a molecular mass of 1:1, or a molecular number of 1:1.
  • the complementary sequences are annealed to form a molecular tag of the double-stranded structure, and the double-stranded molecular tags are then mixed in a ratio of 1:1.
  • the invention also provides for the use of the molecular tag composition for correcting sequencing errors and PCR errors, detecting low frequency mutations, de-redundancy, and calculating the number of specific molecules or cells carrying a particular molecule.
  • Another aspect of the invention provides a linker comprising a molecule as described above A tag, and the molecular tag is located at any position other than the overhang "T" of the linker and the 20 bp base of the non-overhang end.
  • the molecular tag "NNN...NNN” may be located at the 3' end, 5' of the fully complementary duplex of the linker. End or middle, anywhere except the overhang "T” and non-overhanging end blocks, which are 20 bp base length.
  • the molecular label "NNN...NNN” may be located at one end of the joint Y-shaped structure, open One end or the middle, except for the protruding end "T” and any position other than the 20 bp base of the non-overhanging end.
  • the molecular tag may not be located on the linker, but may be introduced into the Y-type structure of the linker by PCR.
  • the molecular tag may also be located at two or more positions of the joint.
  • the linker further comprises a library tag linked to the 3' or 5' end of the molecular tag.
  • the library tag is used to distinguish different sample libraries, and after PCR amplification, PCR products of multiple samples are mixed and sequenced, and samples of each sequence are further based on library tags. Sources are distinguished.
  • the linker further comprises an identifying characteristic sequence, the identifying characteristic sequence being 4 non-repeating bases, for example: "ATCG” or “TGAC”, the identifying feature sequence and the The 3' or 5' end of the molecular tag is linked.
  • Another aspect of the present invention also provides a linker composition comprising a linker as described above, and the ratio of each linker is from 0.95 to 1.05:1.
  • the types of the joints include:
  • the one end of the molecular tag is complementary to the Y-type open end
  • the molecular tag is not located on the linker, but can be introduced into the Y-type structure of the linker by PCR.
  • a joint having a molecular tag at two or more positions is also included.
  • the ratio of the joint is at least one of a molar ratio, a molecular mass ratio, and a molecular ratio of each type of joint.
  • a further aspect of the present invention provides a method for determining a target region of a sample to be tested containing a low frequency mutant nucleic acid sequence, as shown in FIG. 5, comprising the steps of:
  • said step S6 further comprises filtering sequencing errors brought in by PCR and sequencing.
  • the data analysis can be performed using statistical analysis methods well known to those skilled in the art, such as Z test, T test, run test, and the like.
  • Molecular tags are designed according to the possibility of random distribution of 4 bases per position, and the molecular tag contains up to 2 consecutive identical bases. According to the needs of the experiment, different kinds of molecular labels M were designed.
  • the number of species of the molecular tag sequence includes 4n, and n is equal to 6-24. As shown in Table 1, 16 molecular labels:
  • a linker containing any of the above molecular tags is designed, wherein the molecular tag can be located at any position other than the overhang "T" of the linker and the 20 bp base of the non-overhang end.
  • NNN...NNN represents a molecular tag
  • the type of the linker may be a fully complementary double-stranded structure, a Y-type structure in which one end is complementary to one end, or may be PCR-
  • the molecular tag is introduced into the Y-type structure of the linker.
  • the molecular tag may be located at either or both ends of the linker, or may be distributed at two or more positions.
  • the number of N represents the number of bases of the molecular tag, and the number of molecular tags required increases the number of bases at the position.
  • the number for example, the number of bases of 8 bp, 12 bp, 16 bp, 24 bp or more.
  • 16 kinds of linkers containing different molecular tags As shown in Table 2, 16 kinds of linkers containing different molecular tags:
  • Identification signature sequences and/or library tags can also be added at the 3' or 5' end of the molecular tag as desired for the experiment. For example, when sequencing using the illumina platform, an index sequence that identifies different samples can be added to it.
  • the designed molecular tag or its corresponding reverse complement sequence and its sequence at the 3' end and the 5' end are synthesized to obtain a linker containing the molecular tag.
  • synthetic methods can be employed in methods well known in the art or can be commissioned by a primer synthesis company.
  • the synthesized molecular tag-containing linkers were mixed at a molar ratio of 1:1 for different types.
  • each type of joint is mixed in a ratio of 1:1.
  • the patient's peripheral EDTA anticoagulation was taken 10 ml, and the plasma was separated by fresh centrifugation, and plasma DNA was extracted according to a method well known to those skilled in the art.
  • the extracted DNA solution and the end-repaired reagent mixture are mixed, and the reaction is carried out according to a method of terminal repair well known to those skilled in the art, and the reaction is separated and purified.
  • reaction system After mixing at room temperature, after slight centrifugation, the reaction system was placed in a PCR machine, and reacted at 20 ° C for 30 minutes. After the reaction was completed, it was purified using AMpure XP magnetic beads.
  • terminal-repaired DNA solution and the "A"-added reagent mixture are mixed, and the reaction is carried out according to the method of adding "A" at the end well known to those skilled in the art, and the reaction is separated and purified.
  • reaction system After mixing at room temperature, after slight centrifugation, the reaction system was placed in a PCR machine, and reacted at 37 ° C for 30 minutes. After the reaction was completed, it was purified using AMpure XP magnetic beads.
  • Magnetic bead purification was carried out by the method shown in 5.2, except that 75 ul of magnetic beads were added to the 50 ul system reaction product.
  • the DNA solution after the addition of "A” is mixed with the molecular tag-containing linker and the reaction reagent mixture obtained in the step S3, and the reaction is carried out according to a method of adding a linker well known to those skilled in the art, and after the completion of the reaction, separation and purification are carried out.
  • reaction system After mixing at room temperature, after slight centrifugation, the reaction system was placed in a PCR machine, and reacted at 20 ° C for 15 minutes. After the reaction was completed, it was purified using AMpure XP magnetic beads.
  • Magnetic bead purification was carried out using the method shown in 5.2, except that 75 ul of magnetic beads were added to the 50 ul system reaction product.
  • the DNA after the addition of the linker and the PCR reaction reagent mixture are mixed, and the PCR reaction is carried out according to a method well known to those skilled in the art. After the reaction is completed, the separation and purification are carried out. After the completion of the library construction, the library is subjected to QC detection, and the test is waited after passing the test. Sequencing.
  • Reagent Volume /ul DNA 32 10 ⁇ Pfx amplification buffer 5 dNTP solution (10nM) 2 MgSO 4 (50nM) 2 PCR primer PE1.0 (10pmol/ul) 4 index-X (10pmol/ul) 4 Pfx DNA polymerase 1 Total volume / ul 50
  • reaction system After mixing at room temperature, after slight centrifugation, the reaction system was placed in a PCR machine and reacted according to the following conditions:
  • Magnetic bead purification was carried out using the method shown in 5.2, except that 50 ul of magnetic beads were added to the 50 ul system reaction product. The library construction is over.
  • the library was subjected to QPCR and Agilent 2100 detection, and the quality-qualified library was arranged on the machine.
  • the library can be sequenced using a second generation sequencer such as Hiseq2000, Hiseq 2500, Proton, Miseq, NS500.
  • a second generation sequencer such as Hiseq2000, Hiseq 2500, Proton, Miseq, NS500.
  • the sequencing results of the DNA obtained after sequencing are analyzed, and the obtained DNA sequences are classified according to molecular tags, and the sequence carrying the same molecular tag is taken as a "molecular cluster" which is the initial one DNA molecule.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Biotechnology (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

La présente invention concerne une étiquette moléculaire et une composition de cette dernière, une jonction contenant l'étiquette moléculaire et une composition de cette dernière, ainsi qu'un procédé de détermination de séquence nucléotidique contenant la mutation à basse fréquence dans une région cible d'un échantillon à détecter. L'étiquette moléculaire contient au plus deux bases continues et identiques.
PCT/CN2017/100421 2017-07-14 2017-09-04 Étiquette moléculaire, jonction et procédé de détermination de séquence nucléotidique contenant une mutation à basse fréquence Ceased WO2019010775A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710573052.1A CN107385030A (zh) 2017-07-14 2017-07-14 分子标签、接头及确定含有低频突变核酸序列的方法
CN201710573052.1 2017-07-14

Publications (1)

Publication Number Publication Date
WO2019010775A1 true WO2019010775A1 (fr) 2019-01-17

Family

ID=60339624

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/100421 Ceased WO2019010775A1 (fr) 2017-07-14 2017-09-04 Étiquette moléculaire, jonction et procédé de détermination de séquence nucléotidique contenant une mutation à basse fréquence

Country Status (2)

Country Link
CN (1) CN107385030A (fr)
WO (1) WO2019010775A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109295049A (zh) * 2018-09-26 2019-02-01 刘强 血液微量cfDNA文库的标签特异性接头、引物组和建库方法

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009140191A2 (fr) * 2008-05-12 2009-11-19 Biomedomics, Inc. Détection d'une maladie liée à des mutations de gène par une amplification de molécule unique avec des dosages de ligature multiplex
CN106048009A (zh) * 2016-06-03 2016-10-26 人和未来生物科技(长沙)有限公司 一种用于超低频基因突变检测的标签接头及其应用
CN106367485A (zh) * 2016-08-29 2017-02-01 厦门艾德生物医药科技股份有限公司 一种用于检测基因突变的多定位双标签接头组及其制备方法和应用
CN106676182A (zh) * 2017-02-07 2017-05-17 北京诺禾致源科技股份有限公司 一种低频率基因融合的检测方法及装置
CN106811460A (zh) * 2015-11-30 2017-06-09 安诺优达基因科技(北京)有限公司 用于低频突变检测的二代测序文库的构建方法及试剂盒
CN106834275A (zh) * 2017-02-22 2017-06-13 天津诺禾医学检验所有限公司 ctDNA超低频突变检测文库的构建方法、试剂盒及文库检测数据的分析方法

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7198900B2 (en) * 2003-08-29 2007-04-03 Applera Corporation Multiplex detection compositions, methods, and kits
CN102115789B (zh) * 2010-12-15 2013-03-13 厦门大学 用于第二代高通量测序的核酸标签及其设计方法
US20160017410A1 (en) * 2014-07-17 2016-01-21 Jay Shendure Highly multiplex single amino acid mutagenesis for massively parallel functional analysis
CN106086162B (zh) * 2015-11-09 2020-02-21 厦门艾德生物医药科技股份有限公司 一种用于检测肿瘤突变的双标签接头序列及检测方法

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009140191A2 (fr) * 2008-05-12 2009-11-19 Biomedomics, Inc. Détection d'une maladie liée à des mutations de gène par une amplification de molécule unique avec des dosages de ligature multiplex
CN106811460A (zh) * 2015-11-30 2017-06-09 安诺优达基因科技(北京)有限公司 用于低频突变检测的二代测序文库的构建方法及试剂盒
CN106048009A (zh) * 2016-06-03 2016-10-26 人和未来生物科技(长沙)有限公司 一种用于超低频基因突变检测的标签接头及其应用
CN106367485A (zh) * 2016-08-29 2017-02-01 厦门艾德生物医药科技股份有限公司 一种用于检测基因突变的多定位双标签接头组及其制备方法和应用
CN106676182A (zh) * 2017-02-07 2017-05-17 北京诺禾致源科技股份有限公司 一种低频率基因融合的检测方法及装置
CN106834275A (zh) * 2017-02-22 2017-06-13 天津诺禾医学检验所有限公司 ctDNA超低频突变检测文库的构建方法、试剂盒及文库检测数据的分析方法

Also Published As

Publication number Publication date
CN107385030A (zh) 2017-11-24

Similar Documents

Publication Publication Date Title
CN106367485B (zh) 一种用于检测基因突变的多定位双标签接头组及其制备方法和应用
CN106048009B (zh) 一种用于超低频基因突变检测的标签接头及其应用
EP2631336A1 (fr) Bibliothèque d'adn et procédé de préparation de celle-ci, procédé et dispositif de détection de snp
CN108138228B (zh) 用于下一代测序的高分子量dna样品追踪标签
US20210403991A1 (en) Sequencing Process
CN102690809B (zh) Dna标签及其在构建和测序配对末端标签文库中的应用
CN106086162A (zh) 一种用于检测肿瘤突变的双标签接头序列及检测方法
CN111808854B (zh) 带有分子条码的平衡接头及快速构建转录组文库的方法
CN108517567B (zh) 用于cfDNA建库的接头、引物组、试剂盒和建库方法
CN105861710A (zh) 测序接头、其制备方法及其在超低频变异检测中的应用
CN107002080B (zh) 一种基于多重pcr的目标区域富集方法和试剂
CN103205420B (zh) 用于扩增T细胞受体β链CDR3编码序列的引物组合物及其用途
CN102409043B (zh) 高通量低成本Fosmid文库构建的方法及其所使用标签和标签接头
WO2021253372A1 (fr) Construction de bibliothèque sans pcr à haute compatibilité et procédé de séquençage
KR20170133270A (ko) 분자 바코딩을 이용한 초병렬 시퀀싱을 위한 라이브러리 제조방법 및 그의 용도
CN103571822B (zh) 一种用于新一代测序分析的多重目的dna片段富集方法
CN104232626A (zh) 简化基因组测序文库中条码物及其设计方法
CN115612746A (zh) 一种用于肉鸡亲缘关系鉴定的靶向捕获测序芯片
CN105087560A (zh) 一种基于高通量测序构建猪bcr重链文库的多重pcr引物和方法
CN111440846B (zh) 一种用于纳米孔测序建库的位置锚定条码系统
CN110218811A (zh) 一种筛选水稻突变体的方法
WO2019010775A1 (fr) Étiquette moléculaire, jonction et procédé de détermination de séquence nucléotidique contenant une mutation à basse fréquence
US20190218606A1 (en) Methods of reducing errors in deep sequencing
WO2019010776A1 (fr) Étiquette combinée, connecteur et procédé pour déterminer qu'une séquence d'acide nucléique de mutation à basse fréquence est incluse
CN110892079A (zh) 用于检测核酸标识符污染的测定方法和组合物

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17917625

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17917625

Country of ref document: EP

Kind code of ref document: A1