WO2024138762A1

WO2024138762A1 - Recombinant protein and use thereof

Info

Publication number: WO2024138762A1
Application number: PCT/CN2022/144377
Authority: WO
Inventors: 黄思谦; 刘月鹏; 杨丽媛; 宋承威; 徐崇钧; 刘芬
Original assignee: MGI Tech Co Ltd
Current assignee: MGI Tech Co Ltd
Priority date: 2022-12-30
Filing date: 2022-12-30
Publication date: 2024-07-04
Anticipated expiration: 2025-06-30
Also published as: CN120225667A

Abstract

Provided are a recombinant protein having improved amplification performance, a nucleic acid encoding the recombinant protein, a vector comprising the nucleic acid, a kit comprising the protein, the nucleic acid or the vector, a method for preparing the recombinant protein, and a method for amplifying a target DNA by using the recombinant protein. As a chimeric family A DNA polymerase, the recombinant protein has improved processivity, the amplification performance of the recombinant protein is remarkably better than that of a wild-type family A DNA polymerase, and the thermal stability thereof is higher than that of a family B DNA polymerase.

Description

Recombinant proteins and their applications

Technical Field

本发明涉及生物技术领域，具体涉及重组蛋白，编码该重组蛋白的核酸，包含该核酸的载体，包含上述蛋白、核酸或载体的试剂盒，制备上述重组蛋白的方法，以及利用该重组蛋白扩增目的DNA的方法。The present invention relates to the field of biotechnology, and in particular to a recombinant protein, a nucleic acid encoding the recombinant protein, a vector comprising the nucleic acid, a kit comprising the protein, nucleic acid or vector, a method for preparing the recombinant protein, and a method for amplifying a target DNA using the recombinant protein.

Background technique

DNB-SEQ测序方法中通常使用phi29 DNA聚合酶进行滚环扩增(RCA)反应和多重置换扩增(MDA)反应。phi29 DNA聚合酶是B家族聚合酶，尽管具有极高的链置换活性和可持续合成能力，但是phi29 DNA聚合酶的热稳定性较差以致容易失活，因此需要将其在严苛条件下低温保存且保质期较短。此外，phi29 DNA聚合酶的非特异结合能力强且难以洗脱，导致了phi29DNA聚合酶在测序中产生较强荧光背景。In the DNB-SEQ sequencing method, phi29 DNA polymerase is usually used for rolling circle amplification (RCA) and multiple displacement amplification (MDA) reactions. Although phi29 DNA polymerase is a B family polymerase with extremely high chain displacement activity and sustainable synthesis ability, it has poor thermal stability and is easily inactivated. Therefore, it needs to be stored at low temperatures under harsh conditions and has a short shelf life. In addition, phi29 DNA polymerase has strong non-specific binding ability and is difficult to elute, resulting in a strong fluorescence background of phi29 DNA polymerase during sequencing.

Bst聚合酶是A家族聚合酶，具有耐高温的特性，其最适反应温度在60-65℃。因此无论在其制备还是应用过程中，A家族聚合酶对温度具有更好的耐受性，且同样具有较高的链置换活性。在测序中，A家族聚合酶相比于phi29聚合酶更容易被洗脱，并且在复杂体系表现更好。然而，相比于B家族的DNA聚合酶(如phi29、Pfu、Kod聚合酶)，A家族聚合酶的拇指结构域与DNA模板结合的区域较小，导致包括Bst聚合酶在内的A家族聚合酶的可持续扩增能力较弱。Bst polymerase is a family A polymerase with high temperature resistance, and its optimal reaction temperature is 60-65°C. Therefore, whether in its preparation or application, the A family polymerase has better tolerance to temperature and also has higher chain displacement activity. In sequencing, the A family polymerase is easier to be eluted than the phi29 polymerase and performs better in complex systems. However, compared with the B family DNA polymerases (such as phi29, Pfu, and Kod polymerases), the thumb domain of the A family polymerases has a smaller area that binds to the DNA template, resulting in weaker sustainable amplification capabilities of the A family polymerases, including the Bst polymerase.

因此，需要研究开发具有高持续合成能力的A家族聚合酶。Therefore, it is necessary to research and develop A family polymerases with high processivity.

发明内容Summary of the invention

本发明旨在至少在一定程度上解决相关技术中的技术问题之一。The present invention aims to solve one of the technical problems in the related art at least to a certain extent.

为此，本发明的实施方案提供了具有在扩增反应中加强DNA聚合酶与目的DNA相互作用的氨基酸序列的重组蛋白，编码该重组蛋白的核酸，包含该核酸的载体，包含上述蛋白、核酸或载体的试剂盒，制备上述重组蛋白的方法，以及利用该重组蛋白扩增目的DNA的方法。本发明实施方案提供的重组蛋白为具有提高的持续合成能力以及加强的热稳定性、聚合活性和DNA双链亲和力的嵌合型A家族DNA聚合酶，可用于扩增目的DNA。To this end, the embodiments of the present invention provide a recombinant protein having an amino acid sequence that enhances the interaction between a DNA polymerase and a target DNA in an amplification reaction, a nucleic acid encoding the recombinant protein, a vector comprising the nucleic acid, a kit comprising the above protein, nucleic acid or vector, a method for preparing the above recombinant protein, and a method for amplifying a target DNA using the recombinant protein. The recombinant protein provided by the embodiments of the present invention is a chimeric A family DNA polymerase with improved processivity and enhanced thermal stability, polymerization activity and DNA double-strand affinity, which can be used to amplify the target DNA.

第一方面，本发明的实施方案提供了一种重组蛋白，该重组蛋白的氨基酸序列是用在扩增反应中能够加强DNA聚合酶与目的DNA相互作用的氨基酸序列替换野生型的A家族DNA聚合酶中的同源序列获得的氨基酸序列。In a first aspect, an embodiment of the present invention provides a recombinant protein whose amino acid sequence is obtained by replacing a homologous sequence in a wild-type A family DNA polymerase with an amino acid sequence that can enhance the interaction between the DNA polymerase and the target DNA in an amplification reaction.

在一些实施例中，A家族DNA聚合酶选自由大肠杆菌DNA聚合酶I、T3 DNA聚合酶、 T5 DNA聚合酶、T7 DNA聚合酶、Taq DNA聚合酶、Bsu DNA聚合酶和Bst DNA聚合酶组成的组。In some embodiments, the A family DNA polymerase is selected from the group consisting of Escherichia coli DNA polymerase I, T3 DNA polymerase, T5 DNA polymerase, T7 DNA polymerase, Taq DNA polymerase, Bsu DNA polymerase and Bst DNA polymerase.

在一些实施例中，A家族DNA聚合酶选自具有SEQ ID NO:1所示氨基酸序列的Bst DNA聚合酶、具有SEQ ID NO:9所示氨基酸序列的Taq DNA聚合酶、具有SEQ ID NO:10所示氨基酸序列的大肠杆菌DNA聚合酶和具有SEQ ID NO:11所示氨基酸序列的Bsu DNA聚合酶中的至少一种。In some embodiments, the A family DNA polymerase is selected from at least one of a Bst DNA polymerase having the amino acid sequence shown in SEQ ID NO:1, a Taq DNA polymerase having the amino acid sequence shown in SEQ ID NO:9, an Escherichia coli DNA polymerase having the amino acid sequence shown in SEQ ID NO:10, and a Bsu DNA polymerase having the amino acid sequence shown in SEQ ID NO:11.

在一些实施例中，A家族DNA聚合酶中的同源序列来源于拇指结构域。In some embodiments, the homologous sequence in the A-family DNA polymerase is derived from the thumb domain.

在一些实施例中，A家族DNA聚合酶中的同源序列选自以下中的至少一种：In some embodiments, the homologous sequence in the A family DNA polymerase is selected from at least one of the following:

如SEQ ID NO:2所示的Bst DNA聚合酶中的第546位至第554位氨基酸序列；The amino acid sequence from position 546 to position 554 in the Bst DNA polymerase as shown in SEQ ID NO:2;

如SEQ ID NO:12所示的Taq DNA聚合酶中的第199位至第209位氨基酸序列；The amino acid sequence from position 199 to position 209 of the Taq DNA polymerase as shown in SEQ ID NO: 12;

如SEQ ID NO:13所示的大肠杆菌DNA聚合酶中的第276位至第285位氨基酸序列；或The amino acid sequence from position 276 to position 285 of the Escherichia coli DNA polymerase as shown in SEQ ID NO:13; or

如SEQ ID NO:14所示的大肠杆菌Bsu聚合酶中的第249位至第257位氨基酸序列。The amino acid sequence from position 249 to 257 in the Escherichia coli Bsu polymerase as shown in SEQ ID NO:14.

在一些实施例中，A家族DNA聚合酶中的同源序列被替换为包含SEQ ID NO:3所示氨基酸序列的序列、基本上由SEQ ID NO:3所示氨基酸序列组成的序列、或由SEQ ID NO:3所示氨基酸序列组成的序列。In some embodiments, the homologous sequence in the A family DNA polymerase is replaced by a sequence comprising the amino acid sequence shown in SEQ ID NO:3, a sequence essentially consisting of the amino acid sequence shown in SEQ ID NO:3, or a sequence consisting of the amino acid sequence shown in SEQ ID NO:3.

在一些实施例中，重组蛋白的氨基酸序列选自SEQ ID NO:4、SEQ ID NO:15、SEQ ID NO:16和SEQ ID NO:17中的至少一种。In some embodiments, the amino acid sequence of the recombinant protein is selected from at least one of SEQ ID NO:4, SEQ ID NO:15, SEQ ID NO:16 and SEQ ID NO:17.

在一些实施例中，重组蛋白还包含额外的保守突变、添加和缺失中的任一种。In some embodiments, the recombinant protein further comprises any of additional conservative mutations, additions, and deletions.

在一些实施例中，重组蛋白作为嵌合型DNA聚合酶的持续合成能力高于野生型的所述A家族DNA聚合酶。In some embodiments, the processivity of the recombinant protein as a chimeric DNA polymerase is higher than that of the wild-type A-family DNA polymerase.

第二方面，本发明的实施方案提供了一种核酸，该核酸编码如第一方面任一实施方案所述的重组蛋白。In a second aspect, an embodiment of the present invention provides a nucleic acid encoding a recombinant protein as described in any embodiment of the first aspect.

在一些实施例中，所述核酸分子的核苷酸序列选自SEQ ID NO:5、SEQ ID NO:18、SEQ ID NO:19和SEQ ID NO:20中的至少一种。In some embodiments, the nucleotide sequence of the nucleic acid molecule is selected from at least one of SEQ ID NO:5, SEQ ID NO:18, SEQ ID NO:19 and SEQ ID NO:20.

第三方面，本发明的实施方案提供了一种载体，该载体包含如第二方面任一实施方案所述的核酸。In a third aspect, an embodiment of the present invention provides a vector comprising a nucleic acid as described in any embodiment of the second aspect.

第四方面，本发明的实施方案提供了一种试剂盒，包含如第一方面任一实施方案所述的重组蛋白、如第二方面任一实施方案所述的核酸、或如第三方面任一实施方案所述的载体。In a fourth aspect, an embodiment of the present invention provides a kit comprising a recombinant protein as described in any embodiment of the first aspect, a nucleic acid as described in any embodiment of the second aspect, or a vector as described in any embodiment of the third aspect.

第五方面，本发明的实施方案提供了一种制备第一方面任一实施方案所述的重组蛋白的方法，该方法包括：用在扩增反应中能够加强DNA聚合酶与目的DNA相互作用的氨基酸序列替换野生型的A家族DNA聚合酶中的同源序列获得的氨基酸序列，以获得所述重组蛋白。In the fifth aspect, an embodiment of the present invention provides a method for preparing the recombinant protein described in any embodiment of the first aspect, the method comprising: replacing the amino acid sequence obtained by the homologous sequence in the wild-type A family DNA polymerase with an amino acid sequence that can enhance the interaction between the DNA polymerase and the target DNA in the amplification reaction to obtain the recombinant protein.

在一些实施例中，所述方法包括：对来自所述野生型A家族DNA聚合酶的多个氨基酸序列进行序列比对，以确定嵌合序列；和用所述嵌合序列替换所述野生型A家族DNA聚合酶序列中的同源序列，以获得候选重组蛋白序列。In some embodiments, the method comprises: performing sequence alignment on multiple amino acid sequences from the wild-type A family DNA polymerase to determine a chimeric sequence; and replacing a homologous sequence in the wild-type A family DNA polymerase sequence with the chimeric sequence to obtain a candidate recombinant protein sequence.

在一些实施例中，所述方法还包括：对所述候选重组蛋白序列进行计算机模拟分析，以确定重组蛋白序列，其中所示重组蛋白序列表现出减小的拇指结构域与手指结构域之间的空间构型。In some embodiments, the method further comprises: performing computer simulation analysis on the candidate recombinant protein sequence to determine the recombinant protein sequence, wherein the recombinant protein sequence exhibits a reduced spatial configuration between the thumb domain and the finger domain.

在一些实施例中，对所述候选重组蛋白序列进行计算机模拟分析，以获得重组蛋白序列，包括：对所述候选重组蛋白序列进行结构预测，以获得重组蛋白的三级结构；对所述重组蛋白的三级结构进行模拟，以获得所述重组蛋白的分子对接结构；对所述分子对接结构进行基于配体的结构优化，以获得优化的重组蛋白结构；和对所述优化的重组蛋白结构进行动力学模拟，以确定重组蛋白序列。In some embodiments, the candidate recombinant protein sequence is subjected to computer simulation analysis to obtain the recombinant protein sequence, including: performing structural prediction on the candidate recombinant protein sequence to obtain the tertiary structure of the recombinant protein; simulating the tertiary structure of the recombinant protein to obtain the molecular docking structure of the recombinant protein; performing ligand-based structural optimization on the molecular docking structure to obtain an optimized recombinant protein structure; and performing kinetic simulation on the optimized recombinant protein structure to determine the recombinant protein sequence.

在一些实施例中，所述方法还包括：表达并纯化所述重组蛋白序列。In some embodiments, the method further comprises: expressing and purifying the recombinant protein sequence.

在一些实施例中，A家族DNA聚合酶选自由大肠杆菌DNA聚合酶I、T3 DNA聚合酶、T5 DNA聚合酶、T7 DNA聚合酶、Taq DNA聚合酶、Bsu DNA聚合酶和Bst DNA聚合酶组成的组。In some embodiments, the A family DNA polymerase is selected from the group consisting of Escherichia coli DNA polymerase I, T3 DNA polymerase, T5 DNA polymerase, T7 DNA polymerase, Taq DNA polymerase, Bsu DNA polymerase and Bst DNA polymerase.

在一些实施例中，A家族DNA聚合酶选自具有SEQ ID NO:1所示氨基酸序列的Bst DNA聚合酶具有SEQ ID NO:9所示的氨基酸序列的Taq DNA聚合酶、具有SEQ ID NO:10所示氨基酸序列的大肠杆菌DNA聚合酶和具有SEQ ID NO:11所示氨基酸序列的Bsu DNA聚合酶中的至少一种。In some embodiments, the A family DNA polymerase is selected from at least one of a Bst DNA polymerase having the amino acid sequence shown in SEQ ID NO:1, a Taq DNA polymerase having the amino acid sequence shown in SEQ ID NO:9, an Escherichia coli DNA polymerase having the amino acid sequence shown in SEQ ID NO:10, and a Bsu DNA polymerase having the amino acid sequence shown in SEQ ID NO:11.

在一些实施例中，所述A家族DNA聚合酶中的同源序列来源于拇指结构域序列。In some embodiments, the homologous sequence in the A family DNA polymerase is derived from the thumb domain sequence.

在一些实施例中，所述A家族DNA聚合酶中的同源序列选自以下中的至少一种：In some embodiments, the homologous sequence in the A family DNA polymerase is selected from at least one of the following:

第六方面，本发明的实施方案提供了一种扩增目的DNA的方法，使用第一方面任一实施方案所述的重组蛋白对目的DNA进行扩增。In a sixth aspect, an embodiment of the present invention provides a method for amplifying a target DNA, using the recombinant protein described in any embodiment of the first aspect to amplify the target DNA.

在一些实施例中，所述扩增是等温扩增，所述等温扩增选自滚环扩增RCA、多重置换扩增MDA、重组酶聚合酶扩增反应RPA、链置换扩增SDA、环介导等温扩增LAMP。In some embodiments, the amplification is isothermal amplification, and the isothermal amplification is selected from rolling circle amplification RCA, multiple displacement amplification MDA, recombinase polymerase amplification reaction RPA, strand displacement amplification SDA, and loop-mediated isothermal amplification LAMP.

在一些实施例中，所述扩增用于构建DNA文库或测序。In some embodiments, the amplification is used to construct a DNA library or for sequencing.

本发明实施方案中的重组蛋白作为嵌合型A家族DNA聚合酶具有提高的持续合成能力，其扩增性能显著优于野生型A家族DNA聚合酶。此外，与B家族DNA聚合酶相比，本发明实施方案中的重组蛋白还具有以下优势：热稳定性高、方便洗脱且易于储存。The recombinant protein in the embodiment of the present invention as a chimeric A family DNA polymerase has improved processivity, and its amplification performance is significantly better than that of the wild-type A family DNA polymerase. In addition, compared with the B family DNA polymerase, the recombinant protein in the embodiment of the present invention also has the following advantages: high thermal stability, convenient elution and easy storage.

BRIEF DESCRIPTION OF THE DRAWINGS

为了更清楚地说明本公开实施例的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本公开的一些实施例，对于本领域普通技术人员来讲，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions of the embodiments of the present disclosure, the drawings required for use in the embodiments or the description of the prior art will be briefly introduced below. Obviously, the drawings described below are only some embodiments of the present disclosure, and a person skilled in the art can also obtain other drawings based on these drawings.

图1为Bst DNA聚合酶大片段的结构示意图。Figure 1 is a schematic diagram of the structure of the large fragment of Bst DNA polymerase.

图2为野生型Bst DNA聚合酶大片段与根据本发明实施例的嵌合型DNA聚合酶的序列比对示意图。Figure 2 is a schematic diagram of the sequence comparison between the large fragment of wild-type Bst DNA polymerase and the chimeric DNA polymerase according to an embodiment of the present invention.

图3为根据本发明实施例的嵌合型DNA聚合酶Taq-HS-1(a)、Ecoli-HS-1(b)、Bsu-HS-1(c)和Bst-HS-1(d)与DNA分子对接后的结构示意图。3 is a schematic diagram of the structures of chimeric DNA polymerases Taq-HS-1 (a), Ecoli-HS-1 (b), Bsu-HS-1 (c) and Bst-HS-1 (d) after docking with DNA molecules according to an embodiment of the present invention.

图4为根据本发明实施例的嵌合型DNA聚合酶与DNA分子复合物的动力学模拟后的结构示意图。FIG. 4 is a schematic structural diagram of a complex of a chimeric DNA polymerase and a DNA molecule after kinetic simulation according to an embodiment of the present invention.

Detailed ways

下面详细描述本发明的实施例，所述实施例的示例在附图中示出。下面通过参考附图描述的实施例是示例性的，旨在用于解释本发明，而不能理解为对本发明的限制。Embodiments of the present invention are described in detail below, and examples of the embodiments are shown in the accompanying drawings. The embodiments described below with reference to the accompanying drawings are exemplary and intended to be used to explain the present invention, but should not be understood as limiting the present invention.

A家族DNA聚合酶包括诸如大肠杆菌聚合酶I、Bst DNA聚合酶、T7 DNA聚合酶和Taq DNA聚合酶，后者在PCR中已被广泛应用。来自嗜热脂肪地芽孢杆菌(Geobacillus stearothermophilus)的Bst DNA聚合酶是一种多功能酶，以耐高温、强大的链置换活性和可实现环介导的等温扩增为特点，然而其可持续扩增能力仍有待改进。Bst DNA聚合酶属于A家族DNA聚合酶，具有典型的右手结构，分为拇指结构域、手掌结构域、手指结构域和5’-3’外切酶结构域，其中在聚合反应中目的DNA和部分新合成的DNA夹在拇指结构域和手指结构域之间。在生物技术应用中，Bst DNA聚合酶的5’-3’外切酶结构域通常会被敲除，只保留包含拇指结构域、手掌结构域、手指结构域的Bst聚合酶大片段(Bst-LF)，其结构如图1所示。A family DNA polymerases include E. coli polymerase I, Bst DNA polymerase, T7 DNA polymerase and Taq DNA polymerase, the latter of which has been widely used in PCR. Bst DNA polymerase from Geobacillus stearothermophilus is a multifunctional enzyme characterized by high temperature resistance, strong strand displacement activity and loop-mediated isothermal amplification, but its sustainable amplification ability still needs to be improved. Bst DNA polymerase belongs to the A family DNA polymerase and has a typical right-hand structure, which is divided into thumb domain, palm domain, finger domain and 5'-3' exonuclease domain, in which the target DNA and part of the newly synthesized DNA are sandwiched between the thumb domain and the finger domain in the polymerization reaction. In biotechnology applications, the 5'-3' exonuclease domain of Bst DNA polymerase is usually knocked out, leaving only the Bst polymerase large fragment (Bst-LF) containing the thumb domain, palm domain and finger domain, and its structure is shown in Figure 1.

目前应用于测序的DNA聚合酶主要是以phi29聚合酶为主的B家族链置换酶。phi29聚合酶是来源于枯草芽孢杆菌(Bacillus subtilis)的噬菌体phi29中的聚合酶，具有较高的连续合成能力和链置换活性，常应用于基于RCA的DNB制备及基于MDA的单细胞全基因组扩增。phi29聚合酶是一种中温恒温酶，最适反应温度在30℃左右，但在65℃孵育10分钟即可完全失活。phi29聚合酶具有强链置换能力及高持续合成能力主要是因为它的TPR2(terminal protein region 2)结构域。TPR2结构域在链合成过程中可以将模板的双链解链，只允许模板链进入并会把模板一直夹在Exo结构域和TPR2结构域之间，使酶和模板不能分离，从而使phi29聚合酶具有>70000nt的持续扩增能力。但是，phi29聚合酶存在保存条件严苛、保质期较短、测序时荧光背景较大等缺陷。The DNA polymerase currently used for sequencing is mainly the B family strand displacement enzyme, mainly phi29 polymerase. The phi29 polymerase is a polymerase from the bacteriophage phi29 of Bacillus subtilis. It has high continuous synthesis ability and strand displacement activity and is often used in RCA-based DNB preparation and MDA-based single-cell whole genome amplification. The phi29 polymerase is a mesothermal isothermal enzyme with an optimal reaction temperature of about 30°C, but it can be completely inactivated by incubation at 65°C for 10 minutes. The strong strand displacement ability and high continuous synthesis ability of the phi29 polymerase are mainly due to its TPR2 (terminal protein region 2) domain. The TPR2 domain can unwind the double-stranded chain of the template during the chain synthesis process, allowing only the template chain to enter and sandwiching the template between the Exo domain and the TPR2 domain, so that the enzyme and the template cannot be separated, thus enabling the phi29 polymerase to have a continuous amplification ability of >70,000nt. However, phi29 polymerase has defects such as strict storage conditions, short shelf life, and large fluorescence background during sequencing.

相比于B家族的DNA聚合酶(如phi29、Pfu、Kod聚合酶)，A家族聚合酶的拇指结构域与DNA模板结合的区域较小，且与新生DNA链的相互作用也较小，导致包括Bst聚合酶在内的A家族聚合酶的可持续扩增能力较弱。此外，由于5’-3’外切结构域具有DNA结合作用，能阻止聚合酶与模板的彻底解离，在敲除5’-3’外切结构域的情况下，A家族聚合酶大片段的持续合成能力会下降，通常不能很好的合成长度大于10kb的DNA链。Compared with the DNA polymerases of the B family (such as phi29, Pfu, and Kod polymerases), the thumb domain of the A family polymerases has a smaller area that binds to the DNA template and has a smaller interaction with the nascent DNA chain, resulting in weaker sustainable amplification capabilities of the A family polymerases, including Bst polymerase. In addition, since the 5'-3' exodomain has a DNA binding effect, it can prevent the complete dissociation of the polymerase from the template. When the 5'-3' exodomain is knocked out, the continuous synthesis ability of the large fragments of the A family polymerases will decrease, and they usually cannot synthesize DNA chains longer than 10kb very well.

基于以上问题，发明人以Bst聚合酶、Taq DNA聚合酶、大肠杆菌DNA聚合酶和Bsu DNA聚合酶为例对A家族聚合酶的结构进行了探索，以期提供一种对DNA双链具有高亲和力且持续合成能力高的聚合酶变体。根据本申请实施方案提供的重组蛋白，即嵌合型A家族聚合酶，其在结构方面实现了使得拇指结构域向手指结构域方向延伸的改造，缩小了拇指结构域与手指结构域之间的空间，使模板链和部分新合成的DNA链保持在拇指结构域和手指结构域之间，从而起到类似于phi29聚合酶中TPR2结构域的效果，加强了拇指结构域与DNA双链的相互作用并且形成环绕DNA模板链的环形滑动夹，提高了A家族DNA聚合酶的持续合成能力。Based on the above problems, the inventors explored the structure of A family polymerases using Bst polymerase, Taq DNA polymerase, Escherichia coli DNA polymerase and Bsu DNA polymerase as examples, in order to provide a polymerase variant with high affinity for double-stranded DNA and high processivity. The recombinant protein provided in the implementation scheme of the present application, i.e., the chimeric A family polymerase, has achieved a structural transformation that allows the thumb domain to extend in the direction of the finger domain, reduces the space between the thumb domain and the finger domain, and keeps the template chain and part of the newly synthesized DNA chain between the thumb domain and the finger domain, thereby playing a role similar to the TPR2 domain in phi29 polymerase, strengthening the interaction between the thumb domain and the DNA double strand and forming a ring-shaped sliding clamp around the DNA template chain, thereby improving the processivity of A family DNA polymerases.

根据本发明的一个方面，本发明提供了一种重组蛋白，该重组蛋白的氨基酸序列是用在扩增反应中能够加强DNA聚合酶与目的DNA相互作用的氨基酸序列替换野生型的A家族DNA聚合酶中的同源序列获得的氨基酸序列。According to one aspect of the present invention, the present invention provides a recombinant protein, the amino acid sequence of which is obtained by replacing the homologous sequence in the wild-type A family DNA polymerase with an amino acid sequence that can enhance the interaction between the DNA polymerase and the target DNA in an amplification reaction.

需要说明的是，本发明的实施例提供的重组蛋白的拇指结构域中替换为一段嵌合蛋白序列。该序列加强了拇指结构域与DNA双链的互作，并且形成环绕DNA模板链的环形滑动夹，使聚合酶对DNA双链的亲和力提高，从而提高持续合成能力的效果。该重组蛋白也由此具有提高的持续合成能力，加强的热稳定性、聚合活性、DNA双链亲和力，可用于建库和测序中的多种DNA等温扩增反应。It should be noted that the thumb domain of the recombinant protein provided in the embodiment of the present invention is replaced with a chimeric protein sequence. This sequence strengthens the interaction between the thumb domain and the DNA double strand, and forms a circular sliding clamp around the DNA template strand, so that the affinity of the polymerase for the DNA double strand is increased, thereby improving the effect of processivity. The recombinant protein also has improved processivity, enhanced thermal stability, polymerization activity, and DNA double strand affinity, and can be used for a variety of DNA isothermal amplification reactions in library construction and sequencing.

需要说明的是，聚合酶的持续合成能力可定义为在一次结合中处理的核苷酸数量。DNA聚合酶的合成能力通常反应了合成率和合成速度，以及酶与底物的亲和力。合成能力高的DNA聚合酶适用于扩增长模板、具有二级结构和富含GC的序列，以及存在肝素、木聚糖和腐殖酸等PCR抑制剂的血液和植物组织样品。It should be noted that the processivity of a polymerase can be defined as the number of nucleotides processed in one incorporation. The processivity of a DNA polymerase generally reflects the rate and speed of synthesis, as well as the affinity of the enzyme for the substrate. DNA polymerases with high processivity are suitable for amplifying long templates, sequences with secondary structures and GC-rich sequences, as well as blood and plant tissue samples in the presence of PCR inhibitors such as heparin, xylan, and humic acid.

SEQ ID NO:1的序列为：The sequence of SEQ ID NO:1 is:

(加粗下划线为同源序列SEQ ID NO:2)

(The bold underline is the homologous sequence SEQ ID NO: 2)

SEQ ID NO:9的序列为：The sequence of SEQ ID NO:9 is:

(加粗下划线为同源序列SEQ ID NO:12)

(The bold underline is the homologous sequence SEQ ID NO: 12)

SEQ ID NO:10的序列为：The sequence of SEQ ID NO: 10 is:

(加粗下划线为同源序列SEQ ID NO:13)

(The bold underline is the homologous sequence SEQ ID NO: 13)

SEQ ID NO:11的序列为：The sequence of SEQ ID NO:11 is:

(加粗下划线为同源序列SEQ ID NO:14)

(The bold underlined sequence is SEQ ID NO: 14)

如SEQ ID NO:12所示的Taq DNA聚合酶大片段中的第199位至第209位氨基酸序列；The amino acid sequence from position 199 to position 209 in the large fragment of Taq DNA polymerase as shown in SEQ ID NO: 12;

如SEQ ID NO:13所示的大肠杆菌DNA聚合酶大片段中的第276位至第285位氨基酸序列；或The amino acid sequence from position 276 to position 285 of the large fragment of Escherichia coli DNA polymerase as shown in SEQ ID NO:13; or

如SEQ ID NO:14所示的大肠杆菌Bsu聚合酶大片段中的第249位至第257位氨基酸序列。The amino acid sequence from position 249 to 257 in the large fragment of Escherichia coli Bsu polymerase as shown in SEQ ID NO:14.

SEQ ID NO:2的序列为：VLKKTKTGY。The sequence of SEQ ID NO:2 is: VLKKTKTGY.

SEQ ID NO:12的序列为：AIGKTEKTGKRThe sequence of SEQ ID NO:12 is: AIGKTEKTGKR

SEQ ID NO:13的序列为：PLKKTPGGAPThe sequence of SEQ ID NO:13 is: PLKKTPGGAP

SEQ ID NO:14的序列为：VVKKTKTGYThe sequence of SEQ ID NO:14 is: VVKKTKTGY

在一些实施例中，A家族DNA聚合酶中的同源序列被替换为包含SEQ ID NO:3所示氨基酸序列的序列、基本上由SEQ ID NO:3所示氨基酸序列组成的序列、或由SEQ ID NO:3所示氨基酸序列组成的序列。SEQ ID NO:3的序列为：PNREMKNQGSKKTLGSTRRGIDNGRKLRLGRQF。In some embodiments, the homologous sequence in the A family DNA polymerase is replaced with a sequence comprising, consisting essentially of, or consisting of the amino acid sequence shown in SEQ ID NO: 3. The sequence of SEQ ID NO: 3 is: PNREMKNQGSKKTLGSTRRGIDNGRKLRLGRQF.

SEQ ID NO:4的序列为：The sequence of SEQ ID NO:4 is:

(加粗下划线为嵌合序列SEQ ID NO:3)

(The bold underline is the chimeric sequence SEQ ID NO: 3)

SEQ ID NO:15的序列为：The sequence of SEQ ID NO:15 is:

(加粗下划线为嵌合序列SEQ ID NO:3)

(The bold underline is the chimeric sequence SEQ ID NO: 3)

SEQ ID NO:16的序列为：The sequence of SEQ ID NO:16 is:

(加粗下划线为嵌合序列SEQ ID NO:3)

(The bold underline is the chimeric sequence SEQ ID NO: 3)

SEQ ID NO:17的序列为：The sequence of SEQ ID NO:17 is:

(加粗下划线为嵌合序列SEQ ID NO:3)

(The bold underline is the chimeric sequence SEQ ID NO: 3)

根据本发明的又一个方面，本发明提供了一种核酸，该核酸编码如第一方面任一实施方案所述的重组蛋白。According to yet another aspect of the present invention, the present invention provides a nucleic acid encoding the recombinant protein as described in any embodiment of the first aspect.

在一些实施例中，该核酸分子的核苷酸序列选自SEQ ID NO:5、SEQ ID NO:18、SEQ ID NO:19和SEQ ID NO:20中的至少一种。In some embodiments, the nucleotide sequence of the nucleic acid molecule is selected from at least one of SEQ ID NO:5, SEQ ID NO:18, SEQ ID NO:19 and SEQ ID NO:20.

SEQ ID NO:5的序列为：The sequence of SEQ ID NO:5 is:

SEQ ID NO:18的序列为：The sequence of SEQ ID NO: 18 is:

SEQ ID NO:19的序列为：The sequence of SEQ ID NO:19 is:

SEQ ID NO:20的序列为：The sequence of SEQ ID NO:20 is:

根据本发明的又一个方面，本发明的实施方案提供了一种载体，该载体包含如第二方面任一实施方案所述的核酸。According to still another aspect of the present invention, an embodiment of the present invention provides a vector comprising the nucleic acid as described in any embodiment of the second aspect.

根据本发明的又一个方面，本发明的实施方案提供了一种试剂盒，包含如第一方面任一实施方案所述的重组蛋白、第二方面任一实施方案所述的核酸、或第三方面任一实施方案所述的载体。According to another aspect of the present invention, an embodiment of the present invention provides a kit comprising the recombinant protein as described in any embodiment of the first aspect, the nucleic acid as described in any embodiment of the second aspect, or the vector as described in any embodiment of the third aspect.

根据本发明的又一个方面，本发明的实施方案提供了一种制备第一方面任一实施方案所述的重组蛋白的方法，该方法包括：用在扩增反应中能够加强DNA聚合酶与目的DNA相互作用的氨基酸序列替换野生型的A家族DNA聚合酶中的同源序列获得的氨基酸序列，以获得所述重组蛋白。According to another aspect of the present invention, an embodiment of the present invention provides a method for preparing the recombinant protein described in any embodiment of the first aspect, the method comprising: replacing the amino acid sequence obtained by replacing the homologous sequence in the wild-type A family DNA polymerase with an amino acid sequence that can enhance the interaction between the DNA polymerase and the target DNA in an amplification reaction to obtain the recombinant protein.

在一些实施例中，所述方法还包括表达并纯化所述重组蛋白序列。In some embodiments, the method further comprises expressing and purifying the recombinant protein sequence.

在一些具体实施例中，该方法具体包括：通过A家族聚合酶的序列数据库搜寻和同源序列对比，以确定候选嵌合序列用于替换野生型聚合酶中的同源序列；利用Alphafold、Autodock和Gromacs等工具对根据本发明实施例的具有嵌合序列的重组蛋白进行(1)结构预测、(2)DNA双链与重组蛋白的分子对接模拟和(3)动力学模拟分析；利用原核表达系统对模拟分析确定的重组蛋白进行表达和纯化，并测定其活性；并且上机测序确认蛋白序列。In some specific embodiments, the method specifically includes: searching through the sequence database of A family polymerase and comparing homologous sequences to determine a candidate chimeric sequence for replacing the homologous sequence in the wild-type polymerase; using tools such as Alphafold, Autodock and Gromacs to perform (1) structural prediction, (2) molecular docking simulation of DNA double strands and recombinant protein and (3) kinetic simulation analysis on the recombinant protein with a chimeric sequence according to an embodiment of the present invention; using a prokaryotic expression system to express and purify the recombinant protein determined by the simulation analysis, and determine its activity; and confirming the protein sequence by sequencing.

可以理解的是，该方法中的序列比对包括全局比对、局部比对、双重序列比对、多重序列比对等多种比对策略。It can be understood that the sequence alignment in this method includes various alignment strategies such as global alignment, local alignment, double sequence alignment, and multiple sequence alignment.

可以理解的是，A家族DNA聚合酶的结构较为保守相似，通过所述制备重组蛋白的方法，能够应用于多种A家族DNA聚合酶的同源序列替换，并制备所述重组蛋白。It can be understood that the structures of A family DNA polymerases are relatively conservative and similar, and the method for preparing recombinant proteins can be applied to replace homologous sequences of various A family DNA polymerases and prepare the recombinant proteins.

在本发明的实施例中，术语“分子对接”是指通过计算机模拟将小分子(配体)放置于大分子靶标(受体)的结合区域，使其通过调整物理化学参数计算配体与受体的结合力和结合方式，能够获得二者在其活性区域相结合时能量最低的构象。在本发明中，分子对接的配体与受体分别为嵌合聚合酶以及DNA。模拟分子和蛋白质在原子水平上的相互作用，有助于发现对嵌合聚合酶进行进一步改造的策略。In the embodiments of the present invention, the term "molecular docking" refers to placing a small molecule (ligand) in the binding area of a macromolecular target (receptor) by computer simulation, so that the binding force and binding mode of the ligand and the receptor can be calculated by adjusting the physicochemical parameters, and the lowest energy conformation can be obtained when the two are combined in their active areas. In the present invention, the ligand and receptor of molecular docking are chimeric polymerase and DNA, respectively. Simulating the interaction between molecules and proteins at the atomic level helps to discover strategies for further modification of chimeric polymerases.

在本发明的实施例中，术语“动力学模拟”是指用牛顿经典力学计算分子在空间中的轨迹，求解系统中的分子或原子间作用势能和系统外加约束共同作用的分子或原子的牛顿方程，模拟系统随时间推进的微观过程。蛋白质和配体的柔性使它们在相互结合时会寻求最吻合、能量最低的构象。通常情况下，对接完成后，通过打分函数得到的配体蛋白复合物准确度不是很高时，需要借助动力学模拟来计算结合能，验证蛋白与筛选出的小分子配体的有效结合情况。In an embodiment of the present invention, the term "dynamic simulation" refers to the use of Newton's classical mechanics to calculate the trajectory of molecules in space, solve the Newtonian equations of molecules or atoms in the system under the combined action potential energy and system constraints, and simulate the microscopic process of the system advancing over time. The flexibility of proteins and ligands enables them to seek the most consistent and lowest energy conformation when binding to each other. Usually, after docking is completed, when the accuracy of the ligand-protein complex obtained by the scoring function is not very high, it is necessary to use dynamic simulation to calculate the binding energy and verify the effective binding of the protein with the screened small molecule ligand.

在一些实施例中，所述A家族DNA聚合酶选自由大肠杆菌DNA聚合酶I、T3 DNA聚合酶、T5 DNA聚合酶、T7 DNA聚合酶、Taq DNA聚合酶、Bsu DNA聚合酶和Bst DNA聚合酶组成的组。In some embodiments, the A family DNA polymerase is selected from the group consisting of Escherichia coli DNA polymerase I, T3 DNA polymerase, T5 DNA polymerase, T7 DNA polymerase, Taq DNA polymerase, Bsu DNA polymerase and Bst DNA polymerase.

在一些实施例中，所述A家族DNA聚合酶选自具有SEQ ID NO:1所示氨基酸序列的Bst DNA聚合酶、具有SEQ ID NO:9所示的氨基酸序列的Taq DNA聚合酶、具有SEQ ID NO:10所示氨基酸序列的大肠杆菌DNA聚合酶和具有SEQ ID NO:11所示氨基酸序列的Bsu DNA聚合酶中的至少一种。In some embodiments, the A family DNA polymerase is selected from at least one of a Bst DNA polymerase having an amino acid sequence as shown in SEQ ID NO:1, a Taq DNA polymerase having an amino acid sequence as shown in SEQ ID NO:9, an Escherichia coli DNA polymerase having an amino acid sequence as shown in SEQ ID NO:10, and a Bsu DNA polymerase having an amino acid sequence as shown in SEQ ID NO:11.

在一些实施例中，所述A家族DNA聚合酶中的同源序列来源于其拇指结构域。In some embodiments, the homologous sequence in the A family DNA polymerase is derived from its thumb domain.

在一些实施例中，所述A家族DNA聚合酶中的同源序列被替换为包含SEQ ID NO:3所示氨基酸序列的序列、基本上由SEQ ID NO:3所示氨基酸序列组成的序列、或由SEQ ID NO:3所示氨基酸序列组成的序列。In some embodiments, the homologous sequence in the A family DNA polymerase is replaced by a sequence comprising the amino acid sequence shown in SEQ ID NO:3, a sequence essentially consisting of the amino acid sequence shown in SEQ ID NO:3, or a sequence consisting of the amino acid sequence shown in SEQ ID NO:3.

在一些实施例中，所述重组蛋白的氨基酸序列选自SEQ ID NO:4、SEQ ID NO:15、SEQ ID NO:16和SEQ ID NO:17中的至少一种。In some embodiments, the amino acid sequence of the recombinant protein is selected from at least one of SEQ ID NO:4, SEQ ID NO:15, SEQ ID NO:16 and SEQ ID NO:17.

根据本发明的又一个方面，本发明的实施方案提供了一种扩增目的DNA的方法，使用前述第一方面任一实施方案所述的重组蛋白对目的DNA进行扩增。According to yet another aspect of the present invention, an embodiment of the present invention provides a method for amplifying a target DNA, wherein the target DNA is amplified using the recombinant protein described in any embodiment of the first aspect.

实施例Example

实施例1Example 1

本实施例基于A家族DNA聚合酶中Bst聚合酶、Taq DNA聚合酶、大肠杆菌DNA聚合酶和Bsu DNA聚合酶的序列，进行数据库搜寻和MSA同源序列对比，以确定候选嵌合序列用于替换野生型聚合酶中的同源序列；利用Alphafold、Autodock和Gromacs等工具对本发明实施例中的具有嵌合序列的重组蛋白进行结构预测，DNA双链与重组蛋白的分子对接和动力学模拟分析等计算模拟。This embodiment is based on the sequences of Bst polymerase, Taq DNA polymerase, Escherichia coli DNA polymerase and Bsu DNA polymerase in the A family DNA polymerase, and performs database search and MSA homologous sequence comparison to determine candidate chimeric sequences for replacing homologous sequences in wild-type polymerases; uses tools such as Alphafold, Autodock and Gromacs to perform structural prediction on the recombinant protein with chimeric sequence in the embodiment of the present invention, and performs computational simulations such as molecular docking and kinetic simulation analysis of the DNA double chain and the recombinant protein.

1.1确定嵌合序列1.1 Determination of chimeric sequences

利用Protein-blast工具在NCBI蛋白质序列库中比对A家族聚合酶的同源序列，确定出了嵌合序列(SEQ ID NO:3)与Bst DNA聚合酶拇指结构域中氨基酸第546-554位(SEQ ID NO:2)、Taq DNA聚合酶大片段拇指结构域中氨基酸第199-209位(SEQ ID NO:12)、大肠杆菌DNA聚合酶大片段拇指结构域中氨基酸第276-285位(SEQ ID NO:13)和Bsu DNA聚合酶大片段拇指结构域中氨基酸第249-257位(SEQ ID NO:14)同源，比对结果如图2所示。该嵌合序列来自人类DNA聚合θ(Human pol theta，POLθ)，并且SEQ ID NO:3比SEQ ID NO:2长约24个氨基酸、比SEQ ID NO:12长约XX个氨基酸、比SEQ ID NO:13长约XX个氨基酸、比SEQ ID NO:14长约XX个氨基酸。Protein-blast tool was used to compare homologous sequences of A family polymerases in the NCBI protein sequence library, and it was determined that the chimeric sequence (SEQ ID NO: 3) was homologous to amino acids 546-554 in the thumb domain of Bst DNA polymerase (SEQ ID NO: 2), amino acids 199-209 in the thumb domain of Taq DNA polymerase large fragment (SEQ ID NO: 12), amino acids 276-285 in the thumb domain of Escherichia coli DNA polymerase large fragment (SEQ ID NO: 13), and amino acids 249-257 in the thumb domain of Bsu DNA polymerase large fragment (SEQ ID NO: 14). The comparison results are shown in Figure 2. The chimeric sequence is derived from human DNA polymerase θ (Human pol theta, POLθ), and SEQ ID NO:3 is approximately 24 amino acids longer than SEQ ID NO:2, approximately XX amino acids longer than SEQ ID NO:12, approximately XX amino acids longer than SEQ ID NO:13, and approximately XX amino acids longer than SEQ ID NO:14.

1.2用嵌合序列替换野生型A家族DNA聚合酶中的同源序列1.2 Replacement of homologous sequences in wild-type A-family DNA polymerases with chimeric sequences

1.2.1根据上述同源序列比对，将野生型Bst DNA聚合酶大片段(Bst-LF WT)中的同源序列SEQ ID NO:2替换为嵌合序列SEQ ID NO:3，从而得到嵌合型Bst DNA聚合酶，具有SEQ ID NO:4所述的序列。1.2.1 According to the above homologous sequence alignment, the homologous sequence SEQ ID NO:2 in the wild-type Bst DNA polymerase large fragment (Bst-LF WT) was replaced with the chimeric sequence SEQ ID NO:3 to obtain a chimeric Bst DNA polymerase having the sequence described in SEQ ID NO:4.

野生型Bst DNA聚合酶大片段(Bst-LF WT)的序列SEQ ID NO:1：The sequence of wild-type Bst DNA polymerase large fragment (Bst-LF WT) SEQ ID NO: 1:

>Bst-LF WT[Geobacillus stearothermophilus]氨基酸序列(293-831)>Bst-LF WT[Geobacillus stearothermophilus]amino acid sequence (293-831)

(加粗下划线为同源序列SEQ ID NO:2)

(The bold underline is the homologous sequence SEQ ID NO: 2)

嵌合型Bst DNA聚合酶(Bst-HS-1)的氨基酸序列(SEQ ID NO:4)：Amino acid sequence of chimeric Bst DNA polymerase (Bst-HS-1) (SEQ ID NO: 4):

(加粗下划线为嵌合序列SEQ ID NO:3)

(The bold underline is the chimeric sequence SEQ ID NO: 3)

1.2.2根据上述同源序列比对，将野生型Taq DNA聚合酶大片段(Taq-LF WT)中的同源序列SEQ ID NO:12替换为嵌合序列SEQ ID NO:3，从而得到嵌合型Taq DNA聚合酶，具有SEQ ID NO:15所述的序列。1.2.2 According to the above homologous sequence alignment, the homologous sequence SEQ ID NO:12 in the wild-type Taq DNA polymerase large fragment (Taq-LF WT) was replaced with the chimeric sequence SEQ ID NO:3, thereby obtaining a chimeric Taq DNA polymerase having the sequence described in SEQ ID NO:15.

野生型Taq DNA聚合酶大片段(Taq-LF WT)的序列SEQ ID NO:9：The sequence of wild-type Taq DNA polymerase large fragment (Taq-LF WT) is SEQ ID NO:9:

>Taq-LF WT[Thermus aquaticus]氨基酸序列(1-529)>Taq-LF WT[Thermus aquaticus]amino acid sequence (1-529)

(加粗下划线为同源序列SEQ ID NO:12)

(The bold underline is the homologous sequence SEQ ID NO: 12)

嵌合型Taq DNA聚合酶(Taq-HS-1)的氨基酸序列(SEQ ID NO:15)：Amino acid sequence of chimeric Taq DNA polymerase (Taq-HS-1) (SEQ ID NO: 15):

(加粗下划线为嵌合序列SEQ ID NO:3)

(The bold underline is the chimeric sequence SEQ ID NO: 3)

1.2.3根据上述同源序列比对，将野生型大肠杆菌DNA聚合酶大片段(Ecoli-LF WT)中的同源序列SEQ ID NO:13替换为嵌合序列SEQ ID NO:3，从而得到嵌合型大肠杆菌DNA聚合酶，具有SEQ ID NO:16所述的序列。1.2.3 According to the above homologous sequence alignment, the homologous sequence SEQ ID NO:13 in the wild-type Escherichia coli DNA polymerase large fragment (Ecoli-LF WT) was replaced with the chimeric sequence SEQ ID NO:3, thereby obtaining a chimeric Escherichia coli DNA polymerase having the sequence described in SEQ ID NO:16.

野生型大肠杆菌DNA聚合酶大片段(Ecoli-LF WT)的序列SEQ ID NO:10：The sequence of wild-type Escherichia coli DNA polymerase large fragment (Ecoli-LF WT) SEQ ID NO: 10:

>Ecoli-LF WT[Escherichia coli]氨基酸序列(1-606)>Ecoli-LF WT[Escherichia coli]amino acid sequence (1-606)

(加粗下划线为同源序列SEQ ID NO:13)

(The bold underline is the homologous sequence SEQ ID NO: 13)

嵌合型大肠杆菌DNA聚合酶(Ecoli-HS-1)的氨基酸序列(SEQ ID NO:16)：Amino acid sequence of chimeric Escherichia coli DNA polymerase (Ecoli-HS-1) (SEQ ID NO: 16):

(加粗下划线为嵌合序列SEQ ID NO:3)

(The bold underline is the chimeric sequence SEQ ID NO: 3)

1.2.4根据上述同源序列比对，将野生型Bsu DNA聚合酶大片段(Bsu-LF WT)中的同源序列SEQ ID NO:14替换为嵌合序列SEQ ID NO:3，从而得到嵌合型Bsu DNA聚合酶，具有SEQ ID NO:17所述的序列。1.2.4 According to the above homologous sequence alignment, the homologous sequence SEQ ID NO:14 in the wild-type Bsu DNA polymerase large fragment (Bsu-LF WT) was replaced with the chimeric sequence SEQ ID NO:3 to obtain a chimeric Bsu DNA polymerase having the sequence described in SEQ ID NO:17.

野生型Bsu DNA聚合酶大片段(Bsu-LF WT)的序列SEQ ID NO:11：The sequence of wild-type Bsu DNA polymerase large fragment (Bsu-LF WT) is SEQ ID NO: 11:

>Bsu-LF WT[Bacillus subtilis]氨基酸序列(1-603)>Bsu-LF WT[Bacillus subtilis] amino acid sequence (1-603)

(加粗下划线为同源序列SEQ ID NO:14)

(The bold underlined sequence is SEQ ID NO: 14)

嵌合型Bsu DNA聚合酶(Bsu-HS-1)的氨基酸序列(SEQ ID NO:17)：Amino acid sequence of chimeric Bsu DNA polymerase (Bsu-HS-1) (SEQ ID NO: 17):

(加粗下划线为嵌合序列SEQ ID NO:3)

(The bold underline is the chimeric sequence SEQ ID NO: 3)

1.3结构预测1.3 Structure prediction

利用Alphafold2对嵌合型A家族DNA聚合酶Bst-HS-1、Taq-HS-1、Ecoli-HS-1和Bsu-HS-1的序列进行重组蛋白三级结构预测。Alphafold2 was used to predict the tertiary structure of recombinant proteins of the chimeric A family DNA polymerases Bst-HS-1, Taq-HS-1, Ecoli-HS-1 and Bsu-HS-1.

1.4分子对接模拟1.4 Molecular docking simulation

基于所预测的结构，利用Autodock工具在dNTP的存在下对DNA双链与嵌合型A家族DNA聚合酶Bst-HS-1、Taq-HS-1、Ecoli-HS-1和Bsu-HS-1的进行分子对接模拟，确认改造后具有延长的拇指结构域的嵌合型DNA聚合酶与目的DNA的结合强度。如图3所示，分子对接模拟显示了嵌合序列不仅缩小了拇指结构域与手指结构域之间的空间，从而使模板链保持在拇指结构域和手指结构域之间，而且能够与新合成的双链DNA持续保持紧密结合，从而稳定了聚合酶-DNA结构。Based on the predicted structure, the molecular docking simulation of the DNA double-stranded with the chimeric A family DNA polymerase Bst-HS-1, Taq-HS-1, Ecoli-HS-1 and Bsu-HS-1 was performed using the Autodock tool in the presence of dNTP to confirm the binding strength of the chimeric DNA polymerase with an extended thumb domain after modification to the target DNA. As shown in Figure 3, the molecular docking simulation shows that the chimeric sequence not only reduces the space between the thumb domain and the finger domain, thereby keeping the template chain between the thumb domain and the finger domain, but also can maintain a tight binding with the newly synthesized double-stranded DNA, thereby stabilizing the polymerase-DNA structure.

1.5配体结构优化1.5 Ligand structure optimization

用Gromacs工具对接后的嵌合型A家族DNA聚合酶Bst-HS-1、Taq-HS-1、Ecoli-HS-1和Bsu-HS-1的与DNA分子的复合物利进行配体结构优化。The complexes of chimeric A family DNA polymerases Bst-HS-1, Taq-HS-1, Ecoli-HS-1 and Bsu-HS-1 with DNA molecules were docked using the Gromacs tool for ligand structure optimization.

1.6动力学模拟1.6 Dynamic simulation

将经过优化的结构进行25ns动力学模拟，模拟温度为330K。如图4所示，嵌合型Bst DNA聚合酶中手指结构域中的第515至517位氨基酸序列NFN与拇指结构域中的K268可持续保持在4-6埃的距离，使DNA模板链能被夹在二者形成的口袋结构中，同时延长的区域也能和新合成的双链DNA持续保持紧密结合。The optimized structure was subjected to 25 ns kinetic simulation at a simulation temperature of 330 K. As shown in Figure 4, the amino acid sequence NFN at positions 515 to 517 in the finger domain of the chimeric Bst DNA polymerase and K268 in the thumb domain can be continuously maintained at a distance of 4-6 angstroms, allowing the DNA template chain to be clamped in the pocket structure formed by the two, while the extended region can also maintain a tight bond with the newly synthesized double-stranded DNA.

此外，实验结构表明，还可以对嵌合序列中的数个位点进行额外突变，以加强嵌合体的热稳定性及聚合活性，并能进一步提高嵌合型聚合酶对DNA双链的亲和力及持续合成能力。In addition, experimental structures have shown that additional mutations can be made at several sites in the chimeric sequence to enhance the thermal stability and polymerization activity of the chimera, and to further improve the affinity of the chimeric polymerase for double-stranded DNA and its ability to continuously synthesize.

对于所有A家族聚合酶，如Bst、Taq、Ecoli、BSU等，都可以在相应的拇指区域插入该嵌合片段，即可起到与Bst上改造结构类似作用，加强与DNA双链的亲和力及持续合成能力。For all A family polymerases, such as Bst, Taq, Ecoli, BSU, etc., the chimeric fragment can be inserted into the corresponding thumb region, which can play a similar role to the modified structure on Bst, enhancing the affinity with the DNA double helix and the continuous synthesis ability.

下列数据为4种A家族聚合酶与其嵌合突变体和DNA双链结合能模拟计算的对比，可见所有聚合酶在加入嵌合片段后都能加强与DNA双链的亲和力：The following data is a comparison of the binding energy simulation calculations of four A-family polymerases and their chimeric mutants to double-stranded DNA. It can be seen that all polymerases can enhance their affinity with double-stranded DNA after adding chimeric fragments:

1.Bst-LF WT：-164.2kJ/mol1.Bst-LF WT: -164.2 kJ/mol

Bst-HS-1：-272.0kJ/molBst-HS-1: -272.0 kJ/mol

ΔΔG＝-107.8kJ/molΔΔG＝-107.8kJ/mol

2.Taq-LF WT：-150.2kJ/mol2.Taq-LF WT: -150.2 kJ/mol

Taq-HS-1：-277.6kJ/molTaq-HS-1: -277.6 kJ/mol

ΔΔG＝-127.4kJ/molΔΔG＝-127.4kJ/mol

3.Ecoli-LF WT：-186.3kJ/mol3.Ecoli-LF WT: -186.3 kJ/mol

Ecoli-HS-1：-327.9kJ/molEcoli-HS-1: -327.9 kJ/mol

ΔΔG＝-141.6kJ/molΔΔG＝-141.6 kJ/mol

4.BSU-LF WT：-178.4kJ/mol4.BSU-LF WT: -178.4 kJ/mol

BSU-HS-1：-283.3kJ/molBSU-HS-1: -283.3 kJ/mol

ΔΔG＝-104.9kJ/molΔΔG＝-104.9 kJ/mol

实施例2Example 2

本实施例基于合理设计的嵌合型聚合酶Bst-HS-1，利用原核表达系统对其进行蛋白的表达和纯化，以获得用于实际应用的Bst-HS嵌合型DNA聚合酶。In this example, based on the rationally designed chimeric polymerase Bst-HS-1, a prokaryotic expression system was used to express and purify the protein to obtain the Bst-HS chimeric DNA polymerase for practical application.

2.1制备嵌合型DNA聚合酶质粒2.1 Preparation of chimeric DNA polymerase plasmid

针对实施例1确定的嵌合型DNA聚合酶序列(即SEQ ID NO:4)设计用于引入嵌入序列(SEQ ID NO:3)的上游引物和下游引物，以包含编码野生型Bst DNA聚合酶(SEQ ID NO:1)的核酸序列(SEQ ID NO:6)质粒为模板，使用诺唯赞点突变试剂盒(C112-01) 进行点突变PCR，以获得编码DNA聚合酶序列(即SEQ ID NO:4)的核酸序列(SEQ ID NO:5)。For the chimeric DNA polymerase sequence determined in Example 1 (i.e., SEQ ID NO:4), upstream primers and downstream primers for introducing the embedded sequence (SEQ ID NO:3) were designed. Using the plasmid containing the nucleic acid sequence (SEQ ID NO:6) encoding the wild-type Bst DNA polymerase (SEQ ID NO:1) as a template, point mutation PCR was performed using the Novezum Point Mutation Kit (C112-01) to obtain the nucleic acid sequence (SEQ ID NO:5) encoding the DNA polymerase sequence (i.e., SEQ ID NO:4).

编码野生型Bst DNA聚合酶的核酸序列(SEQ ID NO:6)的序列为：The sequence of the nucleic acid sequence encoding the wild-type Bst DNA polymerase (SEQ ID NO:6) is:

2.1.1按照如表1所示的PCR反应体系和如表2所示的PCR反应条件进行上述点突变PCR，以获得包含扩增产物的反应溶液。2.1.1 The above-mentioned point mutation PCR was performed according to the PCR reaction system shown in Table 1 and the PCR reaction conditions shown in Table 2 to obtain a reaction solution containing an amplified product.

表1 PCR反应体系Table 1 PCR reaction system

上游引物(SEQ ID NO:7)：TACTTTCAGGGCGCCGAGGGTGAAAAACCTCTGUpstream primer (SEQ ID NO:7): TACTTTCAGGGCGCCGAGGGTGAAAAACCTCTG

下游引物(SEQ ID NO:8)：GGGTTAACCTTATTTAGCATCATACCAGGTCGGGDownstream primer (SEQ ID NO:8): GGGTTAACCTTATTTAGCATCATACCAGGTCGGG

表2 PCR反应条件Table 2 PCR reaction conditions

2.1.2向体积为20μl的包含扩增产物的反应溶液添加0.4μl dpn1酶，在37℃的温度对扩增产物消化3小时，以获得包含消化产物的溶液。2.1.2 Add 0.4 μl dpn1 enzyme to a 20 μl reaction solution containing the amplified product, and digest the amplified product at 37°C for 3 hours to obtain a solution containing the digested product.

2.1.3取体积为5μl的包含消化产物的溶液对100μl感受态细胞进行转化，并将转化后的感受态细胞涂板于固体LB培养基，在4℃过夜孵育。2.1.3 Take 5 μl of the solution containing the digestion product to transform 100 μl of competent cells, and then plate the transformed competent cells on solid LB medium and incubate at 4°C overnight.

2.1.4挑取单克隆菌落，在37℃对其摇菌后送样测序，以获得测序结果。2.1.4 Pick a single clone, shake it at 37°C, and send it for sequencing to obtain the sequencing results.

2.1.5将测序结果进行比对，以确定含有正确突变的质粒。2.1.5 Compare the sequencing results to determine the plasmid containing the correct mutation.

2.2 Bst-HS嵌合型突变体的诱导表达2.2 Inducible expression of Bst-HS chimeric mutants

2.2.1利用Bst-HS嵌合突变体质粒对BL21感受态细胞进行转化，以获得转化有 Bst-HS嵌合突变体的细胞。2.2.1 Use the Bst-HS chimeric mutant plasmid to transform BL21 competent cells to obtain cells transformed with the Bst-HS chimeric mutant.

2.2.2向50ml离心管加入转化有Bst-HS嵌合突变体的细胞和20mL新鲜培养基，并在37℃的温度以220rpm振摇培养过夜，以获得第一菌液。2.2.2 Add cells transformed with the Bst-HS chimeric mutant and 20 mL of fresh culture medium to a 50 ml centrifuge tube, and culture overnight at 37° C. with shaking at 220 rpm to obtain the first bacterial liquid.

2.2.3将20ml第一菌液转移到含1L培养基的5L锥形瓶中，在37℃以220rpm振摇培养，直到OD值为0.8-1获得第二菌液。2.2.3 Transfer 20 ml of the first bacterial solution to a 5 L conical flask containing 1 L of culture medium, and culture at 37 °C with shaking at 220 rpm until the OD value reaches 0.8-1 to obtain the second bacterial solution.

2.2.4向第二菌液加入IPTG诱导剂达到IPTG的终浓度为0.2mM，并在16℃的温度进行孵育，持续16小时，以获得第三菌液，该第三菌液包含表达Bst-HS嵌合突变体的菌体。2.2.4 Add IPTG inducer to the second bacterial solution to reach a final IPTG concentration of 0.2 mM, and incubate at 16° C. for 16 hours to obtain a third bacterial solution containing bacteria expressing the Bst-HS chimeric mutant.

2.2.5将第三菌液以12000rpm离心10min，收集作为沉淀物的菌体。2.2.5 Centrifuge the third bacterial solution at 12000 rpm for 10 min and collect the bacterial cells as the precipitate.

2.3 Bst-HS嵌合突变体蛋白纯化2.3 Purification of Bst-HS chimeric mutant protein

2.3.1将沉淀物混悬于40ml A溶液，并在700bar的压力下以高压破碎机对菌体进行破碎，持续5分钟，以获得第四菌液。2.3.1 Suspend the precipitate in 40 ml of solution A and crush the bacteria with a high-pressure crusher at a pressure of 700 bar for 5 minutes to obtain the fourth bacterial solution.

A溶液(pH7.4)Solution A (pH 7.4)

2.3.2在60℃温度的水浴中对第四菌液进行热处理，持续15分钟，以获得终溶液。将终溶液以13000rpm离心30分钟，然后用0.22μm滤膜对上清液进行过滤并收集滤液。2.3.2 Heat-treat the fourth bacterial solution in a water bath at 60°C for 15 minutes to obtain a final solution. Centrifuge the final solution at 13,000 rpm for 30 minutes, and then filter the supernatant with a 0.22 μm filter membrane and collect the filtrate.

2.3.3将滤液加载到镍柱上进行纯化，其中用50％B溶液进行洗脱，以获得目的蛋白。2.3.3 The filtrate was loaded onto a nickel column for purification, wherein 50% B solution was used for elution to obtain the target protein.

B溶液(pH7.4)Solution B (pH 7.4)

2.3.4通过透析的方式，将包含目的蛋白的B溶液置换为透析液，以获得纯化后的目的蛋白。2.3.4 The solution B containing the target protein is replaced with dialysate by dialysis to obtain the purified target protein.

透析液(pH7.6)Dialysate (pH 7.6)

2.3.5将纯化后的目的蛋白分装储存，以用于后续的活性测定。2.3.5 The purified target protein is packaged and stored for subsequent activity determination.

实施例3Example 3

本实施例中，对纯化后的嵌合型DNA聚合酶Bst-HS与野生型DNA聚合酶大片段Bst-LF-WT进行聚合活性测试，对比二者的持续合成能力。具有His6标签的野生型Bst DNA聚合酶溶液及实施例2制备的具有His6标签的嵌合型DNA聚合酶Bst DNA溶液为待测酶溶液。In this example, the purified chimeric DNA polymerase Bst-HS and the wild-type DNA polymerase large fragment Bst-LF-WT were tested for polymerization activity to compare their processivity. The wild-type Bst DNA polymerase solution with a His6 tag and the chimeric DNA polymerase Bst DNA solution with a His6 tag prepared in Example 2 were the enzyme solutions to be tested.

3.1嵌合型DNA聚合酶在RCA扩增中的合成能力3.1 Synthetic capacity of chimeric DNA polymerase in RCA amplification

3.1.1制备文库3.1.1 Library preparation

将40μl的初始单链环状DNA(自合成Ecoli V3单链环状文库，1.82ng/μl)与1.06μl的文库互补引物(5μM)以1:1的摩尔比混合，并按照表2所述的条件进行退火，以获得文库+引物的混合物(浓度约为1.77ng/μl，)。40 μl of initial single-stranded circular DNA (self-synthesized Ecoli V3 single-stranded circular library, 1.82 ng/μl) was mixed with 1.06 μl of library complementary primer (5 μM) at a molar ratio of 1:1 and annealed according to the conditions described in Table 2 to obtain a mixture of library + primer (concentration of approximately 1.77 ng/μl).

3.1.2 RCA反应3.1.2 RCA reaction

将文库+引物的混合物配制在如下表3的RCA反应体系中，并在65℃的温度孵育1小时。使用0.1μl的0.5M EDTA终止扩增反应，以获得RCA反应产物。Prepare the library + primer mixture in the RCA reaction system as shown in Table 3 below and incubate at 65°C for 1 hour. Use 0.1 μl of 0.5 M EDTA to terminate the amplification reaction to obtain the RCA reaction product.

表3 RCA反应体系Table 3 RCA reaction system

3.1.3 RCA产物浓度检测3.1.3 RCA product concentration detection

使用qubit ssDNA检测试剂盒，按说明书操作，使用Qubit fluorometor 3.0检测RCA反应产物浓度。结果如下：Use the Qubit ssDNA detection kit according to the instructions and use Qubit fluorometor 3.0 to detect the concentration of RCA reaction products. The results are as follows:

Bst-LF-WT:33.6ng/uLBst-LF-WT:33.6ng/uL

Bst-HS-1:39.9ng/uLBst-HS-1:39.9ng/uL

由此可见，在RCA等温扩增中，重组蛋白(即嵌合型A家族DNA聚合酶)的合成能力高于野生型A家族DNA聚合酶。This shows that in RCA isothermal amplification, the synthesis capacity of the recombinant protein (i.e., the chimeric A family DNA polymerase) is higher than that of the wild-type A family DNA polymerase.

3.2嵌合型DNA聚合酶在MDA扩增中的合成能力3.2 Synthetic capacity of chimeric DNA polymerase in MDA amplification

3.2.1 DNB的制备3.2.1 Preparation of DNB

3.2.1.1将单链环状DNA(自合成Ecoli V3单链环状文库，浓度为c)与DNB制备缓冲液、和TE缓冲液混合在PCR管中，并将PCR管置于冰盒上约0.5小时。使用漩涡振荡器对PCR管震荡5秒以使混合物混合均匀，短暂离心并置于冰上备用。将PCR管置于PCR仪中进行如下程序：95℃持续1min，65℃持续1min，40℃持续1min，热盖温度为102℃，4℃保持。3.2.1.1 Mix the single-stranded circular DNA (self-synthesized Ecoli V3 single-stranded circular library, concentration c) with DNB preparation buffer and TE buffer in a PCR tube, and place the PCR tube on an ice box for about 0.5 hours. Use a vortex oscillator to shake the PCR tube for 5 seconds to mix the mixture evenly, centrifuge briefly and place on ice for later use. Place the PCR tube in a PCR instrument and perform the following program: 95℃ for 1min, 65℃ for 1min, 40℃ for 1min, the hot cover temperature is 102℃, and 4℃ is maintained.

3.2.1.2反应后，将PCR管置转移至冰上，并向PCR管中加入如下表所示体积的DNB聚合酶混合液Ⅰ和DNB聚合酶混合液Ⅱ。将PCR管再次置于PCR仪中，进行如下程序：30℃持续30分钟，热盖温度35℃，4℃保持。3.2.1.2 After the reaction, transfer the PCR tube to ice and add the volumes of DNB polymerase mixture I and DNB polymerase mixture II as shown in the table below. Place the PCR tube in the PCR instrument again and perform the following program: 30℃ for 30 minutes, hot cover temperature 35℃, 4℃ hold.

3.2.1.3向PCR管加入1微升DNB终止缓冲液，震荡混匀以获得DNB。3.2.1.3 Add 1 μl of DNB stop buffer to the PCR tube and vortex to mix to obtain DNB.

3.2.2 MDA反应3.2.2 MDA reaction

3.2.2.1取3.2.1制备的DNB，按下述反应体系在PCR管中混匀，进行如下程序：95℃ 持续1分钟，65℃持续1分钟，40℃持续1分钟，热盖温度为102℃，4℃保持。3.2.2.1 Take the DNB prepared in 3.2.1, mix them in a PCR tube according to the following reaction system, and carry out the following procedure: 95℃ for 1 minute, 65℃ for 1 minute, 40℃ for 1 minute, the hot cover temperature is 102℃, and maintained at 4℃.

3.2.2.2反应后，将PCR管转移至冰上，并向PCR管中加入下表所示成分。混匀后，将PCR管再次置于PCR仪中，进行如下程序：65℃持续30分钟；热盖温度70℃，4℃保持。3.2.2.2 After the reaction, transfer the PCR tube to ice and add the ingredients shown in the table below to the PCR tube. After mixing, place the PCR tube in the PCR instrument again and perform the following program: 65℃ for 30 minutes; hot cover temperature 70℃, maintained at 4℃.

3.2.3MDA产物浓度检测3.2.3 MDA product concentration detection

采用Qubit dsDNA Assay Kit按说明书操作，使用Qubit fluorometor 3.0检测MDA产物浓度。The Qubit dsDNA Assay Kit was used according to the instructions, and the MDA product concentration was detected using Qubit fluorometor 3.0.

检测结果如下：The test results are as follows:

Bst-LF-WT:29.6ng/uLBst-LF-WT:29.6ng/uL

Bst-HS-1:34.8ng/uLBst-HS-1:34.8ng/uL

由此可见，在MDA等温扩增中，重组蛋白(即嵌合型A家族DNA聚合酶)的合成能力高于野生型A家族DNA聚合酶。This shows that in MDA isothermal amplification, the synthesis capacity of the recombinant protein (i.e., chimeric A family DNA polymerase) is higher than that of the wild-type A family DNA polymerase.

在本发明中，术语“第一”、“第二”仅用于描述目的，而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此，限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。在本发明的描述中，“多个”的含义是至少两个，例如两个，三个等，除非另有明确具体的限定。In the present invention, the terms "first" and "second" are used for descriptive purposes only and cannot be understood as indicating or implying relative importance or implicitly indicating the number of the indicated technical features. Therefore, the features defined as "first" and "second" may explicitly or implicitly include at least one of the features. In the description of the present invention, the meaning of "plurality" is at least two, such as two, three, etc., unless otherwise clearly and specifically defined.

在本发明中，术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中，对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且，描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外，在不相互矛盾的情况下，本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the present invention, the terms "one embodiment", "some embodiments", "examples", "specific examples", or "some examples" etc. mean that the specific features, structures, materials or characteristics described in conjunction with the embodiment or example are included in at least one embodiment or example of the present invention. In this specification, the schematic representations of the above terms do not necessarily refer to the same embodiment or example. Moreover, the described specific features, structures, materials or characteristics may be combined in any one or more embodiments or examples in a suitable manner. In addition, those skilled in the art may combine and combine different embodiments or examples and features of different embodiments or examples described in this specification without contradiction.

尽管上面已经示出和描述了本发明的实施例，可以理解的是，上述实施例是示例性的，不能理解为对本发明的限制，本领域的普通技术人员在本发明的范围内可以对上述实施例进行变化、修改、替换和变型。Although the embodiments of the present invention have been shown and described above, it is to be understood that the above embodiments are exemplary and are not to be construed as limitations of the present invention. A person skilled in the art may change, modify, replace and vary the above embodiments within the scope of the present invention.

Claims

A recombinant protein, characterized in that the amino acid sequence of the recombinant protein is obtained by replacing the homologous sequence in the wild-type A family DNA polymerase with an amino acid sequence that can enhance the interaction between the DNA polymerase and the target DNA in an amplification reaction.

The recombinant protein according to claim 1 is characterized in that the A family DNA polymerase is selected from the group consisting of Escherichia coli DNA polymerase I, T3 DNA polymerase, T5 DNA polymerase, T7 DNA polymerase, Taq DNA polymerase, Bsu DNA polymerase and Bst DNA polymerase.

The recombinant protein according to claim 1 or 2 is characterized in that the A family DNA polymerase is selected from at least one of Bst DNA polymerase having the amino acid sequence shown in SEQ ID NO: 1, Taq DNA polymerase having the amino acid sequence shown in SEQ ID NO: 9, Escherichia coli DNA polymerase having the amino acid sequence shown in SEQ ID NO: 10 and Bsu DNA polymerase having the amino acid sequence shown in SEQ ID NO: 11.

The recombinant protein according to any one of claims 1 to 3, characterized in that the homologous sequence in the A family DNA polymerase is derived from its thumb domain.

The recombinant protein according to any one of claims 1 to 4, characterized in that the homologous sequence in the A family DNA polymerase is selected from at least one of the following:

The amino acid sequence from position 546 to position 554 in the Bst DNA polymerase as shown in SEQ ID NO:2;

The amino acid sequence from position 199 to position 209 of the Taq DNA polymerase as shown in SEQ ID NO: 12;

The amino acid sequence from position 276 to position 285 of the Escherichia coli DNA polymerase as shown in SEQ ID NO:13; or

The amino acid sequence from position 249 to 257 in the Escherichia coli Bsu polymerase as shown in SEQ ID NO:14.

The recombinant protein according to any one of claims 1 to 5, characterized in that the homologous sequence in the A family DNA polymerase is replaced by a sequence comprising the amino acid sequence shown in SEQ ID NO: 3, a sequence essentially consisting of the amino acid sequence shown in SEQ ID NO: 3, or a sequence consisting of the amino acid sequence shown in SEQ ID NO: 3.

The recombinant protein according to any one of claims 1 to 6, characterized in that the amino acid sequence of the recombinant protein is selected from at least one of SEQ ID NO: 4, SEQ ID NO: 15, SEQ ID NO: 16 and SEQ ID NO: 17.

The recombinant protein according to any one of claims 1 to 7, characterized in that the recombinant protein further comprises any one of additional conservative mutations, additions and deletions.

The recombinant protein according to any one of claims 1 to 8, characterized in that the processivity of the recombinant protein as a chimeric DNA polymerase is higher than that of the wild-type A family DNA polymerase.

A nucleic acid, characterized in that the nucleic acid encodes the recombinant protein according to any one of claims 1 to 9.

The nucleic acid according to claim 10 is characterized in that the nucleotide sequence of the nucleic acid molecule is selected from at least one of SEQ ID NO: 5, SEQ ID NO: 18, SEQ ID NO: 19 and SEQ ID NO: 20.

A vector, characterized in that the vector comprises the nucleic acid according to claim 11.

A kit comprising the recombinant protein according to any one of claims 1 to 9, the nucleic acid according to claim 10 or 11, or the vector according to claim 12.

A method for preparing a recombinant protein according to any one of claims 1 to 9, characterized in that the method comprises: replacing an amino acid sequence obtained by replacing a homologous sequence in a wild-type A family DNA polymerase with an amino acid sequence that can enhance the interaction between the DNA polymerase and the target DNA in an amplification reaction, so as to obtain the recombinant protein.

The method for preparing a recombinant protein according to claim 14, characterized in that the method comprises:

performing a sequence alignment on a plurality of amino acid sequences from the wild-type A-family DNA polymerase to identify chimeric sequences; and

The chimeric sequence is used to replace the homologous sequence in the wild-type A family DNA polymerase sequence to obtain a candidate recombinant protein sequence.

The method according to claim 15, characterized in that the method further comprises:

Performing computer simulation analysis on the candidate recombinant protein sequence to determine the recombinant protein sequence,

The recombinant protein sequence shown therein exhibits a reduced spatial configuration between the thumb domain and the finger domains.

The method according to claim 16, characterized in that the computer simulation analysis of the candidate recombinant protein sequence to obtain the recombinant protein sequence comprises:

Performing structural prediction on the candidate recombinant protein sequence to obtain the tertiary structure of the recombinant protein;

Simulating the tertiary structure of the recombinant protein to obtain a molecular docking structure of the recombinant protein;

performing ligand-based structural optimization on the molecular docking structure to obtain an optimized recombinant protein structure; and

The optimized recombinant protein structure is subjected to dynamic simulation to determine the recombinant protein sequence.

The method according to claim 16 or 17 is characterized in that it also includes: expressing and purifying the recombinant protein sequence.

The method according to any one of claims 14 to 18 is characterized in that the A family DNA polymerase is selected from the group consisting of Escherichia coli DNA polymerase I, T3 DNA polymerase, T5 DNA polymerase, T7 DNA polymerase, Taq DNA polymerase, Bsu DNA polymerase and Bst DNA polymerase.

The method according to any one of claims 14 to 19 is characterized in that the A family DNA polymerase is selected from at least one of a Bst DNA polymerase having an amino acid sequence shown in SEQ ID NO: 1, a Taq DNA polymerase having an amino acid sequence shown in SEQ ID NO: 9, an Escherichia coli DNA polymerase having an amino acid sequence shown in SEQ ID NO: 10, and a Bsu DNA polymerase having an amino acid sequence shown in SEQ ID NO: 11.

The method according to any one of claims 14 to 20, characterized in that the homologous sequence in the A family DNA polymerase is derived from its thumb domain.

The method according to any one of claims 14 to 21, characterized in that the homologous sequence in the A family DNA polymerase is selected from at least one of the following:

The method according to any one of claims 14 to 22, characterized in that the homologous sequence in the A family DNA polymerase is replaced by a sequence comprising the amino acid sequence shown in SEQ ID NO:3, a sequence essentially consisting of the amino acid sequence shown in SEQ ID NO:3, or a sequence consisting of the amino acid sequence shown in SEQ ID NO:3.

The method according to any one of claims 14 to 23, characterized in that the amino acid sequence of the recombinant protein is selected from at least one of SEQ ID NO: 4, SEQ ID NO: 15, SEQ ID NO: 16 and SEQ ID NO: 17.

A method for amplifying a target DNA, characterized in that the target DNA is amplified using the recombinant protein described in any one of claims 1 to 9.

The method according to claim 25 is characterized in that the amplification is isothermal amplification, and the isothermal amplification is selected from rolling circle amplification RCA, multiple displacement amplification MDA, recombinase polymerase amplification reaction RPA, strand displacement amplification SDA or loop-mediated isothermal amplification LAMP.

The method according to claim 25 or 26 is characterized in that the amplification is used for constructing a DNA library or sequencing.