WO2024056038A1 - Modified capif1 helicase and use thereof - Google Patents
Modified capif1 helicase and use thereof Download PDFInfo
- Publication number
- WO2024056038A1 WO2024056038A1 PCT/CN2023/118905 CN2023118905W WO2024056038A1 WO 2024056038 A1 WO2024056038 A1 WO 2024056038A1 CN 2023118905 W CN2023118905 W CN 2023118905W WO 2024056038 A1 WO2024056038 A1 WO 2024056038A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- helicase
- capif1
- target polynucleotide
- construct
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12R—INDEXING SCHEME ASSOCIATED WITH SUBCLASSES C12C - C12Q, RELATING TO MICROORGANISMS
- C12R2001/00—Microorganisms ; Processes using microorganisms
- C12R2001/01—Bacteria or Actinomycetales ; using bacteria or Actinomycetales
- C12R2001/185—Escherichia
- C12R2001/19—Escherichia coli
Definitions
- the present invention relates to a modified CaPif1 helicase, in particular to a helicase that can control the movement of target polynucleotides through biological nanopores, and is particularly helpful in sequencing polynucleotides, and belongs to genetic engineering. and the field of genetic engineering.
- Nanopore sequencing technology developed in recent years is a new type of single-molecule sequencing technology that uses electric field force to drive single-stranded polynucleotides through nanoscale biopore proteins embedded in insulating phospholipid membranes. Due to different bases size, each nucleotide molecule will generate a characteristic obstruction current when passing through the biological nanopore, and these characteristic current signals recorded correspond to the sequence of the target nucleic acid.
- strand sequencing In the "strand sequencing” method, a single polynucleotide strand is passed through the biological nanopore to enable identification of the nucleotide sequence.
- advantages of this sequencing technology are low cost, no need for PCR amplification, fast and real-time convenience, long sequencing reads (more than 150kb), and the ability to directly sequence RNA and epigenetic modifications. It is considered a revolutionary technology in the field of sequencing and has immeasurable application value.
- nanopore sequencing technology also has serious limitations. Under the influence of an electric field, single-stranded polynucleotides pass through the nanopore so quickly that it is difficult to distinguish the current-blocking signal of a single nucleotide from the system noise. Therefore, to achieve identification of nucleotides, strand sequencing uses polynucleotide-bound molecular motors to control the movement of polynucleotides through the biological nanopore.
- the combination of molecular motors and polynucleotides is not static. When controlling the movement of polynucleotides, especially when facing very long nucleotide sequences, the molecular motors may fall off from the polynucleotides.
- the present invention overcomes the problem of too fast translocation speed of polynucleotides through nanopores in the prior art, and provides a DNA-dependent ATPase (CaPif1) helicase, which can Controlling polynucleotide punch movement is very useful.
- CaPif1 DNA-dependent ATPase
- the core of CaPif1 helicase contains five domains: 1A (RecA-like motor) domain, 2A (RecA-like motor) domain (RecA-like motor) domain, 1B (wedge domain) domain, 2B (SH3-like) domain and 2C domain (an additional structure unique to yeast).
- the structural information of the corresponding CaPif1 (PDB:7OTJ) protein can be obtained from the Protein Data Bank (PDB).
- the 1A and 2A domains are mainly involved in the binding and hydrolysis of ATP.
- the 1B domain consists of a short helix and an extended loop structure, which is called the "wedge region".
- the 2B domain adopts a SH-3-like folding manner, and the orbital-like ⁇ hairpin structure on it is close to the 1B domain.
- the 2C domain has not yet been resolved in the crystal structure.
- the present invention focuses on the two domains 1B and 2B, and tends to reduce or close the opening through covalent connection.
- connection There are two ways of connection. One is through its own naturally occurring amino acids and the insertion of new amino acids, such as cysteine and unnatural amino acids, to achieve connection. The other is through linker molecules, which tend to connect cysteines. Linker molecules include: BMOE, Bis(PEG)2, Bis(PEG)3, etc.
- the present invention introduces cysteine into the two structural domains 1B and 2B respectively.
- the above-mentioned connecting molecules usually contain two functional terminals, which will be covalently connected to the cysteine to realize the connection of the two parts 1B and 2B, so that The bound polynucleotide will not dissociate from the helicase.
- one aspect of the present invention provides a modified CaPif1 helicase, which modifies five natural cysteines in the amino acid sequence of wild-type CaPif1 (368-883). Cystine is C426, C507, C584, C592 and C662 respectively.
- the five natural cysteines are replaced with alanine (A) or serine (S).
- Another aspect of the present invention provides a modified CaPif1 helicase, which introduces new cysteine for connection, and the introduction sites include Q443, N448, K452, R455, I633, L634, P635, Q638, Q641, V642, D795, E796, D797 and T799.
- preferred mutation combinations include R455C, E796C, C426A, C507A, C584A, C592A and C662A.
- amino acid sequence of the helicase is shown in SEQ ID NO.1.
- preferred mutation combinations include Q443C, L634C, C426A, C507A, C584A, C592A and C662A.
- amino acid sequence of the helicase is shown in SEQ ID NO. 3.
- the helicase binds to internal nucleotides of a single-stranded polynucleotide or a double-stranded polynucleotide.
- Another aspect of the invention provides a nucleotide sequence encoding a helicase of the invention.
- nucleotide sequence is shown in SEQ ID NO.2 or SEQ ID NO.4.
- Another aspect of the present invention provides a complex formed between the helicase of the present invention and bismaleimidoethane or bismaleimide PEG3.
- Another aspect of the invention provides a construct comprising a helicase according to the invention and a binding moiety for binding to a polynucleotide.
- the binding moiety is selected from eukaryotic single-chain binding proteins, bacterial single-chain binding proteins, archaeal single-chain binding proteins, viral single-chain binding proteins, or double-chain binding proteins.
- Another aspect of the present invention provides the helicase of the present invention or the nucleotide sequence encoding the helicase or the complex comprising the helicase or the construct comprising the helicase in characterizing the target polynucleoside. Application of acids or controlled passage of target polynucleotides through nanopores.
- kits for characterizing a target polynucleotide or controlling the passage of a target polynucleotide through a nanopore comprising the helicase of the present invention or a nucleoside encoding the helicase. acid sequence or a complex comprising the helicase or a construct comprising the helicase.
- Another aspect of the invention provides a device for characterizing a target polynucleotide or controlling the passage of a target polynucleotide through a nanopore, the device comprising the helicase of the invention or a nucleotide sequence encoding the helicase. or a complex containing the helicase or Constructs comprising said helicase.
- Another aspect of the invention provides a method for characterizing a target polynucleotide or controlling the passage of a target polynucleotide through a nanopore, comprising the following steps:
- the construct includes a helicase and a helicase for binding the polynucleotide the combined part.
- said one or more characteristics are selected from the group consisting of origin, length, identity, sequence, secondary structure of the target polynucleotide or whether the target polynucleotide is modified.
- said one or more characteristics are carried out by electrical measurements and/or optical measurements.
- the target polynucleotide is single-stranded, double-stranded, or at least part of it is double-stranded.
- the nanopores are transmembrane pores, and the transmembrane pores are biological pores, solid pores or hybrid pores between biology and solid.
- Another aspect of the invention provides a vector comprising the nucleotide sequence of the helicase of the invention.
- Yet another aspect of the present invention provides a host cell comprising a nucleotide sequence of the helicase of the present invention or a vector comprising the nucleotide sequence.
- the present invention has demonstrated that the modified CaPif1 helicase can control the movement of polynucleotides through biological nanopores, especially under the action of electric field force.
- the helicase enables target polynucleotides to move through the nanopore in a controlled and stepwise manner.
- the specific CaPif1 helicase mutants provided by the present invention have improved ability to control the translocation of polynucleotides through nanopores. These mutants usually have one or more modifications on the 1B or 2B domain. . Therefore, the modified CaPif1 helicase provided by the present invention has at least one cysteine or unnatural amino acid inserted, and still maintains its ability to control the movement of polynucleotides.
- the present invention also provides a modified CaPif1 helicase that covalently connects 1B and 2B domains through a linker molecule, which improves the stability of the CaPif1 helicase binding to polynucleotides, especially when polynucleotides are used.
- the helicase of the present invention can still stably control the movement of the polynucleotide without falling off from the polynucleotide.
- Figure 1 is a diagram showing the results of SDS-PAGE gel electrophoresis purification of the CaPif1 helicase of the present invention
- Figure 2 is a schematic diagram of fluorescence analysis for detecting helicase enzyme activity.
- Figure 3 is a graph showing the changes in fluorescence value generated by CaPif1 helicase unwinding the fluorescent substrate over time.
- Figure 4 is an SDS-PAGE gel electrophoresis picture of the cross-linking results of the CaPif1 mutant.
- Figure 5 is a gel shift image of CaPif1 binding to DNA before and after modification.
- Figure 6 is a schematic diagram of DNA construct X used in the present invention.
- H SEQ ID NO: 14 with a cholesterol tag at the 3’ end.
- Figure 7 is a schematic diagram of DNA construct Y used in the present invention.
- Figure 8 is a schematic diagram of the current trajectories of mutants CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane when controlling the movement of the complete DNA construct X through the nanopore.
- Figure 9 is an enlarged view of an area showing the movement of the DNA construct X via the helicase shown in Figure 8.
- Figure 10 is a schematic diagram of the current trajectory when CaPif1 helicase controls the displacement of DNA construct X through the nanopore.
- Figure 11 is an enlarged view of an area showing the movement of the via hole of DNA construct X controlled by the helicase shown in Figure 10.
- the recombinant plasmid was transformed into the BL21(DE3) E. coli expression host by heat shock method.
- the host bacteria containing the expression plasmid were first cultured overnight at 37°C in LB medium with kana resistance, and then amplified and cultured at 37°C at a ratio of 1:100, until the OD (600 ) value reaches 0.4-0.6, stop culturing and place it at 4°C for 1 hour of cooling treatment, then add isopropyl ⁇ -D-Thiogalactoside (IPTG) at a final concentration of 0.5mM to induce expression at 16°C12 -16h.
- IPTG isopropyl ⁇ -D-Thiogalactoside
- Figure 1 shows the SDS-PAGE gel electrophoresis pattern of purified CaPif1 helicase.
- Example 2 Fluorescence experiment to analyze the unwinding activity of CaPif1 helicase
- Figure 2 is a schematic diagram of fluorescence analysis for detecting helicase enzyme activity.
- the fluorescent substrate chain (final concentration 100 nM, d, SEQ ID NO: 6) has a single-stranded DNA portion of 20 bases at the 5' end and a double-stranded DNA portion of 18 bases hybridized, and its 3' end has a fluorescent group (Cy3, f).
- BHQ-1, e fluorescent quenching group
- CaPif1 helicase will bind to the 5'-end single-stranded DNA portion of the fluorescent substrate, and after adding ATP (2mM) and MgCl 2 (2mM), Shift in the 5'-3' direction and unwind the double-stranded portion.
- the excess capture chain (b, SEQ ID NO: 8) is preferentially complementary to the short chain (c) to prevent re-annealing between the initial substrates, and the released substrates
- the substance backbone (d) emits fluorescence.
- Figure 3 shows the change in fluorescence value generated by the unwinding of the fluorescent substrate by CaPif1 helicase over time in a 300mM NaCl buffer (25mM Tris-HCl pH 7.5, 2mM ATP, 2mM MgCl 2 , 300mM NaCl). The results show that the substrate ranges from essentially non-fluorescent to fluorescent.
- Example 3 Preparation of CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A mutation combinations and CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A and C662A mutation combinations
- CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A are SEQ ID NO: 1 and bismaleimidoethane with mutation combinations R455C, E796C, C426A, C507A, C584A, C592A and 662A. Alkane connection;
- CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A and C662A is SEQ ID NO: 3 with the mutation combination Q443C, L634C, C426A, C507A, C584A, C592A and C662A linked to bismaleimide PEG3.
- the recombinant expression plasmid of CaPif1 helicase was finally obtained. Then, site-directed mutagenesis was performed by overlapping PCR to obtain the nucleic acid sequences encoding mutation combinations R455C, E796C, C426A, C507A, C584A, C592A, and 662A (SEQ ID NO. 2). and the nucleic acid sequence encoding mutation combinations Q443C, L634C, C426A, C507A, C584A, C592A and C662A (SEQ ID NO. 4).
- the mutated recombinant plasmid was transformed into the BL21 (DE3) E. coli expression host by heat shock method.
- the host bacteria containing the expression plasmid were first cultured overnight at 37°C in LB medium with kana resistance, and then amplified and cultured at 37°C at a ratio of 1:100, until the OD (600 ) value reaches 0.4-0.6, stop culturing and place it at 4°C for 1 hour of cooling treatment, then add isopropyl ⁇ -D-Thiogalactoside (IPTG) at a final concentration of 0.5mM to induce expression at 16°C12 -16h.
- IPTG isopropyl ⁇ -D-Thiogalactoside
- the buffer was replaced with PBS buffer (pH7.5) through a 0.5ml Zeba desalting column (7k MWCO) to obtain a 100 ⁇ l sample.
- PBS buffer pH7.5
- Zeba desalting column 7k MWCO
- Cross-linking results were analyzed on a 4-10% polyacrylamide gel.
- Example 4 Using gel shift assay to measure the ability of modified CaPif1 helicase to bind DNA
- the DNA substrate required for gel shift experiments was prepared by annealing (SEQ ID NO: 9 was hybridized with SEQ ID NO: 10 containing a 5' end Cy3 label), and then it was mixed with SEQ ID NO: 10 in a molar ratio of 1:1.
- Wild-type CaPif1 (PDB ID: 7OTJ, related website: RCSB PDB-7OTJ: Crystal structure of Pif1 helicase from Candida albicans), CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A (with mutation combinations R455C, E796C, SEQ ID NO: 1) for C426A, C507A, C584A, C592A and C662A and CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A and C662A (with mutation combination Q443C, L634C, C426A, C507A, C584A, C592 A and C662A SEQ ID NO: 3) Incubate in buffer (25mM Tris-HCl pH 7.5, 300mM NaCl) for one hour at room temperature to obtain a final concentration of CaP
- Example 5 CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-Bismaleimidoethane and CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A and C662A-Bismaleimidoethane Imine PEG3 has the ability to control the passage of intact DNA constructs through nanopores ability to move
- CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane are SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A.
- CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A, and C662A-Bismaleimide PEG3 is a protein with mutations Q443C, L634C, C426A, C507A, C584A, C592A, and SEQ ID NO: 3 of C662A is linked to bismaleimide PEG3.
- DNA construct X as shown in Figure 6 was prepared.
- design primers which contain sequences A, C, D, E, and F, and then use them to amplify a 1000-base sequence (G) on lambda DNA.
- the obtained PCR product is purified and matched with sequence H at a ratio of 1:1.1 Annealing hybridization is performed at the molar ratio to obtain the final DNA construct X.
- the prepared DNA construct C507A, C584A, C592A, and C662A-bismaleimide PEG3 (final concentration 10 nM) were preincubated in buffer (10 mM Hepes, pH 8.0, 100 Mm KCl, 10% glycerol) for 30 min at room temperature.
- buffer 600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ] ⁇ 3H 2 O, 100mM Hepes, pH 8.0
- phospholipids embedded in the DPhPC phospholipid bilayer were Electrical signal measurements were obtained from Csgg nanopores.
- 2 ml buffer 600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ] ⁇ 3H 2 O, 100mM Hepes, pH 8.0 ) is flowed through the system to remove residual excess nanopores.
- the pre-incubated sample, ATP (final concentration 2mM) and MgCl 2 (final concentration 10mM) were then flowed together into a single nanopore experimental system (total volume 100 ⁇ L), and the signal was measured at a constant voltage of +180mV for 6h (including potential 2s -180mV voltage inversion).
- Figure 9 shows an enlarged view of part of the region where CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A mutants control DNA movement.
- Example 6 CaPif1 helicase controls movement of entire DNA construct X through a single Csgg nanopore
- This example takes CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A (SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A) as an example to verify CaPif1 unwinding How the enzyme controls the movement of the entire DNA construct X through a single Csgg nanopore.
- DNA construct X as shown in Figure 6 was prepared.
- design primers which contain sequences A, C, D, E, and F, and then use them to amplify a 1000-base sequence (G) on lambda DNA.
- the obtained PCR product is purified and matched with sequence H at a ratio of 1:1.1 Annealing hybridization is performed at the molar ratio to obtain the final DNA construct X.
- buffer 600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ] ⁇ 3H 2 O, 100mM Hepes, pH 8.0
- phospholipids embedded in the DPhPC phospholipid bilayer were Electrical signal measurements were obtained from Csgg nanopores.
- 2 ml buffer 600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ] ⁇ 3H 2 O, 100mM Hepes, pH 8.0 ) is flowed through the system to remove residual excess nanopores.
- the pre-incubated sample, ATP (final concentration 2mM) and MgCl 2 (final concentration 10mM) were then flowed together into a single nanopore experimental system (total volume 100 ⁇ L), and the signal was measured at a constant voltage of +180mV for 6h (including potential 2s -180mV voltage inversion).
- CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A (SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A) was observed to unwind
- the enzyme controls the movement of DNA construct X through the nanopore.
- the duration of helicase-controlled DNA movement of 14 seconds corresponds to the movement of nearly 1000 bp of DNA construct through the Csgg nanopore.
- Figure 11 shows CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A (SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A) helicase control A magnified view of part of the DNA movement.
- Example 7 Ability of wild-type CaPifl and CaPifl-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane to control movement of intact DNA construct Y through the nanopore
- CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane are SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A. Connected with bismaleimidoethane.
- DNA construct Y as shown in Figure 7 was prepared.
- the upstream primer contains A, C, D, E and J sequences.
- the J sequence is as shown in SEQ ID NO:16
- the downstream primer is as shown in SEQ ID NO:17.
- They were then used to amplify the 4115 base length sequence (K) on ⁇ DNA, so that a poly T sequence (I) was added to the 3' end of the K sequence (shown as SEQ ID NO: 15), and the obtained After purification, the PCR product is annealed and hybridized with sequence H at a molar ratio of 1:1.1 to obtain the final DNA construct Y.
- the prepared DNA construct Y (final concentration 0.1nM) was combined with wild-type CaPif1 (PDB ID: 7OTJ, related website: RCSB PDB-7OTJ: Crystal structure of Pif1 helicase from Candida albicans) and CaPif1-R455C, E796C, C426A respectively.
- C507A, C584A, C592A, and C662A-bismaleimidoethane final concentration 10 nM were preincubated in buffer (10mM Hepes, pH 8.0, 100mM KCl, 10% glycerol) at room temperature for 30 minutes.
- buffer 600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ] ⁇ 3H 2 O, 100mM Hepes, pH 8.0
- phospholipids embedded in the DPhPC phospholipid bilayer were Electrical signal measurements were obtained from Csgg nanopores.
- 2 ml buffer 600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ] ⁇ 3H 2 O, 100mM Hepes, pH 8.0 ) is flowed through the system to remove residual excess nanopores.
- the pre-incubated sample, ATP (final concentration 2mM) and MgCl 2 (final concentration 10mM) were then flowed together into a single nanopore experimental system (total volume 100 ⁇ L), and the signal was measured at a constant voltage of +180mV for 6h (including potential 2s -180mV voltage inversion).
- the results show that adding the complex formed by CaPif1 helicase and DNA construct to the Csgg nanopore system can generate a typical nucleic acid through-hole current signal.
- the wild-type CaPif1 helicase monomer and CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane can control the pore movement of DNA construct Y.
- SEQ ID NO: 1 which is the amino acid sequence of CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A mutants. The specific sequence is:
- SEQ ID NO: 2 which is the coding sequence of SEQ ID NO: 1.
- the specific sequence is:
- SEQ ID NO: 3 which is the amino acid sequence of CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A and C662A mutants.
- the specific sequence is:
- SEQ ID NO: 4 which is the coding sequence of SEQ ID NO: 3.
- the specific sequence is:
- SEQ ID NO: 5 which is the optimized nucleic acid sequence of CaPif1 protein. The specific sequence is:
- SEQ ID NO: 6 which is the sequence of the fluorescent substrate chain. The specific sequence is:
- SEQ ID NO: 7 which is a short chain sequence that is complementary to the fluorescent substrate chain sequence.
- the specific sequence is:
- SEQ ID NO: 8 which is the capture strand sequence.
- the specific sequence is:
- SEQ ID NO: 9 which is the DNA substrate sequence.
- the specific sequence is:
- SEQ ID NO: 10 which is a sequence that hybridizes with SEQ ID NO: 9. The specific sequence is:
- SEQ ID NO: 11 which is the D sequence of DNA construct X and construct Y. The specific sequence is:
- SEQ ID NO: 12 which is the F sequence of DNA construct X and construct Y. The specific sequence is:
- SEQ ID NO: 13 which is the G sequence of DNA construct X. The specific sequence is:
- SEQ ID NO: 14 which is the H sequence of DNA construct X. The specific sequence is:
- SEQ ID NO: 15 which is the K sequence of DNA construct Y.
- the specific sequence is:
- SEQ ID NO: 16 which is the J sequence of DNA construct Y. The specific sequence is:
- SEQ ID NO: 17 which is the downstream primer sequence of DNA construct Y.
- the specific sequence is:
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicinal Chemistry (AREA)
- Plant Pathology (AREA)
- Analytical Chemistry (AREA)
- Immunology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Description
本申请要求享有于2022年9月16日向中国国家知识产权局提交的,专利申请号为202211132258.8,名称为“一种经修饰的CaPif1解旋酶及其应用”的在先申请的优先权。该在先申请的全文通过引用的方式结合于本申请中。This application claims the priority of the earlier application submitted to the State Intellectual Property Office of China on September 16, 2022, with patent application number 202211132258.8, titled "A modified CaPif1 helicase and its application". The entirety of this prior application is incorporated by reference into this application.
本发明涉及一种经修饰的CaPif1解旋酶,特别是涉及一种能够控制目标多核苷酸穿过生物纳米孔移动的解旋酶,并且特别有助于对多核苷酸进行测序,属于基因工程和遗传工程领域。The present invention relates to a modified CaPif1 helicase, in particular to a helicase that can control the movement of target polynucleotides through biological nanopores, and is particularly helpful in sequencing polynucleotides, and belongs to genetic engineering. and the field of genetic engineering.
近些年发展起来的纳米孔测序技术是一种新型的单分子测序技术,其利用电场力驱动单链多核苷酸穿过嵌在绝缘磷脂膜上的纳米级生物孔道蛋白,由于不同的碱基大小,每个核苷酸分子在通过生物纳米孔时会产生特征性的阻碍电流,记录下的这些特征电流信号对应于目标核酸的序列。Nanopore sequencing technology developed in recent years is a new type of single-molecule sequencing technology that uses electric field force to drive single-stranded polynucleotides through nanoscale biopore proteins embedded in insulating phospholipid membranes. Due to different bases size, each nucleotide molecule will generate a characteristic obstruction current when passing through the biological nanopore, and these characteristic current signals recorded correspond to the sequence of the target nucleic acid.
在“链测序”方法中,单个多核苷酸链通过所述生物纳米孔从而实现核苷酸序列的鉴定。相比于其他测序技术,这种测序技术的优势在于成本低,无需PCR扩增,快速实时便捷,测序读长长(超过150kb),以及RNA及表观遗传修饰直接测序的能力。其被认为是测序领域的革命性技术,具有不可估量的应用价值。In the "strand sequencing" method, a single polynucleotide strand is passed through the biological nanopore to enable identification of the nucleotide sequence. Compared with other sequencing technologies, the advantages of this sequencing technology are low cost, no need for PCR amplification, fast and real-time convenience, long sequencing reads (more than 150kb), and the ability to directly sequence RNA and epigenetic modifications. It is considered a revolutionary technology in the field of sequencing and has immeasurable application value.
但是,纳米孔测序技术也存在严重的限制。在电场作用下,单链多核苷酸穿过纳米孔的速度非常快以至于难以从系统噪音中分辨出单个核苷酸的电流阻碍信号。因此,为了实现对核苷酸的鉴定,链测序使用多核苷酸结合的分子马达来控制多核苷酸穿过所述生物纳米孔的移动。然而,分子马达和多核苷酸的结合并非一成不变,在其控制多核苷酸移动时,尤其是面对很长的核苷酸序列时,分子马达会出现从多核苷酸上发生脱落的问题。此外,在链测序这种单分子水平的测序技术中,分子马达存在许多出乎意料的单分子动态行为,比如某些马达蛋白由于活性差,会出现停滞的现象,从而引起孔道的阻塞,降低了测序的通量。其次,马达蛋白在多核苷酸上的移位会出现向前滑动、倒退以及步长不均一的情况,这些不规则的单分子行为导致马达蛋白无法完美地控制单链多核苷酸以棘轮的方式穿过纳米孔,从而降低碱基识别的准确度。因此,目前亟需一种能够稳定、高效地控制多核苷酸穿过生物纳米孔的技术。However, nanopore sequencing technology also has serious limitations. Under the influence of an electric field, single-stranded polynucleotides pass through the nanopore so quickly that it is difficult to distinguish the current-blocking signal of a single nucleotide from the system noise. Therefore, to achieve identification of nucleotides, strand sequencing uses polynucleotide-bound molecular motors to control the movement of polynucleotides through the biological nanopore. However, the combination of molecular motors and polynucleotides is not static. When controlling the movement of polynucleotides, especially when facing very long nucleotide sequences, the molecular motors may fall off from the polynucleotides. In addition, in single-molecule level sequencing technology such as strand sequencing, molecular motors have many unexpected single-molecule dynamic behaviors. For example, some motor proteins may stagnate due to poor activity, causing channel obstruction and reducing Sequencing throughput. Secondly, the displacement of motor proteins on polynucleotides will slide forward, backward, and have uneven step sizes. These irregular single-molecule behaviors result in the inability of motor proteins to perfectly control single-stranded polynucleotides in a ratcheting manner. pass through the nanopore, thereby reducing the accuracy of base calling. Therefore, there is an urgent need for a technology that can stably and efficiently control polynucleotides to pass through biological nanopores.
发明内容Contents of the invention
针对上述技术问题,本发明克服了现有技术中多核苷酸穿过纳米孔移位速度过快的问题,提供了一种DNA依赖的ATPase(CaPif1)解旋酶,其在链测序过程中对于控制多核苷酸穿孔运动是非常有用的。In view of the above technical problems, the present invention overcomes the problem of too fast translocation speed of polynucleotides through nanopores in the prior art, and provides a DNA-dependent ATPase (CaPif1) helicase, which can Controlling polynucleotide punch movement is very useful.
CaPif1解旋酶核心共包含五个结构域:1A(RecA式(RecA-like)马达)域、2A(RecA式 (RecA-like)马达)域、1B(wedge domain)域、2B(SH3式(SH3-like))域和2C域(酵母特有的额外结构)。相应的CaPif1(PDB:7OTJ)蛋白的结构信息可以从蛋白质数据库(PDB)中获得。The core of CaPif1 helicase contains five domains: 1A (RecA-like motor) domain, 2A (RecA-like motor) domain (RecA-like motor) domain, 1B (wedge domain) domain, 2B (SH3-like) domain and 2C domain (an additional structure unique to yeast). The structural information of the corresponding CaPif1 (PDB:7OTJ) protein can be obtained from the Protein Data Bank (PDB).
其中,1A和2A域主要参与ATP的结合和水解。1B域由一个短的螺旋和一段延伸的环结构构成,被称为“楔形区域”。2B域采取SH-3样的折叠方式,其上的轨道样β发卡结构靠近1B域。2C域在晶体结构中尚未解析出来。Among them, the 1A and 2A domains are mainly involved in the binding and hydrolysis of ATP. The 1B domain consists of a short helix and an extended loop structure, which is called the "wedge region". The 2B domain adopts a SH-3-like folding manner, and the orbital-like β hairpin structure on it is close to the 1B domain. The 2C domain has not yet been resolved in the crystal structure.
由此,本发明在通过修饰连接缩小开口尺寸时,重点放在1B和2B这两个结构域上,倾向于通过共价连接来缩小或者关闭开口。Therefore, when reducing the size of the opening through modified connection, the present invention focuses on the two domains 1B and 2B, and tends to reduce or close the opening through covalent connection.
连接方式有两种,一种是通过其自身的天然存在的氨基酸,通过替换插入新的氨基酸,比如半胱氨酸和非天然氨基酸等,来实现连接。另一种是通过连接分子(linker molecules),倾向于半胱氨酸之间的连接,连接分子包括:BMOE、Bis(PEG)2以及Bis(PEG)3等。There are two ways of connection. One is through its own naturally occurring amino acids and the insertion of new amino acids, such as cysteine and unnatural amino acids, to achieve connection. The other is through linker molecules, which tend to connect cysteines. Linker molecules include: BMOE, Bis(PEG)2, Bis(PEG)3, etc.
本发明分别在1B和2B两个结构域上引入半胱氨酸,上述的连接分子通常含有两个功能末端,会与半胱氨酸发生共价连接从而实现1B和2B两部分的连接,使得结合的多核苷酸不会从解旋酶上解离。The present invention introduces cysteine into the two structural domains 1B and 2B respectively. The above-mentioned connecting molecules usually contain two functional terminals, which will be covalently connected to the cysteine to realize the connection of the two parts 1B and 2B, so that The bound polynucleotide will not dissociate from the helicase.
为此,本发明一方面提供了一种修饰的CaPif1解旋酶,其对野生型CaPif1(368-883)氨基酸序列中的五个天然的半胱氨酸进行修饰,所述五个天然的半胱氨酸分别是C426,C507,C584,C592和C662。To this end, one aspect of the present invention provides a modified CaPif1 helicase, which modifies five natural cysteines in the amino acid sequence of wild-type CaPif1 (368-883). Cystine is C426, C507, C584, C592 and C662 respectively.
在本发明优选的实施方案中,用丙氨酸(A)或者丝氨酸(S)取代所述五个天然的半胱氨酸。In a preferred embodiment of the invention, the five natural cysteines are replaced with alanine (A) or serine (S).
本发明另一方面提供了一种修饰的CaPif1解旋酶,其引入新的用于实现连接的半胱氨酸,引入位点包括Q443、N448、K452、R455、I633、L634、P635、Q638、Q641、V642、D795、E796、D797和T799。Another aspect of the present invention provides a modified CaPif1 helicase, which introduces new cysteine for connection, and the introduction sites include Q443, N448, K452, R455, I633, L634, P635, Q638, Q641, V642, D795, E796, D797 and T799.
在本发明优选的实施方案中,优选的突变组合包括R455C、E796C、C426A、C507A、C584A、C592A和C662A。In preferred embodiments of the invention, preferred mutation combinations include R455C, E796C, C426A, C507A, C584A, C592A and C662A.
在本发明进一步优选的实施方案中,所述的解旋酶的氨基酸序列如SEQ ID NO.1所示。In a further preferred embodiment of the present invention, the amino acid sequence of the helicase is shown in SEQ ID NO.1.
在本发明优选的实施方案中,优选的突变组合包括Q443C、L634C、C426A、C507A、C584A、C592A和C662A。In preferred embodiments of the invention, preferred mutation combinations include Q443C, L634C, C426A, C507A, C584A, C592A and C662A.
在本发明进一步优选的实施方案中,所述的解旋酶的氨基酸序列如SEQ ID NO.3所示。In a further preferred embodiment of the present invention, the amino acid sequence of the helicase is shown in SEQ ID NO. 3.
在本发明优选的实施方案中,所述的解旋酶结合到单链多核苷酸或双链多核苷酸的内部核苷酸。In preferred embodiments of the invention, the helicase binds to internal nucleotides of a single-stranded polynucleotide or a double-stranded polynucleotide.
本发明另一方面提供了一种编码本发明的解旋酶的核苷酸序列。Another aspect of the invention provides a nucleotide sequence encoding a helicase of the invention.
在本发明优选的实施方案中,所述核苷酸序列如SEQ ID NO.2或SEQ ID NO.4所示。In a preferred embodiment of the present invention, the nucleotide sequence is shown in SEQ ID NO.2 or SEQ ID NO.4.
本发明另一方面提供了一种本发明所述的解旋酶与双马来酰亚胺基乙烷或双马来酰亚胺PEG3形成的复合体。Another aspect of the present invention provides a complex formed between the helicase of the present invention and bismaleimidoethane or bismaleimide PEG3.
本发明另一方面提供了一种构建体,其包含本发明所述的解旋酶和用于结合多核苷酸的结合部分。Another aspect of the invention provides a construct comprising a helicase according to the invention and a binding moiety for binding to a polynucleotide.
在本发明优选的实施方案中,所述结合部分选自真核单链结合蛋白、细菌单链结合蛋白、古生单链结合蛋白、病毒单链结合蛋白会或双链结合蛋白。In a preferred embodiment of the invention, the binding moiety is selected from eukaryotic single-chain binding proteins, bacterial single-chain binding proteins, archaeal single-chain binding proteins, viral single-chain binding proteins, or double-chain binding proteins.
本发明另一方面提供了本发明所述的解旋酶或编码该解旋酶的核苷酸序列或包含该解旋酶的复合体或包含所述解旋酶的构建体在表征目标多核苷酸或控制目标多核苷酸穿过纳米孔中的应用。Another aspect of the present invention provides the helicase of the present invention or the nucleotide sequence encoding the helicase or the complex comprising the helicase or the construct comprising the helicase in characterizing the target polynucleoside. Application of acids or controlled passage of target polynucleotides through nanopores.
本发明另一方面提供了一种表征目标多核苷酸或控制目标多核苷酸穿过纳米孔的试剂盒,所述试剂盒包含本发明所述的解旋酶或编码该解旋酶的核苷酸序列或包含该解旋酶的复合体或包含所述解旋酶的构建体。Another aspect of the present invention provides a kit for characterizing a target polynucleotide or controlling the passage of a target polynucleotide through a nanopore, the kit comprising the helicase of the present invention or a nucleoside encoding the helicase. acid sequence or a complex comprising the helicase or a construct comprising the helicase.
本发明另一方面提供了一种表征目标多核苷酸或控制目标多核苷酸穿过纳米孔的装置,所述装置包含本发明所述的解旋酶或编码该解旋酶的核苷酸序列或包含该解旋酶的复合体或 包含所述解旋酶的构建体。Another aspect of the invention provides a device for characterizing a target polynucleotide or controlling the passage of a target polynucleotide through a nanopore, the device comprising the helicase of the invention or a nucleotide sequence encoding the helicase. or a complex containing the helicase or Constructs comprising said helicase.
本发明另一方面提供了一种表征目标多核苷酸或控制目标多核苷酸穿过纳米孔的方法,包括以下步骤:Another aspect of the invention provides a method for characterizing a target polynucleotide or controlling the passage of a target polynucleotide through a nanopore, comprising the following steps:
1、将目标多核苷酸与纳米孔,和本发明所述的解旋酶或包含该解旋酶的复合体或包含该解旋酶的构建体接触,使得所述解旋酶、复合体或构建体控制所述目标多核苷酸穿过所述纳米孔;和1. Contact the target polynucleotide with the nanopore, and the helicase of the present invention or the complex containing the helicase or the construct containing the helicase, so that the helicase, complex or construct The construct controls passage of the target polynucleotide through the nanopore; and
2、获取目标多核苷酸中的核苷酸与所述纳米孔相互作用的一个或多个特征,以表征所述目标多核苷酸,所述构建体包含解旋酶和用于结合多核苷酸的结合部分。2. Obtain one or more characteristics of the interaction between the nucleotides in the target polynucleotide and the nanopore to characterize the target polynucleotide, and the construct includes a helicase and a helicase for binding the polynucleotide the combined part.
在本发明优选的实施方案中,所述一个或多个特征选自目标多核苷酸的来源、长度、同一性、序列、二级结构或目标多核苷酸是否被修饰。In a preferred embodiment of the invention, said one or more characteristics are selected from the group consisting of origin, length, identity, sequence, secondary structure of the target polynucleotide or whether the target polynucleotide is modified.
在本发明优选的实施方案中,所述的一个或多个特征通过电测量和/或光测量进行。In a preferred embodiment of the invention, said one or more characteristics are carried out by electrical measurements and/or optical measurements.
在本发明优选的实施方案中,所述的目标多核苷酸为单链、双链、或至少一部分是双链的。In a preferred embodiment of the present invention, the target polynucleotide is single-stranded, double-stranded, or at least part of it is double-stranded.
在本发明优选的实施方案中,所述的纳米孔为跨膜孔,所述的跨膜孔为生物孔、固态孔或生物与固态交杂的孔。In a preferred embodiment of the present invention, the nanopores are transmembrane pores, and the transmembrane pores are biological pores, solid pores or hybrid pores between biology and solid.
本发明另一方面提供了一种包含本发明所述解旋酶的核苷酸序列的载体。Another aspect of the invention provides a vector comprising the nucleotide sequence of the helicase of the invention.
本发明再一方面提供了一种包含本发明所述解旋酶的核苷酸序列或包含该核苷酸序列的载体的宿主细胞。Yet another aspect of the present invention provides a host cell comprising a nucleotide sequence of the helicase of the present invention or a vector comprising the nucleotide sequence.
本发明已证明经修饰的CaPif1解旋酶能够控制多核苷酸穿过生物纳米孔进行移动,特别在有电场力的作用下。所述解旋酶能够使得目标多核苷酸以可控和逐步地方式穿过纳米孔进行移动。The present invention has demonstrated that the modified CaPif1 helicase can control the movement of polynucleotides through biological nanopores, especially under the action of electric field force. The helicase enables target polynucleotides to move through the nanopore in a controlled and stepwise manner.
本发明提供的特异的CaPif1解旋酶突变体,其在控制多核苷酸穿过纳米孔移位的能力上得到了提高,这些突变体通常会在1B或者2B结构域上存在一个或者多个修饰。因此,本发明所提供的经修饰的CaPif1解旋酶,其至少有1个半胱氨酸或者非天然氨基酸的插入,而且仍然保持其控制多核苷酸移动的能力。The specific CaPif1 helicase mutants provided by the present invention have improved ability to control the translocation of polynucleotides through nanopores. These mutants usually have one or more modifications on the 1B or 2B domain. . Therefore, the modified CaPif1 helicase provided by the present invention has at least one cysteine or unnatural amino acid inserted, and still maintains its ability to control the movement of polynucleotides.
本发明还提供了通过连接分子将1B和2B结构域共价连接起来的经修饰的CaPif1解旋酶,提高了本发明所述CaPif1解旋酶与多核苷酸结合的稳定性,尤其当多核苷酸链长度增加时,本发明所述解旋酶依然可以稳定控制多核苷酸移动而不会从多核苷酸上发生脱落。The present invention also provides a modified CaPif1 helicase that covalently connects 1B and 2B domains through a linker molecule, which improves the stability of the CaPif1 helicase binding to polynucleotides, especially when polynucleotides are used. When the length of the acid chain increases, the helicase of the present invention can still stably control the movement of the polynucleotide without falling off from the polynucleotide.
图1是SDS-PAGE凝胶电泳纯化本发明的CaPif1解旋酶的结果图;Figure 1 is a diagram showing the results of SDS-PAGE gel electrophoresis purification of the CaPif1 helicase of the present invention;
M:Marker;M: Marker;
1:CaPif1解旋酶的电泳结果图。1: Electrophoresis results of CaPif1 helicase.
图2是检测解旋酶酶活性的荧光分析示意图。Figure 2 is a schematic diagram of fluorescence analysis for detecting helicase enzyme activity.
图3是CaPif1解旋酶随时间的解旋荧光底物所产生荧光值的变化结果图。Figure 3 is a graph showing the changes in fluorescence value generated by CaPif1 helicase unwinding the fluorescent substrate over time.
图4是CaPif1突变体交联结果的SDS-PAGE凝胶电泳图。Figure 4 is an SDS-PAGE gel electrophoresis picture of the cross-linking results of the CaPif1 mutant.
图5是修饰前后CaPif1结合DNA的凝胶迁移胶图。Figure 5 is a gel shift image of CaPif1 binding to DNA before and after modification.
1:未加蛋白的单纯的DNA底物;1: Pure DNA substrate without added protein;
2:野生型CaPif1结合DNA的结果;2: Results of wild-type CaPif1 binding to DNA;
3:经修饰的CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A(具有突变组合R455C、E796C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:1) 结合DNA的结果。3: Modified CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A (SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A) The result of combining DNA.
图6是本发明中使用的DNA构建体X的示意图。Figure 6 is a schematic diagram of DNA construct X used in the present invention.
A:50个T;A: 50 T;
B:本发明使用的解旋酶;B: Helicase used in the present invention;
C:1个iSpC18 spacer;C: 1 iSpC18 spacer;
D:SEQ ID NO:11;D: SEQ ID NO: 11;
E:2个iSpC18 spacers;E: 2 iSpC18 spacers;
F:SEQ ID NO:12;F: SEQ ID NO: 12;
G:SEQ ID NO:13;G: SEQ ID NO: 13;
H:3’端带有胆固醇标记的SEQ ID NO:14。H: SEQ ID NO: 14 with a cholesterol tag at the 3’ end.
图7是本发明中使用的DNA构建体Y的示意图。Figure 7 is a schematic diagram of DNA construct Y used in the present invention.
A:50个T;A: 50 T;
B:本发明使用的解旋酶;B: Helicase used in the present invention;
C:1个iSpC18 spacer;C: 1 iSpC18 spacer;
D:SEQ ID NO:11;D: SEQ ID NO: 11;
E:2个iSpC18 spacers;E: 2 iSpC18 spacers;
F:SEQ ID NO:12;F: SEQ ID NO: 12;
K:SEQ ID NO:15;K: SEQ ID NO: 15;
I:30个T;I: 30 T;
J:SEQ ID NO:16。J: SEQ ID NO: 16.
图8是突变体CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺基乙烷控制完整DNA构建体X穿过纳米孔移动时的电流轨迹示意图。Figure 8 is a schematic diagram of the current trajectories of mutants CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane when controlling the movement of the complete DNA construct X through the nanopore.
X轴:时间(s);X-axis: time (s);
Y轴:电流(nA)。Y-axis: current (nA).
图9是显示图8所示解旋酶控制DNA构建体X过孔移动的区域放大图。Figure 9 is an enlarged view of an area showing the movement of the DNA construct X via the helicase shown in Figure 8.
X轴:时间(s);X-axis: time (s);
Y轴:电流(nA)。Y-axis: current (nA).
图10是CaPif1解旋酶控制DNA构建体X穿过纳米孔移位时的电流轨迹示意图。Figure 10 is a schematic diagram of the current trajectory when CaPif1 helicase controls the displacement of DNA construct X through the nanopore.
图11是显示图10所示解旋酶控制DNA构建体X过孔移动的区域放大图。Figure 11 is an enlarged view of an area showing the movement of the via hole of DNA construct X controlled by the helicase shown in Figure 10.
X轴:时间(s);X-axis: time (s);
Y轴:电流(nA)。Y-axis: current (nA).
为了更好的说明本方法的目的和优点,结合附图及具体实施例对本发明具体实施内容做进一步详细说明。In order to better explain the purpose and advantages of this method, the specific implementation content of the present invention will be further described in detail with reference to the accompanying drawings and specific embodiments.
实施例1:CaPif1解旋酶蛋白的制备Example 1: Preparation of CaPif1 helicase protein
根据野生型CaPif1蛋白的氨基酸序列(PDB ID:7OTJ,相关网址:RCSB PDB-7OTJ:Crystal structure of Pif1 helicase from Candida albicans),通过体外基因合成的方法获得其核酸序列,再以大肠杆菌为宿主进行相关密码子的优化,更换为大肠杆菌常用的密码子,获得优化的CaPif1蛋白的核酸序列,其序列如SEQ ID NO.5所示。然后通过NdeⅠ和XhoⅠ两个限制性酶切位点将其连接插入到pET28表达载体中,经测序验证序列正确后,最终获得CaPif1解旋酶的重组表达质粒。 According to the amino acid sequence of wild-type CaPif1 protein (PDB ID: 7OTJ, related website: RCSB PDB-7OTJ: Crystal structure of Pif1 helicase from Candida albicans), its nucleic acid sequence was obtained through in vitro gene synthesis, and then E. coli was used as the host. The relevant codons were optimized and replaced with codons commonly used in E. coli to obtain the optimized nucleic acid sequence of CaPif1 protein, whose sequence is shown in SEQ ID NO.5. Then it was ligated and inserted into the pET28 expression vector through two restriction enzyme sites of NdeⅠ and XhoⅠ. After sequencing to verify the correct sequence, the recombinant expression plasmid of CaPif1 helicase was finally obtained.
将重组质粒通过热激法转化到BL21(DE3)的大肠杆菌表达宿主中。在诱导表达过程中,首先用加有卡纳抗性的LB培养基在37℃过夜培养含有该表达质粒的宿主菌,然后按照1:100的比例在37℃进行放大培养,待其OD(600)值达到0.4-0.6时停止培养并放在4℃进行1小时的降温处理,随后加入终浓度0.5mM的异丙基硫代半乳糖(Isopropylβ-D-Thiogalactoside,IPTG)在16℃诱导表达12-16h。然后,4℃,15000rpm离心收集细菌,通过高压破碎仪在4℃对菌体进行高压破碎,随后4℃离心收集上清液,然后通过镍柱、肝素柱、Q柱以及分子筛等一步一步实现目的蛋白的分离纯化,最终获得大量的高纯度的野生型解旋酶蛋白。The recombinant plasmid was transformed into the BL21(DE3) E. coli expression host by heat shock method. In the process of inducing expression, the host bacteria containing the expression plasmid were first cultured overnight at 37°C in LB medium with kana resistance, and then amplified and cultured at 37°C at a ratio of 1:100, until the OD (600 ) value reaches 0.4-0.6, stop culturing and place it at 4°C for 1 hour of cooling treatment, then add isopropylβ-D-Thiogalactoside (IPTG) at a final concentration of 0.5mM to induce expression at 16°C12 -16h. Then, collect the bacteria by centrifugation at 4°C and 15000rpm, crush the bacteria at 4°C with a high-pressure crusher, and then centrifuge at 4°C to collect the supernatant, and then pass through the nickel column, heparin column, Q column and molecular sieve to achieve the goal step by step. The protein is separated and purified, and a large amount of highly pure wild-type helicase protein is finally obtained.
附图1显示了经纯化后的CaPif1解旋酶的SDS-PAGE凝胶电泳图。Figure 1 shows the SDS-PAGE gel electrophoresis pattern of purified CaPif1 helicase.
实施例2:荧光实验分析CaPif1解旋酶的解旋活性Example 2: Fluorescence experiment to analyze the unwinding activity of CaPif1 helicase
附图2为检测解旋酶酶活性的荧光分析示意图。Figure 2 is a schematic diagram of fluorescence analysis for detecting helicase enzyme activity.
如附图2中(1)所示,荧光底物链(终浓度100nM,d,SEQ ID NO:6)具有5’端20个碱基的单链DNA部分和18个碱基杂交的双链DNA部分,而且其3’端具有荧光基团(Cy3,f)。与其互补配对的上方短链(c,SEQ ID NO:7)在5’端具有荧光淬灭基团(BHQ-1,e)。当两条链发生互补配对时,Cy3(f)的荧光会被BHQ-1(e)淬灭掉,此时底物基本上是无荧光的。As shown in (1) of Figure 2, the fluorescent substrate chain (final concentration 100 nM, d, SEQ ID NO: 6) has a single-stranded DNA portion of 20 bases at the 5' end and a double-stranded DNA portion of 18 bases hybridized, and its 3' end has a fluorescent group (Cy3, f). The upper short chain (c, SEQ ID NO: 7) that is complementary to it has a fluorescent quenching group (BHQ-1, e) at the 5' end. When the two chains are complementary, the fluorescence of Cy3 (f) will be quenched by BHQ-1 (e), and the substrate is basically non-fluorescent.
在实验过程中,如附图2中(2)所示,CaPif1解旋酶会结合到荧光底物的5’端单链DNA部分,在加入ATP(2mM)和MgCl2(2mM)后沿着5’-3’方向进行移位,并解旋所述的双链部分。During the experiment, as shown in (2) in Figure 2, CaPif1 helicase will bind to the 5'-end single-stranded DNA portion of the fluorescent substrate, and after adding ATP (2mM) and MgCl 2 (2mM), Shift in the 5'-3' direction and unwind the double-stranded portion.
随后,如附图2中(3)所示,过量的捕获链(b,SEQ ID NO:8)优先与短链(c)发生互补配对以防止初始底物之间重新退火,释放出来的底物主链(d)发出荧光。Subsequently, as shown in (3) in Figure 2, the excess capture chain (b, SEQ ID NO: 8) is preferentially complementary to the short chain (c) to prevent re-annealing between the initial substrates, and the released substrates The substance backbone (d) emits fluorescence.
附图3显示了在300mM NaCl的缓冲液(25mM Tris-HCl pH 7.5,2mM ATP,2mM MgCl2,300mM NaCl)中CaPif1解旋酶随时间的解旋荧光底物所产生荧光值的变化结果。结果显示,底物由基本上无荧光到发出荧光。Figure 3 shows the change in fluorescence value generated by the unwinding of the fluorescent substrate by CaPif1 helicase over time in a 300mM NaCl buffer (25mM Tris-HCl pH 7.5, 2mM ATP, 2mM MgCl 2 , 300mM NaCl). The results show that the substrate ranges from essentially non-fluorescent to fluorescent.
实施例3:CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A突变组合以及CaPif1-Q443C、L634C、C426A、C507A、C584A、C592A和C662A突变组合的制备Example 3: Preparation of CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A mutation combinations and CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A and C662A mutation combinations
其中,CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A为具有突变组合R455C、E796C、C426A、C507A、C584A、C592A、和662A的SEQ ID NO:1与双马来酰亚胺基乙烷连接;Among them, CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A are SEQ ID NO: 1 and bismaleimidoethane with mutation combinations R455C, E796C, C426A, C507A, C584A, C592A and 662A. Alkane connection;
CaPif1-Q443C、L634C、C426A、C507A、C584A、C592A和C662A为具有突变组合Q443C、L634C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:3与双马来酰亚胺PEG3连接。CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A and C662A is SEQ ID NO: 3 with the mutation combination Q443C, L634C, C426A, C507A, C584A, C592A and C662A linked to bismaleimide PEG3.
在如此情况下,通过与双马来酰亚胺基乙烷或者双马来酰亚胺PEG3连接分子反应,在CaPif1初始序列中的455和796或者443和634位半胱氨酸之间形成共价连接。In this case, by reacting with bismaleimidoethane or bismaleimide PEG3 linker molecules, codons were formed between cysteines 455 and 796 or 443 and 634 in the initial sequence of CaPif1. Valence connection.
根据野生型CaPif1蛋白的氨基酸序列(PDB ID:7OTJ,相关网址:RCSB PDB-7OTJ:Crystal structure of Pif1 helicase from Candida albicans),通过体外基因合成的方法获得其核酸序列,再以大肠杆菌为宿主进行相关密码子的优化,更换为大肠杆菌常用的密码子,获得优化的CaPif1蛋白的核酸序列,其序列如SEQ ID NO.5所示。然后通过NdeⅠ和XhoⅠ两个限制性酶切位点将其连接插入到pET28表达载体中,经测序验证序列正确后,最终获得CaPif1解旋酶的重组表达质粒。然后,通过重叠PCR的方法进行定点突变来分别获得编码突变组合R455C、E796C、C426A、C507A、C584A、C592A、和662A的核酸序列(SEQ ID NO.2), 以及编码突变组合Q443C、L634C、C426A、C507A、C584A、C592A和C662A的核酸序列(SEQ ID NO.4)。According to the amino acid sequence of wild-type CaPif1 protein (PDB ID: 7OTJ, related website: RCSB PDB-7OTJ: Crystal structure of Pif1 helicase from Candida albicans), its nucleic acid sequence was obtained through in vitro gene synthesis, and then E. coli was used as the host. The relevant codons were optimized and replaced with codons commonly used in E. coli to obtain the optimized nucleic acid sequence of the CaPif1 protein, whose sequence is shown in SEQ ID NO.5. Then it was ligated and inserted into the pET28 expression vector through two restriction enzyme sites of NdeⅠ and XhoⅠ. After sequencing to verify the correct sequence, the recombinant expression plasmid of CaPif1 helicase was finally obtained. Then, site-directed mutagenesis was performed by overlapping PCR to obtain the nucleic acid sequences encoding mutation combinations R455C, E796C, C426A, C507A, C584A, C592A, and 662A (SEQ ID NO. 2). and the nucleic acid sequence encoding mutation combinations Q443C, L634C, C426A, C507A, C584A, C592A and C662A (SEQ ID NO. 4).
将突变后的重组质粒通过热激法转化到BL21(DE3)的大肠杆菌表达宿主中。在诱导表达过程中,首先用加有卡纳抗性的LB培养基在37℃过夜培养含有该表达质粒的宿主菌,然后按照1:100的比例在37℃进行放大培养,待其OD(600)值达到0.4-0.6时停止培养并放在4℃进行1小时的降温处理,随后加入终浓度0.5mM的异丙基硫代半乳糖(Isopropylβ-D-Thiogalactoside,IPTG)在16℃诱导表达12-16h。然后,4℃,15000rpm离心收集细菌,通过高压破碎仪在4℃对菌体进行高压破碎,随后4℃离心收集上清液,然后通过镍柱、肝素柱、Q柱以及分子筛等一步一步实现目的蛋白的分离纯化,最终获得大量的高纯度的解旋酶突变体蛋白,序列分别如SEQ ID NO.1和SEQ ID NO.3。The mutated recombinant plasmid was transformed into the BL21 (DE3) E. coli expression host by heat shock method. In the process of inducing expression, the host bacteria containing the expression plasmid were first cultured overnight at 37°C in LB medium with kana resistance, and then amplified and cultured at 37°C at a ratio of 1:100, until the OD (600 ) value reaches 0.4-0.6, stop culturing and place it at 4°C for 1 hour of cooling treatment, then add isopropylβ-D-Thiogalactoside (IPTG) at a final concentration of 0.5mM to induce expression at 16°C12 -16h. Then, collect the bacteria by centrifugation at 4°C and 15000rpm, crush the bacteria at 4°C with a high-pressure crusher, and then centrifuge at 4°C to collect the supernatant, and then pass through the nickel column, heparin column, Q column and molecular sieve to achieve the goal step by step. After protein separation and purification, a large number of highly pure helicase mutant proteins were finally obtained, with sequences such as SEQ ID NO.1 and SEQ ID NO.3.
将1μL 1M DTT分别加入到100μl CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A(具有突变组合R455C、E796C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:1,储存在25mM Tris-HCl pH 7.5,500mM NaCl,10%甘油,2mM EDTA)和CaPif1-Q443C、L634C、C426A、C507A、C584A、C592A和C662A(具有突变组合Q443C、L634C、C426A、C507A、C584A、C592A、C662A的SEQ ID NO:3,储存在25mM Tris-HCl pH 7.5,500mM NaCl,10%甘油,2mM EDTA)中,在室温中孵育30分钟。Add 1μL of 1M DTT to 100μl of CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A (SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A, stored at 25 mM Tris-HCl pH 7.5, 500mM NaCl, 10% glycerol, 2mM EDTA) and CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A, and C662A (with mutation combinations Q443C, L634C, C426A, C507A, C584A, C592A, C66 2A's SEQ ID NO: 3, stored in 25mM Tris-HCl pH 7.5, 500mM NaCl, 10% glycerol, 2mM EDTA) and incubated at room temperature for 30 minutes.
通过0.5ml Zeba脱盐柱(7k MWCO)将缓冲液置换成PBS缓冲液(pH7.5),得到100μl样品。分别加入0.5μl,10mM的双马来酰亚胺基乙烷和双马来酰亚胺PEG3,室温下20rpm旋转孵育1个小时,之后,加入1μl,1M DTT来终止反应。用4-10%聚丙烯酰胺凝胶对交联结果进行分析。The buffer was replaced with PBS buffer (pH7.5) through a 0.5ml Zeba desalting column (7k MWCO) to obtain a 100μl sample. Add 0.5 μl, 10 mM bismaleimidoethane and bismaleimide PEG3 respectively, and incubate with rotation at 20 rpm for 1 hour at room temperature. After that, add 1 μl, 1M DTT to stop the reaction. Cross-linking results were analyzed on a 4-10% polyacrylamide gel.
附图4的结果显示,CaPif1突变体CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A与连接分子的反应几乎达到了100%的产率。The results in Figure 4 show that the reaction of CaPif1 mutants CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A with the linker molecules reached almost 100% yield.
实施例4:采用凝胶迁移实验来测量经修饰的CaPif1解旋酶结合DNA的能力Example 4: Using gel shift assay to measure the ability of modified CaPif1 helicase to bind DNA
通过退火制备用于凝胶迁移实验所需的DNA底物(SEQ ID NO:9与含5’端Cy3标记的SEQ ID NO:10进行杂交),然后将其按照1:1的摩尔比分别与野生型CaPif1(PDB ID:7OTJ,相关网址:RCSB PDB-7OTJ:Crystal structure of Pif1 helicase from Candida albicans),CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A(具有突变组合R455C、E796C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:1)和CaPif1-Q443C、L634C、C426A、C507A、C584A、C592A和C662A(具有突变组合Q443C、L634C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:3)在缓冲液(25mM Tris-HCl pH 7.5,300mM NaCl)中室温孵育一个小时,得到最终反应液中CaPif1解旋酶的终浓度为250nM,DNA的终浓度为25nM,总反应体积为20μL。The DNA substrate required for gel shift experiments was prepared by annealing (SEQ ID NO: 9 was hybridized with SEQ ID NO: 10 containing a 5' end Cy3 label), and then it was mixed with SEQ ID NO: 10 in a molar ratio of 1:1. Wild-type CaPif1 (PDB ID: 7OTJ, related website: RCSB PDB-7OTJ: Crystal structure of Pif1 helicase from Candida albicans), CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A (with mutation combinations R455C, E796C, SEQ ID NO: 1) for C426A, C507A, C584A, C592A and C662A and CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A and C662A (with mutation combination Q443C, L634C, C426A, C507A, C584A, C592 A and C662A SEQ ID NO: 3) Incubate in buffer (25mM Tris-HCl pH 7.5, 300mM NaCl) for one hour at room temperature to obtain a final concentration of CaPif1 helicase in the final reaction solution of 250nM and a final concentration of DNA of 25nM. The total reaction The volume is 20 μL.
分别加入双马来酰亚胺基乙烷和双马来酰亚胺PEG3到相应的解旋酶突变体样品中至终浓度5μM,室温下孵育1小时。随后,用4%-10%的TBE凝胶进行检测,160V跑胶1.5小时,随后在550nM激发光下观察DNA条带。Add bismaleimidoethane and bismaleimide PEG3 to the corresponding helicase mutant samples to a final concentration of 5 μM, and incubate at room temperature for 1 hour. Subsequently, 4%-10% TBE gel was used for detection, and the gel was run at 160V for 1.5 hours, and then the DNA bands were observed under 550nM excitation light.
附图5的结果显示,CaPif1解旋酶在修饰前后对DNA结合能力的影响,即修饰后的CaPif1(CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A突变体)与多核苷酸的结合能力明显增强。The results in Figure 5 show the effect of CaPif1 helicase on DNA binding ability before and after modification, that is, the interaction between modified CaPif1 (CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A mutants) and polynucleotides The binding ability is significantly enhanced.
实施例5:CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺基乙烷和CaPif1-Q443C、L634C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺PEG3具有控制完整DNA构建体X穿过纳米孔 移动的能力Example 5: CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-Bismaleimidoethane and CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A and C662A-Bismaleimidoethane Imine PEG3 has the ability to control the passage of intact DNA constructs through nanopores ability to move
其中,CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺基乙烷为具有突变组合R455C、E796C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:1与双马来酰亚胺基乙烷连接;CaPif1-Q443C、L634C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺PEG3为具有突变Q443C、L634C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:3与双马来酰亚胺PEG3连接。Among them, CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane are SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A. Linked to bismaleimidoethane; CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A, and C662A-Bismaleimide PEG3 is a protein with mutations Q443C, L634C, C426A, C507A, C584A, C592A, and SEQ ID NO: 3 of C662A is linked to bismaleimide PEG3.
制备如附图6所示的DNA构建体X。DNA construct X as shown in Figure 6 was prepared.
首先设计引物,该引物包含A、C、D、E和F序列,然后用其扩增λDNA上1000个碱基长度的序列(G),获得的PCR产物经纯化后与序列H按照1:1.1摩尔比进行退火杂交,从而得到最终的DNA构建体X。First, design primers, which contain sequences A, C, D, E, and F, and then use them to amplify a 1000-base sequence (G) on lambda DNA. The obtained PCR product is purified and matched with sequence H at a ratio of 1:1.1 Annealing hybridization is performed at the molar ratio to obtain the final DNA construct X.
将制备好的DNA构建体X(终浓度0.1nM)分别与CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺基乙烷和CaPif1-Q443C、L634C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺PEG3(终浓度10nM)在室温的缓冲液(10mM Hepes,pH 8.0,100Mm KCl,10%甘油)中预孵育30分钟。The prepared DNA construct C507A, C584A, C592A, and C662A-bismaleimide PEG3 (final concentration 10 nM) were preincubated in buffer (10 mM Hepes, pH 8.0, 100 Mm KCl, 10% glycerol) for 30 min at room temperature.
在缓冲液(600mM KCl,75mM K3[Fe(CN)6,25mM K4[Fe(CN)6]·3H2O,100mM Hepes,pH8.0)中,由嵌入到DPhPC磷脂双分子层的Csgg纳米孔获得电信号测量值。在实现单孔插入磷脂双分子层后,用2ml缓冲液(600mM KCl,75mM K3[Fe(CN)6,25mM K4[Fe(CN)6]·3H2O,100mM Hepes,pH8.0)流过系统以除去残留的过量纳米孔。然后将预孵育的样品、ATP(终浓度2mM)和MgCl2(终浓度10mM)一起流入到单个纳米孔实验系统(总体积100μL),在+180mV的恒定电压下测量信号6h(包括潜在的2s的-180mV电压反转)。In buffer (600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ]·3H 2 O, 100mM Hepes, pH 8.0), phospholipids embedded in the DPhPC phospholipid bilayer were Electrical signal measurements were obtained from Csgg nanopores. After achieving single-hole insertion into the phospholipid bilayer, 2 ml buffer (600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ]·3H 2 O, 100mM Hepes, pH 8.0 ) is flowed through the system to remove residual excess nanopores. The pre-incubated sample, ATP (final concentration 2mM) and MgCl 2 (final concentration 10mM) were then flowed together into a single nanopore experimental system (total volume 100μL), and the signal was measured at a constant voltage of +180mV for 6h (including potential 2s -180mV voltage inversion).
附图8的结果显示,CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A突变体能够控制完整DNA构建体X穿过纳米孔的移动。The results in Figure 8 show that CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A mutants are able to control the movement of intact DNA construct X through the nanopore.
附图9显示了CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A突变体控制DNA移动的部分区域的放大图。Figure 9 shows an enlarged view of part of the region where CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A mutants control DNA movement.
实施例6:CaPif1解旋酶控制整个DNA构建体X穿过单个Csgg纳米孔的移动Example 6: CaPif1 helicase controls movement of entire DNA construct X through a single Csgg nanopore
本实施例以CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A(具有突变组合R455C、E796C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:1)为例,验证CaPif1解旋酶如何控制整个DNA构建体X穿过单个Csgg纳米孔的移动。This example takes CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A (SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A) as an example to verify CaPif1 unwinding How the enzyme controls the movement of the entire DNA construct X through a single Csgg nanopore.
制备如附图6所示的DNA构建体X。DNA construct X as shown in Figure 6 was prepared.
首先设计引物,该引物包含A、C、D、E和F序列,然后用其扩增λDNA上1000个碱基长度的序列(G),获得的PCR产物经纯化后与序列H按照1:1.1摩尔比进行退火杂交,从而得到最终的DNA构建体X。First, design primers, which contain sequences A, C, D, E, and F, and then use them to amplify a 1000-base sequence (G) on lambda DNA. The obtained PCR product is purified and matched with sequence H at a ratio of 1:1.1 Annealing hybridization is performed at the molar ratio to obtain the final DNA construct X.
将制备好的DNA构建体X(终浓度0.1nM)和CaPif1解旋酶(终浓度10nM)在室温的缓冲液(10mMHepes,pH 8.0,100mM KCl,10%甘油,5mM DTT)中预孵育30分钟。Pre-incubate the prepared DNA construct .
在缓冲液(600mM KCl,75mM K3[Fe(CN)6,25mM K4[Fe(CN)6]·3H2O,100mM Hepes,pH8.0)中,由嵌入到DPhPC磷脂双分子层的Csgg纳米孔获得电信号测量值。在实现单孔插入磷脂双分子层后,用2ml缓冲液(600mM KCl,75mM K3[Fe(CN)6,25mM K4[Fe(CN)6]·3H2O,100mM Hepes,pH8.0)流过系统以除去残留的过量纳米孔。然后将预孵育的样品、ATP(终浓度2mM)和MgCl2(终浓度10mM)一起流入到单个纳米孔实验系统(总体积100μL),在+180mV的恒定电压下测量信号6h(包括潜在的2s的-180mV电压反转)。In buffer (600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ]·3H 2 O, 100mM Hepes, pH 8.0), phospholipids embedded in the DPhPC phospholipid bilayer were Electrical signal measurements were obtained from Csgg nanopores. After achieving single-hole insertion into the phospholipid bilayer, 2 ml buffer (600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ]·3H 2 O, 100mM Hepes, pH 8.0 ) is flowed through the system to remove residual excess nanopores. The pre-incubated sample, ATP (final concentration 2mM) and MgCl 2 (final concentration 10mM) were then flowed together into a single nanopore experimental system (total volume 100μL), and the signal was measured at a constant voltage of +180mV for 6h (including potential 2s -180mV voltage inversion).
如附图10所示,观察到CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A(具有突变组合R455C、E796C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:1)解旋酶能够控制DNA构建体X穿过纳米孔进行移动。解旋酶控制的DNA移动时长为14秒对应于接近1000bp的DNA构建体穿过Csgg纳米孔的移动。 As shown in Figure 10, CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A (SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A) was observed to unwind The enzyme controls the movement of DNA construct X through the nanopore. The duration of helicase-controlled DNA movement of 14 seconds corresponds to the movement of nearly 1000 bp of DNA construct through the Csgg nanopore.
其中,附图11显示了CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A(具有突变组合R455C、E796C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:1)解旋酶控制的DNA移动的部分区域的放大图。Among them, Figure 11 shows CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A (SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A) helicase control A magnified view of part of the DNA movement.
实施例7:野生型CaPif1和CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺基乙烷控制完整DNA构建体Y穿过纳米孔移动的能力Example 7: Ability of wild-type CaPifl and CaPifl-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane to control movement of intact DNA construct Y through the nanopore
其中,CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺基乙烷为具有突变组合R455C、E796C、C426A、C507A、C584A、C592A和C662A的SEQ ID NO:1与双马来酰亚胺基乙烷连接。Among them, CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane are SEQ ID NO: 1 with mutation combination R455C, E796C, C426A, C507A, C584A, C592A and C662A. Connected with bismaleimidoethane.
制备如附图7所示的DNA构建体Y。DNA construct Y as shown in Figure 7 was prepared.
首先设计上游引物和下游引物,其中上游引物包含有A、C、D、E和J序列,其中J序列如SEQ ID NO:16所示,下游引物如SEQ ID NO:17所示。然后用他们扩增λDNA上4115个碱基长度的序列(K),这样使得K序列(如SEQ ID NO:15所示)的3’末端添加上了一段多聚T序列(I),获得的PCR产物经纯化后与序列H按照1:1.1摩尔比进行退火杂交,从而得到最终的DNA构建体Y。First, design the upstream primer and downstream primer. The upstream primer contains A, C, D, E and J sequences. The J sequence is as shown in SEQ ID NO:16, and the downstream primer is as shown in SEQ ID NO:17. They were then used to amplify the 4115 base length sequence (K) on λ DNA, so that a poly T sequence (I) was added to the 3' end of the K sequence (shown as SEQ ID NO: 15), and the obtained After purification, the PCR product is annealed and hybridized with sequence H at a molar ratio of 1:1.1 to obtain the final DNA construct Y.
将制备好的DNA构建体Y(终浓度0.1nM)分别与野生型CaPif1(PDB ID:7OTJ,相关网址:RCSB PDB-7OTJ:Crystal structure of Pif1 helicase from Candida albicans)和CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺基乙烷(终浓度10nM)在室温的缓冲液(10mM Hepes,pH 8.0,100mM KCl,10%甘油)中预孵育30分钟。The prepared DNA construct Y (final concentration 0.1nM) was combined with wild-type CaPif1 (PDB ID: 7OTJ, related website: RCSB PDB-7OTJ: Crystal structure of Pif1 helicase from Candida albicans) and CaPif1-R455C, E796C, C426A respectively. , C507A, C584A, C592A, and C662A-bismaleimidoethane (final concentration 10 nM) were preincubated in buffer (10mM Hepes, pH 8.0, 100mM KCl, 10% glycerol) at room temperature for 30 minutes.
在缓冲液(600mM KCl,75mM K3[Fe(CN)6,25mM K4[Fe(CN)6]·3H2O,100mM Hepes,pH8.0)中,由嵌入到DPhPC磷脂双分子层的Csgg纳米孔获得电信号测量值。在实现单孔插入磷脂双分子层后,用2ml缓冲液(600mM KCl,75mM K3[Fe(CN)6,25mM K4[Fe(CN)6]·3H2O,100mM Hepes,pH8.0)流过系统以除去残留的过量纳米孔。然后将预孵育的样品、ATP(终浓度2mM)和MgCl2(终浓度10mM)一起流入到单个纳米孔实验系统(总体积100μL),在+180mV的恒定电压下测量信号6h(包括潜在的2s的-180mV电压反转)。In buffer (600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ]·3H 2 O, 100mM Hepes, pH 8.0), phospholipids embedded in the DPhPC phospholipid bilayer were Electrical signal measurements were obtained from Csgg nanopores. After achieving single-hole insertion into the phospholipid bilayer, 2 ml buffer (600mM KCl, 75mM K 3 [Fe(CN) 6 , 25mM K 4 [Fe(CN) 6 ]·3H 2 O, 100mM Hepes, pH 8.0 ) is flowed through the system to remove residual excess nanopores. The pre-incubated sample, ATP (final concentration 2mM) and MgCl 2 (final concentration 10mM) were then flowed together into a single nanopore experimental system (total volume 100μL), and the signal was measured at a constant voltage of +180mV for 6h (including potential 2s -180mV voltage inversion).
结果显示,添加CaPif1解旋酶和DNA构建体形成的复合物到Csgg纳米孔系统中可以产生典型的核酸过孔电流信号。野生型的CaPif1解旋酶单体和CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺基乙烷均能够控制DNA构建体Y进行穿孔运动。The results show that adding the complex formed by CaPif1 helicase and DNA construct to the Csgg nanopore system can generate a typical nucleic acid through-hole current signal. The wild-type CaPif1 helicase monomer and CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane can control the pore movement of DNA construct Y.
统计比较相同数量的所述解旋酶控制的DNA过孔轨迹发现,对于野生型的CaPif1解旋酶单体来说,只有较少比例的电信号轨迹观察到了典型的多聚T(I区域)过孔电流信号。Statistical comparison of the same number of DNA via trajectories controlled by the helicase found that for the wild-type CaPif1 helicase monomer, only a smaller proportion of electrical signal trajectories observed the typical poly-T (I region) Via current signal.
然而对于CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺基乙烷来说,绝大多数的电信号轨迹均观察到了多聚T(I区域)的典型过孔信号。这意味着解旋酶到达了DNA构建体Y的3’末端。However, for CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A-bismaleimidoethane, the typical process of polyT (I region) was observed in most of the electrical signal traces. hole signal. This means that the helicase reaches the 3’ end of DNA construct Y.
上述结果表明,CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A-双马来酰亚胺基乙烷相比于野生型CaPif1具有更好的控制DNA穿孔运动的能力。The above results indicate that CaPif1-R455C, E796C, C426A, C507A, C584A, C592A, and C662A-bismaleimidoethane have better ability to control DNA perforation movement than wild-type CaPif1.
本发明中全部序列的详细情况总结如下:The details of all sequences in the present invention are summarized as follows:
1、SEQ ID NO:1,其为CaPif1-R455C、E796C、C426A、C507A、C584A、C592A和C662A突变体的氨基酸序列,具体序列为:
1. SEQ ID NO: 1, which is the amino acid sequence of CaPif1-R455C, E796C, C426A, C507A, C584A, C592A and C662A mutants. The specific sequence is:
2、SEQ ID NO:2,其为SEQ ID NO:1的编码序列,具体序列为:
2. SEQ ID NO: 2, which is the coding sequence of SEQ ID NO: 1. The specific sequence is:
3、SEQ ID NO:3,其为CaPif1-Q443C、L634C、C426A、C507A、C584A、C592A和C662A突变体的氨基酸序列,具体序列为:
3. SEQ ID NO: 3, which is the amino acid sequence of CaPif1-Q443C, L634C, C426A, C507A, C584A, C592A and C662A mutants. The specific sequence is:
4、SEQ ID NO:4,其为SEQ ID NO:3的编码序列,具体序列为:
4. SEQ ID NO: 4, which is the coding sequence of SEQ ID NO: 3. The specific sequence is:
5、SEQ ID NO:5,其为优化的CaPif1蛋白的核酸序列,具体序列为:
5. SEQ ID NO: 5, which is the optimized nucleic acid sequence of CaPif1 protein. The specific sequence is:
6、SEQ ID NO:6,其为荧光底物链序列,具体序列为:
6. SEQ ID NO: 6, which is the sequence of the fluorescent substrate chain. The specific sequence is:
7、SEQ ID NO:7,其为与荧光底物链序列互补配对的短链序列,具体序列为:
7. SEQ ID NO: 7, which is a short chain sequence that is complementary to the fluorescent substrate chain sequence. The specific sequence is:
8、SEQ ID NO:8,其为捕获链序列,具体序列为:
8. SEQ ID NO: 8, which is the capture strand sequence. The specific sequence is:
9、SEQ ID NO:9,其为DNA底物序列,具体序列为:
9. SEQ ID NO: 9, which is the DNA substrate sequence. The specific sequence is:
10、SEQ ID NO:10,其为与SEQ ID NO:9杂交的序列,具体序列为:
10. SEQ ID NO: 10, which is a sequence that hybridizes with SEQ ID NO: 9. The specific sequence is:
11、SEQ ID NO:11,其为DNA构建体X和构建体Y的D序列,具体序列为:
11. SEQ ID NO: 11, which is the D sequence of DNA construct X and construct Y. The specific sequence is:
12、SEQ ID NO:12,其为DNA构建体X和构建体Y的F序列,具体序列为:
12. SEQ ID NO: 12, which is the F sequence of DNA construct X and construct Y. The specific sequence is:
13、SEQ ID NO:13,其为DNA构建体X的G序列,具体序列为:
13. SEQ ID NO: 13, which is the G sequence of DNA construct X. The specific sequence is:
14、SEQ ID NO:14,其为DNA构建体X的H序列,具体序列为:
14. SEQ ID NO: 14, which is the H sequence of DNA construct X. The specific sequence is:
15、SEQ ID NO:15,其为DNA构建体Y的K序列,具体序列为:
15. SEQ ID NO: 15, which is the K sequence of DNA construct Y. The specific sequence is:
16、SEQ ID NO:16,其为DNA构建体Y的J序列,具体序列为:
16. SEQ ID NO: 16, which is the J sequence of DNA construct Y. The specific sequence is:
17、SEQ ID NO:17,其为DNA构建体Y的下游引物序列,具体序列为:
17. SEQ ID NO: 17, which is the downstream primer sequence of DNA construct Y. The specific sequence is:
以上实施例仅仅是本发明的较佳实施例而已,本发明不应该局限于该实施例和附图所公开的内容。凡是不脱离本发明所公开的精神下完成的等效或修改,都落入本发明保护的范围。 The above embodiments are only preferred embodiments of the present invention, and the present invention should not be limited to the contents disclosed in the embodiments and drawings. All equivalents or modifications made without departing from the spirit disclosed in the present invention fall within the scope of protection of the present invention.
Claims (11)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202211132258.8 | 2022-09-16 | ||
| CN202211132258.8A CN117721101B (en) | 2022-09-16 | 2022-09-16 | Modified CaPif1 helicase and application thereof |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2024056038A1 true WO2024056038A1 (en) | 2024-03-21 |
Family
ID=90207474
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2023/118905 Ceased WO2024056038A1 (en) | 2022-09-16 | 2023-09-14 | Modified capif1 helicase and use thereof |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN117721101B (en) |
| WO (1) | WO2024056038A1 (en) |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20150191709A1 (en) * | 2012-07-19 | 2015-07-09 | Oxford Nanopore Technologies Limited | Modified helicases |
| US20160257942A1 (en) * | 2013-10-18 | 2016-09-08 | Oxford Nanopore Technologies Ltd. | Modified helicases |
| CN107109380A (en) * | 2014-10-07 | 2017-08-29 | 牛津纳米孔技术公司 | Modified enzymes |
| WO2021253410A1 (en) * | 2020-06-19 | 2021-12-23 | 北京齐碳科技有限公司 | Pif1-like helicase and application thereof |
| CN113930406A (en) * | 2021-12-17 | 2022-01-14 | 北京齐碳科技有限公司 | Pif1-like helicase and application thereof |
| WO2022126304A1 (en) * | 2020-12-14 | 2022-06-23 | 北京齐碳科技有限公司 | Modified helicase and application thereof |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN100381466C (en) * | 2004-04-06 | 2008-04-16 | 中国科学院上海生命科学研究院 | A kind of human Pif1 gene, its encoded protein and its application |
| WO2020061997A1 (en) * | 2018-09-28 | 2020-04-02 | 北京齐碳科技有限公司 | Helicase and use thereof |
-
2022
- 2022-09-16 CN CN202211132258.8A patent/CN117721101B/en active Active
-
2023
- 2023-09-14 WO PCT/CN2023/118905 patent/WO2024056038A1/en not_active Ceased
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20150191709A1 (en) * | 2012-07-19 | 2015-07-09 | Oxford Nanopore Technologies Limited | Modified helicases |
| US20160257942A1 (en) * | 2013-10-18 | 2016-09-08 | Oxford Nanopore Technologies Ltd. | Modified helicases |
| CN107109380A (en) * | 2014-10-07 | 2017-08-29 | 牛津纳米孔技术公司 | Modified enzymes |
| WO2021253410A1 (en) * | 2020-06-19 | 2021-12-23 | 北京齐碳科技有限公司 | Pif1-like helicase and application thereof |
| WO2022126304A1 (en) * | 2020-12-14 | 2022-06-23 | 北京齐碳科技有限公司 | Modified helicase and application thereof |
| CN113930406A (en) * | 2021-12-17 | 2022-01-14 | 北京齐碳科技有限公司 | Pif1-like helicase and application thereof |
Non-Patent Citations (1)
| Title |
|---|
| LU, K. Y. ET AL.: "Structural study of the function of Candida Albicans Pif1", BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, vol. 567, 21 June 2021 (2021-06-21), XP086694340, DOI: 10.1016/j.bbrc.2021.06.050 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN117721101B (en) | 2024-09-20 |
| CN117721101A (en) | 2024-03-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN117384878B (en) | Modified CtPif1 helicase and application thereof | |
| JP6925424B2 (en) | A method of increasing the throughput of a single molecule sequence by ligating short DNA fragments | |
| JP6690005B2 (en) | Polymerase-template complex for nanopore sequencing | |
| KR102592367B1 (en) | Systems and methods for clonal replication and amplification of nucleic acid molecules for genomic and therapeutic applications | |
| JP2023530155A (en) | Methods for Characterizing Polynucleotides Translocating Through Nanopores | |
| CN119677873A (en) | AP50 polymerase and its use | |
| JP2003535587A (en) | Cold-sensitive mutant DNA polymerase | |
| CN112301016B (en) | Application of novel mlCas12a protein in nucleic acid detection | |
| CN115896047B (en) | Recombinant T4DNA ligase mutant, fusion protein and application thereof | |
| JP2023526061A (en) | thermostable terminal deoxynucleotidyl transferase | |
| CN117210535B (en) | A library construction method for direct RNA sequencing | |
| EP3877544A1 (en) | Liquid sample workflow for nanopore sequencing | |
| CN117721101B (en) | Modified CaPif1 helicase and application thereof | |
| CN120210151A (en) | A mutant MuA transposase and its application | |
| CN118256468B (en) | A modified ToPif1 helicase and its application | |
| CN118126983B (en) | A modified MPK2 helicase and its application | |
| US20240209414A1 (en) | Novel nucleic acid template structure for sequencing | |
| WO2023125605A1 (en) | Single molecule nanopore sequencing method | |
| US20250361552A1 (en) | Method and adaptors | |
| CN118126984A (en) | Modified PAE CASDING helicase and application thereof | |
| Airapetyan | Defining the Dynamic Contributions of mRNA 5′–3′ Circularization to eIF4E Cap Recognition | |
| Airapetyan | Defining the Dynamic Contributions of mRNA 5′–3′ Circularization to eIF4E Cap | |
| JP2008099556A (en) | Method of quickly extracting, purifying and cloning target gene by using closed circular double-stranded dna | |
| CN118086286A (en) | A DNA-peptide conjugate, ARCFU-like helicase and its application in detecting peptide segments | |
| JP2000184887A (en) | Preparation method of labeled DNA |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23864768 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 23864768 Country of ref document: EP Kind code of ref document: A1 |