[go: up one dir, main page]

WO2025067293A1 - Helicase and use thereof - Google Patents

Helicase and use thereof Download PDF

Info

Publication number
WO2025067293A1
WO2025067293A1 PCT/CN2024/121291 CN2024121291W WO2025067293A1 WO 2025067293 A1 WO2025067293 A1 WO 2025067293A1 CN 2024121291 W CN2024121291 W CN 2024121291W WO 2025067293 A1 WO2025067293 A1 WO 2025067293A1
Authority
WO
WIPO (PCT)
Prior art keywords
helicase
polynucleotide
target analyte
construct
bond
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2024/121291
Other languages
French (fr)
Chinese (zh)
Inventor
刘先宇
王玉博
鄢妍
刘凡贵
林建勋
兰静
李阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qitan Technology Ltd Beijing
Original Assignee
Qitan Technology Ltd Beijing
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN202311266570.0A external-priority patent/CN119709690A/en
Priority claimed from CN202311261708.8A external-priority patent/CN119709689A/en
Application filed by Qitan Technology Ltd Beijing filed Critical Qitan Technology Ltd Beijing
Publication of WO2025067293A1 publication Critical patent/WO2025067293A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Definitions

  • the invention belongs to the technical field of sequencing, and particularly relates to a modified helicase and an application thereof in characterizing a target analyte.
  • Nanopore sequencing technology also known as the fourth-generation sequencing technology, uses a nanopore that can provide an ion current channel to allow analytes such as single-stranded nucleic acid molecules to pass through the nanopore under the drive of electrophoresis.
  • analytes such as nucleic acids pass through the nanopore, the current of the nanopore will be reduced, and the gene sequencing technology can read the sequence information of the different signals generated in real time.
  • Helicases are a useful tool for controlling the movement of nucleic acid molecules during nanopore sequencing, but current helicases have some problems that need to be solved.
  • the helicase may fall off the nucleic acid molecule during sequencing, causing the nucleic acid molecule to be pulled through the pore at an uncontrolled speed and in an uncontrolled manner, so that the current signal of the analyte, such as a single nucleotide, is too short to be distinguished.
  • the prior art CN105899678A discloses a modified helicase. Specifically, at least one cysteine/non-natural amino acid is introduced into the tower/pin/1A domain of the helicase, thereby improving the continuous rate control ability of the helicase in nanopore sequencing.
  • the present application provides a new helicase modification method, which can also improve the continuous rate control ability of the helicase in nanopore sequencing, and has better effects in terms of sequencing stability and accuracy.
  • the present application provides a helicase
  • (c) comprising a polynucleotide binding domain, wherein the polynucleotide binding domain comprises an opening in at least one conformational state, through which a polynucleotide can be debound by the helicase, wherein the helicase forms an ectopic bond between two or more amino acids at the opening (forming the ectopic bond can prevent the polynucleotide from being debound by the helicase through the opening), and wherein the helicase retains its ability to control the movement of the target polynucleotide; or
  • (d) comprising a polynucleotide binding domain, wherein the polynucleotide binding domain comprises an opening in at least one conformational state, through which a polynucleotide can be debound from the helicase, wherein the helicase is modified so that it can form an ectopic bond between two or more amino acids at the opening (the formation of the ectopic bond can prevent the polynucleotide from being debound from the helicase through the opening), and wherein the helicase retains its ability to control the movement of the target polynucleotide.
  • the ectopic bond does not include an ectopic bond formed with the participation of cysteine, and the ectopic bond includes an ectopic bond formed with the participation of any natural amino acid except cysteine.
  • the arbitrary natural amino acid except cysteine is selected from the group consisting of the following amino acids: Ile, Val, Leu, Phe, Met, Ala, Gly, Thr, Ser, Trp, Tyr, Pro, His, Glu, Gln, Asp, Asn, Lys, Arg,
  • the ectopic bond comprises Lys-His, Lys-Ser, Lys-Thr, Lys-Tyr, Lys-Lys, Lys-Glu, Lys-Asp, Lys-Gln, Lys-Arg, Arg-Glu, Lys-Met, Arg-Asp, Arg-Arg, Tyr-Tyr, Tyr-Trp, Met-Met ectopic bond or a combination thereof;
  • the helicase consists of one or more monomers
  • the two amino acids forming the isopeptide bond are located in the same domain or in different domains.
  • the helicase is derived from a member of the natural or modified helicase family of the following group: Dda helicase, Pifl-like helicase, Upfl-like helicase, UvrD/Rep helicase, Ski-like helicase, Rad3/XPD helicase, NS3/NPH-II helicase, DEAD helicase, DEAHi RHA helicase, RecG-like helicase, REcQ-like helicase, T1R-like helicase, Swi/Snf-like helicase and Rig-I-like helicase, preferably Dda helicase, more preferably T4-Dda helicase;
  • the helicase is derived from a member of the natural or modified helicase family of the following groups: RecD helicase, Upfl helicase, PcrA helicase, Rep helicase, UvrD helicase, Hel308 helicase, Mtr4 helicase, XPD helicase, NS3 helicase, Mssl 16 Helicase, Prp43 helicase, RecG helicase, RecQ helicase, T1R helicase, RapA helicase and Hef helicase.
  • modified helicases such as helicases with amino acid mutations has a relatively complex production process and high cost.
  • unmodified natural helicases for covalent bonding such as ectopic bond bonding is simpler and easier for those skilled in the art, and can also improve the stability of controlling the movement of nucleic acid molecules through nanopores.
  • the unmodified natural helicase includes a polynucleotide binding domain and two RecA-like domains, and there is a connection between the natural amino acid residues located in different RecA-like domains, which has the ability to control the movement of polynucleotides; and compared with the absence of the connection, the presence of the connection enhances the ability of the modified helicase to control the movement of polynucleotides.
  • the natural amino acid residues that are connected have interactions and/or a spatial distance of less than 50 angstroms. The interaction includes electrostatic interaction, preferably, the interaction is generated after the helicase binds to the target polynucleotide.
  • the natural amino acid residues that are connected are naturally present in the helicase, preferably, the natural amino acid residues that are connected include glutamic acid, lysine, tyrosine, arginine, aspartic acid, cysteine, methionine, and tryptophan. In one embodiment, the natural amino acid residues that are connected are located around the polynucleotide binding domain.
  • the connection is a covalent connection, and preferably, the connection is an ectopic bond connection.
  • the covalent connection includes a connection formed by a chemical crosslinker, or a connection formed by an enzyme.
  • the chemical crosslinker includes EDC (1-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride), EDC/NHS (N-hydroxysuccinimide), EDC/s-NHS (N-hydroxysuccinimide sulfonic acid sodium salt), chlorpyrifos oxon, HRP (horseradish peroxidase), laccase, or tyrosinase.
  • the helicase belongs to the SF1B (Superfamily 1B) subfamily helicase (The T4 phage SF1B helicase Dda is structurally optimized to perform DNA strand separation, Structure. 2012 Jul 3; 20 (7): 1189-200).
  • the helicase includes Dda helicase, RecD2 helicase, Pif1 helicase, or DNA helicase B. In one embodiment, the helicase is Dda helicase from T4 phage.
  • the natural amino acid residues present in the connection include E93, E94, K364, Y92, or Y363. In one embodiment, the natural amino acid residues present in the connection are selected from at least one of the following groups: E93 and K364; E94 and K364; or Y92 and Y363.
  • the natural or modified Dda helicase is from the Dda helicases in Table 1 and Table 2 disclosed in CN105899678A.
  • the natural or modified Pif1-like helicase is from the Pif1-like helicases in Table 1 and Table 2 disclosed in CN113930406A.
  • an ectopic bond is introduced between the natural amino acid residues in the 1A domain and the 2A domain of the helicase.
  • the ectopic bond comprises Lys-His, Lys-Ser, Lys-Thr, Lys-Tyr, Lys-Lys, Lys-Glu, Lys-Asp, Lys-Gln, Lys-Arg, Arg-Glu, Lys-Met, Arg-Asp, Arg-Arg, Tyr-Tyr, Tyr-Trp, Met-Met ectopic bond or a combination thereof.
  • the amino acid side chains are directly connected to form a covalent bond or are connected to form an ectopic bond by a connecting molecule.
  • the helicase is derived from a Dda helicase, preferably a T4-Dda helicase.
  • the helicase is derived from T4-Dda helicase, and the helicase comprises:
  • the helicase comprises any mutant of the 1A domain and/or any mutant of the 2A domain, as long as the helicase retains its ability to control the movement of the target analyte, preferably the target polynucleotide.
  • the target analyte is modified by methylation, oxidation, damage with one or more proteins or one or more markers, tags or spacers.
  • the target analyte is a target polynucleotide.
  • the present application provides a polypeptide comprising the 1A domain and the 2A domain that form an ectopic bond from the aforementioned helicase, and a polynucleotide binding domain, and excluding other domains of the helicase.
  • the present application provides a helicase, comprising the polypeptide, wherein the helicase has the ability to control the movement of a target analyte, preferably a target polynucleotide.
  • the present application provides a construct comprising a helicase or a polypeptide, and The target polynucleotide bound by its polynucleotide binding domain, wherein two or more amino acid residues on the surface of the helicase surrounding the polynucleotide binding domain are connected by an isopeptide bond, wherein the bound target polynucleotide is surrounded by the structure connected by the isopeptide bond.
  • the construct comprises an adapter, the target polynucleotide is connected to the adapter, and the construct has the ability to control the movement of the target polynucleotide.
  • the construct comprises two or more helicases.
  • the helicase may be any of the helicases described above, or a combination thereof.
  • the present application provides a polynucleotide comprising a sequence encoding the helicase, the polypeptide, the construct, or consisting of a sequence thereof.
  • the present application provides a vector comprising the polynucleotide operably linked to a promoter.
  • the present application provides a host cell, comprising the vector.
  • the present application provides a method for preparing the helicase, the polypeptide, or the construct, comprising expressing the polynucleotide, transfecting cells with the vector, or culturing the host cells.
  • the present application provides a method for controlling the movement of a target analyte, preferably a target polynucleotide, comprising contacting the target polynucleotide with the helicase or polypeptide, or the construct, and thereby controlling the movement of the target analyte, preferably the target polynucleotide.
  • the method is used to control the movement of a target analyte, preferably a target polynucleotide, through a pore.
  • the present application provides a method for characterizing a target analyte, preferably a target polynucleotide, comprising:
  • the target analyte is selected from one or more of polynucleotides, polypeptides, polysaccharides and lipids, preferably polynucleotides or polypeptides, polysaccharides and lipids linked to polynucleotides, more preferably single-stranded polynucleotides, double-stranded polynucleotides or partially double-stranded polynucleotides. Acid; or
  • the target analyte is a target polynucleotide, and the one or more characteristics are selected from (i) the length of the target polynucleotide, (ii) the identity of the target polynucleotide, (iii) the sequence of the target polynucleotide, (iv) the secondary structure of the target polynucleotide; and (v) whether the target polynucleotide is modified.
  • the target polynucleotide is modified by methylation, oxidation, damage with one or more proteins or one or more markers, tags or spacers.
  • one or more characteristics of the target analyte, preferably the target polynucleotide are measured by electrical measurement and/or optical measurement.
  • the electrical measurement is current measurement, voltage measurement, resistance measurement, capacitance measurement, inductance measurement, impedance measurement, tunnel measurement or field effect transistor measurement.
  • the method comprises:
  • the method further comprises the step of applying a voltage across the hole to form a complex between the hole and the helicase or construct.
  • the target analyte is a target polynucleotide, and at least part of the target polynucleotide is double-stranded.
  • the hole is a nanopore or a transmembrane hole, or the hole is selected from a biological hole, a solid-state hole, or a hole hybridized with a biological solid state.
  • the biological hole is derived from hemolysin, leukocidin, CsGG, Mycobacterium smegmatis porin A (MspA), porin B, porin C, porin D, outer membrane porin F, outer membrane porin G, outer membrane phospholipase A, Neisseria autotransporter, and WZA;
  • the solid-state hole is derived from a graphene nanopore, a MoS2 nanopore, a BN nanopore, or a PA63 nanopore.
  • the present application provides a method for forming a sensor for characterizing a target analyte, preferably a target polynucleotide, comprising forming a complex between (a) a hole and (b) the helicase or polypeptide, or the construct, and thereby forming a sensor for characterizing the target analyte, preferably a target polynucleotide.
  • the complex is formed by (a) contacting the pore and the helicase or construct in the presence of the target analyte, preferably the target polynucleotide, and (b) applying an electric potential across the pore.
  • the electric potential is a voltage potential or a chemical potential.
  • the complex is formed by covalently linking the pore to the helicase or construct.
  • the present application provides a sensor for characterizing a target analyte, preferably a target polynucleotide, comprising (a) a pore and (b) the helicase or polypeptide, or a complex between the constructs.
  • the present application provides the use of the helicase or polypeptide, or the construct in controlling the movement of a target analyte, preferably a target polynucleotide, through a pore.
  • the present application provides a kit for characterizing a target analyte, preferably a target polynucleotide, comprising (a) a pore and (b) the helicase or polypeptide, or the construct.
  • the kit further comprises a chip comprising an amphiphilic membrane.
  • the present application provides a device for characterizing a target analyte, preferably a target polynucleotide, comprising (a) a plurality of holes and (b) a plurality of the helicases or polypeptides, or the constructs.
  • the device comprises: a sensor device capable of supporting a plurality of pores and operable to use the pores and helicase or construct to characterize a target analyte, preferably a target polynucleotide; and at least one port for delivering a material for characterization.
  • the device comprises: a sensor device capable of supporting a plurality of pores and operable to use the pores and helicase or construct to characterize a target analyte, preferably a target polynucleotide; and at least one memory for storing materials for characterization; a fluidics system configured to controllably provide materials to the sensor device from the at least one memory; and one or more containers for receiving corresponding samples, the fluidics system being configured to selectively provide samples to the sensor device from one or more containers.
  • the present application provides a method for preparing the helicase, comprising:
  • step (b) the step of contacting the helicase with the polynucleotide is also included.
  • the ectopic bond comprises the use of H 2 O 2 , Hemin and H 2 O 2 , HRP enzyme and H 2 O 2 , transglutaminase, EDC, or Ru catalyzes the formation, more preferably, HRP enzyme and H 2 O 2 , or EDC.
  • the method further comprises (c) determining whether the obtained helicase is capable of controlling the movement of a target analyte, preferably a target polynucleotide.
  • the present application provides a method for preparing the construct, comprising connecting the helicase or polypeptide to a linker and thereby preparing the construct.
  • the method further comprises determining whether the obtained construct can control the movement of the target analyte, preferably the target polynucleotide.
  • the present application provides a group of two or more helicases attached to a target analyte, preferably a target polynucleotide, wherein at least one of the two or more helicases is the aforementioned helicase.
  • FIGS 1 to 4 show the enzyme cross-linking effect under the addition of different catalysts.
  • FIG. 5 shows the lock-enzyme cross-linking effect of different enzymes under HRP/H 2 O 2 catalysis according to one embodiment.
  • FIG. 6 shows the column purification elution process of a locked enzyme mutant based on Tyr-Tyr cross-linking according to one embodiment.
  • FIG. 7 shows a PAGE gel image of column purification elution of a nucleic acid-protein complex based on Tyr-Tyr cross-linking according to one embodiment.
  • FIG. 8 shows an on-machine test signal diagram of a nucleic acid-protein complex based on Tyr-Tyr cross-linking according to one embodiment.
  • FIG. 9 shows the gel electrophoresis of the nucleic acid-protein complex based on Tyr-Tyr cross-linking prepared in Example 2.1.
  • FIG. 10 shows the gel electrophoresis of the Lys-Glu cross-linked nucleic acid-protein complex prepared in Example 2.2.
  • FIG. 11 shows the column purification elution profile of the nucleic acid-protein complex purified in Example 2.3.
  • FIG. 12 shows the gel electrophoresis diagram of the nucleic acid-protein complex after elution in Example 2.3, wherein FT, E1, E2, and E3 represent nucleic acid-protein complexes corresponding to different main peaks in the elution process shown in FIG. 11 .
  • FIG. 13 shows the nucleic acid-protein complex after treatment with different reagents in Example 2.3 Gel electrophoresis diagram.
  • FIG. 14 shows the results of ATP consumption detection of nucleic acid-protein complexes in Example 2.4.
  • FIG. 15 shows the sequencing signal diagram of the nucleic acid-protein complex in Example 2.5, wherein FIG. A is a complete sequencing signal diagram of 10 kb, and FIG. B is a local amplification signal step diagram.
  • nucleotide includes two or more nucleotides
  • a helicase includes two or more helicases.
  • nucleotide sequence refers to a polymeric form of nucleotides (ribonucleotides or deoxyribonucleotides) of any length. The term refers only to the primary structure of the molecule. Thus, the term includes double-stranded and single-stranded DNA and RNA.
  • nucleic acid refers to a single-stranded or double-stranded covalently linked nucleotide sequence, wherein the 3' and 5' ends on each nucleotide are linked by a phosphodiester bond.
  • Nucleotides can be composed of deoxyribonucleotide bases or ribonucleotide bases.
  • Nucleic acids can include DNA and RNA, and can be synthesized and prepared in vitro or isolated from natural resources.
  • Nucleic acids can further include modified DNA or RNA, such as methylated DNA or RNA, or RNA modified after translation, such as 5'-capping with 7-methylguanosine, 3'-end processing, such as cleavage and polyadenylation, and splicing.
  • Nucleic acids can also include synthetic nucleic acids (XNA), such as hexitol nucleic acids (HNA), cyclohexene nucleic acids (CeNA), threose nucleic acids (TNA), glycerol nucleic acids (GNA), locked nucleic acids (LNA) and peptide nucleic acids (PNA).
  • HNA hexitol nucleic acids
  • CeNA cyclohexene nucleic acids
  • TAA threose nucleic acids
  • GNA glycerol nucleic acids
  • LNA locked nucleic acids
  • PNA peptide nucleic acids
  • nucleic acids or polynucleotides
  • the size of nucleic acids is usually expressed in terms of the number of base pairs (bp) for double-stranded polynucleotides, or in terms of the number of nucleotides (nt) in the case of single-stranded polynucleotides.
  • bp base pairs
  • nt nucleotides
  • One thousand bp or nt is equal to one kilobase pair (kb).
  • Polynucleotides less than about 40 nucleotides in length are generally referred to as "oligonucleotides" and may contain A primer in DNA manipulations, for example by the polymerase chain reaction (PCR).
  • PCR polymerase chain reaction
  • a polynucleotide such as a nucleic acid
  • the polynucleotide or nucleic acid may comprise any combination of any nucleotides.
  • the nucleotides may be naturally occurring or artificially synthesized.
  • One or more nucleotides in the polynucleotide may be oxidized or methylated.
  • One or more nucleotides in the polynucleotide may be damaged.
  • the polynucleotide may comprise a pyrimidine dimer. Such dimers are generally associated with damage caused by ultraviolet light and are a major cause of melanoma of the skin.
  • One or more nucleotides in the polynucleotide may be modified, for example with a conventional marker or label.
  • the polynucleotide may comprise one or more nucleotides that are abasic (i.e., lack a nucleobase), or lack a nucleobase and a sugar (i.e., C3).
  • the nucleotides in the polynucleotide can be linked to each other in any manner.
  • the nucleotides are usually linked by their sugar and phosphate groups, as in nucleic acids.
  • the nucleotides can be linked by their nucleobases, as in pyrimidine dimers.
  • the polynucleotide may be single-stranded or double-stranded. At least a portion of the polynucleotide is preferably double-stranded.
  • the polynucleotide may be a nucleic acid, such as deoxyribonucleic acid (DNA) or ribonucleic acid (RNA).
  • the polynucleotide may comprise an RNA chain hybridized to a DNA chain.
  • the polynucleotide may be any synthetic nucleic acid known in the prior art, such as a peptide nucleic acid (PNA), a glycerol nucleic acid (GNA), a threose nucleic acid (TNA), a locked nucleic acid (LNA) or other synthetic polymers having nucleotide side chains.
  • PNA peptide nucleic acid
  • GNA glycerol nucleic acid
  • TNA threose nucleic acid
  • LNA locked nucleic acid
  • the PNA backbone is composed of repeated N-(2-aminoethyl)-glycine units linked by peptide bonds.
  • the GNA backbone is composed of repeated ethylene glycol units linked by phosphodiester bonds.
  • the TNA backbone is composed of repeated threose groups linked together by phosphodiester bonds.
  • LNA is formed by the above-mentioned ribonucleic acid, with an additional bridging structure connecting the 2' oxygen and 4' carbon in the ribose moiety.
  • Bridged nucleic acids are modified RNA nucleotides. They can also be referred to as restricted or inaccessible RNA.
  • BNA monomers can contain 5-, 6- or even 7-membered bridges with a "fixed" C3'-endo sugar puckering. The bridges are synthetically introduced into the 2', 4'-position of the ribose to produce 2', 4'-BNA monomers.
  • the polynucleotide is most preferably ribonucleic acid (RNA) or deoxyribonucleic acid (DNA).
  • the polynucleotide can be of any length.
  • the length of the polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotides or nucleotide pairs.
  • the length of the polynucleotide can be 1000 or more nucleotides or nucleotide pairs, 5000 or more nucleotides or nucleotide pairs or 100000 nucleotides or nucleotide pairs. or more nucleotides or nucleotide pairs.
  • any number of polynucleotides can be studied.
  • the method of the embodiment can involve characterizing 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 50, 100 or more polynucleotides. If two or more polynucleotides are characterized, they can be different polynucleotides or the same polynucleotide.
  • the polynucleotide may be naturally occurring or artificially synthesized.
  • the method may be used to verify the sequence of the prepared oligonucleotide.
  • the method is typically performed in vitro.
  • protein protein
  • polypeptide and “peptide” are further used interchangeably herein to refer to polymers of amino acid residues as well as variants and synthetic analogs of amino acid residues. Thus, these terms apply to amino acid polymers in which one or more amino acid residues is a synthetic non-naturally occurring amino acid, such as a chemical analog of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
  • Polypeptides may also undergo maturation or post-translational modification processes, which may include, but are not limited to, glycosylation, proteolytic cleavage, lipidation, signal peptide cleavage, propeptide cleavage, phosphorylation, and the like.
  • Conservative substitution replaces amino acid with other amino acid with similar chemical structure, similar chemical property or similar side chain volume.
  • the amino acid introduced can have polarity, hydrophilicity, hydrophobicity, alkalinity, acidity, neutrality or charge similar to the amino acid they replace.
  • conservative substitution can introduce another aromatic or aliphatic amino acid to replace pre-existing aromatic or aliphatic amino acid.
  • Conservative amino acid changes are well known in the art and can be selected according to the properties of the 20 kinds of main amino acids defined in the following table 1. In the case where amino acid has similar polarity, this can also be determined with reference to the hydrophilicity scale of the amino acid side chain in Table 2.
  • Polynucleotide sequences can be derived and replicated using standard methods in the art. Suitable methods for site-directed mutagenesis are known in the art and include, for example, combinatorial chain reactions. Polynucleotides encoding the constructs of the embodiments can be prepared using techniques known in the art, such as those described in Sambrook, J. and Russell, D. (2001). Molecular Cloning A Laboratory Manual, 3rd Edition. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.
  • the resulting polynucleotide sequence can then be incorporated into a recombinant replicable vector, such as a cloning vector.
  • the vector can be used to replicate the polynucleotide in a compatible host cell.
  • the polynucleotide sequence can be prepared by introducing the polynucleotide into a replicable vector, introducing the vector into a compatible host cell, and growing the host cell under conditions that cause the vector to replicate. The vector can be recovered from the host cell.
  • the nanopore is the only channel for ions to pass through on both sides of the phospholipid membrane.
  • Speed-controlling proteins such as polynucleotide binding proteins (such as the helicase of the present application embodiment) act as motor proteins for nucleic acid molecules such as DNA, pulling the DNA chain so that it passes through the nanopore/protein pore/transmembrane pore in sequence with a step length of a single nucleotide.
  • the corresponding pore blocking signal will be recorded.
  • the present application provides a modified helicase, preferably a modified DNA-dependent ATP
  • the helicase of the modified helicase (Dda helicase), more preferably T4-Dda helicase.
  • the modification allows the modified helicase to remain bound to the analyte, preferably polynucleotide, for a longer period of time.
  • the modified helicase retains its ability to control the movement of the analyte, preferably polynucleotide. In other words, the modified helicase can still control the movement of the analyte, preferably polynucleotide.
  • the degree to which the helicase controls the movement of the analyte, preferably polynucleotide is generally changed by the modification, as described in detail below.
  • the helicase preferably the Dda helicase, more preferably the T4-Dda helicase is modified.
  • the modified helicase is usually modified compared to the corresponding wild-type helicase or natural helicase.
  • the helicase of the present application embodiment is artificial or non-natural.
  • the ability of a helicase to bind or not bind to a polynucleotide can be determined using any method known in the art. Suitable binding/not binding assays include, but are not limited to, native polyacrylamide gel electrophoresis (PAGE), fluorescence anisotropy, calorimetry, and surface plasmon resonance (SPR, such as Biacore TM ).
  • PAGE native polyacrylamide gel electrophoresis
  • SPR surface plasmon resonance
  • the ability of a helicase to unwind from a polynucleotide can be determined by measuring the time it takes for the helicase to control the movement of the polynucleotide. This can also be determined using any method known in the art.
  • the ability of a helicase to control the movement of a polynucleotide is typically analyzed in a nanopore system.
  • Modified helicase is a useful tool for controlling the movement of polynucleotides in the chain sequencing process.
  • the Dda helicase can control the movement of DNA with at least two active operation modes (when all necessary components for promoting movement are provided to the helicase, such as ATP and Mg 2+ ) and an inactive operation mode (when the necessary components for promoting movement are not provided to the helicase).
  • the Dda helicase moves along the DNA in the direction of 5'-3', but the orientation of the DNA in the nanopore (depending on which end of the DNA is captured) means that the enzyme can be used to move the DNA out of the nanopore against the direction of the applied field, or move the DNA into the nanopore along the direction of the applied field.
  • the helicase works against the direction of the field applied by the potential, and the spiral DNA is pulled out of the nanopore and pulled into the cis compartment.
  • the helicase works along the direction of the field applied by the potential, and the spiral DNA is pushed into the nanopore and enters the trans compartment.
  • the Dda helicase When the Dda helicase is not provided with the necessary components to facilitate movement, it can bind to the DNA and act as a brake to slow down the movement of the DNA as it is pulled into the pore by the applied field.
  • the inactive mode it does not matter whether the DNA is captured with the 3' or 5' end facing down, it is the applied field that pulls the DNA into the nanopore towards the trans side.
  • the pore is located in the helicase, while the enzyme acts as a brake.
  • the helicase's control over the movement of DNA can be described in a variety of ways, including ratcheting, sliding, and braking.
  • the problem in the sequencing process of polynucleotides is that the molecular motor controlling the movement of the polynucleotides may be released from the polynucleotides. This allows the polynucleotides to be rapidly pulled through the hole in an uncontrolled manner in the direction of the applied field.
  • the modified helicase of the present application embodiment does not unwind or release from the polynucleotides being sequenced. When the modified helicase controls the polynucleotides to move through the nanopore, the modified helicase can provide the increased polynucleotide read length.
  • the ability of the modified helicase of the present application embodiment to move the entire polynucleotide through the nanopore allows the polynucleotides to be evaluated to be characterized with the accuracy and speed improved than the known methods, such as its sequence. When the chain length increases and it is necessary to have a molecular motor with improved progress, this becomes more important.
  • the modified helicase of the present application embodiment is particularly effective for controlling the movement of 500 nucleotides or more nucleotides, for example 1000 nucleotides, 5000, 10000, 20000, 50000, 100000 or more nucleotides.
  • modified helicase in addition, the use of the modified helicase according to the embodiments of the present application means that lower helicase concentrations can be used.
  • the helicase of the modification of the present application embodiment is also a useful tool for isothermal polymerase chain reaction (PCR).
  • PCR isothermal polymerase chain reaction
  • the chain of double-stranded DNA is first separated by helicase and covered by single-stranded DNA (ssDNA)-binding protein.
  • ssDNA single-stranded DNA
  • two sequence-specific primers are usually hybridized to each edge of the DNA template.
  • the primer annealed to the template can be extended using DNA polymerase to prepare double-stranded DNA, and then two newly synthesized DNA products can be used as substrates by the helicase of the present application embodiment, enter the next round of reaction. Therefore, synchronous chain reaction occurs, causing the exponential amplification of the target sequence selected.
  • the modified helicase has the ability to control the movement of a polynucleotide.
  • the ability of the helicase to control the movement of a polynucleotide can be analyzed by any method known in the art.
  • the helicase can be contacted with a polynucleotide, and the position of the polynucleotide can be determined using standard methods.
  • the ability of the modified helicase to control the movement of a polynucleotide is analyzed in a nanopore system.
  • the modified helicases of the embodiments of the present application can be isolated, substantially isolated, purified or substantially purified. If the helicase is completely free of any other components such as lipids, polynucleotides, pore monomers or other proteins, the helicase is isolated or purified. If the helicase is mixed with a carrier or diluent that does not interfere with its intended use, the helicase is substantially isolated. For example, if the helicase contains less than 10%, less than 5%, less than 2% or less than 1% The helicase is substantially isolated or substantially purified if the helicase is present without other components of the pore such as lipids, polynucleotides, pore monomers or other proteins.
  • any helicase preferably Dda helicase, more preferably T4-Dda helicase can be modified according to the embodiments of the present application.
  • the Dda helicase is T4-Dda helicase.
  • Dda helicase usually contains the following five domains: 1A (RecA-type motor) domain, 2A (RecA-type motor) domain, tower domain, pin domain and hook domain (Xiaoping He et al., 2012, Structure; 20: 1189-1200). These domains can be identified using protein modeling, X-ray diffraction measurements of proteins in crystal form (Rupp B (2009). Biomolecular Crystallograph: Principles, Practice and Application to Structural Biology. New York: Gar 1and Science.), nuclear magnetic resonance (NMR) spectroscopy of protein solutions (Mark Rance; Cavanagh, John; Wayne J. Fairbrother; Arthur W. Hunt III; Skelton, N. Nicholas J. (2007).
  • the modified helicase is further modified to reduce its surface negative charge.
  • Surface residues can be identified in the same manner as the Dda domain described above.
  • Surface negative charge is typically surface negatively charged amino acids, such as aspartic acid (D) and glutamic acid (E).
  • the helicase is preferably modified to neutralize one or more surface negative charges by replacing one or more negatively charged amino acids with one or more positively charged amino acids, uncharged amino acids, non-polar amino acids and/or aromatic amino acids, or by introducing one or more positively charged amino acids, preferably adjacent to one or more negatively charged amino acids.
  • Suitable positively charged amino acids include, but are not limited to, histidine (H), lysine (K) and arginine (R). Uncharged amino acids have no net charge.
  • Suitable uncharged amino acids include, but are not limited to, cysteine (C), serine (S), threonine (T), methionine (M), asparagine (N) and glutamine (Q).
  • Non-polar amino acids have non-polar side chains.
  • Suitable non-polar amino acids include, but are not limited to, glycine (G), alanine (A), proline (P), isoleucine (I), leucine (L) and valine (V).
  • Aromatic amino acids have Aromatic Side Chains.
  • Suitable aromatic amino acids include, but are not limited to, histidine (H), phenylalanine (F), tryptophan (W) and tyrosine (Y).
  • Preferred substitutions include, but are not limited to, R for E, K for E, N for E, K for D, and R for D.
  • the embodiment provides a modified helicase, forming an ectopic bond between the 1A domain and the 2A domain of the helicase.
  • the operation of introducing non-natural amino acids is relatively cumbersome and costly, while the introduction of disulfide bonds is less stable and easily opened by reducing agents.
  • the introduction of natural side chains into the 1A domain and the 2A domain can form ectopic bonds.
  • Heteropeptide bonds are a type of covalent bond formed by the reaction of amino acid R chains in addition to backbone peptide bonds in proteins. Heteropeptide bonds can be formed within protein molecules or between protein molecules in the natural environment. Generally, isopeptide bonds formed within molecules help proteins respond to environmental stress and improve protein stability (DOI:10.1016/j.tibs.2010.09.007), while isopeptide bonds formed between protein molecules can be used in the food industry (doi:10.1007/s12223-013-0287-x) or in protein-protein interaction research. The formation of heteropeptide bonds can be divided into three ways: self-catalytic formation within proteins, enzymatic catalysis, and chemical catalysis. The following lists specific examples of these three types of formation methods.
  • Lys-Asp ectopic bonds also known as isopeptide bonds
  • CnaA/CnaB proteins common examples include CnaA/CnaB proteins. Therefore, by forming a hydrophobic environment and introducing specific reactive and catalytic amino acids, ectopic bonds can be spontaneously formed within proteins.
  • This type of ectopic bond can also be transformed into a polypeptide ligase/bisubstrate form, that is, the protein that can spontaneously form an isopeptide bond is divided into a catalytic protein and two polypeptides with side chains that can react to form an ectopic bond.
  • the polypeptides with side chains that can react to form an ectopic bond can be integrated into the site where a covalent bond needs to be formed in the form of a recombinant protein.
  • Examples include the SpyTag/KTag ectopic bond connection catalyzed by SpyLiagse based on CnaB modification (doi/10.1073/pnas.ss11113) and the SnoopTagJr/DogTag connection catalyzed by SnoopLigase based on RrgA C-terminal domain modification.
  • Enzymatic catalysis This type of method involves many reactions, such as peroxidase (HRP, The dityrosine or trityrosine covalent bonds formed by myeloperoxidase Tyrosinas, laccase, etc. (DOI: 10.1021/acsbiomaterials.6b00454), this cross-linking reaction can also be catalyzed by G4-hemin nuclease (DOI: 10.1002/chem.201101941); Lys-Gln ectopic bonds (also known as isopeptide bonds https://doi.org/10.1016/j.ijbiomac.2017.10.115) catalyzed by transglutaminase; Lys-Lys cross-links catalyzed by lysyloxidases (doi: 10.1021/Bm1010195).
  • HRP peroxidase
  • Chemical cross-linking method produces many ectopic bonds, such as the isopeptide bonds formed between the side chains of D/E acidic amino acids and K/R basic amino acids catalyzed by EDC/NHS/S-NHS, Tyr-Tyr covalent cross-linking catalyzed by H 2 O 2 /Hemin, ruthenium, and cross-linking between D/E and D/E mediated by diazo-containing cross-linkers (doi.org/10.1021/acs.analchem.7b03789).
  • ectopic bonds in which non-Cys participate include, but are not limited to, Lys-His, Lys-Ser, Lys-Thr, Lys-Tyr, Lys-Lys, Lys-Glu, Lys-Asp, Lys-Gln, Lys-Arg, Arg-Glu, Lys-Met, Arg-Asp, Arg-Arg, Tyr-Tyr, Tyr-Trp, Met-Met, and these amino acid side chains can directly form covalent bonds or can be connected through other linker molecules.
  • the ectopic bond can be Tyr-Tyr, Lys-Lys, Gln-Lys, Glu-Lys, etc.
  • an ectopic bond is formed between the 1A domain and the 2A domain of a helicase, preferably a Dda helicase, more preferably a T4-Dda helicase.
  • an ectopic bond is introduced between the natural amino acid residues of the 1A domain and the 2A domain of the helicase.
  • the ectopic bond comprises a Tyr-Tyr ectopic bond, a Lys-Lys ectopic bond, a Gln-Lys ectopic bond, a Glu-Lys ectopic bond or a combination thereof.
  • the helicase is derived from a T4-Dda helicase, and the helicase comprises: (1) a Tyr-Tyr ectopic bond formed between x1 and x2; (2) a Lys-Gln ectopic bond formed between x3 and x4; (3) a Tyr-Lys ectopic bond formed between x1 and x3; (4) a Tyr-Gln ectopic bond formed between x1 and x4; (5) a Tyr-Lys ectopic bond formed between x2 and x3; (6) a Tyr-Gln ectopic bond formed between x2 and x4; (7) a Lys-Glu ectopic bond formed between x3 and x5; or (8) any combination thereof.
  • the helicase is further modified to reduce its surface negative charge. In one embodiment, the helicase is further modified to reduce its surface negative charge by replacing one or more negatively charged amino acids with one or more positively charged amino acids, uncharged amino acids, non-polar amino acids and/or aromatic amino acids, or by introducing one or more negatively charged amino acids near one or more negatively charged amino acids. positively charged amino acids to neutralize one or more surface negative charges.
  • the helicase includes any mutant of the 1A domain or any mutant of the 2A domain, as long as the helicase retains its ability to control the movement of the target analyte, preferably the target polynucleotide.
  • the target analyte comprises a polynucleotide, a polypeptide, a polysaccharide, or a lipid, preferably a polynucleotide.
  • the polynucleotide comprises DNA and/or RNA.
  • a variant of a helicase such as a variant of a Dda helicase, is an enzyme having an amino acid sequence that is varied from the amino acid sequence of a wild-type helicase and retains polynucleotide binding activity.
  • Polynucleotide binding activity can be determined using methods known in the art. Suitable methods include, but are not limited to, fluorescence anisotropy, tryptophan fluorescence, and electrophoretic mobility shift assay (EMSA).
  • the variant retains helicase activity. This can be determined using a variety of methods. For example, the ability of the variant to translocate along a polynucleotide can be determined using electrophysiological methods, fluorescence analysis, or ATP hydrolysis.
  • the variants may include modifications that facilitate processing of the helicase encoding polynucleotide or facilitate activity of the polynucleotide at high salt concentrations and/or room temperature.
  • Embodiment provides the fragment of helicase, for example the fragment of Dda helicase (more preferably T4-Dda helicase), it can be used to prepare the helicase of the embodiment of the present application.
  • polypeptide comprises 1A domain and 2A domain from Dda helicase (preferably T4-Dda helicase) and does not comprise any other domain from Dda helicase, wherein ectopic bond is formed between 1A domain and 2A domain.
  • Described polypeptide can comprise the variant of arbitrarily 1A domain and 2A domain defined above.
  • the present application embodiment also provides a construct comprising a Dda helicase or a modified Dda helicase (preferably a modified T4-Dda helicase) of the present application embodiment and an adapter, wherein the helicase is connected to the adapter and the construct has the ability to control the movement of a target analyte, preferably a target polynucleotide.
  • the construct is artificial or non-natural.
  • the construct is a useful tool for controlling the movement of target analytes, preferably target polynucleotides, during strand sequencing.
  • the helicase is uncoupled from the target analyte, preferably target polynucleotide, being sequenced.
  • the construct may even provide longer read lengths of the target analyte, preferably target polynucleotide.
  • Targeting constructs can also be designed to bind to specific polynucleotide sequences.
  • the adaptor can bind to a specific polynucleotide sequence and thereby target the helicase portion of the construct to that specific sequence.
  • the construct has the ability to control the movement of a target analyte, preferably a target polynucleotide. This can be determined as described above.
  • the constructs of the present application embodiments can be separated, substantially separated, purified or substantially purified. If the construct is completely free of any other components such as lipids, polynucleotides or pore monomers, the construct is separated or purified. If the construct is mixed with a carrier or diluent that does not interfere with its intended use, the construct is substantially separated. For example, if the construct exists in the form of containing less than 10%, less than 5%, less than 2% or less than 1% of other components such as lipids, polynucleotides or pore monomers, the construct is substantially separated or substantially purified.
  • the Dda helicase can be any Dda helicase.
  • Preferred Dda helicases include, but are not limited to, T4-Dda helicase and variants thereof. The variants are as described above.
  • the helicase is preferably a modified Dda helicase, more preferably a T-Dda helicase.
  • the helicase of any embodiment may be present in the construct of the embodiment of the present application.
  • the helicase is preferably covalently linked to the adapter.
  • the helicase can be linked to the adapter at more than one site, such as two or a single site.
  • the helicase can be linked to the adapter using any method known in the art.
  • the helicase and the adapter can be prepared separately and then connected together.
  • the two components can be connected in any configuration. For example, they can be connected by their terminal (i.e., amino terminal or carboxyl terminal) amino acids. Suitable configurations include, but are not limited to, the amino terminal of the adapter is connected to the carboxyl terminal of the helicase and vice versa.
  • the two components can be connected by the amino acids in their sequences.
  • the polynucleotide binding portion can be connected to one or more amino acids in the annular region of the helicase. In a preferred embodiment, the terminal amino acid of the polynucleotide binding portion is connected to one or more amino acids in the annular region of the helicase.
  • the helicase is chemically linked to the adaptor.
  • the adaptor For example, connected by one or more connector molecules.
  • Each connector can have two or more functional ends, such as two, three or four functional ends. Suitable terminal configurations in connectors are known in the art.
  • the helicase is fused to the adapter gene. If the entire construct is expressed from a single polynucleotide sequence, the helicase is fused to the adapter gene.
  • the coding sequences of the helicase and the multi-adapter can be combined in any manner to form a single polynucleotide sequence encoding the construct.
  • the helicase and the adapter can be genetically fused in any configuration.
  • the helicase and the polynucleotide binding portion can be fused through their terminal amino acids.
  • the amino terminus of the polynucleotide binding portion can be fused with the carboxyl terminus of the helicase and vice versa.
  • the amino acid sequence of the polynucleotide binding portion is preferably added to the amino acid sequence of the helicase in frame.
  • the polynucleotide binding portion is preferably inserted into the sequence of the helicase.
  • the helicase and the distribution are usually connected at two sites, i.e., by the amino terminus and carboxyl terminus of the adapter.
  • the adapter is inserted into the sequence of the helicase, the amino terminus and carboxyl terminus of the preferred part are very close and are each connected to adjacent amino acids in the sequence of the helicase or its variant.
  • the adapter is inserted into the annular region of the helicase.
  • the helicase can be directly connected to the adapter.
  • the helicase is preferably connected to the part using one or more, such as two or three, connectors as described above.
  • One or more connectors can be designed to limit the mobility of the part.
  • the helicase and/or the adapter can be modified to facilitate the connection of one or more connectors as described above.
  • the shearable connector can be used to help separate the construct from the non-connected components and can be used to further control the synthesis reaction.
  • a heterogeneous-bifunctional connector can react with a helicase, but not with the partial reaction. If the free end of the connector is used to connect the helicase protein to a surface, the unreacted helicase from the first reaction can be removed from the mixture. Subsequently, the connector can be sheared to expose the group reacted with the adapter.
  • the reaction conditions with the helicase can be optimized first, and then the reaction conditions with the adapter can be optimized after the connector is sheared off. The second reaction can be carried out more directly towards the correct site for reacting with the adapter, because the connector is confined to the region where it has been connected.
  • the The enzyme may be covalently linked to the bifunctional cross-linker.
  • the moiety may be covalently linked to the bifunctional cross-linker before the bifunctional cross-linker/moiety complex is linked to the helicase.
  • the helicase and the adaptor may be covalently linked to the chemical cross-linker simultaneously.
  • Cross-linking of the helicase or moiety itself can be prevented by maintaining a large excess of the connector concentration to the helicase and/or moiety.
  • a "lock and key" setup can be used where two connectors are used. Only one end of each connector can be reacted together to form a longer connector, and the other ends of the connectors are reacted separately with different parts of the construct (i.e., the helicase or moiety).
  • the attachment site is selected so that, when the construct is contacted with the polynucleotide, both the helicase and the adaptor can bind to the polynucleotide and control its movement.
  • the helicase and the adapter can be used to promote connection.
  • complementary polynucleotides can be used to combine the helicase and the adapter together when the helicase and the adapter are hybridized.
  • the helicase can be bound to a polynucleotide and the adapter can be bound to a complementary polynucleotide.
  • the two polynucleotides can then hybridize with each other. This allows the helicase to be in close contact with the adapter, so that the ligation reaction is more effectively carried out. This is particularly conducive to connecting two or more helicases in the correct direction to control the movement of the target polynucleotide.
  • Tags can be added to the construct to facilitate purification of the construct. These tags can then be chemically or enzymatically cleaved if they need to be removed. Fluorophores or chromophores can also be included, which can also be cleaved.
  • a simple way to purify the construct is to include different purification tags on each protein (i.e. the helicase and the polynucleotide binding portion), such as a 6His tag and a Strep tag. This method is particularly useful if the two proteins are different from each other. Using two tags only allows substances with both tags to be easily purified. If the two proteins do not have two different tags, other methods can be used.
  • the constructs can be purified from unreacted proteins based on different DNA processivity properties.
  • the constructs can be purified from unreacted proteins based on: increased affinity for the target analyte, preferably the target polynucleotide, reduced likelihood of dissociation from the analyte, preferably the polynucleotide after binding, and/or increased read length of the analyte, preferably the polynucleotide when it controls translocation of the analyte, preferably the polynucleotide through the nanopore.
  • Targeting constructs can also be designed to bind to specific polynucleotide sequences.
  • the adapter can bind to a specific polynucleotide sequence and thereby target the helicase portion of the construct. to that specific sequence.
  • the construct includes an adapter.
  • the adapter is a polypeptide that can bind to an analyte, preferably a polynucleotide.
  • the adapter preferably can specifically bind to a defined polynucleotide sequence.
  • the multi-adapter binds to a specific polynucleotide sequence, while binding to a different sequence cannot be detected. The portion that binds to a specific sequence can be used to design a construct that targets the sequence.
  • the adapter has at least one property of interacting with or modifying a polynucleotide.
  • the adapter can modify the polynucleotide by shearing it to form a single nucleotide or a shorter nucleotide chain such as two or three nucleotides.
  • the adapter can modify the polynucleotide by positioning it or moving it to a specific position, i.e. controlling its movement.
  • the adapter is derived from a polynucleotide binding enzyme.
  • a polynucleotide binding enzyme is a polypeptide that can bind to a polynucleotide and can interact with the polynucleotide and modify at least one of its properties.
  • the enzyme can modify the polynucleotide by shearing the polynucleotide into a single nucleotide or a shorter nucleotide chain, such as a dinucleotide or a trinucleotide.
  • the enzyme can modify the polynucleotide by directing or moving it to a specific position.
  • the polynucleotide binding portion does not need to show enzymatic activity, as long as it can bind to the polynucleotide and control its movement.
  • the portion can be derived from an enzyme that has been modified to remove its enzymatic activity or can be used under conditions that prevent it from acting as an enzyme.
  • the adaptor is derived from a nucleolytic enzyme.
  • preferred enzymes are exonucleases, polymerases, helicases and topoisomerases, such as gyrase.
  • the adapter is connected to the helicase and the target analyte (preferably the target polynucleotide).
  • the adapter is a nucleic acid.
  • the adapter is a polynucleotide.
  • the adapter includes a double-stranded nucleic acid region that can connect to other double-stranded nucleic acids, such as target polynucleotides.
  • the adapter is a Y-shaped adapter.
  • the adapter is composed of three DNA chains.
  • the adapter includes a spacer, preferably the spacer is inserted in the adapter, or the spacer is located at the 5' end or 3' end of one chain.
  • the adapter is a polynucleotide comprising a spacer. In one embodiment, the adapter is a Y-shaped polynucleotide comprising a spacer. In one embodiment, the adapter is a polynucleotide, polypeptide, polysaccharide or lipid containing a spacer.
  • the helicase can be arrested on the spacer of the adapter. By connecting the helicase and the adapter The polynucleotide (or polypeptide or polysaccharide or lipid) on the adapter contacts the transmembrane pore and applies an electric potential, which can move the stalled helicase through the spacer on the adapter.
  • the polynucleotide (or polypeptide or polysaccharide or lipid) of the adapter moves through the transmembrane pore along the electric potential, and the force generated moves the helicase through the spacer.
  • a plurality of adapters are included.
  • the adapter is part of the target analyte, or is not part of the target analyte.
  • other adapters may be interspersed in the target analyte (preferably the target polynucleotide).
  • the present application embodiment provides a polynucleotide containing a sequence encoding a helicase of the present application embodiment, a polypeptide of the present application embodiment, or a construct of the present application embodiment.
  • the polynucleotide may contain such a sequence.
  • the polynucleotide may be any polynucleotide discussed above.
  • Polynucleotide sequences can be isolated and replicated using standard methods in the art. Chromosomal DNA can be extracted from organisms that produce helicases, such as Methanococcus burtonii and/or organisms that produce SSB, such as Escherichia coli. PCR with specific primers can be used to amplify the gene coding sequence of interest. The amplified sequence can then be inserted into a recombinant replication vector such as a cloning vector. The vector can be used to replicate the polynucleotide in a compatible host cell.
  • a polynucleotide sequence can be prepared by introducing a polynucleotide encoding a sequence of interest into a replicable vector, including introducing the vector into a compatible host cell, and growing the host cell under conditions that cause the vector to replicate.
  • the vector can be recovered from the host cell.
  • Suitable host cells for cloning polynucleotides are known in the art and are described in more detail below.
  • the polynucleotide sequence can be cloned into a suitable expression vector.
  • the polynucleotide sequence is usually operably linked to a control sequence that can provide expression of the coding sequence by a host cell.
  • the expression vector can be used to express a construct.
  • operably linked refers to a juxtaposition wherein the various components described are in a relationship permitting them to function in their intended manner.
  • a control sequence "operably linked" to a coding sequence is linked under conditions compatible with the control sequences in a manner that achieves expression of the coding sequence. Multiple copies of the same or different polynucleotides may be introduced into a vector.
  • the expression vector can then be introduced into a suitable host cell.
  • the polynucleotide sequence encoding the construct can be introduced into an expression vector, including introducing the vector into The construct is prepared by introducing the polynucleotide sequence into a compatible bacterial host cell and growing the host cell under conditions that result in expression of the polynucleotide sequence.
  • the vector can be, for example, a plasmid, virus or phage vector with a replication origin, optionally a promoter for expressing the polynucleotide sequence, and optionally a regulatory factor for the promoter.
  • the vector can contain one or more selectable marker genes, such as an ampicillin resistance gene. Promoters and other expression control signals compatible with the host cell (for which the expression vector is designed) can be selected. T7, trc, lac, ara or k promoters are usually used.
  • the host cell typically expresses the construct at a high level.
  • a host cell compatible with the expression vector used to transform the host cell is selected for polynucleotide sequence transformation.
  • the host cell is typically bacterial and preferably E. coli. Any cell with a ⁇ DE3 lysogen, such as Rosetta2(DE3)pLys, C41(DE3), BL21(DE3), JM109(DE3), B834(DE3), TUNER, Origami and Origami B, can express vectors containing a T7 promoter.
  • the present application embodiment also provides the group (series) of two or more helicases that have been connected (or combined) polynucleotide, wherein at least one of the two or more helicases is the helicase of the present application embodiment, preferably Dda helicase, more preferably T4-Dda helicase.
  • the group can include any number of helicases, such as 2,3,4,5,6,7,8,9,10 or more helicases. Any number of helicases can be the helicase of the present application embodiment, preferably Dda helicase, more preferably T4-Dda helicase.All of two or more helicases are preferably the helicase of the present application embodiment, preferably Dda helicase, more preferably T4-Dda helicase.
  • One or more helicases of the present application embodiment, preferably Dda helicase, more preferably T4-Dda helicase can be any helicase discussed above, preferably Dda heli
  • the two or more helicases can be the same or different helicases.
  • the group includes two or more helicases of the present application embodiments, preferably Dda helicases, more preferably T4-Dda helicases
  • the helicases of the present application embodiments, preferably Dda helicases, more preferably T4-Dda helicases can be the same or different.
  • the group may include any number and combination of helicases of the embodiments of the present application, preferably Dda helicases, more preferably T4-Dda helicases.
  • the group of two or more helicases preferably includes at least two helicases of the embodiments of the present application, preferably Dda helicases, more preferably T4-Dda helicase.
  • the group may include two or more helicases, preferably Dda helicases, more preferably T4-Dda helicases.
  • Each helicase, preferably Dda helicase, more preferably T4-Dda helicase comprises the modification described above, including that the 1A domain and the 2A domain form an ectopic bond.
  • the group may include one or more helicases that are not part of the embodiments of the present application.
  • the one or more helicases may be or are derived from Hel308 helicases, RecD helicases, such as Tral helicases or TrwC helicases, XPD helicases or Dda helicases.
  • the one or more helicases may be any helicase in a helicase, modified helicase or helicase construct disclosed in the following international applications: International Application No. PCT/GB2012/052579 (Publication No.
  • WO 2013/057495 PCT/GB2012/053274 (Publication No. WO 2013/098562); PCT/GB2012/053273 (Publication No. WO 2013/098564); 13/098561); PCT/GB2013/051925 (published as WO 2014/013260); PCT/GB2013/051924 (published as WO 2014/013259) and PCT/GB2013/051928 (published as WO 2014/013262); and UK application No. 1318464.3 filed on 18 October 2013.
  • the one or more helicases are preferably modified to reduce the size of the opening in the polynucleotide binding domain through which the polynucleotide can be released from the helicase in at least one conformational state. This is disclosed in WO 2014/013260.
  • the two or more helicases in the group may be separated from each other.
  • the two or more helicases in the group may be bound together by the transmembrane pore as the polynucleotide moves through the pore.
  • the two or more helicases in the group may contact each other.
  • the two or more helicases are not linked to each other except by the polynucleotide.
  • the two or more helicases are preferably not covalently linked to each other.
  • the two or more helicases can be connected to each other or covalently connected to each other.
  • the helicases can be connected in any order and using any method.
  • a group of connected helicases can be called a team.
  • the present invention provides a method for controlling the movement of a target analyte, preferably a target polynucleotide.
  • the method comprises allowing the target analyte, preferably a target polynucleotide, to move with a helicase of the embodiment, preferably a Dda helicase, more preferably a T4-Dda helicase or a construct of the present invention.
  • the method is preferably carried out under a potential applied across the pore.
  • the applied potential generally results in the formation of a complex between the pore and the helicase or construct.
  • the applied potential may be a voltage potential.
  • the applied potential may be a chemical potential.
  • An example of this is the use of a salt gradient across the amphiphilic layer. Salt gradients are disclosed in Holden et al., J Am Chem Soc. 2007 Jul 11; 129(27): 8650-5.
  • the present application embodiment also provides a method for characterizing the preferred target polynucleotide of target analyte.
  • the method includes (a) contacting the preferred target polynucleotide of target analyte with the helicase of hole and embodiment, preferably Dda helicase, more preferably T4-Dda helicase or the construct of the present application embodiment, so that the helicase or construct control the preferred target polynucleotide of target analyte through the movement of the hole.
  • the method also includes (b) as the preferred polynucleotide of target analyte moves relative to the hole, obtaining one or more measured values, wherein the measured value represents one or more features of the preferred target nucleotide of the target analyte, and thus characterizing the preferred target polynucleotide of target analyte.
  • the helicase can be any helicase as described above for the constructs of the embodiments of the present application, including the modified helicases of the embodiments of the present application, preferably Dda helicases, more preferably T4-Dda helicases, and helicases not modified according to the embodiments of the present application, preferably Dda helicases, more preferably T4-Dda helicases.
  • helicases of the present application embodiment can use any number of helicases of the present application embodiment in these methods, preferably Dda helicase, more preferably T4-Dda helicase.
  • 1,2,3,4,5,6,7,8,9,10 or more helicases can be used.
  • two or more helicases of the present application embodiment preferably Dda helicase, more preferably T4-Dda helicase, they can be identical or different.Suitable number and combination are as described in the group for the present application embodiment above.They can be equally applied to the method for the present application embodiment.
  • two or more helicases may be linked to each other.
  • the two or more helicases may be covalently linked to each other.
  • the helicases may be linked in any order and using any method.
  • two or more helicases are used, they are preferably not linked to each other except by the polynucleotide.
  • the two or more helicases are more preferably not covalently linked to each other.
  • Steps (a) and (b) are preferably performed under conditions where a potential is applied across the pore as described above.
  • the current passing through the pore is used to determine the amount of polynucleotide that moves relative to the pore.
  • the sequence of the target polynucleotide is determined by the DNA sequencing. This is called strand sequencing.
  • the method of the embodiment of the present application is used to characterize the target analyte, preferably the target polynucleotide.
  • the polynucleotide is as defined above.
  • the whole or only part of the target polynucleotide can be characterized using this method.
  • the target polynucleotide can be any length.
  • the polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotide pairs in length.
  • the polynucleotide can be 1000 or more nucleotide pairs, 5000 or more nucleotide pairs in length or 100000 or more nucleotide pairs in length.
  • the target polynucleotide is usually present in any suitable sample.
  • the present application embodiment is usually implemented on a sample known to contain or suspected to contain the target polynucleotide.
  • the present application embodiment can be implemented to a sample to confirm the identity of one or more target polynucleotides known or expected to be present in the sample.
  • the sample can be a biological sample.
  • the present application embodiment can be implemented in vitro for samples obtained or extracted from any organism or microorganism.
  • the organism or microorganism is usually ancient nuclear, prokaryotic or eukaryotic, and usually belongs to one of the following five kingdoms: plant kingdom, animal kingdom, fungi, ankaryotic protozoa and protists.
  • the present application embodiment is implemented in vitro for samples obtained or extracted from any virus.
  • the sample is preferably a liquid sample.
  • the sample usually includes the patient's body fluid.
  • the sample can be urine, lymph, saliva, mucus or amniotic fluid, but preferably blood, plasma or serum.
  • the sample is derived from humans, but can alternatively be from other mammals, such as from commercially raised animals such as horses, cattle, sheep or pigs, or can be pets such as cats or dogs.
  • samples of plant origin are usually obtained from commercial crops, such as cereals, beans, fruits or vegetables, such as wheat, barley, oats, Brassica, corn, soybeans, rice, bananas, apples, tomatoes, potatoes, grapes, tobacco, beans, lentils, sugarcane, cocoa, cotton.
  • the sample may be a non-biological sample.
  • the non-biological sample is preferably a liquid sample.
  • Examples of non-biological samples include surgical fluids, water such as drinking water, sea water or river water, and reagents for laboratory tests.
  • the sample is usually processed before being used in the present application embodiment, for example, by centrifugation or filtering through a membrane to remove unwanted molecules or cells, such as red blood cells.
  • the test can be performed immediately after obtaining the sample.
  • the sample can also be stored before analysis, preferably below -70°C.
  • a pore is a structure that spans the membrane to some extent. It allows hydrated ions to flow across or within the membrane by driving them through an applied potential. A pore usually spans the entire membrane to allow hydrated ions to flow through the membrane.
  • the pores may be formed by a membrane having a plurality of pores, each of which is a well in the membrane, along which hydrated ions may flow or flow.
  • Suitable pores include, but are not limited to, protein pores, polynucleotide pores, and solid-state pores.
  • Suitable membranes are preferably amphiphilic layers.
  • An amphiphilic layer is a layer formed by amphiphilic molecules, such as phospholipids, which have hydrophilicity and lipophilicity. Amphiphilic molecules can be synthetic or naturally occurring.
  • An amphiphilic layer can be a monolayer or a bilayer.
  • An amphiphilic layer is typically planar.
  • An amphiphilic layer can be curved.
  • the amphiphilic layer can be supported.
  • the membrane can be a lipid bilayer.
  • a lipid bilayer is formed by two opposing layers of lipids. The two layers of lipids are arranged so that their hydrophobic tail groups face each other to form a hydrophobic interior.
  • the hydrophilic head groups of the lipids face outward toward the aqueous environment on each side of the bilayer.
  • the membrane includes a solid layer.
  • the solid layer can be formed by organic and inorganic materials. If the membrane includes a solid layer, holes are typically present in the amphiphilic membrane or in a layer included in the solid layer, such as holes, wells, gaps, channels, grooves or slits in the solid layer.
  • the polynucleotide may be attached to the membrane. This may be accomplished using any known method. If the membrane is an amphiphilic layer, such as a lipid bilayer, the polynucleotide is preferably attached to the membrane via a polypeptide present in the membrane or via a hydrophobic anchor present in the membrane.
  • the hydrophobic anchor is preferably a lipid, a fatty acid, a sterol, a carbon nanotube or an amino acid.
  • the polynucleotide can be directly connected to the membrane.
  • the polynucleotide is preferably connected to the membrane through a connector.
  • Preferred connectors include, but are not limited to, polymers, such as polynucleotides, polyethylene glycol (PEG) and polypeptides. If the polynucleotide is directly connected to the membrane, due to the distance between the membrane and the helicase, the characterization cannot be carried out to the end of the polynucleotide, and some data will be lost. If a connector is used, the polynucleotide can be fully characterized. If a connector is used, the connector can be connected to any position of the polynucleotide. The connector is usually connected to the tail polymer of the polynucleotide.
  • connection can be stable or temporary. For some applications, the temporary nature of the connection is preferred. If the stable linker is directly connected to the 5' end or 3' end of the polynucleotide, then due to the distance between the active site of the membrane and the helicase, the characterization can not proceed to the end of the polynucleotide, which will cause the loss of data. If the connection is temporary, then when the end of the connection randomly becomes without membrane, the polynucleotide can be characterized completely. In a preferred embodiment, the polynucleotide is connected to an amphiphilic layer.
  • Transmembrane pores or nanopores are preferably transmembrane protein pores, which define channels or holes that allow molecules and ions to be translocated from one side of the membrane to the other side.
  • the translocation of ionic substances through the hole can be driven by a potential difference applied to either side of the hole.”
  • Nanopore is a protein pore or transmembrane pore, in which the minimum diameter of the channel through which molecules or ions pass is nanometer-scale ( 10-9 meters).
  • the protein pore can be a transmembrane protein pore or a transmembrane pore or a nanopore.
  • the transmembrane protein structure of the transmembrane pore can be a monomer or an oligomer in nature.
  • the hole comprises a plurality of polypeptide subunits arranged around a central axis, thereby forming a protein-lined channel extending substantially perpendicular to the membrane where the hole resides.
  • the number of polypeptide subunits is not limited. Typically, the number of subunits is 5 to 30, and the number of subunits is suitably 6 to 10. Alternatively, the number of subunits is not defined as in the case of perfringolysin or related large membrane pores.
  • the portion of the protein subunit that forms the protein-lined channel within the pore typically comprises secondary structural motifs that may include one or more transmembrane ⁇ -barrels and/or ⁇ -helical portions.
  • the method of the embodiment can measure two, three, four or five or more characteristics of the polynucleotide.
  • the one or more characteristics are preferably selected from (i) the length of the polynucleotide, (ii) the identity of the polynucleotide, (iii) the sequence of the polynucleotide, (iv) the secondary structure of the polynucleotide, and (v) whether the polynucleotide is modified. In one embodiment, any combination of (i) to (v) can be measured.
  • the length of the polynucleotide may be measured, for example, by determining the number of interactions between the polynucleotide and the pore or the duration of interactions between the polynucleotide and the pore.
  • the identity of a polynucleotide can be measured in a variety of ways, and the identity of a polynucleotide can be measured in conjunction with or without measurement of the polynucleotide sequence.
  • the former is relatively simple; the polynucleotide is sequenced and identified.
  • the latter can be accomplished in several different ways. For example, the presence of a particular motif in a polynucleotide can be measured (without measuring the rest of the sequence of the polynucleotide).
  • the measurement of a specific electrical and/or optical signal in the method can identify that the polynucleotide is from a particular source.
  • sequence of the polynucleotide can be determined as previously described.
  • secondary structure can be measured using a variety of methods. For example, if the method involves an electrical measurement method, the secondary structure can be measured using a change in residence time or a change in the current flowing through the pore. This allows for the differentiation of regions of single-stranded and double-stranded polynucleotides.
  • the presence or absence of any modification may be measured.
  • the method preferably comprises determining whether the polynucleotide is modified by methylation, oxidation, damage, treatment with one or more proteins or with one or more markers, tags or is abasic or lacking nucleobases and sugars.
  • the decoration will result in a specific interaction with the pore, which can be measured using the methods described below.
  • methylcytosine can be distinguished from cytosine based on the current flowing through the pore during its interaction with each nucleotide.
  • Electrical measurements include voltage measurements, capacitance measurements, current measurements, impedance measurements, tunneling measurements (Ivanov AP et al., Nano Lett. 2011 Jan 12; 11(I): 279-85) and FET measurements (International Application WO 2005/124888).
  • Optical measurements can be combined with electrical measurements (Soni GV et al., Rev Sci Instrum. 2010 Jan; 81(1) 014301).
  • the measurement can be a transmembrane current measurement, such as a measurement of the ionic current flowing through the pore.
  • the electrical measurement or optical measurement can use conventional electrical or optical measurements.
  • Electrical measurements can be made using standard single channel recording equipment as described in Stoddart D et al., Proc Natl Acad Sci, 12; 106 (19) 7702-7, Lieberman KR et al., J Am Chem Soc. 2010; 132 (50) 17961-72 and international application WO 2000/28312.
  • electrical measurements can be made using a multi-channel system, such as described in international application WO 2009/077734 and international application WO 2011/067559.
  • the method is preferably performed using an applied potential across the membrane.
  • the applied potential may be a voltage potential.
  • the applied potential may be a chemical potential.
  • An example of this is the use of a salt gradient across a membrane, such as a layer of amphiphilic molecules. Salt gradients are disclosed in Holden et al., J Am Chem SOC. 2007 Jul 11; 129(27): 8650-5.
  • the current flowing through the pore as the polynucleotide moves relative to the pore is used to estimate or determine the sequence of the polynucleotide. This is strand sequencing.
  • the method may include measuring the current flowing through the hole when the polynucleotide moves relative to the hole. Therefore, the equipment used for the method may also include a circuit capable of applying an electric potential and measuring the electrical signal passing through the membrane and the hole. The method may be performed using a patch clamp or a voltage clamp.
  • the method is usually carried out by applying a voltage to the membrane and the hole.
  • the voltage used is usually from +5V to -5V, for example from +4V to -4V, from +3V to -3V or from +2V to -2V.
  • the voltage used is usually from -600mV to +600V or -400mV to +400mV.
  • the voltage used is preferably selected from -400mV, -300mV, -200mV, -150mV, -100mV, -50mV, -20mV and 0mV lower limit and independently selected from +10mV,
  • the voltage used is more preferably in the range of 100mV to 240mV and most preferably in the range of 120mV to 220mV.
  • the method is generally carried out in the presence of any charge carrier, such as a metal salt such as an alkali metal salt, a halide salt such as a chloride salt, such as an alkali metal chloride salt.
  • the charge carrier may include an ionic liquid or an organic salt, such as tetramethylammonium chloride, trimethylphenylammonium chloride, phenyltrimethylammonium chloride or 1-ethyl-3-methylimidazolium chloride.
  • the salt is present in an aqueous solution in the chamber.
  • Potassium chloride (KCl), sodium chloride (NaCl), cesium chloride (CsCl) or a mixture of potassium ferrocyanide and potassium ferrocyanide are generally used.
  • KCl, NaCl and a mixture of potassium ferrocyanide and potassium ferrocyanide are preferred.
  • the charge carrier may be asymmetric on the membrane. For example, the type and/or concentration of the charge carrier may be different on each side of the membrane
  • the concentration of the salt may be saturated.
  • the concentration of the salt may be 3 M or less, and is typically 0.1 to 2.5 M, 0.3 to 1.9 M, 0.5 to 1.8 M, 0.7 to 1.7 M, 0.9 to 1.6 M or 1 M to 1.4 M.
  • the concentration of the salt is preferably 150 mM to 1 M.
  • the method is preferably performed using a salt concentration of at least 0.3 M, such as at least 0.4 M, at least 0.5 M, at least 0.6 M, at least 0.8 M, at least 1.0 M, at least 1.5 M, at least 2.0 M, at least 2.5 M or at least 3.0 M.
  • High salt concentrations provide a high signal-to-noise ratio and allow the presence of nucleotides to be identified in the context of normal current fluctuations to be indicated by current.
  • the method is usually carried out in the presence of a buffer.
  • the buffer is present in an aqueous solution in the chamber. Any buffer can be used in the method of the present application embodiment.
  • the buffer is a phosphate buffer.
  • Other suitable buffers are HEPES or Tris-HCl buffer.
  • the method is usually carried out at a pH of 4.0 to 12.0, 4.5 to 10.0, 5.0 to 9.0, 5.5 to 8.8, 6.0 to 8.7, 7.0 to 8.8 or 7.5 to 8.5.
  • the pH used is preferably about 7.5.
  • the method can be carried out at a temperature of 0°C to 100°C, 15°C to 95°C, 16°C to 90°C, 17°C to 85°C, 18°C to 80°C, 19°C to 70°C or 20°C to 60°C.
  • the method is typically carried out at room temperature.
  • the method is optionally carried out at a temperature that supports enzyme function, such as about 37°C.
  • the method includes:
  • the target analyte preferably the target polynucleotide
  • the solution of the embodiment of the present application contacting the pore with a helicase or a construct of an embodiment of the present application such that the target analyte, preferably a target polynucleotide, moves through the pore and the helicase or construct controls the movement of the target analyte, preferably a target polynucleotide, through the pore;
  • the method can be implemented using any device suitable for studying membrane/pore systems (wherein the pores are embedded in the membrane).
  • the method can be implemented using a device suitable for sensing the pores.
  • the device comprises a chamber comprising an aqueous solution and a barrier that divides the chamber into two parts.
  • the barrier typically has a gap in which a membrane comprising the pores is formed.
  • the barrier forms a membrane in which the pores are present.
  • the method can be implemented using the device described in International Application No. PCT/GB08/000562 (WO 2008/102120).
  • the method may include measuring the current through the hole as the target analyte, preferably the polynucleotide, moves relative to the hole. Therefore, the device may also include a circuit capable of applying a potential across the membrane and the hole and measuring the current signal.
  • the method may be implemented using a patch clamp or a voltage clamp.
  • the method preferably comprises the use of a voltage clamp.
  • the target polynucleotide may be contacted with a helicase or construct and the hole in any order.
  • the target analyte preferably the target polynucleotide
  • the target analyte preferably the target polynucleotide, first forms a complex with the helicase or construct.
  • the target analyte preferably the target polynucleotide/helicase or construct complex then forms a complex with the hole and controls the target analyte, preferably the polynucleotide, to move through the hole.
  • the present application embodiment also provides the sensor and method thereof formed for characterizing the preferred target polynucleotide of target analyte.Described method is included in the helicase of hole and embodiment, preferably Dda helicase, more preferably forms complex between the construct of T4-Dda helicase, enzyme or the present application embodiment.
  • helicase can be any helicase described above for the construct of the present application embodiment, comprises the helicase of the present application embodiment and the helicase not modified according to the present application embodiment.
  • the Dda helicase of the present application embodiment discussed for the group and method of the present application embodiment of any number and combination can be used.
  • the complex can be formed by contacting the pore and the helicase or construct in the presence of a target analyte, preferably a target polynucleotide, and then applying an electric potential to the pore.
  • the applied electric potential can be a chemical potential or a voltage potential as described above.
  • the complex can be formed by The via hole is covalently connected to the helicase or construct to form.
  • the method of covalent connection is known in the art and, for example, disclosed in International Application No. PCT/GB09/001679 (publication number is WO 2010/004265) and PCT/GB10/000133 (publication number is WO 2010/086603).
  • the complex is a sensor for characterizing a target analyte, preferably a target polynucleotide.
  • the method preferably includes forming a complex between a hole derived from Msp and a helicase of the present application embodiment or a construct of the present application embodiment. Any embodiment of the present application embodiment method discussed above is equally applicable to the method.
  • the present application embodiment also provides a sensor prepared using the method of the present application embodiment.
  • the present application embodiment also provides a test kit for characterizing the preferred target polynucleotide of target analyte.
  • the test kit includes the helicase of (a) hole and (b) embodiment, preferably Dda helicase, more preferably T4-Dda helicase, or the construct of the present application embodiment. Any embodiment discussed above for the present application embodiment method is equally applicable to the test kit.
  • the helicase can be any helicase described for the construct of the present application embodiment above, including the helicase of the present application embodiment and the helicase not modified according to the present application embodiment.
  • the test kit can include the helicase of the present application embodiment discussed for the group and method of the present application embodiment using any number and combination, preferably Dda helicase, more preferably T4-Dda helicase.
  • the kit may further include components of the membrane, such as phospholipids required to form an amphiphilic layer such as a lipid bilayer.
  • the kit of the present application embodiment may additionally include one or more other reagents or instruments that enable any of the embodiments described above to be implemented.
  • the reagents or instruments include one or more of the following: a suitable buffer (aqueous solution), an instrument for obtaining a sample from a receptor (such as a catheter or an instrument containing a needle), an instrument for amplifying and/or expressing polynucleotides, a membrane as defined above or a voltage clamp or patch clamp device.
  • the reagent present in the kit may be dry so that the liquid sample resuspends the reagent.
  • the kit may also optionally include an instrument that enables the kit to be used in the present application embodiment method or instructions for which patients the method may be used for.
  • the kit may optionally include nucleotides.
  • the present application also provides a device for characterizing a target analyte, preferably a target polynucleotide.
  • the device comprises a plurality of holes and a plurality of helicases, preferably a Dda helicase, more preferably a T4-Dda helicase, a plurality of helicases of the present application or a plurality of constructs of the present application. Construct.
  • the device preferably further includes instructions for implementing the method of the embodiment of the present application.
  • the device can be any conventional device for target analyte, preferably polynucleotide analysis, such as an array or a chip. Any embodiment of the method of the embodiment of the present application discussed above can be equally applicable to the device of the embodiment of the present application.
  • the helicase can be any helicase described above for the construct of the embodiment of the present application, including the helicase of the embodiment of the present application and the helicase that is not modified according to the embodiment of the present application.
  • the device may include the helicase of the embodiment of the present application discussed above for the group and method of the embodiment of the present application using any number and combination, preferably Dda helicase, more preferably T4-Dda helicase.
  • the device is preferably assembled to implement the method of the embodiment of the present application.
  • the device preferably comprises:
  • a sensor device capable of supporting a plurality of pores and operable to use the pores and the helicase or construct to characterize a target analyte, preferably a polynucleotide;
  • the device preferably comprises:
  • a sensor device capable of supporting a plurality of pores and operable to use the pores and the helicase or construct to characterize a target analyte, preferably a polynucleotide;
  • At least one memory for storing materials for characterization.
  • the device more preferably comprises:
  • a sensor device capable of supporting the membrane and the plurality of pores and operable to characterize a target analyte, preferably a polynucleotide, using the pores and the helicase or construct;
  • At least one memory storing material for characterization
  • a fluidics system configured to controllably supply material to the sensor device from at least one reservoir; and one or more containers for receiving each sample, the fluidics system being configured to selectively supply samples from the one or more containers to the sensor device.
  • the device may be any of those described in International Application No. PCT/GB08/004127 (published as WO 2009/077734), PCT/GB10/000789 (published as WO 2010/122293), International Application No. PCT/GB10/002206 (published as WO 2011/067559) or International Application No. PCT/US99/25679 (published as WO 00/28312).
  • the present application also provides a method for preparing the modified helicase of the present application.
  • the method includes providing a helicase, preferably a Dda helicase, more preferably a T4-Dda helicase to form the modified helicase of the present application.
  • the method preferably further comprises determining whether the helicase can control the movement of the target analyte, preferably a polynucleotide.
  • the analysis performed on it is described above. If the movement of the target analyte, preferably a polynucleotide, can be controlled, the helicase is correctly modified and the helicase of the present application embodiment is obtained. If the movement of the target analyte, preferably a polynucleotide, cannot be controlled, the helicase of the present application embodiment is not obtained.
  • the present application also provides a method for preparing the construct.
  • the method comprises connecting, preferably covalently connecting, a helicase of the present application, preferably a Dda helicase, more preferably a T4-Dda helicase to an adapter. Any helicase and adapter as described above can be used in the method.
  • the method preferably further comprises determining whether the construct can control the movement of the target analyte, preferably polynucleotide.
  • the analysis performed on it is described above. If the movement of the target analyte, preferably polynucleotide, can be controlled, the helicase and adapter are correctly connected, and the construct of the present application embodiment is obtained. If the movement of the target analyte, preferably polynucleotide, cannot be controlled, the construct of the present application embodiment is not obtained.
  • Example 1.1 Enzyme cross-linking mediated by different catalytic systems
  • Antisense strand S2 5'- CGACTTCTACCGTTTGACTCCGC (SEQ ID NO: 3)-3'
  • Connection chain YB (SEQ ID NO: 4)/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OMeU//i2OMeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeA//i2OMeC//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC/-3' respectively synthesize the main strand S1, the antisense strand S2 and the connecting strand YB, and mix and anneal the three at a ratio of 1:1.1:1.1 to form an adapter.
  • the annealing treatment is specifically to slowly cool from 95°C to 25°C, during which the cooling amplitude does not exceed 0.1°C/s.
  • Two reaction systems were prepared, each with 1uM adapter, 5 times the amount of Q1 (mutant T4DDA-E93Q) protein, and no catalyst, 0.1U microbial transglutaminase as a catalyst, mixed and incubated at 37°C for 36h, then run TBE PAGE gel at 160V for 40min and detect the enzyme cross-linking.
  • the results are shown in Figure 2.
  • the nucleic acid protein complex is a complex of the adapter and the helicase. After adding microbial transglutaminase, an effective cross-linking product can be formed.
  • Two reaction systems were prepared, each with 1uM adapter, 5 times the amount of Q11 (mutant T4DDA-E94G) protein, and no catalyst, 5mM EDC as a catalyst, mixed and incubated at 30°C for 4h, then run TBE PAGE gel at 160V for 40min and detect the enzyme cross-linking.
  • the results are shown in Figure 3, the nucleic acid protein complex is a complex of the adapter and the helicase. After adding EDC, an effective cross-linking product can be formed.
  • Antisense strand S2 5’-Ru-CGACTTCTACCGTTTGACTCCGC (SEQ ID NO: 3)-3’
  • Connection chain YB 5’P-GTCAGCAGGACCGAATGA(SEQ ID NO:4)/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OM eU//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeA//i2OMeC//i2OMeC//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3’
  • the main strand S1, antisense strand S2 and connecting strand YB are synthesized separately, and the three are mixed and annealed at a ratio of 1:1.1:1.1 to form a linker.
  • the annealing treatment is specifically to slowly cool from 95°C to 25°C, during which the cooling amplitude does not exceed 0.1°C/s.
  • reaction systems were prepared, each with 2uM adapter, 4 times the amount of T49 (mutant T4 DDA-E94Y) or 4 times the amount of T53 protein/1mM 1,4-diamine were added, mixed and placed at 30°C for incubation for 30min, then 1.25mM sodium persulfate was added and placed under LED 450nM light for crosslinking (light power of 50mw/ cm2 ; distance 15cm), then TBE PAGE gel was run at 160V for 40min and the enzyme crosslinking was detected, the results are shown in Figure 4.
  • the nucleic acid protein complex is a complex of the adapter and the helicase. Under Ru light catalysis, it can be clearly found that the enzyme has been crosslinked (Y94-K364 in T49; Y94 and N365Y in T53 are crosslinked with 1,4-diamine).
  • Example 1.4 Preparation of nucleic acid-protein complexes based on Tyr-Tyr cross-linking
  • the enzyme cross-linking product catalyzed by HRP/H 2 O 2 in Example 1.1 was added to a DNAPac PA200 column and purified using an elution buffer to remove impurities such as enzymes that were not bound to the adapter and incompletely reacted EDC from the column.
  • the nucleic acid-protein complex was eluted with a mixture of buffer A and buffer B with 10 column volumes.
  • the main elution peaks were then collected, their concentrations were measured, and TBE PAGE gel was run at 160V for 40 minutes.
  • buffer A 20mM Na-CHES, 250mM NaCl, 4% (W/V) glycerol, pH 8.6
  • buffer B 20mM Na-CHES, 1M NaCl, 4% (W/V) glycerol, pH 8.6.
  • the elution process is shown in Figure 6.
  • the peaks in the elution process were collected and detected by TBE PAGE gel running at 160V for 40 minutes.
  • the results are shown in Figure 7.
  • E3 is the desired nucleic acid-protein complex.
  • Example 1.5 On-machine testing of nucleic acid-protein complexes based on Tyr-Tyr cross-linking
  • a 10 kb library was prepared by end repair, and the nucleic acid protein complex in Example 1.4 was connected to the library. Finally, sequencing was performed on the QNome-3841 of Qi Carbon Technology Co., Ltd. The actual sequencing signal is shown in Figure 8.
  • the nucleic acid protein complex can generate sequencing signals normally, and the signal steps are clear for subsequent base inference.
  • the complete library pass-through signal can be seen. This shows that the formation of the 1A domain and the 2A domain
  • the helicase that forms the Tyr-Tyr ectopic bond between the two molecules can control the movement of the target polynucleotide and can be used for nanopore sequencing of polynucleotides.
  • Example 1.6 Analysis of sequencing accuracy based on Tyr-Tyr cross-linking
  • a randomly interrupted Human genome library was prepared by end repair, and the nucleic acid-protein complex or disulfide-crosslinked SAC-M1 (nucleic acid-protein complex formed by T4Dda-E94C/A360C) in Example 1.4 was connected to the library. Finally, sequencing was performed on the QNome-3841 of Qi Carbon Technology Co., Ltd. Under the same small data set training scale, the accuracy based on Tyr-Tyr cross-linking was higher than the traditional disulfide cross-linking form, and the analysis results are shown in Table 4.
  • Normalized noise The signal value of each read is standardized by the Median Absolute Deviation (MAD), the signal and noise are separated by median filtering, and the root mean square (RMS) value of the noise is calculated as an indicator of normalized noise.
  • MAD Median Absolute Deviation
  • RMS root mean square
  • Noise Perform median filtering on the signal value of each read to obtain the filtered noise signal. Calculate the root mean square (RMS) value of the noise signal as an indicator of noise, reflecting the relative intensity of noise. The smaller the noise value, the smaller the noise interference.
  • RMS root mean square
  • Signal-to-noise ratio The signal value of each read is normalized by the Median Absolute Deviation (MAD), and the signal and noise are separated by median filtering.
  • the signal-to-noise ratio is expressed as the ratio of signal power to noise power. The larger the signal-to-noise value, the better the signal quality and the smaller the noise interference.
  • val loss It is the error between the predicted result of the validation data set through the neural network model and the actual result.
  • ACC It is the accuracy of the prediction results of the validation data set through the neural network model compared with the actual results.
  • the main strand S1, antisense strand S2 and connecting strand YB were synthesized respectively, and their specific sequences are shown below:
  • Antisense strand S2 5’-Ru-CGACTTCTACCGTTTGACTCCGC-3’
  • Connection chain YB 5’-GTCAGCAGGACCGAATGA/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OMeU//i2O MeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeC//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3’
  • the main strand S1 sequentially comprises the leading sequence iSpC3 (i.e., a nucleotide lacking sugar and base, indicated as 3), which is connected to the 5' end of SEQ ID NO: 1, and the 3' end of SEQ ID NO: 1 is sequentially connected to the blocking strand iSpC18 (indicated as 8888) and the 5' end of SEQ ID NO: 2.
  • iSpC3 and iSpC18 were purchased from Integrated DNA Technologies.
  • the sequence of the antisense strand S2 is shown in SEQ ID NO:3.
  • the connecting chain YB comprises the sequence shown in SEQ ID NO:4, and its 3’ end is connected to multiple methoxy modifications, including i2OMeA, i2OMeC, i2OMeG, and i2OMeU.
  • the main strand S1, the antisense strand S2 and the connecting strand YB are mixed and annealed at a ratio of 1:1.1:1.1 to form an adapter.
  • the annealing treatment is specifically to slowly cool from 95°C to 25°C, during which the cooling amplitude does not exceed 0.1°C/s.
  • Example 2.2 Preparation of nucleic acid-protein complex based on Lys-Glu linkage
  • the main strand S1, antisense strand S2 and connecting strand YB were synthesized respectively, and their specific sequences are shown below:
  • Antisense strand S2 5’-CGACTTCTACCGTTTGACTCCGC-3’
  • Connection chain YB 5’-GTCAGCAGGACCGAATGA/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OMeU//i2O MeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeC//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3’
  • the main strand S1 sequentially comprises the leading sequence iSpC3 (i.e., a nucleotide lacking sugar and base, indicated as 3), which is connected to the 5' end of SEQ ID NO: 1, and the 3' end of SEQ ID NO: 1 is sequentially connected to the blocking strand iSpC18 (indicated as 8888) and the 5' end of SEQ ID NO: 2.
  • iSpC3 and iSpC18 were purchased from Integrated DNA Technologies.
  • the sequence of the antisense strand S2 is shown in SEQ ID NO:3.
  • the connecting chain YB comprises the sequence shown in SEQ ID NO:4, and its 3’ end is connected to multiple methoxy modifications, including i2OMeA, i2OMeC, i2OMeG, and i2OMeU.
  • the main strand S1, the antisense strand S2 and the connecting strand YB are mixed and annealed at a ratio of 1:1.1:1.1 to form an adapter.
  • the annealing treatment is specifically to slowly cool from 95°C to 25°C, during which the cooling amplitude does not exceed 0.1°C/s.
  • Example 2.2 Add 5 times the amount of substance added in Example 2.2 to the DNAPac PA200 column and purify it with buffer A and buffer B to remove impurities such as enzymes that are not bound to the linker and incompletely reacted EDC from the column.
  • the nucleic acid-protein complex was then eluted with a mixture of 10 column volumes of buffer A and buffer B.
  • the main elution peaks were then pooled, their concentrations measured, and run on a TBE PAGE gel at 160V for 40 min.
  • the composition of buffer A is: 20mM Na-CHES, 250mM NaCl, 4% (W/V) glycerol, pH 8.6;
  • the composition of buffer B is: 20mM Na-CHES, 1M NaCl, 4% (W/V) glycerol, pH 8.6.
  • the eluted main peak was quantified by Qubit, and 10 ng was taken for quality inspection.
  • Example 2.4 ATP consumption detection of nucleic acid-protein complexes
  • Example 2.3 About 1 ng of the purified nucleic acid-protein complex in Example 2.3 was taken and added to 200 ⁇ L 1*Seq Buffer (10 mM HEPES7.0, 600 mM KCl, 25 mM ATP, 25 mM MgCl 2 ), and enzyme-free water was added without adding nucleic acid-protein complex as a control. After that, 0 h, 4 h, and 8 h were taken and the ADP content was detected by HPLC according to the peak time of elution on the C18 column, and the results are shown in Figure 14. The results show that under this enzyme locking mode, the nucleic acid-protein complex still has ATPase activity and the protein can still work normally.
  • 1*Seq Buffer 10 mM HEPES7.0, 600 mM KCl, 25 mM ATP, 25 mM MgCl 2
  • enzyme-free water was added without adding nucleic acid-protein complex as a control.
  • a 10 kb library was prepared by end repair, and the nucleic acid protein complex prepared in Example 2.2 was connected to the library for library construction according to the library construction kit QLK-V1.1.1 of Qi Carbon Technology. Finally, sequencing was performed on the QNome-3841 sequencer of Qi Carbon Technology Co., Ltd., and the actual sequencing signal is shown in Figure 15. The results show that the nucleic acid protein complex can generate sequencing signals normally, and the signal steps are clear for subsequent base inference.
  • the method for preparing the nucleic acid-protein complex without forming cross-links is similar to that in Example 2.2, except that no EDC is added during the preparation process, that is, the enzyme does not form cross-links.
  • Example 5 The library construction and sequencing were performed in the same manner as in Example 2.5, and the results are shown in Table 5. The results showed that compared with Example 2.5, the enzyme in the complex of the non-cross-linked enzyme was easily detached during sequencing, and no effective sequencing signal could be obtained, let alone a complete sequencing signal of 10 kb.
  • a valid signal is a signal that can be subsequently identified by basecalling and converted into a sequence.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Immunology (AREA)
  • Analytical Chemistry (AREA)
  • Plant Pathology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

Provided is a helicase, in which an ectopic bond is formed, wherein the helicase retains the ability to control the movement of a target analyte, and the ectopic bond does not include an ectopic bond where the formation thereof involves cysteine. Further provided are a polypeptide, a construct, a polynucleotide, a vector and a host cell; a method for preparing the helicase, polypeptide or construct; a method for controlling the movement of a target analyte; a method for characterizing a target analyte; a method for forming a sensor for characterizing a target analyte; a sensor for characterizing a target analyte; the use of the helicase or construct in controlling the movement of a target analyte through a pore; a kit or device for characterizing a target analyte; and a set of two or more helicases linking to a target analyte.

Description

解旋酶及其应用Helicase and its application 技术领域Technical Field

本发明属于测序的技术领域,特别涉及一种经修饰的解旋酶及其表征目标分析物的应用。The invention belongs to the technical field of sequencing, and particularly relates to a modified helicase and an application thereof in characterizing a target analyte.

背景技术Background Art

纳米孔(Nanopore)测序技术,又被称为第四代测序技术,利用一个能够提供离子电流通道的纳米孔,使得分析物如单链核酸分子在电泳驱动下通过该纳米孔,当分析物如核酸通过纳米孔时,会减少纳米孔的电流,对产生的不同信号实时读取序列信息的基因测序技术。Nanopore sequencing technology, also known as the fourth-generation sequencing technology, uses a nanopore that can provide an ion current channel to allow analytes such as single-stranded nucleic acid molecules to pass through the nanopore under the drive of electrophoresis. When analytes such as nucleic acids pass through the nanopore, the current of the nanopore will be reduced, and the gene sequencing technology can read the sequence information of the different signals generated in real time.

解旋酶是一种在纳米孔测序过程中控制核酸分子移动的有用工具,但目前的解旋酶存在一些问题需要解决。例如,测序过程中解旋酶可能会从核酸分子上脱落,导致核酸分子以不受控的速度和方式被迅速拉动穿过所述孔,以至于分析物如单个核苷酸的电流信号过短而无法分辨。Helicases are a useful tool for controlling the movement of nucleic acid molecules during nanopore sequencing, but current helicases have some problems that need to be solved. For example, the helicase may fall off the nucleic acid molecule during sequencing, causing the nucleic acid molecule to be pulled through the pore at an uncontrolled speed and in an uncontrolled manner, so that the current signal of the analyte, such as a single nucleotide, is too short to be distinguished.

针对这一问题,现有技术CN105899678A中公开了一种经修饰的解旋酶,具体而言,在解旋酶的塔/销/1A结构域中引入至少一个半胱氨酸/非天然氨基酸,从而提高解旋酶在纳米孔测序中的持续控速能力。To address this problem, the prior art CN105899678A discloses a modified helicase. Specifically, at least one cysteine/non-natural amino acid is introduced into the tower/pin/1A domain of the helicase, thereby improving the continuous rate control ability of the helicase in nanopore sequencing.

发明内容Summary of the invention

本申请提供了一种新的解旋酶修饰方法,也可以提高解旋酶在纳米孔测序中的持续控速能力,且在测序稳定性和准确率等方面的效果更好。The present application provides a new helicase modification method, which can also improve the continuous rate control ability of the helicase in nanopore sequencing, and has better effects in terms of sequencing stability and accuracy.

第一方面,本申请提供了一种解旋酶,其In a first aspect, the present application provides a helicase,

(a)包含一个多核苷酸结合域和两个RecA样结构域,其中位于围绕所述多核苷酸结合域的所述解旋酶表面上的两个或多个氨基酸残基之间通过异位键连接,使得所述多核苷酸结合域中结合的多核苷酸被所述连接后形成的结构包围;或(a) comprising a polynucleotide binding domain and two RecA-like domains, wherein two or more amino acid residues on the surface of the helicase surrounding the polynucleotide binding domain are connected by an ectopic bond, so that the polynucleotide bound to the polynucleotide binding domain is surrounded by the structure formed after the connection; or

(b)包含一个多核苷酸结合域和两个RecA样结构域,其中位于围绕所述多核苷酸结合域的所述解旋酶表面上的两个或多个氨基酸残基之间能够通过异位键连接,使得所述多核苷酸结合域中结合的多核苷酸被所述连接后的结构包围,其中所述两个或多个氨基酸残基中的至少一个为突变氨基酸残基;或 (b) comprising a polynucleotide binding domain and two RecA-like domains, wherein two or more amino acid residues on the surface of the helicase surrounding the polynucleotide binding domain can be connected by an ectopic bond, so that the polynucleotide bound in the polynucleotide binding domain is surrounded by the connected structure, and at least one of the two or more amino acid residues is a mutant amino acid residue; or

(c)包含多核苷酸结合结构域,所述多核苷酸结合结构域在至少一个构象状态下包含开口,多核苷酸可通过所述开口与所述解旋酶解结合,其中所述解旋酶在所述开口处的两个或多个氨基酸之间形成异位键(形成所述异位键可以阻止多核苷酸通过所述开口与所述解旋酶解结合),并且其中所述解旋酶保留其控制目标多核苷酸移动的能力;或(c) comprising a polynucleotide binding domain, wherein the polynucleotide binding domain comprises an opening in at least one conformational state, through which a polynucleotide can be debound by the helicase, wherein the helicase forms an ectopic bond between two or more amino acids at the opening (forming the ectopic bond can prevent the polynucleotide from being debound by the helicase through the opening), and wherein the helicase retains its ability to control the movement of the target polynucleotide; or

(d)包含多核苷酸结合结构域,所述多核苷酸结合结构域在至少一个构象状态下包含开口,多核苷酸可通过所述开口与所述解旋酶解结合,其中所述解旋酶被修饰,使得其在所述开口处的两个或多个氨基酸之间能够形成异位键(形成所述异位键可以阻止多核苷酸通过所述开口与所述解旋酶解结合),并且其中所述解旋酶保留其控制目标多核苷酸移动的能力。(d) comprising a polynucleotide binding domain, wherein the polynucleotide binding domain comprises an opening in at least one conformational state, through which a polynucleotide can be debound from the helicase, wherein the helicase is modified so that it can form an ectopic bond between two or more amino acids at the opening (the formation of the ectopic bond can prevent the polynucleotide from being debound from the helicase through the opening), and wherein the helicase retains its ability to control the movement of the target polynucleotide.

在一个实施方式中,所述异位键不包括半胱氨酸参与形成的异位键,所述异位键包括除半胱氨酸外的任意天然氨基酸参与形成的异位键,In one embodiment, the ectopic bond does not include an ectopic bond formed with the participation of cysteine, and the ectopic bond includes an ectopic bond formed with the participation of any natural amino acid except cysteine.

所述除半胱氨酸外的任意天然氨基酸选自包括如下氨基酸的组:Ile、Val、Leu、Phe、Met、Ala、Gly、Thr、Ser、Trp、Tyr、Pro、His、Glu、Gln、Asp、Asn、Lys、Arg,The arbitrary natural amino acid except cysteine is selected from the group consisting of the following amino acids: Ile, Val, Leu, Phe, Met, Ala, Gly, Thr, Ser, Trp, Tyr, Pro, His, Glu, Gln, Asp, Asn, Lys, Arg,

优选的,其中所述异位键包括Lys-His、Lys-Ser、Lys-Thr、Lys-Tyr、Lys-Lys、Lys-Glu、Lys-Asp、Lys-Gln、Lys-Arg、Arg-Glu、Lys-Met、Arg-Asp、Arg-Arg、Tyr-Tyr、Tyr-Trp、Met-Met异位键或其组合;Preferably, the ectopic bond comprises Lys-His, Lys-Ser, Lys-Thr, Lys-Tyr, Lys-Lys, Lys-Glu, Lys-Asp, Lys-Gln, Lys-Arg, Arg-Glu, Lys-Met, Arg-Asp, Arg-Arg, Tyr-Tyr, Tyr-Trp, Met-Met ectopic bond or a combination thereof;

和/或,所述解旋酶由一个或多个单体组成,and/or, the helicase consists of one or more monomers,

和/或,形成所述异肽键的两个氨基酸位于同一结构域或不同结构域。And/or, the two amino acids forming the isopeptide bond are located in the same domain or in different domains.

在一个实施方式中,所述解旋酶源自天然或经修饰的下组的解旋酶家族的成员:Dda解旋酶、Pifl样解旋酶、Upfl样解旋酶、UvrD/Rep解旋酶、Ski样解旋酶、Rad3/XPD解旋酶、NS3/NPH-II解旋酶、DEAD解旋酶、DEAHi RHA解旋酶、RecG样解旋酶、REcQ样解旋酶、T1R样解旋酶、Swi/Snf样解旋酶和Rig-I样解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶;In one embodiment, the helicase is derived from a member of the natural or modified helicase family of the following group: Dda helicase, Pifl-like helicase, Upfl-like helicase, UvrD/Rep helicase, Ski-like helicase, Rad3/XPD helicase, NS3/NPH-II helicase, DEAD helicase, DEAHi RHA helicase, RecG-like helicase, REcQ-like helicase, T1R-like helicase, Swi/Snf-like helicase and Rig-I-like helicase, preferably Dda helicase, more preferably T4-Dda helicase;

和/或,所述解旋酶源自天然或经修饰的下组的解旋酶家族的成员:RecD解旋酶、Upfl解旋酶、PcrA解旋酶、Rep解旋酶、UvrD解旋酶、Hel308解旋酶、Mtr4解旋酶、XPD解旋酶、NS3解旋酶、Mssl 16 解旋酶、Prp43解旋酶,RecG解旋酶、RecQ解旋酶、T1R解旋酶、RapA解旋酶和Hef解旋酶。and/or, the helicase is derived from a member of the natural or modified helicase family of the following groups: RecD helicase, Upfl helicase, PcrA helicase, Rep helicase, UvrD helicase, Hel308 helicase, Mtr4 helicase, XPD helicase, NS3 helicase, Mssl 16 Helicase, Prp43 helicase, RecG helicase, RecQ helicase, T1R helicase, RapA helicase and Hef helicase.

在一个实施方式中,使用经修饰的解旋酶如引入氨基酸突变的解旋酶,其生产工艺较为复杂、且成本较高,使用未经修饰的天然的解旋酶,进行共价连接如异位键连接时,对本领域人员更为简单易行,也可以改善其控制核酸分子移动通过纳米孔的稳定性。在一个实施方式中,所述未经修饰的天然的解旋酶包括多核苷酸结合域和两个RecA样结构域,位于不同的RecA样结构域的天然氨基酸残基之间存在连接,其具有控制多核苷酸移动的能力;且与不存在所述连接相比,存在所述连接使所述经修饰的解旋酶控制多核苷酸移动的能力增强。在一个实施方式中,所述存在连接的天然氨基酸残基之间具有相互作用和/或空间距离小于50埃。所述相互作用包括静电相互作用,优选地,所述相互作用为所述解旋酶与靶多核苷酸结合后产生。在一个实施方式中,所述存在连接的天然氨基酸残基在所述解旋酶中是天然存在的,优选地,所述存在连接的天然氨基酸残基包括谷氨酸、赖氨酸、酪氨酸、精氨酸、天冬氨酸、半胱氨酸、蛋氨酸、色氨酸。在一个实施方式中,所述存在连接的天然氨基酸残基位于所述多核苷酸结合域周围。在一个实施方式中,所述连接为共价连接,优选地,所述连接为异位键连接。在一个实施方式中,所述共价连接包括通过化学交联剂形成的连接、或通过酶形成的连接。在一个实施方式中,所述化学交联剂包括EDC(1-乙基-3-(3-二甲基氨基丙基)碳二亚胺盐酸盐)、EDC/NHS(N-羟基丁二酰亚胺)、EDC/s-NHS(N-羟基琥珀酰亚胺磺酸钠盐)、氯吡硫磷一氧(Chlorpyrifos oxon)、HRP(辣根过氧化物酶)、漆酶(Laccase)、或酪氨酸酶(Tyrosinase)。在一个实施方式中,所述解旋酶属于SF1B(Superfamily 1B)亚家族解旋酶(The T4 phage SF1Bhelicase Dda is structurally optimized to perform DNA strand separation,Structure.2012Jul 3;20(7):1189-200)。在一个实施方式中,所述解旋酶包括Dda解旋酶,RecD2解旋酶,Pif1解旋酶,或DNA解旋酶B。在一个实施方式中,所述解旋酶为来自T4噬菌体的Dda解旋酶。在一个实施方式中,所述存在连接的天然氨基酸残基包括E93、E94、K364、Y92、或Y363。在一个实施方式中,所述存在连接的天然氨基酸残基选自下述组中的至少一种:E93和K364;E94和K364;或Y92和Y363。 In one embodiment, the use of modified helicases such as helicases with amino acid mutations has a relatively complex production process and high cost. The use of unmodified natural helicases for covalent bonding such as ectopic bond bonding is simpler and easier for those skilled in the art, and can also improve the stability of controlling the movement of nucleic acid molecules through nanopores. In one embodiment, the unmodified natural helicase includes a polynucleotide binding domain and two RecA-like domains, and there is a connection between the natural amino acid residues located in different RecA-like domains, which has the ability to control the movement of polynucleotides; and compared with the absence of the connection, the presence of the connection enhances the ability of the modified helicase to control the movement of polynucleotides. In one embodiment, the natural amino acid residues that are connected have interactions and/or a spatial distance of less than 50 angstroms. The interaction includes electrostatic interaction, preferably, the interaction is generated after the helicase binds to the target polynucleotide. In one embodiment, the natural amino acid residues that are connected are naturally present in the helicase, preferably, the natural amino acid residues that are connected include glutamic acid, lysine, tyrosine, arginine, aspartic acid, cysteine, methionine, and tryptophan. In one embodiment, the natural amino acid residues that are connected are located around the polynucleotide binding domain. In one embodiment, the connection is a covalent connection, and preferably, the connection is an ectopic bond connection. In one embodiment, the covalent connection includes a connection formed by a chemical crosslinker, or a connection formed by an enzyme. In one embodiment, the chemical crosslinker includes EDC (1-ethyl-3-(3-dimethylaminopropyl) carbodiimide hydrochloride), EDC/NHS (N-hydroxysuccinimide), EDC/s-NHS (N-hydroxysuccinimide sulfonic acid sodium salt), chlorpyrifos oxon, HRP (horseradish peroxidase), laccase, or tyrosinase. In one embodiment, the helicase belongs to the SF1B (Superfamily 1B) subfamily helicase (The T4 phage SF1B helicase Dda is structurally optimized to perform DNA strand separation, Structure. 2012 Jul 3; 20 (7): 1189-200). In one embodiment, the helicase includes Dda helicase, RecD2 helicase, Pif1 helicase, or DNA helicase B. In one embodiment, the helicase is Dda helicase from T4 phage. In one embodiment, the natural amino acid residues present in the connection include E93, E94, K364, Y92, or Y363. In one embodiment, the natural amino acid residues present in the connection are selected from at least one of the following groups: E93 and K364; E94 and K364; or Y92 and Y363.

在一个实施方式中,天然或经修饰的Dda解旋酶来自于CN105899678A中公开的表1和表2中的Dda解旋酶。In one embodiment, the natural or modified Dda helicase is from the Dda helicases in Table 1 and Table 2 disclosed in CN105899678A.

在一个实施方式中,天然或经修饰的Pifl样解旋酶来自于CN113930406A中公开的表1和表2中的Pifl样解旋酶。In one embodiment, the natural or modified Pif1-like helicase is from the Pif1-like helicases in Table 1 and Table 2 disclosed in CN113930406A.

在一个实施例中,在所述解旋酶的1A结构域和2A结构域的天然氨基酸残基间引入异位键。在一个实施例中,异位键包括Lys-His、Lys-Ser、Lys-Thr、Lys-Tyr、Lys-Lys、Lys-Glu、Lys-Asp、Lys-Gln、Lys-Arg、Arg-Glu、Lys-Met、Arg-Asp、Arg-Arg、Tyr-Tyr、Tyr-Trp、Met-Met异位键或其组合。在一个实施例中,氨基酸侧链直接连接形成共价键或通过连接分子进行连接形成异位键。在一个实施例中,所述解旋酶源自Dda解旋酶,优选T4-Dda解旋酶。In one embodiment, an ectopic bond is introduced between the natural amino acid residues in the 1A domain and the 2A domain of the helicase. In one embodiment, the ectopic bond comprises Lys-His, Lys-Ser, Lys-Thr, Lys-Tyr, Lys-Lys, Lys-Glu, Lys-Asp, Lys-Gln, Lys-Arg, Arg-Glu, Lys-Met, Arg-Asp, Arg-Arg, Tyr-Tyr, Tyr-Trp, Met-Met ectopic bond or a combination thereof. In one embodiment, the amino acid side chains are directly connected to form a covalent bond or are connected to form an ectopic bond by a connecting molecule. In one embodiment, the helicase is derived from a Dda helicase, preferably a T4-Dda helicase.

在一个实施例中,所述解旋酶源自T4-Dda解旋酶,并且所述解旋酶包括:In one embodiment, the helicase is derived from T4-Dda helicase, and the helicase comprises:

(1)x1与x2形成的Tyr-Tyr(Y-Y)异位键;(1) Tyr-Tyr (Y-Y) heterotopic bond formed by x1 and x2;

(2)x3与x4形成的Lys-Gln(K-Q)异位键;(2) Lys-Gln (K-Q) heterotopic bond formed by x3 and x4;

(3)x1与x3形成的Tyr-Lys(Y-K)异位键;(3) Tyr-Lys (Y-K) heterotopic bond formed by x1 and x3;

(4)x1与x4形成的Tyr-Gln(Y-Q)异位键;(4) Tyr-Gln (Y-Q) heterotopic bond formed by x1 and x4;

(5)x2与x3形成的Tyr-Lys(Y-K)异位键;(5) Tyr-Lys (Y-K) heterotopic bond formed by x2 and x3;

(6)x2与x4形成的Tyr-Gln(Y-Q)异位键;(6) Tyr-Gln (Y-Q) heterotopic bond formed by x2 and x4;

(7)x3与x5形成的Lys-Glu(K-E)异位键;或(7) Lys-Glu (K-E) heterotopic bond formed by x3 and x5; or

(8)以上任何组合。(8)Any combination of the above.

在一个实施例中,所述解旋酶包括1A结构域的任意突变体和/或2A结构域的任意突变体,只要所述解旋酶保留其控制目标分析物优选目标多核苷酸移动的能力。在一个实施例中,所述目标分析物用一个或多个蛋白质或一个或多个标记物、标签或间隔区通过甲基化作用、氧化、损伤进行修饰。在一个实施例中,目标分析物是目标多核苷酸。In one embodiment, the helicase comprises any mutant of the 1A domain and/or any mutant of the 2A domain, as long as the helicase retains its ability to control the movement of the target analyte, preferably the target polynucleotide. In one embodiment, the target analyte is modified by methylation, oxidation, damage with one or more proteins or one or more markers, tags or spacers. In one embodiment, the target analyte is a target polynucleotide.

第二方面,本申请提供了一种多肽,包括来自前述解旋酶的形成异位键的1A结构域和2A结构域,和多核苷酸结合域,并且不包括所述解旋酶的其他结构域。In a second aspect, the present application provides a polypeptide comprising the 1A domain and the 2A domain that form an ectopic bond from the aforementioned helicase, and a polynucleotide binding domain, and excluding other domains of the helicase.

第三方面,本申请提供了一种解旋酶,包括所述多肽,其中所述解旋酶具有控制目标分析物优选目标多核苷酸移动的能力。In a third aspect, the present application provides a helicase, comprising the polypeptide, wherein the helicase has the ability to control the movement of a target analyte, preferably a target polynucleotide.

第四方面,本申请提供了一种构建体,包括解旋酶或多肽,和与 其多核苷酸结合域结合的目标多核苷酸,其中位于围绕多核苷酸结合域的解旋酶表面上两个或多个氨基酸残基之间通过异肽键连接,其中结合的目标多核苷酸被所述异肽键连接后的结构包围。In a fourth aspect, the present application provides a construct comprising a helicase or a polypeptide, and The target polynucleotide bound by its polynucleotide binding domain, wherein two or more amino acid residues on the surface of the helicase surrounding the polynucleotide binding domain are connected by an isopeptide bond, wherein the bound target polynucleotide is surrounded by the structure connected by the isopeptide bond.

在一个实施例中,所述构建体包括衔接体,所述目标多核苷酸与衔接体连接,所述构建体具有控制所述目标多核苷酸移动的能力。In one embodiment, the construct comprises an adapter, the target polynucleotide is connected to the adapter, and the construct has the ability to control the movement of the target polynucleotide.

在一个实施例中,所述构建体包括两个以上所述的解旋酶。在一个实施例中,所述解旋酶可以是上述任何一种解旋酶,或其组合。In one embodiment, the construct comprises two or more helicases. In one embodiment, the helicase may be any of the helicases described above, or a combination thereof.

第五方面,本申请提供了一种多核苷酸,包括编码所述的解旋酶、所述的多肽、所述的构建体的序列,或由其序列组成。In a fifth aspect, the present application provides a polynucleotide comprising a sequence encoding the helicase, the polypeptide, the construct, or consisting of a sequence thereof.

第六方面,本申请提供了一种载体,包括可操作性连接到启动子的所述的多核苷酸。In a sixth aspect, the present application provides a vector comprising the polynucleotide operably linked to a promoter.

第七方面,本申请提供了一种宿主细胞,包括所述的载体。In a seventh aspect, the present application provides a host cell, comprising the vector.

第八方面,本申请提供了一种制备所述的解旋酶、所述的多肽、或所述的构建体的方法,包括表达所述的多核苷酸、用所述的载体转染细胞,或培养所述的宿主细胞。In an eighth aspect, the present application provides a method for preparing the helicase, the polypeptide, or the construct, comprising expressing the polynucleotide, transfecting cells with the vector, or culturing the host cells.

第九方面,本申请提供了一种控制目标分析物优选目标多核苷酸移动的方法,包括将所述目标多核苷酸与所述的解旋酶或多肽、或所述的构建体接触,并由此控制所述目标分析物优选目标多核苷酸的移动。In the ninth aspect, the present application provides a method for controlling the movement of a target analyte, preferably a target polynucleotide, comprising contacting the target polynucleotide with the helicase or polypeptide, or the construct, and thereby controlling the movement of the target analyte, preferably the target polynucleotide.

在一个实施例中,所述方法用于控制目标分析物优选目标多核苷酸穿过孔的移动。In one embodiment, the method is used to control the movement of a target analyte, preferably a target polynucleotide, through a pore.

第十方面,本申请提供了一种表征目标分析物优选目标多核苷酸的方法,包括:In a tenth aspect, the present application provides a method for characterizing a target analyte, preferably a target polynucleotide, comprising:

(a)将目标分析物优选目标多核苷酸与孔和所述的解旋酶或多肽、或所述的构建体接触,使得所述解旋酶控制所述目标分析物优选目标多核苷酸穿过所述孔移动;以及(a) contacting a target analyte, preferably a target polynucleotide, with a pore and said helicase or polypeptide, or said construct, such that said helicase controls movement of said target analyte, preferably a target polynucleotide, through said pore; and

(b)随着所述目标分析物优选目标多核苷酸相对于所述孔移动,获取一个或多个测量值,其中所述测量值代表所述目标多核苷酸的一个或多个特征并由此表征所述目标分析物优选目标多核苷酸。(b) obtaining one or more measurements as the target analyte, preferably target polynucleotide, moves relative to the pore, wherein the measurements represent one or more characteristics of the target polynucleotide and thereby characterize the target analyte, preferably target polynucleotide.

在一个实施例中,所述目标分析物选自多核苷酸、多肽、多糖和脂质中的一种或多种,优选为多核苷酸或与多核苷酸连接的多肽、多糖和脂质,更优选为单链多核苷酸、双链多核苷酸或部分双链多核苷 酸;或者In one embodiment, the target analyte is selected from one or more of polynucleotides, polypeptides, polysaccharides and lipids, preferably polynucleotides or polypeptides, polysaccharides and lipids linked to polynucleotides, more preferably single-stranded polynucleotides, double-stranded polynucleotides or partially double-stranded polynucleotides. Acid; or

所述目标分析物是目标多核苷酸,并且所述一个或多个特征选自(i)所述目标多核苷酸的长度,(ii)所述目标多核苷酸的同一性,(iii)所述目标多核苷酸的序列,(iv)所述目标多核苷酸的二级结构;以及(v)所述目标多核苷酸是否是经修饰的。The target analyte is a target polynucleotide, and the one or more characteristics are selected from (i) the length of the target polynucleotide, (ii) the identity of the target polynucleotide, (iii) the sequence of the target polynucleotide, (iv) the secondary structure of the target polynucleotide; and (v) whether the target polynucleotide is modified.

在一个实施例中,所述目标多核苷酸用一个或多个蛋白质或一个或多个标记物、标签或间隔区通过甲基化作用、氧化、损伤进行修饰。在一个实施例中,所述目标分析物优选目标多核苷酸的一个或多个特征通过电测量和/或光测量进行测量。在一个实施例中,所述电测量为电流测量、电压测量、电阻测量、电容测量、电感测量、阻抗测量、隧道测量或场效应晶体管测量。In one embodiment, the target polynucleotide is modified by methylation, oxidation, damage with one or more proteins or one or more markers, tags or spacers. In one embodiment, one or more characteristics of the target analyte, preferably the target polynucleotide, are measured by electrical measurement and/or optical measurement. In one embodiment, the electrical measurement is current measurement, voltage measurement, resistance measurement, capacitance measurement, inductance measurement, impedance measurement, tunnel measurement or field effect transistor measurement.

在一个实施例中,所述方法包括:In one embodiment, the method comprises:

(a)将所述目标分析物优选目标多核苷酸与孔和所述的解旋酶或所述的构建体接触,使得所述解旋酶或构建体控制所述目标分析物优选目标多核苷酸穿过所述孔移动;以及(a) contacting the target analyte, preferably a target polynucleotide, with a pore and the helicase or the construct, such that the helicase or the construct controls the movement of the target analyte, preferably a target polynucleotide, through the pore; and

(b)随着所述目标分析物优选目标多核苷酸相对于所述孔移动,获取穿过所述孔的电流,其中所述电流代表所述目标分析物优选目标多核苷酸的一个或多特特征,并由此表征所述目标分析物优选目标多核苷酸。(b) acquiring a current across the pore as the target analyte, preferably target polynucleotide, moves relative to the pore, wherein the current represents one or more characteristics of the target analyte, preferably target polynucleotide, and thereby characterizes the target analyte, preferably target polynucleotide.

在一个实施例中,所述方法进一步包括跨所述孔施加电压以在所述孔和所述解旋酶或构建体之间形成复合体的步骤。在一个实施例中,所述目标分析物是目标多核苷酸,并且至少部分的所述目标多核苷酸是双链的。在一个实施例中,所述孔为纳米孔或跨膜孔,或所述孔选自生物孔、固态孔或生物与固态杂交的孔。在一个实施例中,所述生物孔衍生自溶血素、杀白细胞素、CsGG、耻垢分枝杆菌孔蛋白A(MspA)、孔蛋白B、孔蛋白C、孔蛋白D、外膜孔蛋白F、外膜孔蛋白G、外膜磷脂酶A、奈瑟氏菌属自转运脂蛋白、和WZA;所述固态孔衍生自石墨烯纳米孔、MoS2纳米孔、BN纳米孔或PA63纳米孔。In one embodiment, the method further comprises the step of applying a voltage across the hole to form a complex between the hole and the helicase or construct. In one embodiment, the target analyte is a target polynucleotide, and at least part of the target polynucleotide is double-stranded. In one embodiment, the hole is a nanopore or a transmembrane hole, or the hole is selected from a biological hole, a solid-state hole, or a hole hybridized with a biological solid state. In one embodiment, the biological hole is derived from hemolysin, leukocidin, CsGG, Mycobacterium smegmatis porin A (MspA), porin B, porin C, porin D, outer membrane porin F, outer membrane porin G, outer membrane phospholipase A, Neisseria autotransporter, and WZA; the solid-state hole is derived from a graphene nanopore, a MoS2 nanopore, a BN nanopore, or a PA63 nanopore.

第十一方面,本申请提供了一种形成用于表征目标分析物优选目标多核苷酸的传感器的方法,包括在(a)孔和(b)所述的解旋酶或多肽、或所述的构建体之间形成复合体并由此形成用于表征所述目标分析物优选目标多核苷酸的传感器。 In the eleventh aspect, the present application provides a method for forming a sensor for characterizing a target analyte, preferably a target polynucleotide, comprising forming a complex between (a) a hole and (b) the helicase or polypeptide, or the construct, and thereby forming a sensor for characterizing the target analyte, preferably a target polynucleotide.

在一个实施例中,所述复合体通过(a)使所述孔和所述解旋酶或构建体在所述目标分析物优选目标多核苷酸的存在下接触以及(b)跨所述孔施加电势而形成。在一个实施例中,所述电势是电压电势或化学电势。在一个实施例中,所述复合体通过将所述孔共价连接到所述解旋酶或构建体而形成。In one embodiment, the complex is formed by (a) contacting the pore and the helicase or construct in the presence of the target analyte, preferably the target polynucleotide, and (b) applying an electric potential across the pore. In one embodiment, the electric potential is a voltage potential or a chemical potential. In one embodiment, the complex is formed by covalently linking the pore to the helicase or construct.

第十二方面,本申请提供了一种用于表征目标分析物优选目标多核苷酸的传感器,包括(a)孔和(b)所述的解旋酶或多肽、或所述的构建体之间的复合体。In a twelfth aspect, the present application provides a sensor for characterizing a target analyte, preferably a target polynucleotide, comprising (a) a pore and (b) the helicase or polypeptide, or a complex between the constructs.

第十三方面,本申请提供了所述的解旋酶或多肽、或所述的构建体在控制目标分析物优选目标多核苷酸穿过孔的移动中的应用。In a thirteenth aspect, the present application provides the use of the helicase or polypeptide, or the construct in controlling the movement of a target analyte, preferably a target polynucleotide, through a pore.

第十四方面,本申请提供了一种用于表征目标分析物优选目标多核苷酸的试剂盒,包括(a)孔和(b)所述的解旋酶或多肽、或所述的构建体。In a fourteenth aspect, the present application provides a kit for characterizing a target analyte, preferably a target polynucleotide, comprising (a) a pore and (b) the helicase or polypeptide, or the construct.

在一个实施例中,所述试剂盒进一步包括含有两亲膜的芯片。In one embodiment, the kit further comprises a chip comprising an amphiphilic membrane.

第十五方面,本申请提供了一种用于表征目标分析物优选目标多核苷酸的装置,包括(a)多个孔和(b)多个所述的解旋酶或多肽、或所述的构建体。In a fifteenth aspect, the present application provides a device for characterizing a target analyte, preferably a target polynucleotide, comprising (a) a plurality of holes and (b) a plurality of the helicases or polypeptides, or the constructs.

在一个实施例中,所述装置包括:传感器设备,能支撑多个孔,并能可操作的使用所述孔和解旋酶或构建体进行目标分析物优选目标多核苷酸的表征;和至少一个口,用于传递进行表征的材料。在一个实施例中,所述装置包括:传感器设备,能支撑多个孔,并能可操作的使用所述孔和解旋酶或构建体进行目标分析物优选目标多核苷酸的表征;以及至少一个存储器,存储用于进行表征的材料;流控系统,配置为从所述至少一个存储器可控地向所述传感器设备提供材料;以及一个或多个容器,用于接收相应的样本,所述流控系统被配置为选择性地从一个或多个容器向所述传感器设备提供样本。In one embodiment, the device comprises: a sensor device capable of supporting a plurality of pores and operable to use the pores and helicase or construct to characterize a target analyte, preferably a target polynucleotide; and at least one port for delivering a material for characterization. In one embodiment, the device comprises: a sensor device capable of supporting a plurality of pores and operable to use the pores and helicase or construct to characterize a target analyte, preferably a target polynucleotide; and at least one memory for storing materials for characterization; a fluidics system configured to controllably provide materials to the sensor device from the at least one memory; and one or more containers for receiving corresponding samples, the fluidics system being configured to selectively provide samples to the sensor device from one or more containers.

第十六方面,本申请提供了一种制备所述的解旋酶的方法,包括:In a sixteenth aspect, the present application provides a method for preparing the helicase, comprising:

(a)提供解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶;和(a) providing a helicase, preferably a Dda helicase, more preferably a T4-Dda helicase; and

(b)在所述解旋酶的蛋白内形成异位键以制备所述的解旋酶;(b) forming an ectopic bond within the helicase protein to prepare the helicase;

优选的,在步骤(b)中,还包括将所述解旋酶与多核苷酸接触的步骤,Preferably, in step (b), the step of contacting the helicase with the polynucleotide is also included.

优选的,所述异位键包括使用H2O2、Hemin和H2O2、HRP酶和 H2O2、转谷氨酰胺酶、EDC、或Ru催化形成,更优选的,HRP酶和H2O2、或EDC。Preferably, the ectopic bond comprises the use of H 2 O 2 , Hemin and H 2 O 2 , HRP enzyme and H 2 O 2 , transglutaminase, EDC, or Ru catalyzes the formation, more preferably, HRP enzyme and H 2 O 2 , or EDC.

在一个实施例中,所述方法进一步包括(c)确定所获得的解旋酶是否能够控制目标分析物优选目标多核苷酸的移动。In one embodiment, the method further comprises (c) determining whether the obtained helicase is capable of controlling the movement of a target analyte, preferably a target polynucleotide.

第十七方面,本申请提供了一种制备所述的构建体的方法,包括将所述的解旋酶或多肽与衔接体连接并由此制备所述的构建体。In the seventeenth aspect, the present application provides a method for preparing the construct, comprising connecting the helicase or polypeptide to a linker and thereby preparing the construct.

在一个实施例中,所述方法进一步包括,确定所获得构建体能否控制目标分析物优选目标多核苷酸的移动。In one embodiment, the method further comprises determining whether the obtained construct can control the movement of the target analyte, preferably the target polynucleotide.

第十七方面,本申请提供了连接到目标分析物优选目标多核苷酸的两个或更多个解旋酶的组,其中所述两个或更多个解旋酶中至少一个为前述的解旋酶。In a seventeenth aspect, the present application provides a group of two or more helicases attached to a target analyte, preferably a target polynucleotide, wherein at least one of the two or more helicases is the aforementioned helicase.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

所描述的附图仅是示意性的而非限制性的。The drawings described are only schematic and are non-limiting.

图1至图4示出了不同催化剂加入下的酶交联效果。Figures 1 to 4 show the enzyme cross-linking effect under the addition of different catalysts.

图5示出了根据一个实施例的不同酶在HRP/H2O2催化下的锁酶交联效果。FIG. 5 shows the lock-enzyme cross-linking effect of different enzymes under HRP/H 2 O 2 catalysis according to one embodiment.

图6示出了根据一个实施例的基于Tyr-Tyr交联的锁酶突变体的柱纯化洗脱过程。FIG. 6 shows the column purification elution process of a locked enzyme mutant based on Tyr-Tyr cross-linking according to one embodiment.

图7示出了根据一个实施例的基于Tyr-Tyr交联的核酸蛋白复合物的柱纯化洗脱PAGE胶图。FIG. 7 shows a PAGE gel image of column purification elution of a nucleic acid-protein complex based on Tyr-Tyr cross-linking according to one embodiment.

图8示出了根据一个实施例的基于Tyr-Tyr交联的核酸蛋白复合物的上机测试信号图。FIG. 8 shows an on-machine test signal diagram of a nucleic acid-protein complex based on Tyr-Tyr cross-linking according to one embodiment.

图9示出了实施例2.1中制备的基于Tyr-Tyr交联的核酸蛋白复合物的凝胶电泳图。FIG. 9 shows the gel electrophoresis of the nucleic acid-protein complex based on Tyr-Tyr cross-linking prepared in Example 2.1.

图10示出了实施例2.2中制备的基于Lys-Glu交联的核酸蛋白复合物的凝胶电泳图。FIG. 10 shows the gel electrophoresis of the Lys-Glu cross-linked nucleic acid-protein complex prepared in Example 2.2.

图11示出了实施例2.3中纯化得到的核酸蛋白复合物的柱纯化洗脱图。FIG. 11 shows the column purification elution profile of the nucleic acid-protein complex purified in Example 2.3.

图12示出了实施例2.3中经洗脱后的核酸蛋白复合物的凝胶电泳图,其中FT、E1、E2、E3代表图11中所示洗脱过程中的不同主峰所对应的核酸蛋白复合物。FIG. 12 shows the gel electrophoresis diagram of the nucleic acid-protein complex after elution in Example 2.3, wherein FT, E1, E2, and E3 represent nucleic acid-protein complexes corresponding to different main peaks in the elution process shown in FIG. 11 .

图13示出了实施例2.3中加入不同试剂处理后的核酸蛋白复合物 的凝胶电泳图。FIG. 13 shows the nucleic acid-protein complex after treatment with different reagents in Example 2.3 Gel electrophoresis diagram.

图14示出了实施例2.4中核酸蛋白复合物的ATP消耗检测结果图。FIG. 14 shows the results of ATP consumption detection of nucleic acid-protein complexes in Example 2.4.

图15示出了实施例2.5中核酸蛋白复合物的测序信号图,其中A图为10kb的完整测序信号图,B图为局部放大信号台阶图。FIG. 15 shows the sequencing signal diagram of the nucleic acid-protein complex in Example 2.5, wherein FIG. A is a complete sequencing signal diagram of 10 kb, and FIG. B is a local amplification signal step diagram.

具体实施方式DETAILED DESCRIPTION

应理解,所公开的产品和方法的不同应用可根据所属领域的特定需要来调适。还应理解,本文所用的术语仅出于描述本申请实施例的特定实施例的目的,并且不打算是限制性的。It should be understood that different applications of the disclosed products and methods can be adapted according to the specific needs of the field. It should also be understood that the terminology used herein is only for the purpose of describing specific embodiments of the present application and is not intended to be limiting.

另外除非上下文另外明确规定,否则如本说明书和权利要求书中所使用,单数形式“一”和“所述”包括多个。举例来说,提及“核苷酸”包括两个或更多个核苷酸,提及“一个解旋酶”包括两个或更多解旋酶。Additionally, as used in the specification and claims, the singular forms "a", "an" and "the" include pluralities unless the context clearly dictates otherwise. For example, reference to "a nucleotide" includes two or more nucleotides, and reference to "a helicase" includes two or more helicases.

如本文所使用的,术语“包括”是指必须包括任何所列举的要素,并且也可以任选地包括其他元素。“由...组成”是指不包括所有未列举的元素。由这些术语中的每一个定义的实施例在本申请实施例的范围内。As used herein, the term "comprising" means that any listed elements must be included, and other elements may also be optionally included. "Composed of" means that all unlisted elements are not included. Embodiments defined by each of these terms are within the scope of the present application embodiments.

如本文所用的“核苷酸序列”、“DNA序列”或“核酸分子”是指任何长度的核苷酸(核糖核苷酸或脱氧核糖核苷酸)的聚合形式。该术语仅指分子的一级结构。因此,该术语包括双链和单链DNA和RNA。As used herein, "nucleotide sequence", "DNA sequence" or "nucleic acid molecule" refers to a polymeric form of nucleotides (ribonucleotides or deoxyribonucleotides) of any length. The term refers only to the primary structure of the molecule. Thus, the term includes double-stranded and single-stranded DNA and RNA.

本文所用的术语“核酸”是指单链或双链共价连接的核苷酸序列,其中每个核苷酸上的3'和5'末端通过磷酸二酯键连接。核苷酸可以由脱氧核糖核苷酸碱基或核糖核苷酸碱基组成。核酸可以包括DNA和RNA,并可以在体外合成制备或从自然资源中分离。核酸可以进一步包括修饰的DNA或RNA,例如甲基化的DNA或RNA,或经过翻译后修饰的RNA,例如用7-甲基鸟苷进行的5'-盖帽,3'-端加工,例如裂解和多腺苷化,以及拼接。核酸还可以包括合成核酸(XNA),例如己糖醇核酸(HNA),环己烯核酸(CeNA),苏糖核酸(TNA),甘油核酸(GNA),锁核酸(LNA)和肽核酸(PNA)。核酸(或多核苷酸)的大小通常用双链多核苷酸的碱基对(bp)数目表示,或在单链多核苷酸的情况下用核苷酸的数目(nt)表示。1千个bp或nt等于一个千碱基对(kb)。长度小于约40个核苷酸的多核苷酸通常称为“寡核苷酸”,并且可以包含用于 DNA操作(例如通过聚合酶链式反应(PCR))中的引物。The term "nucleic acid" as used herein refers to a single-stranded or double-stranded covalently linked nucleotide sequence, wherein the 3' and 5' ends on each nucleotide are linked by a phosphodiester bond. Nucleotides can be composed of deoxyribonucleotide bases or ribonucleotide bases. Nucleic acids can include DNA and RNA, and can be synthesized and prepared in vitro or isolated from natural resources. Nucleic acids can further include modified DNA or RNA, such as methylated DNA or RNA, or RNA modified after translation, such as 5'-capping with 7-methylguanosine, 3'-end processing, such as cleavage and polyadenylation, and splicing. Nucleic acids can also include synthetic nucleic acids (XNA), such as hexitol nucleic acids (HNA), cyclohexene nucleic acids (CeNA), threose nucleic acids (TNA), glycerol nucleic acids (GNA), locked nucleic acids (LNA) and peptide nucleic acids (PNA). The size of nucleic acids (or polynucleotides) is usually expressed in terms of the number of base pairs (bp) for double-stranded polynucleotides, or in terms of the number of nucleotides (nt) in the case of single-stranded polynucleotides. One thousand bp or nt is equal to one kilobase pair (kb). Polynucleotides less than about 40 nucleotides in length are generally referred to as "oligonucleotides" and may contain A primer in DNA manipulations, for example by the polymerase chain reaction (PCR).

多核苷酸,例如核酸,是包含两个或多个核苷酸的大分子。所述多核苷酸或核酸可以包含任意核苷酸的任意组合。所述核苷酸可以是天然存在的或人工合成的。所述多核苷酸中的一个或多个核苷酸可以被氧化或甲基化。所述多核苷酸中的一个或多个核苷酸可以被损伤。例如,所述多核苷酸可以包含嘧啶二聚体。这种二聚体通常与由紫外线造成的损伤有关并且是皮肤黑色素瘤的主要成因。所述多核苷酸中的一个或多个核苷酸可以被修饰,例如用常规的标记或标签。所述多核苷酸可以包含一个或多个无碱基的(即缺少核碱基)、或缺少核碱基和糖(即为C3)的核苷酸。A polynucleotide, such as a nucleic acid, is a macromolecule comprising two or more nucleotides. The polynucleotide or nucleic acid may comprise any combination of any nucleotides. The nucleotides may be naturally occurring or artificially synthesized. One or more nucleotides in the polynucleotide may be oxidized or methylated. One or more nucleotides in the polynucleotide may be damaged. For example, the polynucleotide may comprise a pyrimidine dimer. Such dimers are generally associated with damage caused by ultraviolet light and are a major cause of melanoma of the skin. One or more nucleotides in the polynucleotide may be modified, for example with a conventional marker or label. The polynucleotide may comprise one or more nucleotides that are abasic (i.e., lack a nucleobase), or lack a nucleobase and a sugar (i.e., C3).

所述多核苷酸中的核苷酸可以任意方式相互连接。所述核苷酸通常通过其糖基和磷酸基团连接,如在核酸中一样。所述核苷酸可以通过其核碱基连接,如在嘧啶二聚体中一样。The nucleotides in the polynucleotide can be linked to each other in any manner. The nucleotides are usually linked by their sugar and phosphate groups, as in nucleic acids. The nucleotides can be linked by their nucleobases, as in pyrimidine dimers.

多核苷酸可以是单链或双链的。多核苷酸的至少一部分优选是双链的。多核苷酸可以是核酸,例如脱氧核糖核酸(DNA)或核糖核酸(RNA)。多核苷酸可以包含一条RNA链,所述RNA链杂合到一条DNA链。多核苷酸可以是任意现有技术已知的合成核酸,例如肽核酸(PNA),甘油核酸(GNA),苏糖核酸(TNA),锁核酸(LNA)或具有核苷酸侧链的其他合成聚合物。所述PNA骨架是由通过肽键连接的重复的N-(2-氨基乙基)-甘氨酸单元组成。所述GNA骨架是由通过磷酸二酯键连接的重复乙二醇单元组成。所述TNA骨架是由通过磷酸二酯键连接在一起的重复苏糖基组成。LNA由上述核糖核酸形成,具有连接核糖部分中2’氧和4’碳的额外桥连结构。桥连的核酸(BNA)是修饰的RNA核苷酸。它们也可以称为限制的或不可接近的RNA13BNA单体可以含有5元,6元或甚至7元桥连结构并带有“固定的”C3’-内糖折叠结构(C3’-endo sugar puckering)。所述桥连结构被合成引入核糖的2’,4’-位,以产生2’,4’-BNA单体。The polynucleotide may be single-stranded or double-stranded. At least a portion of the polynucleotide is preferably double-stranded. The polynucleotide may be a nucleic acid, such as deoxyribonucleic acid (DNA) or ribonucleic acid (RNA). The polynucleotide may comprise an RNA chain hybridized to a DNA chain. The polynucleotide may be any synthetic nucleic acid known in the prior art, such as a peptide nucleic acid (PNA), a glycerol nucleic acid (GNA), a threose nucleic acid (TNA), a locked nucleic acid (LNA) or other synthetic polymers having nucleotide side chains. The PNA backbone is composed of repeated N-(2-aminoethyl)-glycine units linked by peptide bonds. The GNA backbone is composed of repeated ethylene glycol units linked by phosphodiester bonds. The TNA backbone is composed of repeated threose groups linked together by phosphodiester bonds. LNA is formed by the above-mentioned ribonucleic acid, with an additional bridging structure connecting the 2' oxygen and 4' carbon in the ribose moiety. Bridged nucleic acids (BNA) are modified RNA nucleotides. They can also be referred to as restricted or inaccessible RNA. BNA monomers can contain 5-, 6- or even 7-membered bridges with a "fixed" C3'-endo sugar puckering. The bridges are synthetically introduced into the 2', 4'-position of the ribose to produce 2', 4'-BNA monomers.

多核苷酸最优选核糖核酸(RNA)或脱氧核糖核酸(DNA)。多核苷酸可以为任意长度。例如,多核苷酸的长度可以是至少10,至少50,至少100,至少150,至少200,至少250,至少300,至少400或至少500个核苷酸或核苷酸对。所述多核苷酸的长度可以为1000个或更多个核苷酸或核苷酸对,5000个或更多个核苷酸或核苷酸对或100000个 或更多个核苷酸或核苷酸对。The polynucleotide is most preferably ribonucleic acid (RNA) or deoxyribonucleic acid (DNA). The polynucleotide can be of any length. For example, the length of the polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotides or nucleotide pairs. The length of the polynucleotide can be 1000 or more nucleotides or nucleotide pairs, 5000 or more nucleotides or nucleotide pairs or 100000 nucleotides or nucleotide pairs. or more nucleotides or nucleotide pairs.

任意数量的多核苷酸可以被研究。例如实施例的方法可以涉及表征2、3、4、5、6、7、8、9、10、20、30、50、100个或更多个多核苷酸。如果两个或更多个多核苷酸被表征,它们可以是不同的多核苷酸或相同多核苷酸的情形。Any number of polynucleotides can be studied. For example, the method of the embodiment can involve characterizing 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 50, 100 or more polynucleotides. If two or more polynucleotides are characterized, they can be different polynucleotides or the same polynucleotide.

多核苷酸可以是天然存在的或人工合成的。例如,所述方法可用于验证所制备的寡核苷酸的序列。所述方法通常在体外进行。The polynucleotide may be naturally occurring or artificially synthesized. For example, the method may be used to verify the sequence of the prepared oligonucleotide. The method is typically performed in vitro.

术语“蛋白质”、“多肽”和“肽”在本文中进一步可互换使用,是指氨基酸残基的聚合物以及氨基酸残基的变体和合成类似物。因此,这些术语适用于其中一个或多个氨基酸残基是合成的非天然存在的氨基酸,诸如相应天然存在的氨基酸的化学类似物的氨基酸聚合物,以及适用于天然存在的氨基酸聚合物。多肽还可经历成熟或翻译后修饰过程,这些过程可以包括但不限于:糖基化、蛋白水解裂解、脂化、信号肽裂解、前肽裂解、磷酸化等。The terms "protein", "polypeptide" and "peptide" are further used interchangeably herein to refer to polymers of amino acid residues as well as variants and synthetic analogs of amino acid residues. Thus, these terms apply to amino acid polymers in which one or more amino acid residues is a synthetic non-naturally occurring amino acid, such as a chemical analog of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. Polypeptides may also undergo maturation or post-translational modification processes, which may include, but are not limited to, glycosylation, proteolytic cleavage, lipidation, signal peptide cleavage, propeptide cleavage, phosphorylation, and the like.

保守性取代将氨基酸置换为具有相似化学结构、相似化学性质或相似侧链体积的其他氨基酸。引入的氨基酸可以具有与它们所置换的氨基酸相似的极性、亲水性、疏水性、碱性、酸性、中性或电荷。可替代地,保守性取代可以引入另一种芳族或脂肪族氨基酸代替预先存在的芳族或脂肪族氨基酸。保守性氨基酸变化是本领域众所周知的,并且可以根据下表1中定义的20种主要氨基酸的性质进行选择。在氨基酸具有相似极性的情况下,这也可以参考表2中氨基酸侧链的亲水性量表来确定。Conservative substitution replaces amino acid with other amino acid with similar chemical structure, similar chemical property or similar side chain volume. The amino acid introduced can have polarity, hydrophilicity, hydrophobicity, alkalinity, acidity, neutrality or charge similar to the amino acid they replace. Alternatively, conservative substitution can introduce another aromatic or aliphatic amino acid to replace pre-existing aromatic or aliphatic amino acid. Conservative amino acid changes are well known in the art and can be selected according to the properties of the 20 kinds of main amino acids defined in the following table 1. In the case where amino acid has similar polarity, this can also be determined with reference to the hydrophilicity scale of the amino acid side chain in Table 2.

表1-氨基酸的化学性质

Table 1 - Chemical properties of amino acids

表2-亲水性量表
Table 2 - Hydrophilicity Scale

众所周知,性质相似氨基酸彼此之间保守性替换通常不会影响肽序列的活性,保守性替换如表3。It is well known that conservative substitutions between amino acids with similar properties usually do not affect the activity of the peptide sequence. Conservative substitutions are shown in Table 3.

表3保守氨基酸替换

Table 3 Conservative amino acid substitutions

多核苷酸序列可以采用本领域的标准方法进行衍生和复制。定点突变的合适方法是本领域已知的并且包括,例如,组合链式反应。编码实施例的构建的多核苷酸可以采用本领域公知的技术制备,例如在Sambrook,J.and Russell,D.(2001).Molecular Cloning A Laboratory Manual,3rd Edition.Cold Spring Harbor Laboratory Press,Cold Spring Harbor,NY中描述的那些。Polynucleotide sequences can be derived and replicated using standard methods in the art. Suitable methods for site-directed mutagenesis are known in the art and include, for example, combinatorial chain reactions. Polynucleotides encoding the constructs of the embodiments can be prepared using techniques known in the art, such as those described in Sambrook, J. and Russell, D. (2001). Molecular Cloning A Laboratory Manual, 3rd Edition. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY.

所得到的多核苷酸序列随后可以被整合到重组可复制载体上,例如克隆载体。所述载体可以用于在相容的宿主细胞中复制所述多核苷酸。因此多核苷酸序列可以通过将多核苷酸引入到可复制载体中,将载体引入相容的宿主细胞中,并在引起载体复制的条件下使宿主细胞生长而进行制备。所述载体可以从所述宿主细胞中回收。The resulting polynucleotide sequence can then be incorporated into a recombinant replicable vector, such as a cloning vector. The vector can be used to replicate the polynucleotide in a compatible host cell. Thus, the polynucleotide sequence can be prepared by introducing the polynucleotide into a replicable vector, introducing the vector into a compatible host cell, and growing the host cell under conditions that cause the vector to replicate. The vector can be recovered from the host cell.

由于组成DNA的四种碱基腺嘌呤(A)、鸟嘌呤(G)、胞嘧啶(C)和胸腺嘧啶(T)的分子结构及体积大小均不同,单链DNA(ssDNA)在控速酶(例如本申请实施例的解旋酶)和电场驱使下通过纳米级的小孔时,不同碱基的化学性质差异导致穿越纳米孔或蛋白孔或跨膜孔时引起的电流的变化幅度不同,从而得到所测核酸例如DNA的序列信息。Since the molecular structures and volumes of the four bases adenine (A), guanine (G), cytosine (C) and thymine (T) that make up DNA are different, when single-stranded DNA (ssDNA) passes through a nanoscale pore driven by a rate-controlling enzyme (such as the helicase in the embodiment of the present application) and an electric field, the differences in the chemical properties of different bases result in different amplitudes of changes in the current caused when passing through the nanopore, protein pore or transmembrane pore, thereby obtaining the sequence information of the measured nucleic acid, such as DNA.

在一个纳米孔/蛋白孔/跨膜孔测序实验的实施例中,纳米孔是磷脂膜两侧离子通过的唯一通道。控速蛋白例如多核苷酸结合蛋白(再例如本申请实施例的解旋酶)充当核酸分子例如DNA的马达蛋白,拉动DNA链使其以单个核苷酸的步长依次通过纳米孔/蛋白孔/跨膜孔。每当一个核苷酸穿过纳米孔/蛋白孔/跨膜孔,相应的堵孔信号会被记录下来。通过相应算法分析这些序列相关的电流信号,可以反推出核酸分子例如DNA的序列信息。In an embodiment of a nanopore/protein pore/transmembrane pore sequencing experiment, the nanopore is the only channel for ions to pass through on both sides of the phospholipid membrane. Speed-controlling proteins such as polynucleotide binding proteins (such as the helicase of the present application embodiment) act as motor proteins for nucleic acid molecules such as DNA, pulling the DNA chain so that it passes through the nanopore/protein pore/transmembrane pore in sequence with a step length of a single nucleotide. Whenever a nucleotide passes through the nanopore/protein pore/transmembrane pore, the corresponding pore blocking signal will be recorded. By analyzing these sequence-related current signals through corresponding algorithms, the sequence information of nucleic acid molecules such as DNA can be inferred.

解旋酶Helicase

本申请实施例提供了修饰的解旋酶,优选修饰的DNA依赖性ATP 酶的解旋酶(Dda解旋酶),更优选T4-Dda解旋酶。所述修饰允许修饰的解旋酶更长时间保持与分析物优选多核苷酸的结合。所述修饰的解旋酶保留其控制分析物优选多核苷酸移动的能力。换句话,所述修饰的解旋酶仍能控制分析物优选多核苷酸的移动。所述解旋酶控制分析物优选多核苷酸的移动的程度通常由所述修饰改变,如下文所详述的。The present application provides a modified helicase, preferably a modified DNA-dependent ATP The helicase of the modified helicase (Dda helicase), more preferably T4-Dda helicase. The modification allows the modified helicase to remain bound to the analyte, preferably polynucleotide, for a longer period of time. The modified helicase retains its ability to control the movement of the analyte, preferably polynucleotide. In other words, the modified helicase can still control the movement of the analyte, preferably polynucleotide. The degree to which the helicase controls the movement of the analyte, preferably polynucleotide, is generally changed by the modification, as described in detail below.

在一个实施例中,解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶是经修饰的。所述修饰的解旋酶通常是相比于相应的野生型解旋酶或天然解旋酶是修饰的。本申请实施例的解旋酶是人工的或非天然的。In one embodiment, the helicase, preferably the Dda helicase, more preferably the T4-Dda helicase is modified. The modified helicase is usually modified compared to the corresponding wild-type helicase or natural helicase. The helicase of the present application embodiment is artificial or non-natural.

解旋酶与多核苷酸结合或不结合的能力可使用任何本领域已知的方法确定。合适的结合/不结合分析法包括,但不限于,天然聚丙烯酰胺凝胶电泳(PAGE)、荧光各向异性法、量热法和表面等离子体共振法(SPR,诸如BiacoreTM)。当然,解旋酶从多核苷酸上解旋的能力可通过测量解旋酶控制多核苷酸移动的时间来确定。这也可以使用本领域任何已知的方法确定。解旋酶控制多核苷酸移动的能力通常在纳米孔系统中分析。The ability of a helicase to bind or not bind to a polynucleotide can be determined using any method known in the art. Suitable binding/not binding assays include, but are not limited to, native polyacrylamide gel electrophoresis (PAGE), fluorescence anisotropy, calorimetry, and surface plasmon resonance (SPR, such as Biacore ). Of course, the ability of a helicase to unwind from a polynucleotide can be determined by measuring the time it takes for the helicase to control the movement of the polynucleotide. This can also be determined using any method known in the art. The ability of a helicase to control the movement of a polynucleotide is typically analyzed in a nanopore system.

修饰的解旋酶是一种在链测序过程中控制多核苷酸移动的有用工具。在一个实施例中,Dda解旋酶能以至少两种活动操作模式(当给解旋酶提供促进移动的所有必要组分,如ATP和Mg2+时)以及一种非活动操作模式(当没有给解旋酶提供促进移动的必要组分时)控制DNA的移动。当提供了促进移动的所有必要组分时,Dda解旋酶沿着DNA以5'-3'的方向移动,但DNA在纳米孔中的定向(取决于DNA的哪个末端被捕获)意味着酶可以用于逆着所施加的场的方向将DNA移出纳米孔,或顺着施加的场的方向将DNA移进纳米孔。当DNA的3'末端被捕获时,解旋酶逆着由电势所施加的场的方向工作,将螺旋状的DNA拉出纳米孔并拉入顺式隔间。然而,当DNA以5'向下被捕获进纳米孔时,解旋酶顺着由电势所施加的场的方向工作,将螺旋状的DNA推入纳米孔并进入反式隔间。当没有给Dda解旋酶提供促进移动的必要组分时,当DNA被施加的场拉进所述孔时,Dda解旋酶能结合到DNA并作为减缓DNA移动的制动器。在非活动模式下,DNA是以3'还是5'端向下被捕获不重要,是施加的场将DNA朝着反侧拉进纳米 孔,而所述酶用作制动器。当在非活动模式下,解旋酶对DNA的移动的控制可以多种方式描述,包括棘轮、滑动和制动。Modified helicase is a useful tool for controlling the movement of polynucleotides in the chain sequencing process. In one embodiment, the Dda helicase can control the movement of DNA with at least two active operation modes (when all necessary components for promoting movement are provided to the helicase, such as ATP and Mg 2+ ) and an inactive operation mode (when the necessary components for promoting movement are not provided to the helicase). When all necessary components for promoting movement are provided, the Dda helicase moves along the DNA in the direction of 5'-3', but the orientation of the DNA in the nanopore (depending on which end of the DNA is captured) means that the enzyme can be used to move the DNA out of the nanopore against the direction of the applied field, or move the DNA into the nanopore along the direction of the applied field. When the 3' end of the DNA is captured, the helicase works against the direction of the field applied by the potential, and the spiral DNA is pulled out of the nanopore and pulled into the cis compartment. However, when the DNA is captured into the nanopore with 5' downwards, the helicase works along the direction of the field applied by the potential, and the spiral DNA is pushed into the nanopore and enters the trans compartment. When the Dda helicase is not provided with the necessary components to facilitate movement, it can bind to the DNA and act as a brake to slow down the movement of the DNA as it is pulled into the pore by the applied field. In the inactive mode, it does not matter whether the DNA is captured with the 3' or 5' end facing down, it is the applied field that pulls the DNA into the nanopore towards the trans side. The pore is located in the helicase, while the enzyme acts as a brake. When in the inactive mode, the helicase's control over the movement of DNA can be described in a variety of ways, including ratcheting, sliding, and braking.

在对多核苷酸,尤其是500个核苷酸或更多个核苷酸的测序过程中存在的问题是控制多核苷酸移动的分子马达可能会从多核苷酸上解脱。这允许多核苷酸在施加的场的方向上以不受控的方式被迅速拉动穿过所述孔。本申请实施例的修饰的解旋酶不从被测序的多核苷酸上解旋或解脱。当修饰的解旋酶控制所述多核苷酸穿过纳米孔移动时,所述修饰的解旋酶能提供增加的多核苷酸读取长度。在本申请实施例的修饰的解旋酶的控制下将整个多核苷酸移动穿过纳米孔的能力允许对待评估的多核苷酸以比已知方法提高的精度和速度进行表征,诸如其序列。当链长度增加以及需要具有提高的进行性的分子马达时,这变得更加重要。本申请实施例的修饰的解旋酶对于控制500个核苷酸或更多核苷酸,例如1000个核苷酸,5000,10000,20000,50000,100000或更多个核苷酸的移动特别有效。The problem in the sequencing process of polynucleotides, especially 500 nucleotides or more nucleotides, is that the molecular motor controlling the movement of the polynucleotides may be released from the polynucleotides. This allows the polynucleotides to be rapidly pulled through the hole in an uncontrolled manner in the direction of the applied field. The modified helicase of the present application embodiment does not unwind or release from the polynucleotides being sequenced. When the modified helicase controls the polynucleotides to move through the nanopore, the modified helicase can provide the increased polynucleotide read length. The ability of the modified helicase of the present application embodiment to move the entire polynucleotide through the nanopore allows the polynucleotides to be evaluated to be characterized with the accuracy and speed improved than the known methods, such as its sequence. When the chain length increases and it is necessary to have a molecular motor with improved progress, this becomes more important. The modified helicase of the present application embodiment is particularly effective for controlling the movement of 500 nucleotides or more nucleotides, for example 1000 nucleotides, 5000, 10000, 20000, 50000, 100000 or more nucleotides.

另外,使用根据本申请实施例的修饰的解旋酶意味着可以使用更低的解旋酶浓度。In addition, the use of the modified helicase according to the embodiments of the present application means that lower helicase concentrations can be used.

本申请实施例的修饰的解旋酶也是等温聚合酶链反应(PCR)的有用工具。在这种方法中,双链DNA的链首先被解旋酶分开并被单链DNA(ssDNA)-结合蛋白覆盖。在第二步中,两个序列特异性的引物通常杂交到DNA模板的每个边缘。然后可使用DNA聚合酶延伸退火到模板的引物以制备双链DNA,然后两个新合成的DNA产物可以通过本申请实施例的解旋酶用作底物,进入下一轮反应。因此,同步链反应发生,导致选择的目标序列的指数式扩增。The helicase of the modification of the present application embodiment is also a useful tool for isothermal polymerase chain reaction (PCR). In this method, the chain of double-stranded DNA is first separated by helicase and covered by single-stranded DNA (ssDNA)-binding protein. In the second step, two sequence-specific primers are usually hybridized to each edge of the DNA template. Then the primer annealed to the template can be extended using DNA polymerase to prepare double-stranded DNA, and then two newly synthesized DNA products can be used as substrates by the helicase of the present application embodiment, enter the next round of reaction. Therefore, synchronous chain reaction occurs, causing the exponential amplification of the target sequence selected.

修饰的解旋酶具有控制多核苷酸移动的能力。所述解旋酶控制多核苷酸移动的能力可以通过本领域任何已知方法分析。例如可将所述解旋酶与多核苷酸接触,可以使用标准方法测定多核苷酸的位置。修饰的解旋酶控制多核苷酸移动的能力在纳米孔系统中进行分析。The modified helicase has the ability to control the movement of a polynucleotide. The ability of the helicase to control the movement of a polynucleotide can be analyzed by any method known in the art. For example, the helicase can be contacted with a polynucleotide, and the position of the polynucleotide can be determined using standard methods. The ability of the modified helicase to control the movement of a polynucleotide is analyzed in a nanopore system.

本申请实施例的修饰的解旋酶可以被分离,被基本上分离,纯化或基本上纯化。如果解旋酶完全不含任何其他组分诸如脂质、多核苷酸、孔单体或其他蛋白质,该解旋酶是被分离或纯化的。如果解旋酶与不干扰其预期用途的载体或稀释剂混合,该解旋酶是基本上被分离的。例如,如果解旋酶以含有小于10%,小于5%,小于2%或小于1% 的其他组分诸如脂质、多核苷酸、孔单体或其他蛋白质的形式存在,该解旋酶是基本上分离的或基本上纯化的。The modified helicases of the embodiments of the present application can be isolated, substantially isolated, purified or substantially purified. If the helicase is completely free of any other components such as lipids, polynucleotides, pore monomers or other proteins, the helicase is isolated or purified. If the helicase is mixed with a carrier or diluent that does not interfere with its intended use, the helicase is substantially isolated. For example, if the helicase contains less than 10%, less than 5%, less than 2% or less than 1% The helicase is substantially isolated or substantially purified if the helicase is present without other components of the pore such as lipids, polynucleotides, pore monomers or other proteins.

任何解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶可以根据本申请实施例进行修饰。在一个实施例中,Dda解旋酶是T4-Dda解旋酶。Any helicase, preferably Dda helicase, more preferably T4-Dda helicase can be modified according to the embodiments of the present application. In one embodiment, the Dda helicase is T4-Dda helicase.

Dda解旋酶通常含有以下5个结构域,lA(RecA型马达)结构域,2A(RecA型马达)结构域,塔结构域,销结构域和钩结构域(Xiaoping He等,2012,Structure;20:1189-1200)。这些结构域可以使用以下方法鉴定:蛋白质模型法,晶体态的蛋白质的X-射线衍射测量法(Rupp B(2009).Biomolecular Crystallograph:Principles,Practice and Application to Structural Biology.New York:Gar 1and Science.),蛋白质溶液的核磁共振(NMR)光谱法(Mark Rance;Cavanagh,John;Wayne J.Fairbrother;Arthur W.Hunt III;Skelton,NNicholas J.(2007).Protein NMR spectroscopy:principles and practice(2nd ed.).Boston:Academic Press.)或冷动水合态蛋白质的冷冻电子显微镜法(van Heel M,Gowen B,Matadeen R,Orlova EV,Finn R,Pape T,Cohen D,Stark H,Schmidt R,Schatz M,Patwardhan A(2000).“Single-particle electron cryo-microscopy:towards atomic resolution.”Q Rev Biophys.33:307-69)。通过上述方法确定的蛋白质的结构信息在蛋白数据库(PDB)是公开可获得的。Dda helicase usually contains the following five domains: 1A (RecA-type motor) domain, 2A (RecA-type motor) domain, tower domain, pin domain and hook domain (Xiaoping He et al., 2012, Structure; 20: 1189-1200). These domains can be identified using protein modeling, X-ray diffraction measurements of proteins in crystal form (Rupp B (2009). Biomolecular Crystallograph: Principles, Practice and Application to Structural Biology. New York: Gar 1and Science.), nuclear magnetic resonance (NMR) spectroscopy of protein solutions (Mark Rance; Cavanagh, John; Wayne J. Fairbrother; Arthur W. Hunt III; Skelton, N. Nicholas J. (2007). Protein NMR spectroscopy: principles and practice ( 2nd ed.). Boston: Academic Press.) or cryo-electron microscopy of proteins in a cold hydrated state (van Heel M, Gowen B, Matadeen R, Orlova EV, Finn R, Pape T, Cohen D, Stark H, Schmidt R, Schatz M, Patwardhan A (2000). "Single-particle electron microscopy". cryo-microscopy: toward atomic resolution.” Q Rev Biophys. 33: 307-69). The structural information of proteins determined by the above method is publicly available in the Protein Data Bank (PDB).

在一个实施例中,修饰的解旋酶进一步被修饰以降低其表面负电荷。可以与上述Dda结构域相同的方式鉴定表面残基。表面负电荷通常是表面带负电的氨基酸,诸如天冬氨酸(D)和谷氨酸(E)。In one embodiment, the modified helicase is further modified to reduce its surface negative charge. Surface residues can be identified in the same manner as the Dda domain described above. Surface negative charge is typically surface negatively charged amino acids, such as aspartic acid (D) and glutamic acid (E).

所述解旋酶优选被修饰以通过下述中和一个或多个表面负电荷:通过用一个或多个带正电荷的氨基酸、不带电荷的氨基酸、非极性氨基酸和/或芳香族氨基酸取代一个或多个带负电荷的氨基酸,或通过将一个或多个的带正电荷的氨基酸引入,优选临近一个或多个带负电荷的氨基酸引入。适合的带正电荷的氨基酸包括,但不限于,组氨酸(H),赖氨酸(K)和精氨酸(R)。不带电荷的氨基酸没有净电荷。适合的不带电荷的氨基酸包括,但不限于,半胱氨酸(C),丝氨酸(S),苏氨酸(T),蛋氨酸(M),天冬酰胺(N)和谷氨酰胺(Q)。非极性氨基酸具有非极性侧链。适合的非极性氨基酸包括,但不限于,甘氨酸(G),丙氨酸(A),脯氨酸(P),异亮氨酸(I),亮氨酸(L)和缬氨酸(V)。芳香族氨基酸具有 芳香族侧链。适合的芳香族氨基酸包括,但不限于,组氨酸(H),苯丙氨酸(F),色氨酸(W)和酪氨酸(Y)。The helicase is preferably modified to neutralize one or more surface negative charges by replacing one or more negatively charged amino acids with one or more positively charged amino acids, uncharged amino acids, non-polar amino acids and/or aromatic amino acids, or by introducing one or more positively charged amino acids, preferably adjacent to one or more negatively charged amino acids. Suitable positively charged amino acids include, but are not limited to, histidine (H), lysine (K) and arginine (R). Uncharged amino acids have no net charge. Suitable uncharged amino acids include, but are not limited to, cysteine (C), serine (S), threonine (T), methionine (M), asparagine (N) and glutamine (Q). Non-polar amino acids have non-polar side chains. Suitable non-polar amino acids include, but are not limited to, glycine (G), alanine (A), proline (P), isoleucine (I), leucine (L) and valine (V). Aromatic amino acids have Aromatic Side Chains. Suitable aromatic amino acids include, but are not limited to, histidine (H), phenylalanine (F), tryptophan (W) and tyrosine (Y).

优选的取代包括,但不限于,用R取代E,用K取代E,用N取代E,用K取代D,以及用R取代D。Preferred substitutions include, but are not limited to, R for E, K for E, N for E, K for D, and R for D.

1A结构域和2A结构域的修饰Modification of 1A and 2A domains

实施例提供了一种修饰的解旋酶,在解旋酶的1A结构域和2A结构域之间形成异位键。引入非天然氨基酸的操作较为繁琐,成本高,而引入二硫键形式则稳定性较差,容易被还原剂打开。相反,在1A结构域和2A结构域引入天然的侧链可形成异位键的氨基酸残基,从而介导形成稳定共价键异位键,提升测序过程中酶的可持续控速能力。The embodiment provides a modified helicase, forming an ectopic bond between the 1A domain and the 2A domain of the helicase. The operation of introducing non-natural amino acids is relatively cumbersome and costly, while the introduction of disulfide bonds is less stable and easily opened by reducing agents. On the contrary, the introduction of natural side chains into the 1A domain and the 2A domain can form ectopic bonds. The amino acid residues, thereby mediating the formation of stable covalent ectopic bonds, improve the sustainable rate control ability of the enzyme during the sequencing process.

异位键Eccentric bond

异位键是蛋白内部存在的除了骨架肽键外的氨基酸R链互相反应形成的一类共价键,异位键在自然环境中可以是蛋白分子内形成,也可以在蛋白分子间形成。一般分子内形成的异肽键会帮助蛋白响应环境应力,提高蛋白的稳定性(DOI:10.1016/j.tibs.2010.09.007),而蛋白分子间形成异肽键则可以被用于食品行业(doi:10.1007/s12223-013-0287-x)或者蛋白-蛋白互作研究中。异位键的形成可分为三种方式:蛋白内部自催化形成,酶法催化以及化学催化。下面列举了这三类形成方法的具体方法案例。Heteropeptide bonds are a type of covalent bond formed by the reaction of amino acid R chains in addition to backbone peptide bonds in proteins. Heteropeptide bonds can be formed within protein molecules or between protein molecules in the natural environment. Generally, isopeptide bonds formed within molecules help proteins respond to environmental stress and improve protein stability (DOI:10.1016/j.tibs.2010.09.007), while isopeptide bonds formed between protein molecules can be used in the food industry (doi:10.1007/s12223-013-0287-x) or in protein-protein interaction research. The formation of heteropeptide bonds can be divided into three ways: self-catalytic formation within proteins, enzymatic catalysis, and chemical catalysis. The following lists specific examples of these three types of formation methods.

蛋白内部自发形成共价键:例如有研究发现,蛋白内部可以在疏水环境以及存在羧基作为催化剂的情况下自发形成的Lys-Asp异位键(该类异位键也被称为异肽键)(DOI:10.1002/anie.201004340),常见的有CnaA/CnaB类蛋白。因此通过形成一个疏水环境,以及引入特定的可反应以及催化氨基酸可让蛋白内部自发形成异位键。而这一类异位键还可以被改造成多肽连接酶/双底物形式,即将可以自发形成异肽键的蛋白分为催化的蛋白,以及两个拥有可反应形成异位键侧链的多肽三部分,拥有可反应形成异位键侧链的多肽则可通过重组蛋白形式被整合到需要形成共价键的部位,这一例子有例如基于CnaB改造的SpyLiagse催化的SpyTag/KTag异位键连接(doi/10.1073/pnas.ss11113)和基于RrgA C-terminal domain改造的SnoopLigase催化的SnoopTagJr/DogTag连接等等。Spontaneous formation of covalent bonds within proteins: For example, studies have found that Lys-Asp ectopic bonds (also known as isopeptide bonds) can be spontaneously formed within proteins in a hydrophobic environment and with carboxyl groups as catalysts (DOI: 10.1002/anie.201004340), and common examples include CnaA/CnaB proteins. Therefore, by forming a hydrophobic environment and introducing specific reactive and catalytic amino acids, ectopic bonds can be spontaneously formed within proteins. This type of ectopic bond can also be transformed into a polypeptide ligase/bisubstrate form, that is, the protein that can spontaneously form an isopeptide bond is divided into a catalytic protein and two polypeptides with side chains that can react to form an ectopic bond. The polypeptides with side chains that can react to form an ectopic bond can be integrated into the site where a covalent bond needs to be formed in the form of a recombinant protein. Examples include the SpyTag/KTag ectopic bond connection catalyzed by SpyLiagse based on CnaB modification (doi/10.1073/pnas.ss11113) and the SnoopTagJr/DogTag connection catalyzed by SnoopLigase based on RrgA C-terminal domain modification.

酶法催化:该类方法涉及的反应较多,比如过氧化物酶(HRP, myeloperoxidaseTyrosinas,laccase等)催化形成的二酪氨酸(dityrosine)或者三酪氨酸(Trityrosine)共价键(DOI:10.1021/acsbiomaterials.6b00454),这一交联反应也可用G4-hemin核酸酶来进行催化(DOI:10.1002/chem.201101941);转谷氨酰胺酶(transglutaminase)催化形成的Lys-Gln异位键(也称之为异肽键https://doi.org/10.1016/j.ijbiomac.2017.10.115);赖氨酰氧化酶类(Lysyloxidases)催化的Lys-Lys交联(doi:10.1021/Bm1010195)。Enzymatic catalysis: This type of method involves many reactions, such as peroxidase (HRP, The dityrosine or trityrosine covalent bonds formed by myeloperoxidase Tyrosinas, laccase, etc. (DOI: 10.1021/acsbiomaterials.6b00454), this cross-linking reaction can also be catalyzed by G4-hemin nuclease (DOI: 10.1002/chem.201101941); Lys-Gln ectopic bonds (also known as isopeptide bonds https://doi.org/10.1016/j.ijbiomac.2017.10.115) catalyzed by transglutaminase; Lys-Lys cross-links catalyzed by lysyloxidases (doi: 10.1021/Bm1010195).

化学交联法:化学交联产生的异位键较多,例如EDC/NHS/S-NHS催化的D/E酸性氨基酸与K/R碱性氨基酸的侧链形成异肽键这一类异位键,H2O2/Hemin,ruthenium催化的Tyr-Tyr共价交联,diazo-containingcross-linkers介导的D/E与D/E间的交联(doi.org/10.1021/acs.analchem.7b03789)。Chemical cross-linking method: Chemical cross-linking produces many ectopic bonds, such as the isopeptide bonds formed between the side chains of D/E acidic amino acids and K/R basic amino acids catalyzed by EDC/NHS/S-NHS, Tyr-Tyr covalent cross-linking catalyzed by H 2 O 2 /Hemin, ruthenium, and cross-linking between D/E and D/E mediated by diazo-containing cross-linkers (doi.org/10.1021/acs.analchem.7b03789).

非Cys参与的异位键的类型包括但不限于:Lys-His、Lys-Ser、Lys-Thr、Lys-Tyr、Lys-Lys、Lys-Glu、Lys-Asp、Lys-Gln、Lys-Arg、Arg-Glu、Lys-Met、Arg-Asp、Arg-Arg、Tyr-Tyr、Tyr-Trp、Met-Met,这些氨基酸侧链可以直接形成共价键,也可以通过其它连接分子Linker进行连接。在一个实施例中,异位键可为Tyr-Tyr,Lys-Lys,Gln-Lys,Glu-Lys等。The types of ectopic bonds in which non-Cys participate include, but are not limited to, Lys-His, Lys-Ser, Lys-Thr, Lys-Tyr, Lys-Lys, Lys-Glu, Lys-Asp, Lys-Gln, Lys-Arg, Arg-Glu, Lys-Met, Arg-Asp, Arg-Arg, Tyr-Tyr, Tyr-Trp, Met-Met, and these amino acid side chains can directly form covalent bonds or can be connected through other linker molecules. In one embodiment, the ectopic bond can be Tyr-Tyr, Lys-Lys, Gln-Lys, Glu-Lys, etc.

在一个实施例中,解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶的1A结构域和2A结构域之间形成异位键。在另一个实施例中,所述解旋酶的1A结构域和2A结构域的天然氨基酸残基间引入异位键。In one embodiment, an ectopic bond is formed between the 1A domain and the 2A domain of a helicase, preferably a Dda helicase, more preferably a T4-Dda helicase. In another embodiment, an ectopic bond is introduced between the natural amino acid residues of the 1A domain and the 2A domain of the helicase.

在一个实施例中,所述异位键包括Tyr-Tyr异位键、Lys-Lys异位键、Gln-Lys异位键、Glu-Lys异位键或其结合。在一个实施例中,所述解旋酶源自T4-Dda解旋酶,并且所述解旋酶包括:(1)x1与x2形成的Tyr-Tyr异位键;(2)x3与x4形成的Lys-Gln异位键;(3)x1与x3形成的Tyr-Lys异位键;(4)x1与x4形成的Tyr-Gln异位键;(5)x2与x3形成的Tyr-Lys异位键;(6)x2与x4形成的Tyr-Gln异位键;(7)x3与x5形成的Lys-Glu异位键;或(8)以上任何组合。In one embodiment, the ectopic bond comprises a Tyr-Tyr ectopic bond, a Lys-Lys ectopic bond, a Gln-Lys ectopic bond, a Glu-Lys ectopic bond or a combination thereof. In one embodiment, the helicase is derived from a T4-Dda helicase, and the helicase comprises: (1) a Tyr-Tyr ectopic bond formed between x1 and x2; (2) a Lys-Gln ectopic bond formed between x3 and x4; (3) a Tyr-Lys ectopic bond formed between x1 and x3; (4) a Tyr-Gln ectopic bond formed between x1 and x4; (5) a Tyr-Lys ectopic bond formed between x2 and x3; (6) a Tyr-Gln ectopic bond formed between x2 and x4; (7) a Lys-Glu ectopic bond formed between x3 and x5; or (8) any combination thereof.

在一个实施例中,所述解旋酶被进一步修饰以减少其表面负电荷。在一个实施例中,通过用一个或多个带正电荷的氨基酸、不带电荷的氨基酸、非极性氨基酸和/或芳香族氨基酸取代一个或多个带负电荷的氨基酸,或者通过在一个或多个带负电荷的氨基酸附近引入一个或多 个带正电荷的氨基酸,来中和一个或多个表面负电荷。In one embodiment, the helicase is further modified to reduce its surface negative charge. In one embodiment, the helicase is further modified to reduce its surface negative charge by replacing one or more negatively charged amino acids with one or more positively charged amino acids, uncharged amino acids, non-polar amino acids and/or aromatic amino acids, or by introducing one or more negatively charged amino acids near one or more negatively charged amino acids. positively charged amino acids to neutralize one or more surface negative charges.

在一个实施例中,所述解旋酶包括1A结构域的任意突变体或2A结构域的任意突变体,只要所述解旋酶保留其控制目标分析物优选目标多核苷酸移动的能力。In one embodiment, the helicase includes any mutant of the 1A domain or any mutant of the 2A domain, as long as the helicase retains its ability to control the movement of the target analyte, preferably the target polynucleotide.

在一个实施例中,目标分析物包括多核苷酸、多肽、多糖、或脂质,优选多核苷酸。在一个实施例中,所述多核苷酸包括DNA和/或RNA。In one embodiment, the target analyte comprises a polynucleotide, a polypeptide, a polysaccharide, or a lipid, preferably a polynucleotide. In one embodiment, the polynucleotide comprises DNA and/or RNA.

变体Variants

解旋酶的变体,例如Dda解旋酶的变体是具有下述氨基酸序列的酶:所述氨基酸序列从野生型解旋酶的氨基酸序列变化而来并保留多核苷酸结合活性。多核苷酸结合活性可使用本领域已知方法确定。适合的方法包括,但不限于荧光各向异性法,色氨酸荧光法和电泳迀移位移试验法(EMSA)。A variant of a helicase, such as a variant of a Dda helicase, is an enzyme having an amino acid sequence that is varied from the amino acid sequence of a wild-type helicase and retains polynucleotide binding activity. Polynucleotide binding activity can be determined using methods known in the art. Suitable methods include, but are not limited to, fluorescence anisotropy, tryptophan fluorescence, and electrophoretic mobility shift assay (EMSA).

变体保留解旋酶活性。这可以使用多种方式测定。例如,变体沿多核苷酸移位的能力可使用电生理学法,荧光分析法或ATP水解法测定。The variant retains helicase activity. This can be determined using a variety of methods. For example, the ability of the variant to translocate along a polynucleotide can be determined using electrophysiological methods, fluorescence analysis, or ATP hydrolysis.

所述变体可以包括促进对编码解旋酶的多核苷酸的处理或促进该多核苷酸在高盐浓度和/或室温下的活性的多种修饰。The variants may include modifications that facilitate processing of the helicase encoding polynucleotide or facilitate activity of the polynucleotide at high salt concentrations and/or room temperature.

解旋酶片段Helicase fragment

实施例提供了解旋酶的片段,例如Dda解旋酶的片段(更优选T4-Dda解旋酶),其可以用于制备本申请实施例的解旋酶。在一个实施例中,多肽包括来自Dda解旋酶(优选T4-Dda解旋酶)的1A结构域和2A结构域并且不包括来自Dda解旋酶的任何其他结构域,其中在1A结构域和2A结构域之间形成异位键。所述多肽可以包括以上所限定的任意的1A结构域和2A结构域的变体。Embodiment provides the fragment of helicase, for example the fragment of Dda helicase (more preferably T4-Dda helicase), it can be used to prepare the helicase of the embodiment of the present application.In one embodiment, polypeptide comprises 1A domain and 2A domain from Dda helicase (preferably T4-Dda helicase) and does not comprise any other domain from Dda helicase, wherein ectopic bond is formed between 1A domain and 2A domain.Described polypeptide can comprise the variant of arbitrarily 1A domain and 2A domain defined above.

构建体Constructs

本申请实施例还提供了包括本申请实施例的Dda解旋酶或修饰的Dda解旋酶(优选为修饰的T4-Dda解旋酶)以及衔接体的构建体,其中所述解旋酶连接到所述衔接体并且所述构建体具有控制目标分析物优选目标多核苷酸移动的能力。所述构建体是人工的或非天然的。The present application embodiment also provides a construct comprising a Dda helicase or a modified Dda helicase (preferably a modified T4-Dda helicase) of the present application embodiment and an adapter, wherein the helicase is connected to the adapter and the construct has the ability to control the movement of a target analyte, preferably a target polynucleotide. The construct is artificial or non-natural.

在一个实施例中,构建体是在链测序过程中控制目标分析物优选目标多核苷酸移动的有用工具。实施例的构建体不使实施例的修饰的 解旋酶从被测序的目标分析物优选目标多核苷酸解脱。随着所述构建体控制所述目标分析物优选目标多核苷酸穿过所述纳米孔的移位,所述构建体甚至可提供所述目标分析物优选目标多核苷酸的更长读取长度。In one embodiment, the construct is a useful tool for controlling the movement of target analytes, preferably target polynucleotides, during strand sequencing. The helicase is uncoupled from the target analyte, preferably target polynucleotide, being sequenced. As the construct controls the translocation of the target analyte, preferably target polynucleotide through the nanopore, the construct may even provide longer read lengths of the target analyte, preferably target polynucleotide.

也可以设计结合特异性多核苷酸序列的目标构建体。如下面详细讨论的,所述衔接体可以结合到特异性的多核苷酸序列并由此将所述构建体的解旋酶部分靶向到该特异性的序列。Targeting constructs can also be designed to bind to specific polynucleotide sequences. As discussed in detail below, the adaptor can bind to a specific polynucleotide sequence and thereby target the helicase portion of the construct to that specific sequence.

所述构建体具有控制目标分析物优选目标多核苷酸移动的能力。这可以如上所述进行确定。The construct has the ability to control the movement of a target analyte, preferably a target polynucleotide. This can be determined as described above.

本申请实施例的构建体可以被分离,被基本上分离,纯化或基本上纯化。如果构建体完全不含任何其他组分诸如脂质,多核苷酸或孔单体,该构建体是被分离或纯化的。如果构建体与不干扰其预期用途的载体或稀释剂混合,该构建体是基本上被分离的。例如,如果构建体以含有小于10%,小于5%,小于2%或小于1%的其他组分诸如脂质,多核苷酸或孔单体的形式存在,该构建体是基本上分离的或基本上纯化的。The constructs of the present application embodiments can be separated, substantially separated, purified or substantially purified. If the construct is completely free of any other components such as lipids, polynucleotides or pore monomers, the construct is separated or purified. If the construct is mixed with a carrier or diluent that does not interfere with its intended use, the construct is substantially separated. For example, if the construct exists in the form of containing less than 10%, less than 5%, less than 2% or less than 1% of other components such as lipids, polynucleotides or pore monomers, the construct is substantially separated or substantially purified.

Dda解旋酶可以是任意的Dda解旋酶。优选的Dda解旋酶包括,但不限于T4-Dda解旋酶及其变体。变体如上所述。The Dda helicase can be any Dda helicase. Preferred Dda helicases include, but are not limited to, T4-Dda helicase and variants thereof. The variants are as described above.

所述解旋酶优选为修饰的Dda解旋酶,更优选为T-Dda解旋酶。任意的实施例的解旋酶可以存在于本申请实施例的构建体中。The helicase is preferably a modified Dda helicase, more preferably a T-Dda helicase. The helicase of any embodiment may be present in the construct of the embodiment of the present application.

所述解旋酶优选与衔接体共价连接。所述解旋酶可以在多于一个位点诸如两个或单个位点连接到所述衔接体。所述解旋酶可以使用本领域已知的任何方法连接到所述衔接体。The helicase is preferably covalently linked to the adapter. The helicase can be linked to the adapter at more than one site, such as two or a single site. The helicase can be linked to the adapter using any method known in the art.

所述解旋酶和所述衔接体可以分别制备,然后连接在一起。所述这两个组分可以任何构型连接。例如,它们可以通过它们的末端(即氨基端或羧基端)氨基酸进行连接。适合的构型包括但不限于,所述衔接体的氨基末端与所述解旋酶的羧基末端连接并且反之亦然。替换的,这两个组分可以通过其序列中的氨基酸连接。例如,所述多核苷酸结合部分可以连接到解旋酶环状区域中的一个或多个氨基酸。在一个优选的实施方式中,所述多核苷酸结合部分的末端氨基酸与所述解旋酶的环状区域中的一个或多个氨基酸连接。The helicase and the adapter can be prepared separately and then connected together. The two components can be connected in any configuration. For example, they can be connected by their terminal (i.e., amino terminal or carboxyl terminal) amino acids. Suitable configurations include, but are not limited to, the amino terminal of the adapter is connected to the carboxyl terminal of the helicase and vice versa. Alternatively, the two components can be connected by the amino acids in their sequences. For example, the polynucleotide binding portion can be connected to one or more amino acids in the annular region of the helicase. In a preferred embodiment, the terminal amino acid of the polynucleotide binding portion is connected to one or more amino acids in the annular region of the helicase.

在一个优选的实施方式中,所述解旋酶化学连接到所述衔接体, 例如,通过一个或多个连接器分子连接。每个连接器可以具有两个或更多个功能性末端,诸如两个、三个或四个功能性末端。连接器中适合的末端构型在本领域是已知的。在另一个优选的实施方式中,所述解旋酶与所述衔接体基因融合。如果整个构建体是由单多核苷酸序列表达而来的,所述解旋酶与所述衔接体基因融合。所述解旋酶和所述多衔接体的编码序列可以任意方式结合以形成编码所述构建体的单多核酸序列。In a preferred embodiment, the helicase is chemically linked to the adaptor. For example, connected by one or more connector molecules. Each connector can have two or more functional ends, such as two, three or four functional ends. Suitable terminal configurations in connectors are known in the art. In another preferred embodiment, the helicase is fused to the adapter gene. If the entire construct is expressed from a single polynucleotide sequence, the helicase is fused to the adapter gene. The coding sequences of the helicase and the multi-adapter can be combined in any manner to form a single polynucleotide sequence encoding the construct.

所述解旋酶和所述衔接体可以任何构型基因融合。所述解旋酶和所述多核苷酸结合部分可以通过它们的末端氨基酸融合。例如,所述多核苷酸结合部分的氨基末端可以与所述解旋酶的羧基末端融合并且反之亦然。所述多核苷酸结合部分的氨基酸序列优选在框架内添加到解旋酶的氨基酸序列中。换句话,优选将所述多核苷酸结合部分插入到所述解旋酶的序列中。在该实施方式中,所述解旋酶和所述分布通常在两个位点进行连接,即通过所述衔接体的氨基末端和羧基末端的氨基酸。如果将所述衔接体插入到所述解旋酶的序列中,优选所述部分的氨基末端和羧基末端的氨基酸非常接近并且各自连接到解旋酶或其变体的序列中相邻的氨基酸。在一个优选的实施方式中,将所述衔接体插入到所述解旋酶的环状区域中。The helicase and the adapter can be genetically fused in any configuration. The helicase and the polynucleotide binding portion can be fused through their terminal amino acids. For example, the amino terminus of the polynucleotide binding portion can be fused with the carboxyl terminus of the helicase and vice versa. The amino acid sequence of the polynucleotide binding portion is preferably added to the amino acid sequence of the helicase in frame. In other words, the polynucleotide binding portion is preferably inserted into the sequence of the helicase. In this embodiment, the helicase and the distribution are usually connected at two sites, i.e., by the amino terminus and carboxyl terminus of the adapter. If the adapter is inserted into the sequence of the helicase, the amino terminus and carboxyl terminus of the preferred part are very close and are each connected to adjacent amino acids in the sequence of the helicase or its variant. In a preferred embodiment, the adapter is inserted into the annular region of the helicase.

所述解旋酶可以直接与所述衔接体连接。所述解旋酶优选使用一个或多个,诸如两个或三个,如上所述的连接器连接到所述部分。一个或多个连接器可以设计为限制所述部分的活动性。所述解旋酶和/或所述衔接体可以被修饰以促进如上所述的一个或多个连接器的连接。The helicase can be directly connected to the adapter. The helicase is preferably connected to the part using one or more, such as two or three, connectors as described above. One or more connectors can be designed to limit the mobility of the part. The helicase and/or the adapter can be modified to facilitate the connection of one or more connectors as described above.

可剪切的连接器可以用于帮助将构建体与非连接组分分离并可以用于进一步控制合成反应。例如,异质-双官能连接器可以与解旋酶反应,但不与所述部分反应。如果所述连接器的自由末端用于将解旋酶蛋白连接到一个表面,则可从混合物中除去来自第一个反应的未反应的解旋酶。随后,可以将所述连接器剪切以暴露与所述衔接体反应的基团。另外,按照这种连接反应的顺序,可以首先优化与解旋酶的反应条件,然后在剪切掉连接器后优化与所述衔接体的反应条件。第二个反应可以更加直接的朝向与所述衔接体进行反应的正确的位点进行,因为连接器被限制在其已经连接的区域。The shearable connector can be used to help separate the construct from the non-connected components and can be used to further control the synthesis reaction. For example, a heterogeneous-bifunctional connector can react with a helicase, but not with the partial reaction. If the free end of the connector is used to connect the helicase protein to a surface, the unreacted helicase from the first reaction can be removed from the mixture. Subsequently, the connector can be sheared to expose the group reacted with the adapter. In addition, according to the order of this ligation reaction, the reaction conditions with the helicase can be optimized first, and then the reaction conditions with the adapter can be optimized after the connector is sheared off. The second reaction can be carried out more directly towards the correct site for reacting with the adapter, because the connector is confined to the region where it has been connected.

在所述解旋酶/交联剂复合体共价连接所述衔接体之前,所述解旋 酶可以共价连接到所述双官能交联剂。替换的,在双官能交联剂/部分复合体连接所述解旋酶之前,所述部分可以共价连接到所述双官能交联剂。所述解旋酶和所述衔接体可以同时共价连接所述化学交联剂。Before the helicase/crosslinker complex is covalently linked to the adaptor, the The enzyme may be covalently linked to the bifunctional cross-linker. Alternatively, the moiety may be covalently linked to the bifunctional cross-linker before the bifunctional cross-linker/moiety complex is linked to the helicase. The helicase and the adaptor may be covalently linked to the chemical cross-linker simultaneously.

解旋酶或部分的自身的交联可以通过保持连接器浓度极大的过量于所述解旋酶和/或部分而防止。替换的,"锁和钥匙"设置可以在使用两个连接器的情况下使用。每个连接器的仅一个末端可以一起反应以形成更长的连接器,并且连接器的其他末端分别与所述构建体的不同部分(即解旋酶或部分)反应。Cross-linking of the helicase or moiety itself can be prevented by maintaining a large excess of the connector concentration to the helicase and/or moiety. Alternatively, a "lock and key" setup can be used where two connectors are used. Only one end of each connector can be reacted together to form a longer connector, and the other ends of the connectors are reacted separately with different parts of the construct (i.e., the helicase or moiety).

选择连接位点使得,当所述构建体与所述多核苷酸接触时,所述解旋酶和所述衔接体均可以结合到所述多核苷酸并控制其运动。The attachment site is selected so that, when the construct is contacted with the polynucleotide, both the helicase and the adaptor can bind to the polynucleotide and control its movement.

用所述解旋酶和所述衔接体可以促进连接。例如,可以使用互补的多核苷酸,用于在所述解旋酶和衔接体进行杂交时,将所述解旋酶和所述衔接体结合在一起。所述解旋酶能结合到一个多核苷酸并且所述衔接体可以被结合到互补的多核苷酸。这两个多核苷酸然后可以相互杂交。这使得解旋酶与所述衔接体近距离接触,使得连接反应更有效进行。这尤其有利于以正确的方向连接两个或更多个解旋酶来控制目标多核苷酸的移动。The helicase and the adapter can be used to promote connection. For example, complementary polynucleotides can be used to combine the helicase and the adapter together when the helicase and the adapter are hybridized. The helicase can be bound to a polynucleotide and the adapter can be bound to a complementary polynucleotide. The two polynucleotides can then hybridize with each other. This allows the helicase to be in close contact with the adapter, so that the ligation reaction is more effectively carried out. This is particularly conducive to connecting two or more helicases in the correct direction to control the movement of the target polynucleotide.

可以在所述构建体中添加标签使得所述构建体的纯化变得容易。这些标签然后可以化学剪切掉或酶剪切掉一一如果需要将它们去除的话。也可以包括荧光团或显色团,它们也可以被剪切。Tags can be added to the construct to facilitate purification of the construct. These tags can then be chemically or enzymatically cleaved if they need to be removed. Fluorophores or chromophores can also be included, which can also be cleaved.

纯化所述构建体的简单的方式为在每个蛋白质上(即所述解旋酶和所述多核苷酸结合部分)含有不同的纯化标签,诸如6Hi s标签和Str印标篆如果两个蛋白质相互不同,该方法特别有用。使用两个标签仅能使具有这两个标签的物质被容易的纯化。如果这两个蛋白质不具有两个不同的标签,可以使用其他的方法。A simple way to purify the construct is to include different purification tags on each protein (i.e. the helicase and the polynucleotide binding portion), such as a 6His tag and a Strep tag. This method is particularly useful if the two proteins are different from each other. Using two tags only allows substances with both tags to be easily purified. If the two proteins do not have two different tags, other methods can be used.

构建体可以基于不同的DNA进行性性质从未反应蛋白质中纯化出。特别的,构建体可以基于以下从未反应蛋白质中纯化出:提高的对目标分析物优选目标多核苷酸的亲和力,减小的结合后从分析物优选多核苷酸上解脱的可能性和/或在其控制分析物优选多核苷酸移位穿过纳米孔时增加的分析物优选多核苷酸的读取长度。The constructs can be purified from unreacted proteins based on different DNA processivity properties. In particular, the constructs can be purified from unreacted proteins based on: increased affinity for the target analyte, preferably the target polynucleotide, reduced likelihood of dissociation from the analyte, preferably the polynucleotide after binding, and/or increased read length of the analyte, preferably the polynucleotide when it controls translocation of the analyte, preferably the polynucleotide through the nanopore.

也可以设计结合到特异性多核苷酸序列的目标构建体。所述衔接体可以结合特异性的多核苷酸序列并由此将构建体的解旋酶部分靶向 到该特异性的序列。Targeting constructs can also be designed to bind to specific polynucleotide sequences. The adapter can bind to a specific polynucleotide sequence and thereby target the helicase portion of the construct. to that specific sequence.

衔接体Connector

在一个实施例中,构建体包括衔接体。在一个实施例中,衔接体为能结合到分析物优选多核苷酸的多肽。所述衔接体优选能特异性结合到限定的多核苷酸序列。在一些实施方式中,所述多衔接体结合到特异性的多核苷酸序列,而结合到不同的序列不能被检测到。结合到特异性序列的部分可以被用于设计靶向到所述序列的构建体。In one embodiment, the construct includes an adapter. In one embodiment, the adapter is a polypeptide that can bind to an analyte, preferably a polynucleotide. The adapter preferably can specifically bind to a defined polynucleotide sequence. In some embodiments, the multi-adapter binds to a specific polynucleotide sequence, while binding to a different sequence cannot be detected. The portion that binds to a specific sequence can be used to design a construct that targets the sequence.

在一个实施例中,所述衔接体具有与多核苷酸相互作用或修饰多核苷酸的至少一个性质。所述衔接体可以通过将所述多核苷酸剪切形成单个的核苷酸或更短的核苷酸链诸如两个核苷酸或三个核苷酸而对其进行修饰。所述衔接体可以通过定位所述多核苷酸或将其移动到特定位置,即控制其移动而对其进行修饰。In one embodiment, the adapter has at least one property of interacting with or modifying a polynucleotide. The adapter can modify the polynucleotide by shearing it to form a single nucleotide or a shorter nucleotide chain such as two or three nucleotides. The adapter can modify the polynucleotide by positioning it or moving it to a specific position, i.e. controlling its movement.

在一个实施例中,衔接体衍生自多核苷酸结合酶。多核苷酸结合酶是能结合多核苷酸并能与所述多核苷酸相互作用以及修饰其至少一个性质的多肽。所述酶可以通过将所述多核苷酸剪切为单个核苷酸或更短的核苷酸链,诸如二核苷酸或三核苷酸对所述多核苷酸进行修饰。所述酶可以通过将所述多核苷酸定向或移动到一特定位置而对其进行修饰。所述多核苷酸结合部分不需要显示酶活性,只要其能结合所述多核苷酸并控制其移动即可。例如,所述部分可以衍生自己被修饰为去除了其酶活性或可以在防止其作为酶发挥作用的条件下使用的酶。In one embodiment, the adapter is derived from a polynucleotide binding enzyme. A polynucleotide binding enzyme is a polypeptide that can bind to a polynucleotide and can interact with the polynucleotide and modify at least one of its properties. The enzyme can modify the polynucleotide by shearing the polynucleotide into a single nucleotide or a shorter nucleotide chain, such as a dinucleotide or a trinucleotide. The enzyme can modify the polynucleotide by directing or moving it to a specific position. The polynucleotide binding portion does not need to show enzymatic activity, as long as it can bind to the polynucleotide and control its movement. For example, the portion can be derived from an enzyme that has been modified to remove its enzymatic activity or can be used under conditions that prevent it from acting as an enzyme.

在一个实施例中,衔接体衍生自溶核酶。在一个实施例中,优选的酶为核酸外切酶,聚合酶,解旋酶和拓扑异构酶,诸如促旋酶。In one embodiment, the adaptor is derived from a nucleolytic enzyme.In one embodiment, preferred enzymes are exonucleases, polymerases, helicases and topoisomerases, such as gyrase.

在一个实施例中,衔接体与解旋酶和目标分析物(优选目标多核苷酸)连接。在一个实施例中,衔接体是核酸。在一个实施例中,衔接体是多核苷酸。在另一个实施例中,衔接体包括双链核酸区,此区能够连接其他双链核酸,例如目标多核苷酸。在一个实施例中,衔接体是Y-型衔接体。在一个实施例中,衔接体是3条DNA链组成的。在一个实施例中,衔接体包括间隔区,优选间隔区插在衔接体中,或间隔区位于一条链的5’端或3’端。在一个实施例中,衔接体是包含间隔区的多核苷酸。在一个实施例中,衔接体是包含间隔区的Y-型多核苷酸。在一个实施例中,衔接体是含有间隔区的多核苷酸、多肽、多糖或脂质。解旋酶可以停滞在衔接体的间隔区上。通过将解旋酶和衔 接体上的多核苷酸(或多肽或多糖或脂质)与跨膜孔接触并施加电势,可以移动停滞的解旋酶穿过衔接体上的间隔区。由于解旋酶不能穿过跨膜孔,衔接体的多核苷酸(或多肽或多糖或脂质)沿电势移动穿过跨膜孔产生的力将解旋酶移动穿过间隔区。在一个实施例中,包括多个衔接体。在一个实施例中,所述衔接体是所述目标分析物的一部分,或者不是所述目标分析物的一部分。在一个实施例中,其他衔接体可穿插于目标分析物(优选目标多核苷酸)中。In one embodiment, the adapter is connected to the helicase and the target analyte (preferably the target polynucleotide). In one embodiment, the adapter is a nucleic acid. In one embodiment, the adapter is a polynucleotide. In another embodiment, the adapter includes a double-stranded nucleic acid region that can connect to other double-stranded nucleic acids, such as target polynucleotides. In one embodiment, the adapter is a Y-shaped adapter. In one embodiment, the adapter is composed of three DNA chains. In one embodiment, the adapter includes a spacer, preferably the spacer is inserted in the adapter, or the spacer is located at the 5' end or 3' end of one chain. In one embodiment, the adapter is a polynucleotide comprising a spacer. In one embodiment, the adapter is a Y-shaped polynucleotide comprising a spacer. In one embodiment, the adapter is a polynucleotide, polypeptide, polysaccharide or lipid containing a spacer. The helicase can be arrested on the spacer of the adapter. By connecting the helicase and the adapter The polynucleotide (or polypeptide or polysaccharide or lipid) on the adapter contacts the transmembrane pore and applies an electric potential, which can move the stalled helicase through the spacer on the adapter. Since the helicase cannot pass through the transmembrane pore, the polynucleotide (or polypeptide or polysaccharide or lipid) of the adapter moves through the transmembrane pore along the electric potential, and the force generated moves the helicase through the spacer. In one embodiment, a plurality of adapters are included. In one embodiment, the adapter is part of the target analyte, or is not part of the target analyte. In one embodiment, other adapters may be interspersed in the target analyte (preferably the target polynucleotide).

多核苷酸序列Polynucleotide sequence

本申请实施例提供了含有编码本申请实施例的解旋酶、本申请实施例的多肽或本申请实施例的构建体的序列的多核苷酸。所述多核苷酸可以含有这样的序列。所述多核苷酸可以是上述讨论的任意的多核苷酸。The present application embodiment provides a polynucleotide containing a sequence encoding a helicase of the present application embodiment, a polypeptide of the present application embodiment, or a construct of the present application embodiment. The polynucleotide may contain such a sequence. The polynucleotide may be any polynucleotide discussed above.

本文描述的任意的蛋白可以使用本领域已知的方法表达。可以使用本领域标准方法分离和复制多核苷酸序列。染色体DNA可以从产生解旋酶的生物体诸如伯顿拟甲烷球菌和/或产生SSB的生物体,诸如大肠杆菌提取。可以使用具有特异性引物的PCR扩增感兴趣的基因编码序列。然后可将扩增的序列插入到重组复制载体诸如克隆载体中。所述载体可以用于在相容的宿主细胞中复制所述多核苷酸。因此可以通过将编码感兴趣序列的多核苷酸引入可复制载体中,包括将该载体引入相容的宿主细胞中,以及在引发载体复制的条件下使所述宿主细胞生长,而制备多核苷酸序列。所述载体可以从所述宿主细胞回收。适合的用于克隆多核苷酸的宿主细胞为本领域已知的并在下文更详细描述。Any of the proteins described herein can be expressed using methods known in the art. Polynucleotide sequences can be isolated and replicated using standard methods in the art. Chromosomal DNA can be extracted from organisms that produce helicases, such as Methanococcus burtonii and/or organisms that produce SSB, such as Escherichia coli. PCR with specific primers can be used to amplify the gene coding sequence of interest. The amplified sequence can then be inserted into a recombinant replication vector such as a cloning vector. The vector can be used to replicate the polynucleotide in a compatible host cell. Therefore, a polynucleotide sequence can be prepared by introducing a polynucleotide encoding a sequence of interest into a replicable vector, including introducing the vector into a compatible host cell, and growing the host cell under conditions that cause the vector to replicate. The vector can be recovered from the host cell. Suitable host cells for cloning polynucleotides are known in the art and are described in more detail below.

可将所述多核苷酸序列克隆进适合的表达载体中。在表达载体中,多核苷酸序列通常可操作性连接到控制序列,该控制序列能通过宿主细胞提供所述编码序列的表达。该表达载体可用于表达构建体。The polynucleotide sequence can be cloned into a suitable expression vector. In the expression vector, the polynucleotide sequence is usually operably linked to a control sequence that can provide expression of the coding sequence by a host cell. The expression vector can be used to express a construct.

术语"可操作性连接"指的是并列放置,其中描述的多个组分处于使它们以预期方式发挥功能的关系。控制序列"可操作性连接"到编码序列指的是在与控制序列兼容的条件下以实现编码序列的表达的方式连接。相同或不同的多核苷酸的多个拷贝可以被引入载体中。The term "operably linked" refers to a juxtaposition wherein the various components described are in a relationship permitting them to function in their intended manner. A control sequence "operably linked" to a coding sequence is linked under conditions compatible with the control sequences in a manner that achieves expression of the coding sequence. Multiple copies of the same or different polynucleotides may be introduced into a vector.

所述表达载体可以随后被引入适合的宿主细胞中。因此,可以通过将编码构建体的多核苷酸序列引入表达载体中,包括将所述载体引 入兼容的细菌宿主细胞中,并在引发多核苷酸序列表达的条件下使所述宿主细胞生长,而制备构建体。The expression vector can then be introduced into a suitable host cell. Thus, the polynucleotide sequence encoding the construct can be introduced into an expression vector, including introducing the vector into The construct is prepared by introducing the polynucleotide sequence into a compatible bacterial host cell and growing the host cell under conditions that result in expression of the polynucleotide sequence.

所述载体可以为例如具有复制起点的质粒、病毒或噬菌体载体,可选的为用于表达所述多核苷酸序列的启动子,以及可选的为所述启动子的调控因子。所述载体可含有一个或多个选择性标记基因,例如氨苄青霉素抗性基因。可选择与宿主细胞(针对其设计了所述表达载体)兼容的启动子和其他表达调控信号。通常使用T7,trc,lac,ara或k启动子。The vector can be, for example, a plasmid, virus or phage vector with a replication origin, optionally a promoter for expressing the polynucleotide sequence, and optionally a regulatory factor for the promoter. The vector can contain one or more selectable marker genes, such as an ampicillin resistance gene. Promoters and other expression control signals compatible with the host cell (for which the expression vector is designed) can be selected. T7, trc, lac, ara or k promoters are usually used.

所述宿主细胞通常以高水平表达所述构建体。选择与用于转化所述宿主细胞的表达载体兼容的宿主细胞进行多核苷酸序列转化。所述宿主细胞通常为细菌的并优选大肠杆菌。任何具有λDE3溶原菌(lysogen)的细胞,例如Rosetta2(DE3)pLys,C41(DE3),BL21(DE3),JM109(DE3),B834(DE3),TUNER,0rigami和Origami B,均能表达含有T7启动子的载体。The host cell typically expresses the construct at a high level. A host cell compatible with the expression vector used to transform the host cell is selected for polynucleotide sequence transformation. The host cell is typically bacterial and preferably E. coli. Any cell with a λDE3 lysogen, such as Rosetta2(DE3)pLys, C41(DE3), BL21(DE3), JM109(DE3), B834(DE3), TUNER, Origami and Origami B, can express vectors containing a T7 promoter.

解旋酶的组Helicase group

本申请实施例还提供过了连接(或结合)多核苷酸的两个或更多个解旋酶的组(series),其中所述两个或更多个解旋酶中的至少一个是本申请实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶。所述组可以包括任意数量的解旋酶,诸如2,3,4,5,6,7,8,9,10或更多个解旋酶。任意数量的解旋酶可以是本申请实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶。两个或更多个解旋酶的全部优选为本申请实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶。本申请实施例的一个或多个解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶可以是以上讨论的任意的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶。The present application embodiment also provides the group (series) of two or more helicases that have been connected (or combined) polynucleotide, wherein at least one of the two or more helicases is the helicase of the present application embodiment, preferably Dda helicase, more preferably T4-Dda helicase.The group can include any number of helicases, such as 2,3,4,5,6,7,8,9,10 or more helicases.Any number of helicases can be the helicase of the present application embodiment, preferably Dda helicase, more preferably T4-Dda helicase.All of two or more helicases are preferably the helicase of the present application embodiment, preferably Dda helicase, more preferably T4-Dda helicase.One or more helicases of the present application embodiment, preferably Dda helicase, more preferably T4-Dda helicase can be any helicase discussed above, preferably Dda helicase, more preferably T4-Dda helicase.

所述两个或更多个解旋酶可以是相同的或不同的解旋酶。例如,如果所述组包括两个或更多个本申请实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶,本申请实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶可以是相同的或不同的。The two or more helicases can be the same or different helicases. For example, if the group includes two or more helicases of the present application embodiments, preferably Dda helicases, more preferably T4-Dda helicases, the helicases of the present application embodiments, preferably Dda helicases, more preferably T4-Dda helicases can be the same or different.

所述组可以包括任意数量和组合的本申请实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶。所述两个或更多个解旋酶的组优选包括至少两个本申请实施例的解旋酶,优选Dda解旋酶,更优选 T4-Dda解旋酶。所述组可以包括两个或更多个解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶。每个解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶包括上文所述的修饰,包括1A结构域和2A结构域形成异位键。The group may include any number and combination of helicases of the embodiments of the present application, preferably Dda helicases, more preferably T4-Dda helicases. The group of two or more helicases preferably includes at least two helicases of the embodiments of the present application, preferably Dda helicases, more preferably T4-Dda helicase. The group may include two or more helicases, preferably Dda helicases, more preferably T4-Dda helicases. Each helicase, preferably Dda helicase, more preferably T4-Dda helicase comprises the modification described above, including that the 1A domain and the 2A domain form an ectopic bond.

除了本申请实施例的一个或多个解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶,所述组可以包括不是本申请实施例的一部分的一个或多个解旋酶。所述一个或多个解旋酶可以是或衍生自Hel308解旋酶,RecD解旋酶,诸如Tral解旋酶或TrwC解旋酶,XPD解旋酶或Dda解旋酶。所述一个或多个解旋酶可以是下述国际申请中公开的解旋酶、修饰的解旋酶或解旋酶构建体中的任意解旋酶:国际申请No.PCT/GB2012/052579(公布号为W0 2013/057495);PCT/GB2012/053274(公布号为W0 2013/098562);PCT/GB2012/053273(公布号为W02013/098561);PCT/GB2013/051925(公布号为WO 2014/013260);PCT/GB2013/051924(公布号为WO 2014/013259)和PCT/GB2013/051928(公布号为WO 2014/013262);以及2013年10月18日申请的UK申请No.1318464.3。特别的,所述一个或多个解旋酶优选被修饰以减少在多核苷酸结合结构域的开口的尺寸,所述多核苷酸可以至少一种构型状态穿过所述开口而从所述解旋酶解脱。这在WO 2014/013260中有公开。In addition to one or more helicases of the embodiments of the present application, preferably Dda helicases, more preferably T4-Dda helicases, the group may include one or more helicases that are not part of the embodiments of the present application. The one or more helicases may be or are derived from Hel308 helicases, RecD helicases, such as Tral helicases or TrwC helicases, XPD helicases or Dda helicases. The one or more helicases may be any helicase in a helicase, modified helicase or helicase construct disclosed in the following international applications: International Application No. PCT/GB2012/052579 (Publication No. WO 2013/057495); PCT/GB2012/053274 (Publication No. WO 2013/098562); PCT/GB2012/053273 (Publication No. WO 2013/098564); 13/098561); PCT/GB2013/051925 (published as WO 2014/013260); PCT/GB2013/051924 (published as WO 2014/013259) and PCT/GB2013/051928 (published as WO 2014/013262); and UK application No. 1318464.3 filed on 18 October 2013. In particular, the one or more helicases are preferably modified to reduce the size of the opening in the polynucleotide binding domain through which the polynucleotide can be released from the helicase in at least one conformational state. This is disclosed in WO 2014/013260.

在所述组中的所述两个或更多个解旋酶可以是相互分离的。所述组中的所述两个或更多个解旋酶可以在随着多核苷酸移动穿过所述孔而通过跨膜孔结合在一起。所述组中的所述两个或更多个解旋酶可以相互接触。The two or more helicases in the group may be separated from each other. The two or more helicases in the group may be bound together by the transmembrane pore as the polynucleotide moves through the pore. The two or more helicases in the group may contact each other.

优选所述两个或更多个解旋酶除了通过多核苷酸相互连接之外不相互连接。所述两个或更多个解旋酶优选不相互共价连接。Preferably, the two or more helicases are not linked to each other except by the polynucleotide. The two or more helicases are preferably not covalently linked to each other.

所述两个或更多个解旋酶可以相互连接或相互共价连接。所述解旋酶可以任何次序以及使用任何方法连接。一组连接的解旋酶可以称为一队。The two or more helicases can be connected to each other or covalently connected to each other. The helicases can be connected in any order and using any method. A group of connected helicases can be called a team.

控制目标分析物优选目标多核苷酸移动的方法Method for controlling the movement of a target analyte, preferably a target polynucleotide

本申请实施例提供了一种控制目标分析物优选目标多核苷酸移动的方法。所述方法包括使目标分析物优选目标多核苷酸与实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶或本申请实施例的构建 体接触,并由此控制所述目标分析物优选目标多核苷酸的移动。该方法优选在跨所述孔施加的电势的下实施。如下详细描述的,施加的电势通常导致在孔和解旋酶或构建体之间形成复合体。施加的电势可以为电压电势。替换的,所施加电势可以为化学电势。其一个实例为使用跨两性分子层的盐梯度。盐梯度在Holden等,J Am Chem Soc.2007Jul 11;129(27):8650-5中公开。The present invention provides a method for controlling the movement of a target analyte, preferably a target polynucleotide. The method comprises allowing the target analyte, preferably a target polynucleotide, to move with a helicase of the embodiment, preferably a Dda helicase, more preferably a T4-Dda helicase or a construct of the present invention. The method is preferably carried out under a potential applied across the pore. As described in detail below, the applied potential generally results in the formation of a complex between the pore and the helicase or construct. The applied potential may be a voltage potential. Alternatively, the applied potential may be a chemical potential. An example of this is the use of a salt gradient across the amphiphilic layer. Salt gradients are disclosed in Holden et al., J Am Chem Soc. 2007 Jul 11; 129(27): 8650-5.

本申请实施例还提供了表征目标分析物优选目标多核苷酸的方法。所述方法包括(a)将目标分析物优选目标多核苷酸与孔和实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶或本申请实施例的构建体接触,使得所述解旋酶或构建体控制目标分析物优选目标多核苷酸穿过所述孔的移动。所述方法还包括(b)随着所述目标分析物优选多核苷酸相对于所述孔移动,获取一个或多个测量值,其中所述测量值代表所述目标分析物优选目标核苷酸的一个或多个特征,并由此表征所述目标分析物优选目标多核苷酸。The present application embodiment also provides a method for characterizing the preferred target polynucleotide of target analyte. The method includes (a) contacting the preferred target polynucleotide of target analyte with the helicase of hole and embodiment, preferably Dda helicase, more preferably T4-Dda helicase or the construct of the present application embodiment, so that the helicase or construct control the preferred target polynucleotide of target analyte through the movement of the hole. The method also includes (b) as the preferred polynucleotide of target analyte moves relative to the hole, obtaining one or more measured values, wherein the measured value represents one or more features of the preferred target nucleotide of the target analyte, and thus characterizing the preferred target polynucleotide of target analyte.

在本申请实施例的所有方法中,所述解旋酶可以是如上针对本申请实施例的构建体所述的任意的解旋酶,包括本申请实施例的修饰的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶和未根据本申请实施例修饰的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶。In all methods of the embodiments of the present application, the helicase can be any helicase as described above for the constructs of the embodiments of the present application, including the modified helicases of the embodiments of the present application, preferably Dda helicases, more preferably T4-Dda helicases, and helicases not modified according to the embodiments of the present application, preferably Dda helicases, more preferably T4-Dda helicases.

可以在这些方法中使用任意数量的本申请实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶。例如,可以使用1,2,3,4,5,6,7,8,9,10或更多个解旋酶。如果使用本申请实施例的两个或更多个解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶,它们可以相同或不同。适合的数目和组合如上述针对本申请实施例的组中所表述的。它们可同等应用于本申请实施例的方法。Can use any number of helicases of the present application embodiment in these methods, preferably Dda helicase, more preferably T4-Dda helicase.For example, 1,2,3,4,5,6,7,8,9,10 or more helicases can be used.If use two or more helicases of the present application embodiment, preferably Dda helicase, more preferably T4-Dda helicase, they can be identical or different.Suitable number and combination are as described in the group for the present application embodiment above.They can be equally applied to the method for the present application embodiment.

如果使用两个或更多个解旋酶,它们就可以相互连接。所述两个或更多个解旋酶可以相互共价连接。所述解旋酶可以任何次序以及使用任何方法连接。If two or more helicases are used, they may be linked to each other. The two or more helicases may be covalently linked to each other. The helicases may be linked in any order and using any method.

如果使用两个或更多个解旋酶,它们优选除了通过多核苷酸相互连接之外不彼此连接。所述两个或更多个解旋酶更优选不相互共价连接。If two or more helicases are used, they are preferably not linked to each other except by the polynucleotide. The two or more helicases are more preferably not covalently linked to each other.

步骤(a)和(b)优选在跨如上所述的跨孔施加电势的条件下实施。在一些情况下,随着多核苷酸相对于孔移动,使用穿过孔的电流确定所 述目标多核苷酸的序列。这就是链测序。Steps (a) and (b) are preferably performed under conditions where a potential is applied across the pore as described above. In some cases, the current passing through the pore is used to determine the amount of polynucleotide that moves relative to the pore. The sequence of the target polynucleotide is determined by the DNA sequencing. This is called strand sequencing.

本申请实施例的方法为用于表征目标分析物优选目标多核苷酸。多核苷酸如上所限定。The method of the embodiment of the present application is used to characterize the target analyte, preferably the target polynucleotide. The polynucleotide is as defined above.

所述目标多核苷酸的整个或仅部分可以使用该方法表征。所述目标多核苷酸能够为任何长度。例如,多核苷酸可以为至少10个,至少50个,至少100,至少150,至少200,至少250,至少300,至少400或至少500个核苷酸对长度。所述多核苷酸可以为1000或更多个核苷酸对,5000或更多个核苷酸对长度或100000或更多个核苷酸对长度。The whole or only part of the target polynucleotide can be characterized using this method. The target polynucleotide can be any length. For example, the polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotide pairs in length. The polynucleotide can be 1000 or more nucleotide pairs, 5000 or more nucleotide pairs in length or 100000 or more nucleotide pairs in length.

所述目标多核苷酸通常存在于任何合适的样品中。本申请实施例通常在已知含有或怀疑含有目标多核苷酸的样品上实施。可替换的,可以对样本实施本申请实施例,以确认已知或预期存在于所述样本中的一个或多个目标多核苷酸的同一性。The target polynucleotide is usually present in any suitable sample. The present application embodiment is usually implemented on a sample known to contain or suspected to contain the target polynucleotide. Alternatively, the present application embodiment can be implemented to a sample to confirm the identity of one or more target polynucleotides known or expected to be present in the sample.

所述样品可以是生物样品。本申请实施例可以针对从任何生物体或微生物中获得或提取的样品在体外实施。所述生物体或微生物通常是古核的,原核的或真核的,并且通常属于以下五界中的一个:植物界,动物界,真菌,无核原生物和原生生物。本申请实施例针对从任何病毒中获得或提取的样品在体外实施。所述样品优选是液体样品。样品通常包括患者的体液。所述样品可以是尿液,淋巴液,唾液,粘液或羊水,但优选血液,血浆或血清。通常,所述样品是来源于人的,但可替代地可以是来自其他哺乳动物的,如来自商业上养殖的动物如马,牛,绵羊或猪,或者可以是宠物如猫或狗。或者,植物来源的样品通常从商业作物获得,如谷类,豆类,水果或蔬菜,例如小麦,大麦,燕麦,芸苔,玉米,大豆,水稻,香蕉,苹果,番茄,土豆,葡萄,烟草,菜豆,小扁豆,甘蔗,可可,棉花。The sample can be a biological sample. The present application embodiment can be implemented in vitro for samples obtained or extracted from any organism or microorganism. The organism or microorganism is usually ancient nuclear, prokaryotic or eukaryotic, and usually belongs to one of the following five kingdoms: plant kingdom, animal kingdom, fungi, ankaryotic protozoa and protists. The present application embodiment is implemented in vitro for samples obtained or extracted from any virus. The sample is preferably a liquid sample. The sample usually includes the patient's body fluid. The sample can be urine, lymph, saliva, mucus or amniotic fluid, but preferably blood, plasma or serum. Usually, the sample is derived from humans, but can alternatively be from other mammals, such as from commercially raised animals such as horses, cattle, sheep or pigs, or can be pets such as cats or dogs. Alternatively, samples of plant origin are usually obtained from commercial crops, such as cereals, beans, fruits or vegetables, such as wheat, barley, oats, Brassica, corn, soybeans, rice, bananas, apples, tomatoes, potatoes, grapes, tobacco, beans, lentils, sugarcane, cocoa, cotton.

所述样品可以是非生物样品。非生物样品优选为液体样品。非生物样品的示例包括外科手术液体,水如饮用水,海水或河水,以及用于实验室测试的试剂。The sample may be a non-biological sample. The non-biological sample is preferably a liquid sample. Examples of non-biological samples include surgical fluids, water such as drinking water, sea water or river water, and reagents for laboratory tests.

样品通常在用于本申请实施例前进行处理,例如通过离心或通过膜滤除不需要的分子或细胞,例如红血细胞。可以在获取所述样本后立即进行检测。通常也可以在分析前储存所述样本,优选低于-70℃。The sample is usually processed before being used in the present application embodiment, for example, by centrifugation or filtering through a membrane to remove unwanted molecules or cells, such as red blood cells. The test can be performed immediately after obtaining the sample. The sample can also be stored before analysis, preferably below -70°C.

孔是一定程度跨越膜的结构。它允许通过施加的电势驱动水合离子使其跨膜流动或在膜内流动。孔通常跨越整个膜以使得水合离子可 以从膜的一侧流动到所述膜的另一侧。然而,孔不是必须跨越膜。其可以在一端封闭。例如,孔可以是膜中的井,水合离子可以沿着其流动或流入。A pore is a structure that spans the membrane to some extent. It allows hydrated ions to flow across or within the membrane by driving them through an applied potential. A pore usually spans the entire membrane to allow hydrated ions to flow through the membrane. The pores may be formed by a membrane having a plurality of pores, each of which is a well in the membrane, along which hydrated ions may flow or flow.

本申请实施例可使用任何孔。该孔可以是生物的或人造的。合适的孔包括但不限于,蛋白质孔,多核苷酸孔和固态孔。Any pore may be used in the embodiments of the present application. The pore may be biological or artificial. Suitable pores include, but are not limited to, protein pores, polynucleotide pores, and solid-state pores.

适的膜优选为两亲层。两亲层是由两亲分子形成的层,例如,磷脂,其具有亲水性和亲油性。两亲分子可以是合成的或天然存在的。两亲层可以是单层或双层。两亲层通常是平面的。两亲层可以是弯曲的。可以对两亲层进行支撑。膜可以为脂质双层。脂质双层是由脂质的两个相对的层形成的。脂质的两层被排列为使得它们的疏水性尾部基团面向彼此以形成疏水性内部。脂质的亲水性头部基团面向外朝向该双层的每一侧上的含水环境。膜包括固态层。固态层可以由有机和无机材料形成。如果膜包括固态层,则孔通常存在于两亲膜中或固态层内包括的层中,例如,固态层内的孔洞、阱、间隙、通道、沟槽或狭缝中。Suitable membranes are preferably amphiphilic layers. An amphiphilic layer is a layer formed by amphiphilic molecules, such as phospholipids, which have hydrophilicity and lipophilicity. Amphiphilic molecules can be synthetic or naturally occurring. An amphiphilic layer can be a monolayer or a bilayer. An amphiphilic layer is typically planar. An amphiphilic layer can be curved. The amphiphilic layer can be supported. The membrane can be a lipid bilayer. A lipid bilayer is formed by two opposing layers of lipids. The two layers of lipids are arranged so that their hydrophobic tail groups face each other to form a hydrophobic interior. The hydrophilic head groups of the lipids face outward toward the aqueous environment on each side of the bilayer. The membrane includes a solid layer. The solid layer can be formed by organic and inorganic materials. If the membrane includes a solid layer, holes are typically present in the amphiphilic membrane or in a layer included in the solid layer, such as holes, wells, gaps, channels, grooves or slits in the solid layer.

所述多核苷酸可以连接到膜上。这可以使用任何已知的方法来完成。如果膜是两性分子层,如脂质双分子层,所述多核苷酸优选通过在所述膜中存在的多肽或通过在所述膜中存在的疏水锚被连接到该膜上。疏水锚优选脂质,脂肪酸,甾醇,碳纳米管或氨基酸。The polynucleotide may be attached to the membrane. This may be accomplished using any known method. If the membrane is an amphiphilic layer, such as a lipid bilayer, the polynucleotide is preferably attached to the membrane via a polypeptide present in the membrane or via a hydrophobic anchor present in the membrane. The hydrophobic anchor is preferably a lipid, a fatty acid, a sterol, a carbon nanotube or an amino acid.

所述多核苷酸可以直接连接到膜上。所述多核苷酸优选通过接器连接到膜上。优选的接器包括但不限于,聚合物,如多核苷酸,聚乙二醇(PEG)和多肽。如果所述多核苷酸直接连接到所述膜,由于该膜和所述解旋酶之间的距离,导致表征不能进行到所述多核苷酸的末端,则将丢失一些数据。如果使用接器,则多核苷酸能够被表征完全。如果使用接器时,可将接器连接到所述多核苷酸的任何位置。接器通常连接到所述多核苷酸的尾部聚合物处。The polynucleotide can be directly connected to the membrane. The polynucleotide is preferably connected to the membrane through a connector. Preferred connectors include, but are not limited to, polymers, such as polynucleotides, polyethylene glycol (PEG) and polypeptides. If the polynucleotide is directly connected to the membrane, due to the distance between the membrane and the helicase, the characterization cannot be carried out to the end of the polynucleotide, and some data will be lost. If a connector is used, the polynucleotide can be fully characterized. If a connector is used, the connector can be connected to any position of the polynucleotide. The connector is usually connected to the tail polymer of the polynucleotide.

该连接可以是稳定的或暂时的。对于某些应用,该连接的暂时性的性质是是优选的。如果稳定的连接分子直接地连接到多核苷酸的5'末端或3'末端,则由于该膜和解旋酶的活性位点之间的距离,表征不能进行到多核苷酸的末端,则将会导致数据的丢失。如果连接是暂时性的,那么当连接的末端随机变成没有膜时,则多核苷酸能被表征完全。在优选的实施例中,多核苷酸连接到两性分子层。 The connection can be stable or temporary. For some applications, the temporary nature of the connection is preferred. If the stable linker is directly connected to the 5' end or 3' end of the polynucleotide, then due to the distance between the active site of the membrane and the helicase, the characterization can not proceed to the end of the polynucleotide, which will cause the loss of data. If the connection is temporary, then when the end of the connection randomly becomes without membrane, the polynucleotide can be characterized completely. In a preferred embodiment, the polynucleotide is connected to an amphiphilic layer.

跨膜孔或纳米孔优选跨膜蛋白孔,其限定了允许分子和离子从膜的一侧易位到另一侧的通道或孔。离子物质通过孔的易位可以由施加到孔任一侧的电位差驱动。“纳米孔”是一种蛋白孔或跨膜孔,其中分子或离子所通过的通道的最小直径为纳米级(10-9米)。在一些实施方案中,蛋白孔可以是跨膜蛋白孔或跨膜孔或纳米孔。跨膜孔的跨膜蛋白结构本质上可以是单体或寡聚体。通常,孔包含多个围绕中心轴排列的多肽亚基,从而形成基本上垂直于孔所驻留的膜延伸的蛋白内衬通道。多肽亚基的数量没有限制。通常,亚基的数量为5至30,合适地亚基的数量为6至10。可替代地,亚基的数量不像在产气荚膜梭菌溶素(perfringolysin)或相关大膜孔的情况下那样定义。孔内形成蛋白内衬通道的蛋白亚基部分通常包含可包括一个或多个跨膜β-桶和/或α-螺旋部分的二级结构基序。Transmembrane pores or nanopores are preferably transmembrane protein pores, which define channels or holes that allow molecules and ions to be translocated from one side of the membrane to the other side. The translocation of ionic substances through the hole can be driven by a potential difference applied to either side of the hole."Nanopore" is a protein pore or transmembrane pore, in which the minimum diameter of the channel through which molecules or ions pass is nanometer-scale ( 10-9 meters). In some embodiments, the protein pore can be a transmembrane protein pore or a transmembrane pore or a nanopore. The transmembrane protein structure of the transmembrane pore can be a monomer or an oligomer in nature. Typically, the hole comprises a plurality of polypeptide subunits arranged around a central axis, thereby forming a protein-lined channel extending substantially perpendicular to the membrane where the hole resides. The number of polypeptide subunits is not limited. Typically, the number of subunits is 5 to 30, and the number of subunits is suitably 6 to 10. Alternatively, the number of subunits is not defined as in the case of perfringolysin or related large membrane pores. The portion of the protein subunit that forms the protein-lined channel within the pore typically comprises secondary structural motifs that may include one or more transmembrane β-barrels and/or α-helical portions.

实施例的方法可以测量多核苷酸的两个、三个、四个或五个或更多个特征。所述一个或多个特征优选选自(i)多核苷酸的长度,(ii)多核苷酸的身份,(iii)多核苷酸的序列,(iv)多核苷酸的二级结构,以及(v)多核苷酸是否被修饰。在一个实施例中,可以测量(i)至(v)的任意组合。The method of the embodiment can measure two, three, four or five or more characteristics of the polynucleotide. The one or more characteristics are preferably selected from (i) the length of the polynucleotide, (ii) the identity of the polynucleotide, (iii) the sequence of the polynucleotide, (iv) the secondary structure of the polynucleotide, and (v) whether the polynucleotide is modified. In one embodiment, any combination of (i) to (v) can be measured.

对于(i),可以例如通过确定多核苷酸和孔之间相互作用的数量或多核苷酸和孔之间相互作用的持续时间对多核苷酸的长度进行测量。For (i), the length of the polynucleotide may be measured, for example, by determining the number of interactions between the polynucleotide and the pore or the duration of interactions between the polynucleotide and the pore.

对于(ii),可以以多种方式测量多核苷酸的身份,多核苷酸的身份可以结合多核苷酸序列的测量或不结合多核苷酸序列的测量进行测量。前者较为简单;对多核苷酸进行测序进而进行识别。后者可以通过几种不同方式完成。例如,可以测量多核苷酸中特定基序的存在(无需测量多核苷酸的其余序列)。或者,所述方法中特定的电和/或光信号的测量可以识别出所述多核苷酸来自特定来源。With respect to (ii), the identity of a polynucleotide can be measured in a variety of ways, and the identity of a polynucleotide can be measured in conjunction with or without measurement of the polynucleotide sequence. The former is relatively simple; the polynucleotide is sequenced and identified. The latter can be accomplished in several different ways. For example, the presence of a particular motif in a polynucleotide can be measured (without measuring the rest of the sequence of the polynucleotide). Alternatively, the measurement of a specific electrical and/or optical signal in the method can identify that the polynucleotide is from a particular source.

对于(iii),多核苷酸的序列可以如先前所述进行测定。For (iii), the sequence of the polynucleotide can be determined as previously described.

对于(iv),二级结构可以采用多种方法测量。例如,如果所述方法涉及电学测量方法,则可以使用停留时间的变化或流过孔的电流的变化来测量所述二级结构。这允许区分单链和双链多核苷酸的区域。For (iv), secondary structure can be measured using a variety of methods. For example, if the method involves an electrical measurement method, the secondary structure can be measured using a change in residence time or a change in the current flowing through the pore. This allows for the differentiation of regions of single-stranded and double-stranded polynucleotides.

对于(v),可以测量是否存在任何修饰。所述方法优选包括,测定多核苷酸是否通过甲基化,氧化,损伤,用一种或多种蛋白或用一种或多种标记,标签或进行无碱基或缺少核碱基和糖的修饰。特定的修 饰将导致与所述孔的特异性相互作用,其可以使用下述方法进行测量。例如,甲基胞嘧啶可以基于其与每个核苷酸相互作用期间流过所述孔的电流而与胞嘧啶区分开来。For (v), the presence or absence of any modification may be measured. The method preferably comprises determining whether the polynucleotide is modified by methylation, oxidation, damage, treatment with one or more proteins or with one or more markers, tags or is abasic or lacking nucleobases and sugars. The decoration will result in a specific interaction with the pore, which can be measured using the methods described below. For example, methylcytosine can be distinguished from cytosine based on the current flowing through the pore during its interaction with each nucleotide.

可以进行各种不同类型的测量。这包括但不限于电学测量和光学测量。电学测量包括电压测量、电容测量、电流测量,阻抗测量,隧道测量(tunnelling measurement)(Ivanov AP et al.,Nano Lett.2011Jan 12;11(I):279-85)以及FET测量(国际申请WO 2005/124888)。光学测量可以与电学测量结合(Soni GV et al.,Rev Sci Instrum.2010Jan;81(1)014301)。所述测量可以是跨膜电流测量,例如流过所述孔的离子电流的测量。在一个实施例中,电学测量或光学测量可采用常规的电学或光学测量。Various different types of measurements can be performed. This includes, but is not limited to, electrical measurements and optical measurements. Electrical measurements include voltage measurements, capacitance measurements, current measurements, impedance measurements, tunneling measurements (Ivanov AP et al., Nano Lett. 2011 Jan 12; 11(I): 279-85) and FET measurements (International Application WO 2005/124888). Optical measurements can be combined with electrical measurements (Soni GV et al., Rev Sci Instrum. 2010 Jan; 81(1) 014301). The measurement can be a transmembrane current measurement, such as a measurement of the ionic current flowing through the pore. In one embodiment, the electrical measurement or optical measurement can use conventional electrical or optical measurements.

电学测量可以使用描述在Stoddart D et al·,ProC Natl Acad Sci,12;106(19)7702-7,Lieberman KR et al,J Am Chem SoC.2010;132(50)17961-72和国际申请WO 2000/28312中的标准单通道记录设备进行。替代地,电学测量可以使用多通道系统进行,例如如国际申请W02009/077734和国际申请WO 2011/067559中描述的。Electrical measurements can be made using standard single channel recording equipment as described in Stoddart D et al., Proc Natl Acad Sci, 12; 106 (19) 7702-7, Lieberman KR et al., J Am Chem Soc. 2010; 132 (50) 17961-72 and international application WO 2000/28312. Alternatively, electrical measurements can be made using a multi-channel system, such as described in international application WO 2009/077734 and international application WO 2011/067559.

所述方法优选采用跨膜施加的电势进行。所述施加的电势可以是电压电势。替代地,所施加的电势可以是化学电势。其一实例为采用跨膜,例如双亲性分子层的盐梯度进行。盐梯度被公开在Holden et al.,J Am Chem SoC.2007Jul 11;129(27):8650-5中。在一些情况下,多核苷酸相对所述孔移动时流过所述孔的电流用于估算或确定所述多核苷酸的序列。这就是链测序。The method is preferably performed using an applied potential across the membrane. The applied potential may be a voltage potential. Alternatively, the applied potential may be a chemical potential. An example of this is the use of a salt gradient across a membrane, such as a layer of amphiphilic molecules. Salt gradients are disclosed in Holden et al., J Am Chem SOC. 2007 Jul 11; 129(27): 8650-5. In some cases, the current flowing through the pore as the polynucleotide moves relative to the pore is used to estimate or determine the sequence of the polynucleotide. This is strand sequencing.

所述方法可以包括测量多核苷酸相对所述孔移动时流过所述孔的电流。因此用于所述方法的设备也可以包括能够施加电势并测量穿过膜和孔的电信号的电路。所述方法可以采用膜片钳或电压钳进行。The method may include measuring the current flowing through the hole when the polynucleotide moves relative to the hole. Therefore, the equipment used for the method may also include a circuit capable of applying an electric potential and measuring the electrical signal passing through the membrane and the hole. The method may be performed using a patch clamp or a voltage clamp.

可以包括测量多核苷酸相对所述孔移动时流过所述孔的电流。测量通过跨膜蛋白孔的离子流的合适条件是本领域已知的并且在实施例中公开。所述方法通常通过施加在所述膜和所述孔上的电压进行。所使用的电压通常为从+5V至-5V,例如从从+4V至-4V,从+3V至-3V或从+2V至-2V。所使用的电压通常为从-600mV至+600V或-400mV至+400mV。所使用的电压优选在具有选自-400mV,-300mV,-200mV,-150mV,-100mV,-50mV,-20mV和0mV的下限和独立地选自+10mV, +20mV,+50mV,+100mV,+150mV,+200mV,+300nA^P+400mV的上限的范围内。所使用的电压更优选在100mV至240mV的范围内并且最优选在120mV至220mV的范围内。通过使用增加的施加电势,可以增加孔对不同核苷酸的识别。Can comprise measuring the current flowing through the hole when the polynucleotide moves relative to the hole. Suitable conditions for measuring the ion flow through the transmembrane protein pore are known in the art and disclosed in the embodiments. The method is usually carried out by applying a voltage to the membrane and the hole. The voltage used is usually from +5V to -5V, for example from +4V to -4V, from +3V to -3V or from +2V to -2V. The voltage used is usually from -600mV to +600V or -400mV to +400mV. The voltage used is preferably selected from -400mV, -300mV, -200mV, -150mV, -100mV, -50mV, -20mV and 0mV lower limit and independently selected from +10mV, The voltage used is more preferably in the range of 100mV to 240mV and most preferably in the range of 120mV to 220mV. By using an increased applied potential, the recognition of different nucleotides by the pore can be increased.

所述方法通常在存在任何电荷载体的情况下进行,例如金属盐例如碱金属盐,卤化物盐例如氯化物盐,例如碱金属氯化物盐。电荷载体可以包括离子液体或有机盐,例如四甲基氯化铵,三甲基苯基氯化铵,苯基三甲基氯化铵或1-乙基-3-甲基咪唑鑰氯化物。在上述示例性装置中,盐存在于所述腔室中的水性溶液中。通常使用氯化钾(KCl),氯化钠(NaCl),氯化铯(CsCl)或亚铁氰化钾和铁氰化钾的混合物。KCl,NaCl和亚铁氰化钾和铁氰化钾的混合物是优选的。电荷载体在所述膜上可以是不对称的。例如,电荷载体的类型和/或浓度可以在所述膜的每一侧上不同。The method is generally carried out in the presence of any charge carrier, such as a metal salt such as an alkali metal salt, a halide salt such as a chloride salt, such as an alkali metal chloride salt. The charge carrier may include an ionic liquid or an organic salt, such as tetramethylammonium chloride, trimethylphenylammonium chloride, phenyltrimethylammonium chloride or 1-ethyl-3-methylimidazolium chloride. In the exemplary device described above, the salt is present in an aqueous solution in the chamber. Potassium chloride (KCl), sodium chloride (NaCl), cesium chloride (CsCl) or a mixture of potassium ferrocyanide and potassium ferrocyanide are generally used. KCl, NaCl and a mixture of potassium ferrocyanide and potassium ferrocyanide are preferred. The charge carrier may be asymmetric on the membrane. For example, the type and/or concentration of the charge carrier may be different on each side of the membrane.

所述盐的浓度可以是饱和的。所述盐的浓度可以为3M或更低,并且通常为0.1至2.5M,0.3至1.9M,0.5至1.8M,0.7至1.7M,0.9至1.6M或1M至1.4M。所述盐的浓度优选为150mM至1M。所述方法优选使用至少0.3M,例如至少0.4M,至少0.5M,至少0.6M,至少0.8M,至少1.0M,至少1.5M,至少2.0M,至少2.5M或至少3.0M的盐浓度进行。高盐浓度提供高的信噪比,并允许通过电流指示在正常电流波动背景下待识别的核苷酸的存在。The concentration of the salt may be saturated. The concentration of the salt may be 3 M or less, and is typically 0.1 to 2.5 M, 0.3 to 1.9 M, 0.5 to 1.8 M, 0.7 to 1.7 M, 0.9 to 1.6 M or 1 M to 1.4 M. The concentration of the salt is preferably 150 mM to 1 M. The method is preferably performed using a salt concentration of at least 0.3 M, such as at least 0.4 M, at least 0.5 M, at least 0.6 M, at least 0.8 M, at least 1.0 M, at least 1.5 M, at least 2.0 M, at least 2.5 M or at least 3.0 M. High salt concentrations provide a high signal-to-noise ratio and allow the presence of nucleotides to be identified in the context of normal current fluctuations to be indicated by current.

所述方法通常在存在缓冲液的情况下进行。在上述示例性装置中,所述缓冲液存在于所述腔室中的水性溶液中。任意缓冲液可以用于本申请实施例的方法。通常地,所述缓冲液为磷酸缓冲液。其他合适的缓冲液为HEPES或Tris-HCl缓冲液。所述方法通常在pH为4.0至12.0、4.5至10.0、5.0至9.0、5.5至8.8、6.0至8.7、7.0至8.8、或7.5至8.5下进行。使用的pH值优选约7.5。The method is usually carried out in the presence of a buffer. In the exemplary device described above, the buffer is present in an aqueous solution in the chamber. Any buffer can be used in the method of the present application embodiment. Typically, the buffer is a phosphate buffer. Other suitable buffers are HEPES or Tris-HCl buffer. The method is usually carried out at a pH of 4.0 to 12.0, 4.5 to 10.0, 5.0 to 9.0, 5.5 to 8.8, 6.0 to 8.7, 7.0 to 8.8 or 7.5 to 8.5. The pH used is preferably about 7.5.

所述方法可以在0℃至100℃,15℃至95℃,16℃至90℃,17℃至85℃,18℃至80℃,19℃至70℃或20℃至60℃温度下进行。所述方法通常在室温下进行。所述方法任选的在支持酶功能的温度下进行,例如约37℃。The method can be carried out at a temperature of 0°C to 100°C, 15°C to 95°C, 16°C to 90°C, 17°C to 85°C, 18°C to 80°C, 19°C to 70°C or 20°C to 60°C. The method is typically carried out at room temperature. The method is optionally carried out at a temperature that supports enzyme function, such as about 37°C.

在一个实施例中,该方法包括:In one embodiment, the method includes:

(a)将所述目标分析物优选目标多核苷酸与孔和本申请实施例的解 旋酶或本申请实施例的构建体接触,使得所述目标分析物优选目标多核苷酸移动穿过所述孔并且所述解旋酶或构建体控制所述目标分析物优选目标多核苷酸移动穿过所述孔;以及(a) the target analyte, preferably the target polynucleotide, is contacted with the pore and the solution of the embodiment of the present application contacting the pore with a helicase or a construct of an embodiment of the present application such that the target analyte, preferably a target polynucleotide, moves through the pore and the helicase or construct controls the movement of the target analyte, preferably a target polynucleotide, through the pore; and

(b)随着所述目标分析物优选多核苷酸相对于所述孔移动,获取穿过所述孔的电流,其中所述电流代表所述目标分析物优选目标多核苷酸的一个或多个特征,并由此表征所述目标分析物优选目标多核苷酸。(b) acquiring an electric current across the pore as the target analyte, preferably the polynucleotide, moves relative to the pore, wherein the electric current represents one or more characteristics of the target analyte, preferably the polynucleotide, and thereby characterizes the target analyte, preferably the polynucleotide.

所述方法可以使用任何适于研究膜/孔系统(其中孔嵌入膜中)的设备进行实施。该方法可以使用适合于感测孔的设备实施。例如,所述设备包括一个室,所述室包括水溶液和将该室分割为两部分的屏障(barrier)。所述屏障通常具有缝隙,其中在缝隙中形成包括孔的膜。或者该屏障形成其中存在孔的膜。该方法可以使用在国际申请No.PCT/GB08/000562(W0 2008/102120)中描述的设备实施。The method can be implemented using any device suitable for studying membrane/pore systems (wherein the pores are embedded in the membrane). The method can be implemented using a device suitable for sensing the pores. For example, the device comprises a chamber comprising an aqueous solution and a barrier that divides the chamber into two parts. The barrier typically has a gap in which a membrane comprising the pores is formed. Or the barrier forms a membrane in which the pores are present. The method can be implemented using the device described in International Application No. PCT/GB08/000562 (WO 2008/102120).

该方法可以包括随着目标分析物优选多核苷酸相对于所述孔移动,测量通过所述孔的电流。因此该装置也可以包括能够跨膜和孔施加电势并测量电流信号的电路。该方法可以使用膜片钳或电压钳实施。所述方法优选包含使用电压钳。所述目标多核苷酸可以与解旋酶或构建体以及孔以任何次序接触。优选的,当目标分析物优选目标多核苷酸与解旋酶或构建体以及孔接触时,目标分析物优选目标多核苷酸先与解旋酶或构建体形成复合体。当跨所述孔施加电势时,目标分析物优选目标多核苷酸/解旋酶或构建体复合体则与孔形成复合体并控制目标分析物优选多核苷酸移动穿过所述孔。The method may include measuring the current through the hole as the target analyte, preferably the polynucleotide, moves relative to the hole. Therefore, the device may also include a circuit capable of applying a potential across the membrane and the hole and measuring the current signal. The method may be implemented using a patch clamp or a voltage clamp. The method preferably comprises the use of a voltage clamp. The target polynucleotide may be contacted with a helicase or construct and the hole in any order. Preferably, when the target analyte, preferably the target polynucleotide, contacts the helicase or construct and the hole, the target analyte, preferably the target polynucleotide, first forms a complex with the helicase or construct. When a potential is applied across the hole, the target analyte, preferably the target polynucleotide/helicase or construct complex then forms a complex with the hole and controls the target analyte, preferably the polynucleotide, to move through the hole.

传感器sensor

本申请实施例还提供形成用于表征目标分析物优选目标多核苷酸的传感器及其方法。所述方法包括在孔和实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶、酶或本申请实施例的构建体之间形成复合体。所述解旋酶可以是以上针对本申请实施例的构建体所描述的任意的解旋酶,包括本申请实施例的解旋酶和未根据本申请实施例修饰的解旋酶。可以使用任意数量和组合的上述针对本申请实施例的组和方法讨论的本申请实施例的Dda解旋酶。The present application embodiment also provides the sensor and method thereof formed for characterizing the preferred target polynucleotide of target analyte.Described method is included in the helicase of hole and embodiment, preferably Dda helicase, more preferably forms complex between the construct of T4-Dda helicase, enzyme or the present application embodiment.Described helicase can be any helicase described above for the construct of the present application embodiment, comprises the helicase of the present application embodiment and the helicase not modified according to the present application embodiment.The Dda helicase of the present application embodiment discussed for the group and method of the present application embodiment of any number and combination can be used.

复合体可以通过将孔和解旋酶或构建体在目标分析物优选目标多核苷酸的存在下相接触,然后对所述孔施加电势而形成。所施加的电势可以如上所述是化学电势或电压电势。替换的,所述复合体可以通 过孔共价连接到所述解旋酶或构建体而形成。共价连接的方法是本领域已知的并例如,在国际申请号PCT/GB09/001679(公布号为W0 2010/004265)和PCT/GB10/000133(公布号为W0 2010/086603)中有公开。所述复合体是用于表征目标分析物优选目标多核苷酸的传感器。所述方法优选包括在源自Msp的孔与本申请实施例的解旋酶或本申请实施例的构建体之间形成复合体。上面论述的关于本申请实施例方法的任一实施方式同样适用于该方法。本申请实施例还提供了使用本申请实施例的方法制备的传感器。The complex can be formed by contacting the pore and the helicase or construct in the presence of a target analyte, preferably a target polynucleotide, and then applying an electric potential to the pore. The applied electric potential can be a chemical potential or a voltage potential as described above. Alternatively, the complex can be formed by The via hole is covalently connected to the helicase or construct to form. The method of covalent connection is known in the art and, for example, disclosed in International Application No. PCT/GB09/001679 (publication number is WO 2010/004265) and PCT/GB10/000133 (publication number is WO 2010/086603). The complex is a sensor for characterizing a target analyte, preferably a target polynucleotide. The method preferably includes forming a complex between a hole derived from Msp and a helicase of the present application embodiment or a construct of the present application embodiment. Any embodiment of the present application embodiment method discussed above is equally applicable to the method. The present application embodiment also provides a sensor prepared using the method of the present application embodiment.

试剂盒Reagent test kit

本申请实施例还提供了用于表征目标分析物优选目标多核苷酸的试剂盒。所述试剂盒包括(a)孔和(b)实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶,或本申请实施例的构建体。上面针对本申请实施例方法论述的任一实施方式同样适用于该试剂盒。所述解旋酶可以是以上针对本申请实施例的构建体所描述的任意解旋酶,包括本申请实施例的解旋酶和未根据本申请实施例修饰的解旋酶。所述试剂盒可以包括使用任意数量和组合的上述针对本申请实施例的组和方法讨论的本申请实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶。The present application embodiment also provides a test kit for characterizing the preferred target polynucleotide of target analyte. The test kit includes the helicase of (a) hole and (b) embodiment, preferably Dda helicase, more preferably T4-Dda helicase, or the construct of the present application embodiment. Any embodiment discussed above for the present application embodiment method is equally applicable to the test kit. The helicase can be any helicase described for the construct of the present application embodiment above, including the helicase of the present application embodiment and the helicase not modified according to the present application embodiment. The test kit can include the helicase of the present application embodiment discussed for the group and method of the present application embodiment using any number and combination, preferably Dda helicase, more preferably T4-Dda helicase.

所述试剂盒可进一步包括膜的组分,例如形成两性分子层诸如脂质双分子层所需要的磷脂。本申请实施例的试剂盒可以另外包括一种或多种其他试剂或能使上面描述的任何实施方案实施的仪器。所述试剂或仪器包括以下的一种或多种:合适的缓冲剂(水溶液),用于从受体获得样本的工具(诸如导管或含有针的仪器),用于扩增和/或表达多核苷酸的工具,如上所定义的膜或者电压钳或膜片钳装置。存在于所述试剂盒中的试剂可以是干态的,使得液体样本使所述试剂重悬。所述试剂盒还可选地包括使该试剂盒能在本申请实施例方法中使用的仪器或关于该方法可以用于哪些患者的说明书。所述试剂盒可选地包括核苷酸。The kit may further include components of the membrane, such as phospholipids required to form an amphiphilic layer such as a lipid bilayer. The kit of the present application embodiment may additionally include one or more other reagents or instruments that enable any of the embodiments described above to be implemented. The reagents or instruments include one or more of the following: a suitable buffer (aqueous solution), an instrument for obtaining a sample from a receptor (such as a catheter or an instrument containing a needle), an instrument for amplifying and/or expressing polynucleotides, a membrane as defined above or a voltage clamp or patch clamp device. The reagent present in the kit may be dry so that the liquid sample resuspends the reagent. The kit may also optionally include an instrument that enables the kit to be used in the present application embodiment method or instructions for which patients the method may be used for. The kit may optionally include nucleotides.

装置Device

本申请实施例还提供了用于表征目标分析物优选目标多核苷酸的装置。所述装置包括多个孔和多个解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶,多个本申请实施例的解旋酶或多个本申请实施例的构 建体。所述装置优选进一步包括用于实施本申请实施例方法的说明书。所述装置可以是任何用于目标分析物优选多核苷酸分析的常规装置,诸如阵列或芯片。上面论述的关于本申请实施例方法的任一实施方式可以同等适用于本申请实施例的装置。所述解旋酶可以是以上针对本申请实施例的构建体所描述的任意的解旋酶,包括本申请实施例的解旋酶和未根据本申请实施例被修饰的解旋酶。所述装置可以包括使用任意数量和组合的上述针对本申请实施例的组和方法讨论的本申请实施例的解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶。所述装置优选被装配成用于实施本申请实施例的方法。The present application also provides a device for characterizing a target analyte, preferably a target polynucleotide. The device comprises a plurality of holes and a plurality of helicases, preferably a Dda helicase, more preferably a T4-Dda helicase, a plurality of helicases of the present application or a plurality of constructs of the present application. Construct. The device preferably further includes instructions for implementing the method of the embodiment of the present application. The device can be any conventional device for target analyte, preferably polynucleotide analysis, such as an array or a chip. Any embodiment of the method of the embodiment of the present application discussed above can be equally applicable to the device of the embodiment of the present application. The helicase can be any helicase described above for the construct of the embodiment of the present application, including the helicase of the embodiment of the present application and the helicase that is not modified according to the embodiment of the present application. The device may include the helicase of the embodiment of the present application discussed above for the group and method of the embodiment of the present application using any number and combination, preferably Dda helicase, more preferably T4-Dda helicase. The device is preferably assembled to implement the method of the embodiment of the present application.

在一个实施例中,所述装置优选包括:In one embodiment, the device preferably comprises:

传感器设备,能支撑多个孔,并能可操作的使用所述孔和解旋酶或构建体进行目标分析物优选多核苷酸的表征;以及A sensor device capable of supporting a plurality of pores and operable to use the pores and the helicase or construct to characterize a target analyte, preferably a polynucleotide; and

至少一个口,用于传递进行表征的材料。At least one port for delivering a material to be characterized.

替换的,所述装置优选包括:Alternatively, the device preferably comprises:

传感器设备,能支撑多个孔,并能可操作的使用所述孔和解旋酶或构建体进行目标分析物优选多核苷酸的表征;以及A sensor device capable of supporting a plurality of pores and operable to use the pores and the helicase or construct to characterize a target analyte, preferably a polynucleotide; and

至少一个存储器,存储用于进行表征的材料。At least one memory for storing materials for characterization.

所述装置更优选包括:The device more preferably comprises:

传感器设备,能支撑所述膜和多个孔,并能可操作的使用所述孔和解旋酶或构建体进行目标分析物优选多核苷酸的表征;a sensor device capable of supporting the membrane and the plurality of pores and operable to characterize a target analyte, preferably a polynucleotide, using the pores and the helicase or construct;

至少一个存储器,存储用于进行表征的材料;at least one memory storing material for characterization;

流控系统,配置为从至少一个存储器可控地向所述传感器设备供应材料;以及一个或多个容器,用于接收各样本,所述流控系统被配置为选择性地从一个或多个容器向所述传感器设备供应样本。所述设备可以是那些在国际申请NO.PCT/GB08/004127(公布号为W0 2009/077734),PCT/GB10/000789(公布号为W0 2010/122293),国际申请No.PCT/GB10/002206(公布号为W0 2011/067559)或国际申请No.PCT/US99/25679(公布号为WO 00/28312)中描述的任一设备。a fluidics system configured to controllably supply material to the sensor device from at least one reservoir; and one or more containers for receiving each sample, the fluidics system being configured to selectively supply samples from the one or more containers to the sensor device. The device may be any of those described in International Application No. PCT/GB08/004127 (published as WO 2009/077734), PCT/GB10/000789 (published as WO 2010/122293), International Application No. PCT/GB10/002206 (published as WO 2011/067559) or International Application No. PCT/US99/25679 (published as WO 00/28312).

制备解旋酶的方法Method for preparing helicase

本申请实施例还提供了制备本申请实施例的修饰的解旋酶的方法。所述方法包括提供解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶以形成本申请实施例的修饰的解旋酶。 The present application also provides a method for preparing the modified helicase of the present application. The method includes providing a helicase, preferably a Dda helicase, more preferably a T4-Dda helicase to form the modified helicase of the present application.

所述方法优选进一步包括确定所述解旋酶是否能够控制目标分析物优选多核苷酸的移动。对它进行的分析在上文进行了描述。如果目标分析物优选多核苷酸的移动能被控制,则所述解旋酶被正确的修饰并且制得了本申请实施例的解旋酶。如果目标分析物优选多核苷酸的移动不能被控制,则没有制得本申请实施例的解旋酶。The method preferably further comprises determining whether the helicase can control the movement of the target analyte, preferably a polynucleotide. The analysis performed on it is described above. If the movement of the target analyte, preferably a polynucleotide, can be controlled, the helicase is correctly modified and the helicase of the present application embodiment is obtained. If the movement of the target analyte, preferably a polynucleotide, cannot be controlled, the helicase of the present application embodiment is not obtained.

制备构建体的方法Methods of making constructs

本申请实施例还提供了制备构建体的方法。所述方法包括将本申请解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶连接、优选共价连接到衔接体。如上所述的任意解旋酶和衔接体可以在该方法中使用。The present application also provides a method for preparing the construct. The method comprises connecting, preferably covalently connecting, a helicase of the present application, preferably a Dda helicase, more preferably a T4-Dda helicase to an adapter. Any helicase and adapter as described above can be used in the method.

所述方法优选进一步包括确定所述构建体能否控制目标分析物优选多核苷酸的移动。对它进行的分析在上文进行了描述。如果目标分析物优选多核苷酸的移动能被控制,所述解旋酶和衔接体被正确地连接,并且制得了本申请实施例的构建体。如果目标分析物优选多核苷酸的移动不能被控制,则没有制得本申请实施例的构建体。The method preferably further comprises determining whether the construct can control the movement of the target analyte, preferably polynucleotide. The analysis performed on it is described above. If the movement of the target analyte, preferably polynucleotide, can be controlled, the helicase and adapter are correctly connected, and the construct of the present application embodiment is obtained. If the movement of the target analyte, preferably polynucleotide, cannot be controlled, the construct of the present application embodiment is not obtained.

上述提及的现有技术以全文引用的方式并入本文。The above-mentioned prior art is incorporated herein by reference in its entirety.

以下实施例用以阐述本申请实施例,但不具有限制作用。The following examples are used to illustrate the embodiments of the present application but are not intended to be limiting.

实施例1.1:不同催化体系下介导的酶交联Example 1.1: Enzyme cross-linking mediated by different catalytic systems

主体链S1:5’-333333333333333333333333333333-GCGGAGTCAAACGGTAGAAGTCGTTTTTTTTTT(SEQ ID NO:1)-8888- T(SEQ ID NO:2)-3’Main strand S1: 5'-333333333333333333333333333333- GCGGAGTCAAACGGTAGAAG TCGTTTTTTTTTT (SEQ ID NO: 1)-8888- T(SEQ ID NO:2)-3'

反义链S2:5’-CGACTTCTACCGTTTGACTCCGC(SEQ ID NO:3)-3’Antisense strand S2: 5'- CGACTTCTACCGTTTGACTCCGC (SEQ ID NO: 3)-3'

连接链YB:(SEQ ID NO:4)/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OMeU//i2OMeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeA//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3’分别合成主体链S1,反义链S2和连接链YB,并且将三者以1:1.1:1.1的比例进行混合和退火处理,形成衔接体。退火处理具体为从95℃缓慢降温到25℃,期间降温幅度不超过0.1℃/s。Connection chain YB: (SEQ ID NO: 4)/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OMeU//i2OMeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeA//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3' respectively synthesize the main strand S1, the antisense strand S2 and the connecting strand YB, and mix and anneal the three at a ratio of 1:1.1:1.1 to form an adapter. The annealing treatment is specifically to slowly cool from 95°C to 25°C, during which the cooling amplitude does not exceed 0.1°C/s.

配制4个反应体系,均取1uM衔接体,加入7.5倍物质的量的T53(为突变型T4 DDA-E94Y/N365Y,其野生型T4DDA序列如SEQ ID  NO:5所示),以及分别不加催化剂,加入100uM H2O2,50uM Hemin/100uM H2O2,0.1U HRP/100uM H2O2混合并且放置于30℃孵育18h,之后用TBE PAGE凝胶在160V下运行40min并且检测其酶交联情况,结果如图1所示。如图1所示,核酸蛋白复合物为衔接体与解旋酶的复合物。添加H2O2后能形成有效的交联产物,HRP酶催化的效果要显著优于其它催化剂。Prepare 4 reaction systems, take 1uM adapter, add 7.5 times the amount of T53 (mutant T4 DDA-E94Y/N365Y, the wild-type T4DDA sequence is as shown in SEQ ID NO: 5), and no catalyst was added, 100uM H 2 O 2 , 50uM Hemin/100uM H 2 O 2 , 0.1U HRP/100uM H 2 O 2 were added, mixed and placed at 30°C for 18h, and then TBE PAGE gel was run at 160V for 40min and the enzyme cross-linking was detected. The results are shown in Figure 1. As shown in Figure 1, the nucleic acid protein complex is a complex of the adapter and the helicase. After adding H 2 O 2 , an effective cross-linking product can be formed, and the effect of HRP enzyme catalysis is significantly better than other catalysts.

配制2个反应体系,均取1uM衔接体,加入5倍物质的量的Q1(为突变型T4DDA-E93Q)蛋白,以及分别不加催化剂,加入0.1U微生物转谷氨酰胺酶作为催化剂混合并且放置于37℃孵育36h,之后用TBE PAGE凝胶在160V下运行40min并且检测其酶交联情况,结果如图2所示,核酸蛋白复合物为衔接体与解旋酶的复合物。添加微生物转谷氨酰胺酶后能形成有效的交联产物。Two reaction systems were prepared, each with 1uM adapter, 5 times the amount of Q1 (mutant T4DDA-E93Q) protein, and no catalyst, 0.1U microbial transglutaminase as a catalyst, mixed and incubated at 37°C for 36h, then run TBE PAGE gel at 160V for 40min and detect the enzyme cross-linking. The results are shown in Figure 2. The nucleic acid protein complex is a complex of the adapter and the helicase. After adding microbial transglutaminase, an effective cross-linking product can be formed.

配制2个反应体系,均取1uM衔接体,加入5倍物质的量的Q11(为突变型T4DDA-E94G)蛋白,以及分别不加催化剂,加入5mM EDC作为催化剂混合并且放置于30℃孵育4h,之后用TBE PAGE凝胶在160V下运行40min并且检测其酶交联情况。结果如图3所示,核酸蛋白复合物为衔接体与解旋酶的复合物。添加EDC后能形成有效的交联产物。Two reaction systems were prepared, each with 1uM adapter, 5 times the amount of Q11 (mutant T4DDA-E94G) protein, and no catalyst, 5mM EDC as a catalyst, mixed and incubated at 30°C for 4h, then run TBE PAGE gel at 160V for 40min and detect the enzyme cross-linking. The results are shown in Figure 3, the nucleic acid protein complex is a complex of the adapter and the helicase. After adding EDC, an effective cross-linking product can be formed.

实施例1.2:Ru催化的Tyr-氨基锁酶反应Example 1.2: Ru-catalyzed Tyr-amino lock enzyme reaction

主体链S1:5’-333333333333333333333333333333-GCGGAGTCAAACGGTAGAAGTCGTTTTTTTTTT(SEQ ID NO:1)-8888-ACTGCTCATTCGGTCCTGCTGAC T(SEQ ID NO:2)-3’Main chain S1: 5’-333333333333333333333333333333-GCGGAGTCAAACGGTAGAAGTCGTTTTTTTTTT(SEQ ID NO:1)-8888-ACTGCTCATTCGGTCCTGCTGAC T(SEQ ID NO:2)-3’

反义链S2:5’-Ru-CGACTTCTACCGTTTGACTCCGC(SEQ ID NO:3)-3’Antisense strand S2: 5’-Ru-CGACTTCTACCGTTTGACTCCGC (SEQ ID NO: 3)-3’

连接链YB:5’P-GTCAGCAGGACCGAATGA(SEQ ID NO:4)/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OMeU//i2OMeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeA//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3’Connection chain YB: 5’P-GTCAGCAGGACCGAATGA(SEQ ID NO:4)/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OM eU//i2OMeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeA//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3’

分别合成主体链S1,反义链S2和连接链YB,并且将三者以1:1.1:1.1的比例进行混合和退火处理,形成衔接体。退火处理具体为从95℃缓慢降温到25℃,期间降温幅度不超过0.1℃/s。 The main strand S1, antisense strand S2 and connecting strand YB are synthesized separately, and the three are mixed and annealed at a ratio of 1:1.1:1.1 to form a linker. The annealing treatment is specifically to slowly cool from 95°C to 25°C, during which the cooling amplitude does not exceed 0.1°C/s.

配制2个反应体系,均取2uM衔接体,加入4倍物质的量的T49(为突变型T4 DDA-E94Y)或4倍物质的量T53蛋白/1mM 1,4-丁二胺,混合并且放置于30℃孵育30min,之后分别加入1.25mM过硫酸钠并且放置于LED 450nM光下进行交联(光功率为50mw/cm2;距离15cm),之后用TBE PAGE凝胶在160V下运行40min并且检测其酶交联情况,结果如图4所示。核酸蛋白复合物为衔接体与解旋酶的复合物。在Ru光照催化情况下可以明显发现酶发生了交联(T49中的Y94-K364;T53中的Y94和N365Y与1,4-丁二胺交联)。Two reaction systems were prepared, each with 2uM adapter, 4 times the amount of T49 (mutant T4 DDA-E94Y) or 4 times the amount of T53 protein/1mM 1,4-diamine were added, mixed and placed at 30°C for incubation for 30min, then 1.25mM sodium persulfate was added and placed under LED 450nM light for crosslinking (light power of 50mw/ cm2 ; distance 15cm), then TBE PAGE gel was run at 160V for 40min and the enzyme crosslinking was detected, the results are shown in Figure 4. The nucleic acid protein complex is a complex of the adapter and the helicase. Under Ru light catalysis, it can be clearly found that the enzyme has been crosslinked (Y94-K364 in T49; Y94 and N365Y in T53 are crosslinked with 1,4-diamine).

实施例1.3:不同Tyr-Tyr突变体的HRP交联反应Example 1.3: HRP cross-linking reaction of different Tyr-Tyr mutants

配制3个反应体系,均取1uM衔接体,加入7.5倍物质的量的T52(为突变型T4DDA-E94Y/G357Y)/T53/T61(为突变型T4DDA-A372Y)蛋白,和0.1U HRP/100uM H2O2,混合并且放置于30℃孵育18h,之后用TBE PAGE凝胶在160V下运行40min并且检测其酶交联情况,结果如图5所示,对于选取的3种Tyr-Tyr突变体在HRP/H2O2的催化下均能看到较好的锁酶交联效果。Three reaction systems were prepared, each with 1uM adapter, 7.5 times the amount of T52 (mutant T4DDA-E94Y/G357Y)/T53/T61 (mutant T4DDA-A372Y) proteins and 0.1U HRP/100uM H 2 O 2 added, mixed and incubated at 30°C for 18h, then run TBE PAGE gel at 160V for 40min and detect the enzyme cross-linking. The results are shown in Figure 5. For the three selected Tyr-Tyr mutants, good lock-enzyme cross-linking effects can be seen under the catalysis of HRP/H 2 O 2 .

实施列1.4:基于Tyr-Tyr交联的核酸蛋白复合物的制备Example 1.4: Preparation of nucleic acid-protein complexes based on Tyr-Tyr cross-linking

将实施例1.1中加入HRP/H2O2催化的酶交联产物加入DNAPac PA200柱,用洗脱缓冲液进行纯化,以将没有结合到衔接体上的酶以及未完全反应的EDC等杂质从柱子上除掉。The enzyme cross-linking product catalyzed by HRP/H 2 O 2 in Example 1.1 was added to a DNAPac PA200 column and purified using an elution buffer to remove impurities such as enzymes that were not bound to the adapter and incompletely reacted EDC from the column.

然后用10倍柱体积的缓冲液A和缓冲液B的混合物对核酸蛋白复合物进行洗脱。然后汇集主洗脱峰,测量其浓度,并用TBE PAGE凝胶160V下运行40分钟。其中,缓冲液A:20mMNa-CHES,250mM NaCl,4%(W/V)甘油,pH 8.6;缓冲液B:20mM Na-CHES,1MNaCl,4%(W/V)甘油,pH 8.6,其洗脱过程如图6所示,将洗脱过程中的各个峰收集并且用TBE PAGE凝胶在160V下运行40min进行检测,结果如图7所示,E3为所需的核酸蛋白复合物。Then, the nucleic acid-protein complex was eluted with a mixture of buffer A and buffer B with 10 column volumes. The main elution peaks were then collected, their concentrations were measured, and TBE PAGE gel was run at 160V for 40 minutes. Among them, buffer A: 20mM Na-CHES, 250mM NaCl, 4% (W/V) glycerol, pH 8.6; buffer B: 20mM Na-CHES, 1M NaCl, 4% (W/V) glycerol, pH 8.6. The elution process is shown in Figure 6. The peaks in the elution process were collected and detected by TBE PAGE gel running at 160V for 40 minutes. The results are shown in Figure 7. E3 is the desired nucleic acid-protein complex.

实施例1.5:基于Tyr-Tyr交联的核酸蛋白复合物的上机测试Example 1.5: On-machine testing of nucleic acid-protein complexes based on Tyr-Tyr cross-linking

通过末端修复方式制备长为10kb的文库,并且与实施例1.4中的核酸蛋白复合物与文库进行连接建库。最后在齐碳科技有限公司QNome-3841上进行测序,其实际测序信号如图8所示,该核酸蛋白复合物可以正常产生测序信号,且信号台阶清晰可以进行后续碱基推定。可以看到完整地文库过孔信号。这说明形成1A结构域和2A结构域之 间形成Tyr-Tyr异位键的解旋酶可以控制目标多核苷酸的运动,并可用于纳米孔对多核苷酸的测序,后续我们对其测序信号进行了分析,发现其在噪声和信噪比方面要优于传统的二硫键交联(指T4Dda-E94C/A360C形成的核酸蛋白复合物)形式,分析结果如表3所示。A 10 kb library was prepared by end repair, and the nucleic acid protein complex in Example 1.4 was connected to the library. Finally, sequencing was performed on the QNome-3841 of Qi Carbon Technology Co., Ltd. The actual sequencing signal is shown in Figure 8. The nucleic acid protein complex can generate sequencing signals normally, and the signal steps are clear for subsequent base inference. The complete library pass-through signal can be seen. This shows that the formation of the 1A domain and the 2A domain The helicase that forms the Tyr-Tyr ectopic bond between the two molecules can control the movement of the target polynucleotide and can be used for nanopore sequencing of polynucleotides. We subsequently analyzed its sequencing signal and found that it is superior to the traditional disulfide cross-linking (referring to the nucleic acid-protein complex formed by T4Dda-E94C/A360C) in terms of noise and signal-to-noise ratio. The analysis results are shown in Table 3.

实施例1.6:基于Tyr-Tyr交联的测序准确率分析Example 1.6: Analysis of sequencing accuracy based on Tyr-Tyr cross-linking

通过末端修复方式制备随机打断的Human基因组文库,并且与实施例1.4中的核酸蛋白复合物或二硫键交联的SAC-M1(T4Dda-E94C/A360C形成的核酸蛋白复合物)与文库进行连接建库。最后在齐碳科技有限公司QNome-3841上进行测序,在同等小数据集训练规模下,基于Tyr-Tyr交联所得准确率要高于传统的二硫键交联形式,分析结果如表4所示。A randomly interrupted Human genome library was prepared by end repair, and the nucleic acid-protein complex or disulfide-crosslinked SAC-M1 (nucleic acid-protein complex formed by T4Dda-E94C/A360C) in Example 1.4 was connected to the library. Finally, sequencing was performed on the QNome-3841 of Qi Carbon Technology Co., Ltd. Under the same small data set training scale, the accuracy based on Tyr-Tyr cross-linking was higher than the traditional disulfide cross-linking form, and the analysis results are shown in Table 4.

表3-基于Tyr-Tyr交联的核酸蛋白复合物的测序信号
Table 3 - Sequencing signals of nucleic acid-protein complexes based on Tyr-Tyr crosslinking

表4-基于Tyr-Tyr交联的测序准确率分析
Table 4 - Analysis of sequencing accuracy based on Tyr-Tyr cross-linking

表3和表4中的参数含义如下:The meanings of the parameters in Table 3 and Table 4 are as follows:

归一化噪声:对每个read的信号值进行中位数绝对偏差(Median Absolute Deviation,MAD)标准化,通过中值滤波进行信号、噪声的分离,计算噪声的均方根(Root Mean Square,RMS)值,作为归一化噪声的指标,归一化噪声值越小表示信号水平受到的噪声干扰较小,即信号的清晰度越高。Normalized noise: The signal value of each read is standardized by the Median Absolute Deviation (MAD), the signal and noise are separated by median filtering, and the root mean square (RMS) value of the noise is calculated as an indicator of normalized noise. The smaller the normalized noise value, the less noise interference the signal level is, that is, the higher the signal clarity.

噪声:对每个read的信号值进行中值滤波,获取滤波后的噪声信号,计算噪声信号的均方根(Root Mean Square,RMS)值作为噪声的指标,反映噪声的相对强度,噪声值越小表示噪声干扰越小。 Noise: Perform median filtering on the signal value of each read to obtain the filtered noise signal. Calculate the root mean square (RMS) value of the noise signal as an indicator of noise, reflecting the relative intensity of noise. The smaller the noise value, the smaller the noise interference.

信噪比:对每个read的信号值进行中位数绝对偏差(Median Absolute Deviation,MAD)标准化,通过中值滤波进行信号、噪声的分离,用信号功率与噪声功率的比值来表示信噪比,信噪值越大表示信号质量越好、噪声干扰越小。Signal-to-noise ratio: The signal value of each read is normalized by the Median Absolute Deviation (MAD), and the signal and noise are separated by median filtering. The signal-to-noise ratio is expressed as the ratio of signal power to noise power. The larger the signal-to-noise value, the better the signal quality and the smaller the noise interference.

Train loss:是训练数据集训练过程中在神经网络模型预测结果与真实结果的误差。Train loss: It is the error between the predicted results of the neural network model and the actual results during the training of the training data set.

val loss:是验证数据集通过神经网络模型预测结果与真实结果的误差。val loss: It is the error between the predicted result of the validation data set through the neural network model and the actual result.

ACC:是验证数据集通过神经网络模型预测结果对比真实结果的准确率。ACC: It is the accuracy of the prediction results of the validation data set through the neural network model compared with the actual results.

实施例2.1:基于Tyr-Tyr连接的核酸蛋白复合物的制备Example 2.1: Preparation of nucleic acid-protein complex based on Tyr-Tyr linkage

分别合成主体链S1,反义链S2和连接链YB,其具体序列如下所示:The main strand S1, antisense strand S2 and connecting strand YB were synthesized respectively, and their specific sequences are shown below:

主体链S1:5’-333333333333333333333333333333-GCGGAGTCAAACGGTAGAAGTCGTTTTTTTTTT-8888-ACTGCTCATTCGGTCCTGCTGACT-3’Main strand S1: 5’-3333333333333333333333333333333-GCGGAGTCAAACGGTAGAAGTCGTTTTTTTTTT-8888-ACTGCTCATTCGGTCCTGCTGACT-3’

反义链S2:5’-Ru-CGACTTCTACCGTTTGACTCCGC-3’Antisense strand S2: 5’-Ru-CGACTTCTACCGTTTGACTCCGC-3’

连接链YB:5’-GTCAGCAGGACCGAATGA/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OMeU//i2OMeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeA//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3’Connection chain YB: 5’-GTCAGCAGGACCGAATGA/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OMeU//i2O MeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeA//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3’

其中:in:

主体链S1依次包含前导序列iSpC3(即缺少糖和碱基的核苷酸,表示为3),其连接到SEQ ID NO:1的5’端,SEQ ID NO:1的3’端依次连接到阻断链iSpC18(表示为8888)和SEQ ID NO:2的5’端。iSpC3和iSpC18购自Integrated DNA Technologies。The main strand S1 sequentially comprises the leading sequence iSpC3 (i.e., a nucleotide lacking sugar and base, indicated as 3), which is connected to the 5' end of SEQ ID NO: 1, and the 3' end of SEQ ID NO: 1 is sequentially connected to the blocking strand iSpC18 (indicated as 8888) and the 5' end of SEQ ID NO: 2. iSpC3 and iSpC18 were purchased from Integrated DNA Technologies.

反义链S2的序列如SEQ ID NO:3所示。The sequence of the antisense strand S2 is shown in SEQ ID NO:3.

连接链YB包含如SEQ ID NO:4所示的序列,其3’端连接有多个甲氧基修饰,包括i2OMeA、i2OMeC、i2OMeG、i2OMeU。The connecting chain YB comprises the sequence shown in SEQ ID NO:4, and its 3’ end is connected to multiple methoxy modifications, including i2OMeA, i2OMeC, i2OMeG, and i2OMeU.

将主体链S1,反义链S2和连接链YB以1:1.1:1.1的比例进行混合和退火处理,形成衔接体。退火处理具体为从95℃缓慢降温到25℃,期间降温幅度不超过0.1℃/s。 The main strand S1, the antisense strand S2 and the connecting strand YB are mixed and annealed at a ratio of 1:1.1:1.1 to form an adapter. The annealing treatment is specifically to slowly cool from 95°C to 25°C, during which the cooling amplitude does not exceed 0.1°C/s.

取2μM衔接体,将其与4倍物质的量的野生型T4 Dda蛋白(SEQ ID NO:5)、以及分别加入1mM的1,3丙二胺和1,4丁二胺混合并且放置于30℃孵育0.5h,之后加入2.5mM过硫酸钠并且放置于蓝光LED 450nM光源下(距离15cm,辐射功率50mW/cm2)处理1min,之后用TBE PAGE凝胶在160V下运行40min并且检测其酶交联情况,结果如图9所示。结果显示,可在Ru的催化下氧化酪氨酸与氨基linker形成有效的核酸蛋白复合物。2 μM linker was taken, mixed with 4 times the amount of wild-type T4 Dda protein (SEQ ID NO: 5), and 1 mM 1,3-propanediamine and 1,4-butanediamine respectively, and incubated at 30°C for 0.5 h, then 2.5 mM sodium persulfate was added and placed under a blue LED 450 nM light source (distance 15 cm, radiation power 50 mW/cm 2 ) for 1 min, and then TBE PAGE gel was run at 160 V for 40 min and the enzyme cross-linking was detected, the results are shown in Figure 9. The results show that tyrosine can be oxidized under the catalysis of Ru to form an effective nucleic acid protein complex with the amino linker.

实施例2.2:基于Lys-Glu连接的核酸蛋白复合物的制备Example 2.2: Preparation of nucleic acid-protein complex based on Lys-Glu linkage

分别合成主体链S1,反义链S2和连接链YB,其具体序列如下所示:The main strand S1, antisense strand S2 and connecting strand YB were synthesized respectively, and their specific sequences are shown below:

主体链S1:5’-333333333333333333333333333333-GCGGAGTCAAACGGTAGAAGTCGTTTTTTTTTT-8888-ACTGCTCATTCGGTCCTGCTGACT-3’Main strand S1: 5’-3333333333333333333333333333333-GCGGAGTCAAACGGTAGAAGTCGTTTTTTTTTT-8888-ACTGCTCATTCGGTCCTGCTGACT-3’

反义链S2:5’-CGACTTCTACCGTTTGACTCCGC-3’Antisense strand S2: 5’-CGACTTCTACCGTTTGACTCCGC-3’

连接链YB:5’-GTCAGCAGGACCGAATGA/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OMeU//i2OMeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeA//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3’Connection chain YB: 5’-GTCAGCAGGACCGAATGA/i2OMeG//i2OMeC//i2OMeA//i2OMeG//i2OMeU//i2OMeA//i2OMeG//i2OMeU//i2O MeC//i2OMeC//i2OMeA//i2OMeG//i2OMeC//i2OMeA//i2OMeC//i2OMeC//i2OMeG//i2OMeA//i2OMeC//i2OMeC/-3’

其中:in:

主体链S1依次包含前导序列iSpC3(即缺少糖和碱基的核苷酸,表示为3),其连接到SEQ ID NO:1的5’端,SEQ ID NO:1的3’端依次连接到阻断链iSpC18(表示为8888)和SEQ ID NO:2的5’端。iSpC3和iSpC18购自Integrated DNA Technologies。The main strand S1 sequentially comprises the leading sequence iSpC3 (i.e., a nucleotide lacking sugar and base, indicated as 3), which is connected to the 5' end of SEQ ID NO: 1, and the 3' end of SEQ ID NO: 1 is sequentially connected to the blocking strand iSpC18 (indicated as 8888) and the 5' end of SEQ ID NO: 2. iSpC3 and iSpC18 were purchased from Integrated DNA Technologies.

反义链S2的序列如SEQ ID NO:3所示。The sequence of the antisense strand S2 is shown in SEQ ID NO:3.

连接链YB包含如SEQ ID NO:4所示的序列,其3’端连接有多个甲氧基修饰,包括i2OMeA、i2OMeC、i2OMeG、i2OMeU。The connecting chain YB comprises the sequence shown in SEQ ID NO:4, and its 3’ end is connected to multiple methoxy modifications, including i2OMeA, i2OMeC, i2OMeG, and i2OMeU.

将主体链S1,反义链S2和连接链YB以1:1.1:1.1的比例进行混合和退火处理,形成衔接体。退火处理具体为从95℃缓慢降温到25℃,期间降温幅度不超过0.1℃/s。The main strand S1, the antisense strand S2 and the connecting strand YB are mixed and annealed at a ratio of 1:1.1:1.1 to form an adapter. The annealing treatment is specifically to slowly cool from 95°C to 25°C, during which the cooling amplitude does not exceed 0.1°C/s.

取1μM衔接体,分别将其0、1、2、5倍物质的量的野生型T4 Dda蛋白(SEQ ID NO:5),以及5mM EDC混合并且放置于30℃孵育4h,加入20mMβ-ME终止反应,之后用TBE PAGE凝胶在160V下运行 40min并且检测其酶交联情况,结果如图10所示。结果显示,随着加入蛋白浓度的增加,核酸蛋白复合物的比例也逐渐增加,可以形成有效的核酸蛋白复合物。Take 1 μM adapter, mix 0, 1, 2, 5 times the amount of wild-type T4 Dda protein (SEQ ID NO: 5), and 5 mM EDC, incubate at 30 ° C for 4 h, add 20 mM β-ME to terminate the reaction, and then run TBE PAGE gel at 160 V. The enzyme cross-linking was detected for 40 min, and the results are shown in Figure 10. The results show that with the increase of the added protein concentration, the proportion of the nucleic acid-protein complex gradually increases, and an effective nucleic acid-protein complex can be formed.

实施例2.3:核酸蛋白复合物的纯化与质检Example 2.3: Purification and quality control of nucleic acid-protein complexes

将实施例2.2中加入5倍物质的量的孵育结合产物加入DNAPac PA200柱,用缓冲液A和缓冲液B进行纯化,以将没有结合到衔接体上的酶以及未完全反应的EDC等杂质从柱子上除掉。Add 5 times the amount of substance added in Example 2.2 to the DNAPac PA200 column and purify it with buffer A and buffer B to remove impurities such as enzymes that are not bound to the linker and incompletely reacted EDC from the column.

然后用10倍柱体积的缓冲液A和缓冲液B的混合物对核酸蛋白复合物进行洗脱。然后汇集主洗脱峰,测量其浓度,并用TBE PAGE凝胶160V下运行40min。The nucleic acid-protein complex was then eluted with a mixture of 10 column volumes of buffer A and buffer B. The main elution peaks were then pooled, their concentrations measured, and run on a TBE PAGE gel at 160V for 40 min.

其中,缓冲液A的成分为:20mM Na-CHES,250mM NaCl,4%(W/V)甘油,pH 8.6;缓冲液B的成分为:20mM Na-CHES,1M NaCl,4%(W/V)甘油,pH 8.6。Among them, the composition of buffer A is: 20mM Na-CHES, 250mM NaCl, 4% (W/V) glycerol, pH 8.6; the composition of buffer B is: 20mM Na-CHES, 1M NaCl, 4% (W/V) glycerol, pH 8.6.

其洗脱过程如图11所示,将洗脱过程中的各个峰收集并且用TBE PAGE凝胶在160V下运行40min进行检测,结果如图12所示,E2为所需的核酸蛋白复合物。The elution process is shown in Figure 11. The peaks in the elution process were collected and detected using TBE PAGE gel running at 160V for 40 minutes. The results are shown in Figure 12. E2 is the desired nucleic acid-protein complex.

将洗脱下来的主峰,用Qubit进行定量,并且取10ng进行质检,分别加入1*Seq buffer(10mM HEPES7.0、600mM KCl、25mM ATP、25mM MgCl2)、NF-H20(对照组,加入一定量超纯水补足反应体系)、或0.1% SDS在35℃处理30min之后用TBE PAGE凝胶在160V下运行40min进行检测,结果如图13所示。结果显示,spacer能够成功将该复合物中的解旋酶挡住,质检酶脱落产生的核酸量较少;而SDS处理组未能见到核酸蛋白复合物残留,说明构建的锁酶复合物未和核酸形成共价交联。The eluted main peak was quantified by Qubit, and 10 ng was taken for quality inspection. 1*Seq buffer (10 mM HEPES7.0, 600 mM KCl, 25 mM ATP, 25 mM MgCl 2 ), NF-H 2 0 (control group, a certain amount of ultrapure water was added to supplement the reaction system), or 0.1% SDS was added and treated at 35°C for 30 minutes, and then tested by TBE PAGE gel running at 160V for 40 minutes. The results are shown in Figure 13. The results show that the spacer can successfully block the helicase in the complex, and the amount of nucleic acid produced by the detachment of the quality inspection enzyme is small; while no nucleic acid-protein complex residues were found in the SDS treatment group, indicating that the constructed lock-enzyme complex did not form a covalent cross-link with the nucleic acid.

实施例2.4:核酸蛋白复合物的ATP消耗检测Example 2.4: ATP consumption detection of nucleic acid-protein complexes

将实施例2.3中纯化后的核酸蛋白复合物取约1ng,加入到200μL1*Seq Buffer(10mM HEPES7.0、600mM KCl、25mM ATP、25mM MgCl2)中,以不添加核酸蛋白复合物添加无酶水为对照,之后分别取0h、4h、8h并且用HPLC根据在C18柱上洗脱的出峰时间检测其中ADP的含量,其结果如图14所示。结果显示,该锁酶方式下,核酸蛋白复合物仍然具有ATP酶活性,蛋白仍然能正常工作。About 1 ng of the purified nucleic acid-protein complex in Example 2.3 was taken and added to 200 μL 1*Seq Buffer (10 mM HEPES7.0, 600 mM KCl, 25 mM ATP, 25 mM MgCl 2 ), and enzyme-free water was added without adding nucleic acid-protein complex as a control. After that, 0 h, 4 h, and 8 h were taken and the ADP content was detected by HPLC according to the peak time of elution on the C18 column, and the results are shown in Figure 14. The results show that under this enzyme locking mode, the nucleic acid-protein complex still has ATPase activity and the protein can still work normally.

实施例2.5:核酸蛋白复合物的测序功能测试 Example 2.5: Sequencing function test of nucleic acid-protein complex

通过末端修复方式制备长为10kb的文库,并且用实施例2.2中制备的核酸蛋白复合物与文库进行连接建库,按照齐碳科技的建库试剂盒QLK-V1.1.1进行。最后在齐碳科技有限公司QNome-3841测序仪上进行测序,其实际测序信号如图15所示。结果显示,该核酸蛋白复合物可以正常产生测序信号,且信号台阶清晰可以进行后续碱基推定。A 10 kb library was prepared by end repair, and the nucleic acid protein complex prepared in Example 2.2 was connected to the library for library construction according to the library construction kit QLK-V1.1.1 of Qi Carbon Technology. Finally, sequencing was performed on the QNome-3841 sequencer of Qi Carbon Technology Co., Ltd., and the actual sequencing signal is shown in Figure 15. The results show that the nucleic acid protein complex can generate sequencing signals normally, and the signal steps are clear for subsequent base inference.

对比例2.1:未形成交联的核酸蛋白复合物的测序功能测试Comparative Example 2.1: Sequencing function test of nucleic acid-protein complex without cross-linking

未形成交联的核酸蛋白复合物的制备方法与实施例2.2类似,其不同之处在于:制备过程中未添加EDC,即酶未形成交联。The method for preparing the nucleic acid-protein complex without forming cross-links is similar to that in Example 2.2, except that no EDC is added during the preparation process, that is, the enzyme does not form cross-links.

按照实施例2.5相同的方法进行建库和测序,结果如表5所示。结果显示,与实施例2.5相比,未交联的酶在测序时复合物中的酶易脱落,无法获得有效测序信号,更不能获得10kb的完整测序信号。The library construction and sequencing were performed in the same manner as in Example 2.5, and the results are shown in Table 5. The results showed that compared with Example 2.5, the enzyme in the complex of the non-cross-linked enzyme was easily detached during sequencing, and no effective sequencing signal could be obtained, let alone a complete sequencing signal of 10 kb.

表5实施例2.5与对比例2.1的测试结果
Table 5 Test results of Example 2.5 and Comparative Example 2.1

注:有效信号即后续可以通过basecalling识别并且转化为序列的信号。 Note: A valid signal is a signal that can be subsequently identified by basecalling and converted into a sequence.

Claims (31)

一种解旋酶,其A helicase, which (a)包含一个多核苷酸结合域和两个RecA样结构域,其中位于围绕所述多核苷酸结合域的所述解旋酶表面上的两个或多个氨基酸残基之间通过异位键连接,使得所述多核苷酸结合域中结合的多核苷酸被所述连接后形成的结构包围;或(a) comprising a polynucleotide binding domain and two RecA-like domains, wherein two or more amino acid residues on the surface of the helicase surrounding the polynucleotide binding domain are connected by an ectopic bond, so that the polynucleotide bound to the polynucleotide binding domain is surrounded by the structure formed after the connection; or (b)包含一个多核苷酸结合域和两个RecA样结构域,其中位于围绕所述多核苷酸结合域的所述解旋酶表面上的两个或多个氨基酸残基之间能够通过异位键连接,使得所述多核苷酸结合域中结合的多核苷酸被所述连接后的结构包围,其中所述两个或多个氨基酸残基中的至少一个为突变氨基酸残基;或(b) comprising a polynucleotide binding domain and two RecA-like domains, wherein two or more amino acid residues on the surface of the helicase surrounding the polynucleotide binding domain can be connected by an ectopic bond, so that the polynucleotide bound in the polynucleotide binding domain is surrounded by the connected structure, and at least one of the two or more amino acid residues is a mutant amino acid residue; or (c)包含多核苷酸结合结构域,所述多核苷酸结合结构域在至少一个构象状态下包含开口,多核苷酸可通过所述开口与所述解旋酶解结合,其中所述解旋酶在所述开口处的两个或多个氨基酸之间形成异位键,并且其中所述解旋酶保留其控制目标多核苷酸移动的能力;或(c) comprises a polynucleotide binding domain, wherein the polynucleotide binding domain comprises an opening in at least one conformational state through which a polynucleotide can be debound by the helicase, wherein the helicase forms an ectopic bond between two or more amino acids at the opening, and wherein the helicase retains its ability to control the movement of the target polynucleotide; or (d)包含多核苷酸结合结构域,所述多核苷酸结合结构域在至少一个构象状态下包含开口,多核苷酸可通过所述开口与所述解旋酶解结合,其中所述解旋酶被修饰,使得其在所述开口处的两个或多个氨基酸之间能够形成异位键,并且其中所述解旋酶保留其控制目标多核苷酸移动的能力。(d) comprising a polynucleotide binding domain, wherein the polynucleotide binding domain comprises an opening in at least one conformational state, through which a polynucleotide can be debound by the helicase, wherein the helicase is modified so that it can form an ectopic bond between two or more amino acids at the opening, and wherein the helicase retains its ability to control the movement of the target polynucleotide. 根据权利要求1所述的解旋酶,其中所述异位键不包括半胱氨酸参与形成的异位键,所述异位键包括除半胱氨酸外的任意天然氨基酸参与形成的异位键,The helicase according to claim 1, wherein the ectopic bond does not include an ectopic bond formed with the participation of cysteine, and the ectopic bond includes an ectopic bond formed with the participation of any natural amino acid except cysteine, 所述除半胱氨酸外的任意天然氨基酸选自包括如下氨基酸的组:Ile、Val、Leu、Phe、Met、Ala、Gly、Thr、Ser、Trp、Tyr、Pro、His、Glu、Gln、Asp、Asn、Lys、Arg,The arbitrary natural amino acid except cysteine is selected from the group consisting of the following amino acids: Ile, Val, Leu, Phe, Met, Ala, Gly, Thr, Ser, Trp, Tyr, Pro, His, Glu, Gln, Asp, Asn, Lys, Arg, 优选的,其中所述异位键包括Lys-His、Lys-Ser、Lys-Thr、Lys-Tyr、Lys-Lys、Lys-Glu、Lys-Asp、Lys-Gln、Lys-Arg、Arg-Glu、Lys-Met、Arg-Asp、Arg-Arg、Tyr-Tyr、Tyr-Trp、Met-Met异位键或其组合;Preferably, the ectopic bond comprises Lys-His, Lys-Ser, Lys-Thr, Lys-Tyr, Lys-Lys, Lys-Glu, Lys-Asp, Lys-Gln, Lys-Arg, Arg-Glu, Lys-Met, Arg-Asp, Arg-Arg, Tyr-Tyr, Tyr-Trp, Met-Met ectopic bond or a combination thereof; 和/或,所述解旋酶由一个或多个单体组成, and/or, the helicase consists of one or more monomers, 和/或,形成所述异肽键的两个氨基酸位于同一结构域或不同结构域。And/or, the two amino acids forming the isopeptide bond are located in the same domain or in different domains. 根据前述权利要求任一项所述的解旋酶,所述解旋酶源自天然或经修饰的下组的解旋酶家族的成员:Dda解旋酶、Pifl样解旋酶、Upfl样解旋酶、UvrD/Rep解旋酶、Ski样解旋酶、Rad3/XPD解旋酶、NS3/NPH-II解旋酶、DEAD解旋酶、DEAHi RHA解旋酶、RecG样解旋酶、REcQ样解旋酶、T1R样解旋酶、Swi/Snf样解旋酶和Rig-I样解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶;The helicase according to any one of the preceding claims, wherein the helicase is derived from a member of the natural or modified helicase family of the following group: Dda helicase, Pifl-like helicase, Upfl-like helicase, UvrD/Rep helicase, Ski-like helicase, Rad3/XPD helicase, NS3/NPH-II helicase, DEAD helicase, DEAHi RHA helicase, RecG-like helicase, REcQ-like helicase, T1R-like helicase, Swi/Snf-like helicase and Rig-I-like helicase, preferably Dda helicase, more preferably T4-Dda helicase; 和/或,所述解旋酶源自天然或经修饰的下组的解旋酶家族的成员:RecD解旋酶、Upfl解旋酶、PcrA解旋酶、Rep解旋酶、UvrD解旋酶、Hel308解旋酶、Mtr4解旋酶、XPD解旋酶、NS3解旋酶、Mssl 16解旋酶、Prp43解旋酶,RecG解旋酶、RecQ解旋酶、T1R解旋酶、RapA解旋酶和Hef解旋酶。And/or, the helicase is derived from a natural or modified member of the helicase family of the following groups: RecD helicase, Upfl helicase, PcrA helicase, Rep helicase, UvrD helicase, Hel308 helicase, Mtr4 helicase, XPD helicase, NS3 helicase, Mssl 16 helicase, Prp43 helicase, RecG helicase, RecQ helicase, T1R helicase, RapA helicase and Hef helicase. 根据前述权利要求任一项所述的解旋酶,其中所述解旋酶源自T4-Dda解旋酶,并且所述解旋酶包括:The helicase according to any one of the preceding claims, wherein the helicase is derived from a T4-Dda helicase and comprises: (1)x1与x2形成的Tyr-Tyr异位键;(1) Tyr-Tyr heterotopic bond formed by x1 and x2; (2)x3与x4形成的Lys-Gln异位键;(2) Lys-Gln heterotopic bond formed by x3 and x4; (3)x1与x3形成的Tyr-Lys异位键;(3) Tyr-Lys heterotopic bond formed by x1 and x3; (4)x1与x4形成的Tyr-Gln异位键;(4) Tyr-Gln heterotopic bond formed by x1 and x4; (5)x2与x3形成的Tyr-Lys异位键;(5) Tyr-Lys heterotopic bond formed by x2 and x3; (6)x2与x4形成的Tyr-Gln异位键;(6) Tyr-Gln heterotopic bond formed by x2 and x4; (7)x3与x5形成的Lys-Glu异位键;或(7) Lys-Glu ectopic bond formed by x3 and x5; or (8)以上任何组合。(8)Any combination of the above. 根据前述权利要求任一项所述的解旋酶,其中所述解旋酶包括1A结构域的任意突变体和/或2A结构域的任意突变体,只要所述解旋酶保留其控制多核苷酸移动的能力。The helicase according to any of the preceding claims, wherein the helicase comprises any mutant of the 1A domain and/or any mutant of the 2A domain, as long as the helicase retains its ability to control the movement of polynucleotides. 一种多肽,包括来自前述权利要求任一项所述的解旋酶的形成异位键的1A结构域和2A结构域,和多核苷酸结合域,并且不包括所述解旋酶的其他结构域。A polypeptide comprising the 1A domain and the 2A domain that form an ectopic bond, and a polynucleotide binding domain from the helicase according to any one of the preceding claims, and excluding other domains of the helicase. 一种构建体,包括权利要求1-6任一项所述的解旋酶或多肽,和与其多核苷酸结合域结合的目标多核苷酸,其中位于围绕多核苷酸结合域的解旋酶表面上两个或多个氨基酸残基之间通过异肽键连接,其 中结合的目标多核苷酸被所述异肽键连接后的结构包围。A construct comprising a helicase or polypeptide according to any one of claims 1 to 6, and a target polynucleotide bound to its polynucleotide binding domain, wherein two or more amino acid residues on the surface of the helicase surrounding the polynucleotide binding domain are connected by an isopeptide bond, The target polynucleotide bound in the peptide bond is surrounded by the structure connected by the isopeptide bonds. 根据权利要求7所述的构建体,其包括衔接体,所述目标多核苷酸与衔接体连接,所述构建体具有控制所述目标多核苷酸移动的能力。The construct according to claim 7, which comprises an adapter, the target polynucleotide is connected to the adapter, and the construct has the ability to control the movement of the target polynucleotide. 根据权利要求7所述的构建体,其中所述构建体包括两个以上权利要求1-6任一项所述的解旋酶或多肽。The construct according to claim 7, wherein the construct comprises two or more helicases or polypeptides according to any one of claims 1-6. 根据权利要求8所述的构建体,其中所述衔接体中的目标多核苷酸与目标分析物连接,所述构建体具有控制述目标分析物移动的能力。The construct according to claim 8, wherein the target polynucleotide in the adapter is connected to the target analyte, and the construct has the ability to control the movement of the target analyte. 一种多核苷酸,包括编码权利要求1-5任一项所述的解旋酶、权利要求6所述的多肽、权利要求7-10任一项所述的构建体的序列,或由其序列组成。A polynucleotide comprising a sequence encoding the helicase according to any one of claims 1 to 5, the polypeptide according to claim 6, or the construct according to any one of claims 7 to 10, or consisting of their sequences. 一种载体,包括可操作性连接到启动子的权利要求11所述的多核苷酸。A vector comprising the polynucleotide of claim 11 operably linked to a promoter. 一种宿主细胞,包括权利要求12所述的载体。A host cell comprising the vector according to claim 12. 一种制备权利要求1-5任一项所述的解旋酶、权利要求6所述的多肽、或权利要求7-10任一项所述的构建体的方法,包括表达权利要求11所述的多核苷酸、用权利要求12所述的载体转染细胞,或培养权利要求13所述的宿主细胞。A method for preparing the helicase according to any one of claims 1 to 5, the polypeptide according to claim 6, or the construct according to any one of claims 7 to 10, comprising expressing the polynucleotide according to claim 11, transfecting cells with the vector according to claim 12, or culturing the host cell according to claim 13. 一种控制目标分析物移动的方法,包括将所述目标分析物与权利要求1-6任一项所述的解旋酶或多肽、或权利要求7-10任一项所述的构建体接触,并由此控制所述目标分析物的移动。A method for controlling the movement of a target analyte, comprising contacting the target analyte with the helicase or polypeptide described in any one of claims 1-6, or the construct described in any one of claims 7-10, and thereby controlling the movement of the target analyte. 根据权利要求15所述的方法,其中所述方法用于控制目标分析物穿过孔的移动。The method of claim 15, wherein the method is used to control the movement of a target analyte through a pore. 一种表征目标分析物的方法,包括:A method for characterizing a target analyte, comprising: (a)将目标分析物与孔和权利要求1-6任一项所述的解旋酶或多肽或权利要求7-10任一项所述的构建体接触,使得所述解旋酶控制所述目标分析物穿过所述孔移动;以及(a) contacting a target analyte with a pore and a helicase or polypeptide according to any one of claims 1 to 6 or a construct according to any one of claims 7 to 10, such that the helicase controls the movement of the target analyte through the pore; and (b)随着所述目标分析物相对于所述孔移动,获取一个或多个测量值,其中所述测量值代表所述目标分析物的一个或多个特征并由此表征所述目标分析物。(b) obtaining one or more measurements as the target analyte moves relative to the pore, wherein the measurements represent one or more characteristics of the target analyte and thereby characterize the target analyte. 根据权利要求10所述构建体、15-17所述的方法,其中所述 目标分析物选自多核苷酸、多肽、多糖和脂质中的一种或多种,优选为多核苷酸或与多核苷酸连接的多肽、多糖和脂质,更优选为单链多核苷酸、双链多核苷酸或部分双链多核苷酸;或者The construct according to claim 10, 15-17, wherein the The target analyte is selected from one or more of polynucleotides, polypeptides, polysaccharides and lipids, preferably polynucleotides or polypeptides, polysaccharides and lipids linked to polynucleotides, more preferably single-stranded polynucleotides, double-stranded polynucleotides or partially double-stranded polynucleotides; or 所述目标分析物是目标多核苷酸,并且所述一个或多个特征选自(i)所述目标多核苷酸的长度,(ii)所述目标多核苷酸的同一性,(iii)所述目标多核苷酸的序列,(iv)所述目标多核苷酸的二级结构;以及(v)所述目标多核苷酸是否是经修饰的。The target analyte is a target polynucleotide, and the one or more characteristics are selected from (i) the length of the target polynucleotide, (ii) the identity of the target polynucleotide, (iii) the sequence of the target polynucleotide, (iv) the secondary structure of the target polynucleotide; and (v) whether the target polynucleotide is modified. 根据权利要求17-18任一项所述的方法,其中所述目标分析物的一个或多个特征通过电测量和/或光测量进行测量。The method according to any one of claims 17-18, wherein one or more characteristics of the target analyte are measured by electrical measurement and/or optical measurement. 根据权利要求19所述的方法,其中所述电测量为电流测量、阻抗测量、隧道测量或场效应晶体管测量。The method of claim 19, wherein the electrical measurement is a current measurement, an impedance measurement, a tunneling measurement, or a field effect transistor measurement. 根据权利要求17-20任一项所述的方法,其中所述方法进一步包括跨所述孔施加电压以在所述孔和所述解旋酶或构建体之间形成复合体的步骤。A method according to any one of claims 17 to 20, wherein the method further comprises the step of applying a voltage across the pore to form a complex between the pore and the helicase or construct. 根据权利要求17-21任一项所述的方法,其中所述孔为纳米孔或跨膜孔,或所述孔选自生物孔、固态孔或生物与固态杂交的孔。The method according to any one of claims 17 to 21, wherein the pore is a nanopore or a transmembrane pore, or the pore is selected from a biological pore, a solid-state pore, or a biological and solid-state hybrid pore. 根据权利要求22所述的方法,所述生物孔衍生自溶血素、杀白细胞素、CsGG、耻垢分枝杆菌孔蛋白A(MspA)、孔蛋白B、孔蛋白C、孔蛋白D、外膜孔蛋白F、外膜孔蛋白G、外膜磷脂酶A、奈瑟氏菌属自转运脂蛋白、和WZA;The method of claim 22, wherein the biological pore is derived from hemolysin, leukocidin, CsGG, Mycobacterium smegmatis porin A (MspA), porin B, porin C, porin D, outer membrane porin F, outer membrane porin G, outer membrane phospholipase A, Neisseria autotransporter, and WZA; 所述固态孔衍生自石墨烯纳米孔、MoS2纳米孔、BN纳米孔或PA63纳米孔。The solid-state pores are derived from graphene nanopores, MoS2 nanopores, BN nanopores or PA63 nanopores. 一种形成用于表征目标分析物的传感器的方法,包括在(a)孔和(b)权利要求1-6任一项所述的解旋酶或多肽或权利要求7-10任一项所述的构建体之间形成复合体并由此形成用于表征所述目标分析物的传感器。A method for forming a sensor for characterizing a target analyte, comprising forming a complex between (a) a pore and (b) a helicase or polypeptide as described in any one of claims 1 to 6 or a construct as described in any one of claims 7 to 10 and thereby forming a sensor for characterizing the target analyte. 一种用于表征目标分析物的传感器,包括(a)孔和(b)权利要求1-6任一项所述的解旋酶或多肽、或权利要求7-10任一项所述的构建体之间的复合体。A sensor for characterizing a target analyte, comprising a complex between (a) a pore and (b) a helicase or polypeptide according to any one of claims 1 to 6, or a construct according to any one of claims 7 to 10. 权利要求1-6任一项所述的解旋酶或多肽、或权利要求7-10任一项所述的构建体在控制目标分析物穿过孔的移动中的应用。Use of the helicase or polypeptide according to any one of claims 1 to 6, or the construct according to any one of claims 7 to 10, in controlling the movement of a target analyte through a pore. 一种用于表征目标分析物的试剂盒,包括(a)孔和(b)权 利要求1-6任一项所述的解旋酶或多肽、或权利要求7-10任一项所述的构建体。A kit for characterizing a target analyte, comprising (a) a well and (b) a weight The helicase or polypeptide according to any one of claims 1-6, or the construct according to any one of claims 7-10. 一种用于表征目标分析物的装置,包括(a)多个孔和(b)多个权利要求1-6任一项所述的解旋酶或多肽、或权利要求7-10任一项所述的构建体。A device for characterizing a target analyte, comprising (a) a plurality of pores and (b) a plurality of helicases or polypeptides according to any one of claims 1 to 6, or a construct according to any one of claims 7 to 10. 一种制备权利要求1-5任一项所述的解旋酶的方法,包括:A method for preparing the helicase according to any one of claims 1 to 5, comprising: (a)提供解旋酶,优选Dda解旋酶,更优选T4-Dda解旋酶;和(a) providing a helicase, preferably a Dda helicase, more preferably a T4-Dda helicase; and (b)在所述解旋酶的蛋白内形成异位键以制备权利要求1-5任一项所述的解旋酶;(b) forming an ectopic bond within the protein of the helicase to prepare the helicase according to any one of claims 1 to 5; 优选的,在步骤(b)中,还包括将所述解旋酶与多核苷酸接触的步骤,Preferably, in step (b), the step of contacting the helicase with the polynucleotide is also included. 优选的,所述异位键包括使用H2O2、Hemin和H2O2、HRP酶和H2O2、转谷氨酰胺酶、EDC、或Ru催化形成,更优选的,HRP酶和H2O2、或EDC。Preferably, the ectopic bond is formed by catalysis using H 2 O 2 , Hemin and H 2 O 2 , HRP enzyme and H 2 O 2 , transglutaminase, EDC, or Ru, more preferably, HRP enzyme and H 2 O 2 , or EDC. 一种制备权利要求7-10任一项所述的构建体的方法,包括将权利要求1-6任一项所述的解旋酶或多肽与所述衔接体连接并由此制备所述的构建体。A method for preparing the construct according to any one of claims 7-10, comprising connecting the helicase or polypeptide according to any one of claims 1-6 to the adapter and thereby preparing the construct. 连接到目标分析物的两个或更多个解旋酶的组,其中所述两个或更多个解旋酶中至少一个为权利要求1-5任一项所述的解旋酶。 A group of two or more helicases attached to a target analyte, wherein at least one of the two or more helicases is the helicase described in any one of claims 1-5.
PCT/CN2024/121291 2023-09-27 2024-09-26 Helicase and use thereof Pending WO2025067293A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN202311261708.8 2023-09-27
CN202311266570.0A CN119709690A (en) 2023-09-27 2023-09-27 Enzyme modification method for nanopore sequencing
CN202311261708.8A CN119709689A (en) 2023-09-27 2023-09-27 Helicase and its application
CN202311266570.0 2023-09-27

Publications (1)

Publication Number Publication Date
WO2025067293A1 true WO2025067293A1 (en) 2025-04-03

Family

ID=95204362

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/121291 Pending WO2025067293A1 (en) 2023-09-27 2024-09-26 Helicase and use thereof

Country Status (1)

Country Link
WO (1) WO2025067293A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105899678A (en) * 2013-10-18 2016-08-24 牛津纳米孔技术公司 Modified enzymes
CN107109483A (en) * 2014-10-14 2017-08-29 牛津纳米孔技术公司 Method
CN107109380A (en) * 2014-10-07 2017-08-29 牛津纳米孔技术公司 Modified enzymes
WO2021253410A1 (en) * 2020-06-19 2021-12-23 北京齐碳科技有限公司 Pif1-like helicase and application thereof
WO2022213253A1 (en) * 2021-04-06 2022-10-13 成都齐碳科技有限公司 Modified prp43 helicase and use thereof
CN116497102A (en) * 2021-12-21 2023-07-28 成都齐碳科技有限公司 Adaptors, methods for characterizing polynucleotides of interest and uses thereof

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105899678A (en) * 2013-10-18 2016-08-24 牛津纳米孔技术公司 Modified enzymes
CN107109380A (en) * 2014-10-07 2017-08-29 牛津纳米孔技术公司 Modified enzymes
CN107109483A (en) * 2014-10-14 2017-08-29 牛津纳米孔技术公司 Method
WO2021253410A1 (en) * 2020-06-19 2021-12-23 北京齐碳科技有限公司 Pif1-like helicase and application thereof
WO2022213253A1 (en) * 2021-04-06 2022-10-13 成都齐碳科技有限公司 Modified prp43 helicase and use thereof
CN116497102A (en) * 2021-12-21 2023-07-28 成都齐碳科技有限公司 Adaptors, methods for characterizing polynucleotides of interest and uses thereof

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
DATABASE PROTEIN 28 April 1993 (1993-04-28), "DNA helicase, partial [Escherichia phage T4]", XP093297050, Database accession no. AAA32488.1 *
HE, XIAOPING ET AL.: "The T4 Phage SF1B Helicase Dda is Structurally Optimized to Perform DNA Strand Separation", STRUCTURE, vol. 20, 3 July 2012 (2012-07-03), pages 1189 - 1200, XP028399006, DOI: 10.1016/j.str.2012.04.013 *
WANUNU MENI: "Back and forth with nanopore peptide sequencing", NATURE BIOTECHNOLOGY, NATURE PUBLISHING GROUP US, NEW YORK, vol. 40, no. 2, 20 January 2022 (2022-01-20), New York, pages 172 - 173, XP037691462, ISSN: 1087-0156, DOI: 10.1038/s41587-021-01205-x *

Similar Documents

Publication Publication Date Title
US20230374583A1 (en) Method of target molecule characterisation using a molecular pore
US11560589B2 (en) Enzyme stalling method
CN104126018B (en) Enzymatic process
EP3619224B1 (en) Transmembrane pore consisting of two csgg pores
CN106460061B (en) Methods for characterizing double-stranded nucleic acid molecules using nanopores and anchor molecules at both ends of the double-stranded nucleic acid molecules
EP3440098B1 (en) Mutant pore
EP2836506B1 (en) Mutant lysenin pores
EP3207157B1 (en) Methods for delivering an analyte to transmembrane pores
EP3259281B1 (en) Hetero-pores
CN113912683B (en) Mutant of porin monomer, protein hole and application thereof
CN113773373B (en) Mutant of porin monomer, protein hole and application thereof
CN116601297A (en) A modified helicase and its application
CN114854826A (en) Sequences, linkers comprising sequences and uses thereof
US20240240248A1 (en) Methods for complement strand sequencing
WO2025067293A1 (en) Helicase and use thereof
CN119709689A (en) Helicase and its application
EP4362028A1 (en) Mutant aerolysin and uses thereof
CN119753110A (en) Adapter for characterizing analytes, characterization method and use thereof
HK40075988A (en) Sequences, linkers comprising sequences and uses thereof
WO2023060422A1 (en) Mutant of porin monomer, protein pore and use thereof
WO2023060419A1 (en) Mutant of porin monomer, protein pore and use thereof
WO2023060420A1 (en) Mutant of porin monomer, protein pore, and use thereof
WO2023060421A1 (en) Mutant of porin monomer, protein pore and use thereof
WO2023050031A1 (en) Mutant of porin monomer, protein pore and use thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24870837

Country of ref document: EP

Kind code of ref document: A1