[go: up one dir, main page]

WO2023030146A1 - Polypeptide composition analysis method based on copper ion modified mspa nanopores - Google Patents

Polypeptide composition analysis method based on copper ion modified mspa nanopores Download PDF

Info

Publication number
WO2023030146A1
WO2023030146A1 PCT/CN2022/114670 CN2022114670W WO2023030146A1 WO 2023030146 A1 WO2023030146 A1 WO 2023030146A1 CN 2022114670 W CN2022114670 W CN 2022114670W WO 2023030146 A1 WO2023030146 A1 WO 2023030146A1
Authority
WO
WIPO (PCT)
Prior art keywords
amino acid
medium
nanopore
signal
polypeptide
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2022/114670
Other languages
French (fr)
Chinese (zh)
Inventor
耿佳
张明
王紫纯
魏于全
陈山川
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan University
Original Assignee
Sichuan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan University filed Critical Sichuan University
Publication of WO2023030146A1 publication Critical patent/WO2023030146A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/483Physical analysis of biological material
    • G01N33/487Physical analysis of biological material of liquid biological material
    • G01N33/48707Physical analysis of biological material of liquid biological material by electrical means
    • G01N33/48721Investigating individual macromolecules, e.g. by translocation through nanopores
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P13/00Preparation of nitrogen-containing organic compounds
    • C12P13/04Alpha- or beta- amino acids
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N27/00Investigating or analysing materials by the use of electric, electrochemical, or magnetic means
    • G01N27/26Investigating or analysing materials by the use of electric, electrochemical, or magnetic means by investigating electrochemical variables; by using electrolysis or electrophoresis
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/483Physical analysis of biological material
    • G01N33/487Physical analysis of biological material of liquid biological material
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/483Physical analysis of biological material
    • G01N33/487Physical analysis of biological material of liquid biological material
    • G01N33/48707Physical analysis of biological material of liquid biological material by electrical means
    • G01N33/48728Investigating individual cells, e.g. by patch clamp, voltage clamp
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • G01N33/6806Determination of free amino acids

Definitions

  • the invention belongs to the field of nanopore detection, and in particular relates to a method for analyzing polypeptide components based on copper ion-modified MspA nanopores.
  • Nanopore sensing is a single-molecule sensing technology with a detection principle similar to that of a Kurt counter.
  • the technique features real-time and direct monitoring at the single-molecule level and generally does not require labeling or modification of the analyte.
  • the present invention provides a method for analyzing polypeptide components based on M2MspA-N91H nanopore, which is characterized in that it comprises the following steps:
  • S1 mixes the exopeptidase and the polypeptide to be detected evenly and reacts to obtain a hydrolyzate
  • the nanopore system comprising: M2MspA-N91H
  • the M2MspA-N91H nanopore is embedded in the insulating film, and the M2MspA-N91H nanopore provides communication between the first medium and the second medium channel, the hydrolyzate is added to the first medium;
  • S3 applies a driving force between the first medium and the second medium, and free amino acids in the hydrolyzate pass through the M2MspA-N91H nanopore and generate an electrical signal;
  • S4 performs data analysis on the electrical signal to obtain amino acid type information of the free amino acid.
  • the free amino acids include natural amino acids and unnatural amino acids.
  • the natural amino acids include glycine, alanine, valine, leucine, isoleucine, phenylalanine, tryptophan, tyrosine, aspartic acid, histamine
  • acid asparagine, glutamic acid, lysine, glutamine, methionine, arginine, serine, threonine, cysteine, proline.
  • the unnatural amino acid includes one or more of citrulline, L-norvaline, and amarine.
  • the exopeptidase comprises aminopeptidase and/or carboxypeptidase.
  • the M2MspA-N91H nanopore is bound to copper ions before the hydrolyzate is added to the nanopore system.
  • the pH values of the first medium and the second medium comprise 6.5-7.5.
  • the final concentration of the polypeptide solution to be detected is greater than 10 ⁇ M.
  • the data analysis step described in S4 further includes:
  • the signal filtering condition is to exclude electrical signals whose SD value is greater than the SD threshold, wherein the SD value of the electrical signal is calculated according to the following formula:
  • x i is the current value of each collection point of the signal
  • x is the average value of the signal current point
  • n is the number of signal collection points.
  • the method further includes S5 comparing the amino acid type information of the free amino acid with the expected sequence information of the polypeptide to be detected, so as to determine that the actual sequence of the polypeptide to be detected is different from that of the polypeptide to be detected. whether there is a difference in the expected sequence of ; or
  • S5 compares the amino acid type information of the free amino acid combined with the information of the exopeptidase with the polypeptide sequence information in the database to predict the actual sequence of the polypeptide to be detected.
  • Nanopores of the prior art can only detect amino acids with specific functional groups, but experiments in the present invention show that M2MspA-N91H nanopores (porin A derived from Mycobacterium smegmatis) have versatility in detecting amino acids , which can robustly and sensitively detect and distinguish 20 common amino acids and unnatural amino acids at the single-molecule level. Furthermore, 20 common amino acids and unnatural amino acids can block M2MspA-N91H nanopores and cause corresponding blockage without additional introduction of enzymes into the M2MspA-N91H nanopore system and/or modification of amino acids (e.g., adding luminescent labels). current signal. The direct detection and differentiation of various amino acids can be realized by analyzing different current signals.
  • the present invention provides a method for analyzing polypeptide components based on M2MspA-N91H nanopores. According to the blocking rate of the standard signal in the established standard signal database, the amino acid type corresponding to the detected free amino acid signal can be judged, and then the component analysis of the polypeptide to be detected can be realized to a certain extent.
  • the polypeptide component analysis method provided by the present invention is suitable for various application scenarios such as the expression of tumor neoantigens, the judgment of the synthesis quality of polypeptide drugs, and the prediction of unknown polypeptide sequences.
  • Figure 1 is a diagram of the structure and electrophysiological properties of M2MspA-N91H nanopores
  • Fig. 2 is when adding different concentrations of copper ions, M2MspA-N91H nanopore single hole current signal diagram (1M KCl, pH 6);
  • Fig. 3 is the electric current signal figure that M2MspA-N91H nanopore carries out amino acid detection under the concentration of 3M KCl;
  • Fig. 4 is a scatter diagram of amino acid volume vs. current signal blocking rate (labeled letters are amino acid abbreviations);
  • Figure 5 is a diagram of blocking rate of amino acid current signal vs. blocking time (labeled letters are amino acid abbreviations);
  • Figure 6 is the experimental diagram of single amino acid signal extracted by Clampfit (A is the single-channel statistical result of Clampfit; B is the Amp S.D. In the second experiment, the blank group, the asp experimental group, and the asp+glu experimental group counted all the signals for 2 minutes); D is the source of the -5pA background signal; E is the source of the -12pA background signal);
  • Figure 7 is the current signal diagram measured after the carboxypeptidase hydrolyzate was added to the nanopore system
  • A is carboxypeptidase under the conditions of 0.5M KCl, 10mM MOPS, pH 7.5, 10U carboxypeptidase hydrolyzed 10mM EF polypeptide for 20 minutes and then added
  • B is a scatter diagram involving the blocking rate and passage time of all current signals obtained in the experiment
  • C is the nanopore detection of three polypeptides (P1, P2, P3) hydrolyzate);
  • Figure 8 is the experimental diagram of the M2MspA-N91H nanopore detection of unnatural amino acids (A is the statistical results of three unnatural amino acids; B is the current of the blank group (Blank) and the addition of amarin in the same experiment, within 1 minute blocking signal scatterplot);
  • Fig. 9 is the current trajectory diagram of the M2MspA-N91H nanopore after adding copper ions under three kinds of pH conditions;
  • Fig. 10 is a schematic diagram of the nanopore detection polypeptide hydrolyzate of the present invention.
  • FIG. 11 is a flowchart of establishing a standard signal database in an embodiment of the present invention.
  • Fig. 12 is a flow chart of data analysis in the embodiment of the present invention.
  • the term "about” typically means +/- 5% of the stated value, more typically +/- 4% of the stated value, more typically +/- 4% of the stated value /-3%, more typically +/-2% of the stated value, even more typically +/-1% of the stated value, even more typically +/-0.5% of the stated value.
  • a “nanopore system” includes pores having nanoscale dimensions (referred to simply as “nanopores”), an insulating membrane, a first medium, and a second medium.
  • the nanopores are M2MspA-N91H nanopores.
  • the pores having nanoscale dimensions allow translocation of the analyte from one side of the insulating film to the other.
  • those skilled in the art can also modify the above-mentioned M2MspA-N91H nanopore according to the actual situation (for example, any mutation, truncation, fusion, chemical modification, etc.) to obtain the corresponding variant, and the modification means are the present well known in the field.
  • the nanoscale-sized pores are embedded in the insulating film, and the insulating film (also can be understood as a composite of the nanoscale-sized pores and the insulating film ) separates the first medium from the second medium, and the channels having nanometer-sized pores provide passages connecting the first medium and the second medium; After a driving force is applied between the second medium, the analyte located in the first medium interacts with the M2MspA-N91H nanopore to form a current (ie, an electrical signal).
  • first medium refers to the medium in which the analyte is added to the nanopore system
  • second medium refers to the two parts of the medium separated by the insulating film. , the other side of the "first medium”.
  • the driving force refers to the force that drives the analyte to interact with the nanopore by means of potential, electroosmotic flow, concentration gradient and the like.
  • the first medium and the second medium may be the same or different, and the first medium and the second medium may comprise electrically conductive fluids.
  • the conductive liquid is an aqueous alkali metal halide solution, specifically sodium chloride (NaCl), lithium chloride (LiCl), cesium chloride (CsCl), potassium chloride (KCl), and sodium bromide (NaBr).
  • the concentrations of the conductive liquid contained in the first medium and the second medium are different, in other words, the concentrations of the conductive liquid in the first medium and the second medium exist The difference, and then there is a difference in the osmotic pressure on both sides of the insulating film.
  • the first medium and/or the second medium may further comprise a buffer, such as HEPES, MOPS.
  • the concentration range of the first medium and/or the second medium may be 0.3M-3M.
  • the pH range of the first medium and/or the second medium is 6-9 (preferably 6.5-7.5).
  • An insulating membrane refers to a membrane that has the ability to host nanopores and block ionic currents passing through non-nanopores.
  • the insulating film may include a phospholipid film and/or a polymer film.
  • Exemplary phospholipid membranes include DPHPC, DOPC, E.coli lipid, and exemplary polymer membranes include triblock copolymer polymer membranes.
  • the nanopore system includes two electrolyte chambers, which are separated by an insulating membrane to form a trans (-trans) compartment and a cis (-cis) compartment, the The pores of the M2MspA-N91H channel are embedded in the insulating film, and only the M2MspA-N91H channel on the insulating film communicates with the above two electrolyte chambers.
  • electrolyte ions in solution in the electrolyte compartments move by electrophoresis and pass through the M2MspA-N91H channel.
  • the analyte is a charged substance. An analyte is charged if it has a net charge.
  • the analyte can be negatively charged or positively charged.
  • An analyte is negatively charged if it has a net negative charge.
  • An analyte is positively charged if it has a net positive charge.
  • Suitable analytes are preferably amino acids.
  • the amino acid may be a natural amino acid.
  • Natural amino acids are naturally occurring amino acids and their post-translationally modified variants, such as the 20 common amino acids (glycine, alanine, valine, leucine, isoleucine, methionine, proline , tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine and histidine), etc.
  • the analyte may be an unnatural amino acid.
  • "Unnatural amino acid” means an amino acid that is not a proteinogenic amino acid or a post-translationally modified variant thereof (i.e. an amino acid other than the 20 common amino acids or pyrrolysine or selenocysteine or a post-translationally modified variant thereof) .
  • the unnatural amino acid is citrulline, L-norvaline, and amanitine.
  • the analyte may be in contact with either side of the nanopore on both sides of the insulating film.
  • the analyte may be in contact with either side of the insulating film, so that the analyte passes through the channel of the nanopore to reach the other side of the insulating film.
  • the analyte interacts with the nanopore as it passes through the insulating membrane via the passage of the pore.
  • the analyte can be in contact with the side of the insulating film, and the side of the insulating film can make the analyte interact with the nanopore so that it is separated from the nanopore and stays in the nanopore. on the same side as the insulating film.
  • the analyte can interact with the nanopore in any manner and at any location.
  • the analyte can also hit the nanopore, interact with the nanopore, make it separate from the nanopore and stay on the same side of the insulating membrane.
  • the analyte affects the current flowing through the nanopore in a manner specific to the analyte, that is, the current flowing through the nanopore
  • the current is characteristic for a particular DUT.
  • Control experiments can be performed to determine the effect of a particular analyte on the current flowing through the nanopore, and then to identify the particular analyte in the sample or to determine the presence of the particular analyte in the sample. More specifically, the presence or absence or concentration of the analyte can be identified based on the comparison of the current pattern obtained by detecting the analyte with a known current pattern obtained using a known analyte under the same conditions wait.
  • the nanopore system of the present invention may also include one or more measurement devices that measure the current flowing through the nanopore, such as patch clamp amplifiers or data acquisition devices.
  • the present invention also provides a method for analyzing polypeptide components based on the M2MspA-N91H nanopore, which is characterized in that it comprises the following steps:
  • S1 mixes the exopeptidase and the polypeptide to be detected evenly and reacts to obtain a hydrolyzate
  • the nanopore system comprising: M2MspA-N91H
  • the M2MspA-N91H nanopore is embedded in the insulating film, and the M2MspA-N91H nanopore provides communication between the first medium and the second medium channel, the hydrolyzate is added to the first medium;
  • S3 applies a driving force between the first medium and the second medium, and the free amino acids in the hydrolyzate pass through the M2MspA-N91H nanopore and generate an electrical signal;
  • S4 performs data analysis on the electrical signal to obtain amino acid type information of the free amino acid.
  • Polypeptide refers to a peptide or protein comprising two or more amino acids linked by peptide bonds. Polypeptides may contain natural, modified or synthetic amino acids. Polypeptides can also be modified naturally (e.g., by post-translational processing) or chemically (e.g., amidation, acylation, cross-linking, etc.). In the present invention, the sequence information of the polypeptide may be known or unknown.
  • Exopeptidase refers to a hydrolase that removes terminal amino acids of polypeptides by cleaving peptide bonds. Terminal amino acids are amino acids within about 10 amino acids of the N- or C-terminus of a polypeptide. Accordingly, "exopeptidases” are classified into aminopeptidases that hydrolyze amino acids one by one from the N-terminus of polypeptides and carboxypeptidases that hydrolyze amino acids one by one from the C-terminus of polypeptides. Exemplary aminopeptidases include cysteinyl aminopeptidase, prolyl aminopeptidase, arginyl aminopeptidase. Exemplary carboxypeptidases include carboxypeptidases A, B, C, Y. In the present invention, the exopeptidase may be carboxypeptidase and/or aminopeptidase.
  • the exopeptidase can completely or partially hydrolyze the polypeptide to be detected.
  • carboxypeptidase A1 can cut C-terminal amino acids except lysine (K), arginine (R), proline (P) (that is, the stop point of hydrolysis is R, K, P).
  • the polypeptide to be detected does not contain the three amino acids R, K, and P, the polypeptide to be detected will be completely hydrolyzed by carboxypeptidase A, and the amino acid type information of the free amino acids in the obtained hydrolyzate is The amino acid composition of the polypeptide to be detected.
  • the polypeptide to be detected contains one or more of the three amino acids R, K, and P, the polypeptide to be detected will be partially hydrolyzed by carboxypeptidase A (stop hydrolysis at the stop point), and the obtained hydrolyzate
  • the amino acid type information of the free amino acid is the amino acid composition of the polypeptide to be detected before the stop point.
  • the amino acid type information of the free amino acids in the obtained hydrolyzate can be compared with the expected sequence information of the polypeptide to be detected to determine whether the actual sequence of the polypeptide to be detected is consistent with the expected sequence of the polypeptide to be detected. Describe whether there are differences in the expected sequence of the polypeptide to be detected (for example, some (some) amino acids that should appear in the hydrolyzate do not appear, and some (some) amino acids that should not appear in the hydrolyzate appear), which is suitable for tumor neogenesis Application scenarios such as the expression of antigens and the judgment of the synthetic quality of peptide drugs.
  • the sequence of the polypeptide to be detected is unknown, it can be based on the amino acid type information of the free amino acids in the obtained hydrolyzate and in combination with the information of the exopeptidase used (for example, carboxypeptidase or aminopeptidase, whether there is a "hydrolysis stop point” ), compare in the database by means of machine learning, etc., and then predict the sequence of the polypeptide to be detected or narrow the possible range of the polypeptide to be detected to a certain extent (for example, starting at the N-terminus or C-terminus, amino acid Composition (including defined species of free amino acids and possible species corresponding to "hydrolysis stop points").
  • Buffer 0.3M-3M salt solution such as KCl and NaCl (preferably 1M aqueous KCl solution), 10mM MOPS;
  • Copper ion concentration range 1nM to 200 ⁇ M
  • pH range 6-9 (preferably 6.5-7.5).
  • Single-channel current recordings were amplified using an Axopatch 200B amplifier (Molecular Devices) and filtered at 2 kHz using a built-in four-hole low-pass Bessel filter. Data was digitized by a Digidata 1550B converter (Molecular Devices) at a sampling rate of 100kHz. Extract the current signal file obtained by sampling to obtain amino acid signal data.
  • Single amino acid signals can be identified and defined by data such as signal blocking rate, blocking time, and SD.
  • the SD value is calculated to obtain the degree of deviation between the current value of each sampling point of the signal and the overall average value, so as to distinguish the single amino acid signal from the background signal and the superimposed signal of multiple amino acids.
  • the buffer conditions for the experiments were as described above.
  • the experimental temperature can be set at 15°C-60°C (preferably 25°C).
  • mix exopeptidase such as carboxypeptidase
  • polypeptide according to a certain concentration ratio, such as 10U carboxypeptidase, 0.5M KCl, 10mM MOPS, pH 7.5, 10mM polypeptide (Figure 7A), 25°C
  • concentration ratio such as 10U carboxypeptidase, 0.5M KCl, 10mM MOPS, pH 7.5, 10mM polypeptide ( Figure 7A)
  • the M2MspA-N91H nanopore has an 8-mer structure, and the 8 amino acid residues at the narrowest part of the channel are oriented toward the central axis of the M2MspA-N91H nanopore.
  • Adding Cu 2+ into the trans chamber and observing the current trajectory, Cu 2+ can significantly reduce the current fluctuation (Fig. 1).
  • the reduction of the current level is related to the concentration of Cu 2+ added (Fig. 2), and the copper ion concentration can range from 1 nM to 200 ⁇ M.
  • MspA is assembled by eight identical monomers to form a channel
  • amino acid sequence of M2MspA-N91H is:
  • non-overlapping amino acids include G, S, A, T, N, R, V, Q, M, L, I, D, F, W, E, a total of 15
  • the signals of these amino acids do not overlap and can be distinguished from each other.
  • the above-mentioned method of directly detecting 20 kinds of amino acids may generate background noise and interfere with subsequent analysis.
  • the experimenters of the present invention calculated the SD value of the current value distribution of a single amino acid signal using the following formula, and obtained the degree of deviation between the current value of each collection point of the signal and the overall average value. In this way, the single amino acid signal can be distinguished from the background signal and the superimposed signal of multiple amino acids.
  • x i is the current value of each collection point of the signal
  • x is the average value of the signal current point
  • n is the number of signal collection points.
  • the upper, middle and lower figures are the blank group (Blank), the addition of aspartic acid (Asp) and the addition of a mixture of aspartic acid and glutamic acid (Asp+Glu) in the same experiment, respectively.
  • the SD threshold can be set to filter the signal, for example, the SD threshold can be set to 2, where the signal (blue) with an SD value less than 2 is considered to be a valid signal; the signal with an SD value greater than 2 is considered to be a background signal Simultaneous passage of one or more amino acids produces superimposed signals, which can therefore be filtered. After filtering, only look at the blue signal, and you can clearly see the newly generated signal peak after adding aspartic acid and glutamic acid. Using the background noise processing method, amino acid signals can be identified and counted more accurately.
  • the experimenters of the present invention used Clampfit software to extract single amino acid signals.
  • the background signal and double-step signal were filtered out, and the single amino acid signal was retained.
  • the "Amp S.D.” value indicates the signal fitting effect.
  • the SD value corresponding to the specific signal is shown in Figure 6B; it can be seen from the six signals shown that the SD value of the double-step signal is generally greater than 4.
  • a single-step signal means that the current density distribution curve of the electrical signal presents a single-peak distribution, and the current has no distribution between 0 and 60% of the peak value.
  • the double-step signal means that the current density distribution curve of the electrical signal presents a bimodal distribution.
  • the SD threshold is set in the range of 0-2, which can filter most multi-step signals.
  • background signal interference still exists around -5pA and -12pA.
  • the signal source near -5pA is shown in Figure 6D, because its corresponding level 0 deviates too much from the baseline and can be filtered out in the later stage.
  • the signal around -12pA (Fig. 6E) is a normal background signal with a short blocking time. The background signal only interferes with the nearby signals of gly, ser, and ala, and the signals of these three amino acids can be distinguished.
  • the reserved electrical signal is a single-step electrical signal: if the reserved electrical signal is a single-step electrical signal, then include the single-step electrical signal into the standard signal database, and regard the single-step electrical signal as the amino acid Standard signal; if the retained electrical signal is not a single-step electrical signal (such as a double-step electrical signal), reset the filter condition (for example, increase the SD threshold), and regain the retained electrical signal, and then judge again the retained Whether the electrical signal is a single-step electrical signal. Finally, a standard signal database is established using single-step electrical signals.
  • Example 3 Using M2MspA-N91H nanopore to detect three kinds of unnatural amino acids
  • the upper and lower graphs are the scatter diagrams of the current blocking signal within 1 minute of the blank group (Blank) and the addition of amarin in the same experiment, respectively. Signals with SD values less than 2 were considered valid signals, and after filtering, it was clearly seen that a new signal peak was generated after the addition of amarine.
  • Polypeptides can be used as substrates to participate in enzymatic reactions, and the above methods for detecting amino acids can be used to continuously monitor enzymatic reactions: By detecting the types of free amino acids in the reaction system, it is possible to determine whether the enzymatic reaction occurs and the progress of the enzymatic cascade reaction .
  • Example 4 Method for analyzing polypeptide components based on M2MspA-N91H nanopore:
  • Carboxypeptidase A1 (a type of exopeptidase) can hydrolyze amino acids one by one from the C-terminus of the polypeptide, and the free amino acid products obtained by hydrolysis are captured by nanopores to be detected. According to the appearance of amino acid signals The type and quantity can determine the peptide composition.
  • the final concentration of the polypeptide solution can be calculated according to the following formula (assuming that the polypeptide can be completely hydrolyzed):
  • a is the length of the polypeptide
  • b is the amount of substance ( ⁇ mol)
  • d is the activity of exopeptidase (U)
  • c is the reaction time (min).
  • the detection effect when the concentration of free amino acids is 1 ⁇ M, the detection effect is better; if the final concentration of free amino acids is lower than 1 ⁇ M, the signal of free amino acids may be reduced, thereby affecting the detection effect.
  • the hydrolyzate (reaction product) may be diluted at least 10 times after being added to the M2MspA-N91H nanopore system, therefore, based on the above formula, the final concentration of the polypeptide solution in the reaction system should be greater than 10 times of 1 ⁇ M (ie 10 ⁇ M) .
  • the terminal amino acid of the polypeptide is hydrolyzed by exopeptidase to obtain a hydrolyzate (reaction product).
  • reaction product After the above reaction product is added to the M2MspA-N91H nanopore system and a driving force is applied, free amino acids in the reaction product pass through the M2MspA-N91H nanopore to generate an electrical signal, and obtain corresponding electrical signal data.
  • the specific data analysis process is shown in Figure 12: First, the above reaction products are detected through the M2MspA-N91H nanopore to obtain electrical signal data. Second, build a standard signal database (as described above). Third, filter conditions (SD threshold of electrical signals) are set. Fourth, according to the filter condition, some electrical signals are excluded to obtain retained electrical signals. Fifth, determine the type of amino acid corresponding to the retained electrical signal based on the blocking rate of the standard signal in the standard signal database. Sixth, the amino acid type information of the free amino acid is obtained.
  • SD threshold of electrical signals filter conditions
  • FIG. 7 shows the analysis of the composition of four different polypeptides based on the above polypeptide composition analysis method and data analysis method.
  • Figure 7A is the current signal diagram obtained by adding the reaction solution to the nanopore system after hydrolyzing 10mM EF polypeptide with 10U carboxypeptidase under the conditions of 0.5M KCl, 10mM MOPS, and pH 7.5 for 20 minutes.
  • FIG. 7B is a scatter diagram of FIG. 7A related to the blocking rate and hole-through time of all current signals obtained in the experiment.
  • the signal (red) with SD value between 1-2 has 3 distribution peaks, and the signal of EF polypeptide near the blocking rate of 0.4; the left two peaks are related to glutamic acid (E) and phenylalanine (The blocking rate of F) is the closest, so it is judged as the signal of amino acids E and F.
  • Fig. 7 C shows that M2MspA-N91H nanopore detects respectively three kinds of polypeptides (P1: APRLR FYSL ; P2: DRVYIHP FHL ; P3: RPVKVYPNGAEDESAEAFP LEF ; wherein underline represents that these amino acids of polypeptide C terminal can be hydrolyzed as free amino acids by carboxypeptidase A1)
  • Experimental results of hydrolyzate show that the polypeptide component analysis method provided by the present invention can analyze the terminal amino acid composition of the polypeptide.
  • polypeptide composition analysis method provided by the present invention can be applied to many different scenarios, such as the identification of known polypeptides and the prediction of unknown polypeptides:
  • Tumor neoantigens which are not expressed in normal tissues but only expressed in tumor tissues, are ideal targets for immunotherapy.
  • Each patient has its unique neoantigen spectrum, and the above-mentioned peptide component analysis method can be used to screen tumor neoantigen peptides: first, use a highly specific extraction method to extract and purify known tumor cell surface antigen peptides, and the purified The mixture theoretically contains only a small amount of antigenic peptides with known components; secondly, use the above-mentioned peptide composition analysis method to detect the purified mixture; finally, compare the amino acid species detected in the test results with the amino acid species of known antigenic peptides In contrast, it can be judged whether certain amino acids are produced or lacked, and then judged whether neoplastic antigens may exist on the surface of tumor cells.
  • the detection of the purity of the polypeptide drug can be realized: comparing the detection results of the amino acid composition of the polypeptide drug to be detected with the expected synthetic amino acid type can judge the synthetic quality of the polypeptide drug to a certain extent .
  • polypeptide composition analysis method can also be combined with machine learning to predict unknown polypeptides: hydrolyze the unknown polypeptide to be detected with exonuclease and detect free amino acids; use machine learning methods to combine the type of exonuclease , hydrolysis characteristics, and amino acid composition detection results, compared with known peptides in the database, can predict the sequence of the unknown peptide to a certain extent, and at the same time play a role in narrowing the range of unknown peptides.

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Chemical & Material Sciences (AREA)
  • Biomedical Technology (AREA)
  • Physics & Mathematics (AREA)
  • Molecular Biology (AREA)
  • Immunology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Analytical Chemistry (AREA)
  • General Physics & Mathematics (AREA)
  • Urology & Nephrology (AREA)
  • Hematology (AREA)
  • Pathology (AREA)
  • Food Science & Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Biophysics (AREA)
  • Organic Chemistry (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Zoology (AREA)
  • Electrochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Cell Biology (AREA)
  • Wood Science & Technology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Nanotechnology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Chemical & Material Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Peptides Or Proteins (AREA)

Abstract

Provided is a polypeptide composition analysis method based on M2MspA-N91H nanopores, comprising the following steps: S1: uniformly mixing and reacting exopeptidase with a polypeptide to be tested, and obtaining a hydrolysate; S2: adding the hydrolysate into a nanopore system, where the nanopore system comprises: M2MspA-N91H nanopores, an insulating film, a first medium, and a second medium, wherein the M2MspA-N91H nanopores are embedded in the insulating film, the M2MspA-N91H nanopores provide a channel communicating the first medium and the second medium, and the hydrolysate is added into the first medium; S3: applying a driving force between the first medium and the second medium, free amino acids in the hydrolysate passing through the M2MspA-N91H nanopores and generating electrical signals; S4: performing data analysis on the electrical signals to obtain amino acid type information for the free amino acids.

Description

一种基于铜离子修饰MspA纳米孔的多肽成分分析方法A method for analyzing peptide components based on copper ion-modified MspA nanopores

本申请要求2021年08月30日提交的中国发明专利申请【CN2021110067648】、名称为“基于纳米孔检测多种氨基酸的方法及多肽组成成分分析方法”的优先权,该优先权发明专利申请以引用方式全文并入。This application claims the priority of the Chinese invention patent application [CN2021110067648] filed on August 30, 2021, entitled "A method for detecting multiple amino acids based on nanopores and a method for analyzing polypeptide components". The method is incorporated in its entirety.

技术领域technical field

本发明属于纳米孔检测领域,具体涉及一种基于铜离子修饰MspA纳米孔的多肽成分分析方法。The invention belongs to the field of nanopore detection, and in particular relates to a method for analyzing polypeptide components based on copper ion-modified MspA nanopores.

背景技术Background technique

纳米孔传感是一种单分子传感技术,有着类似于库尔特计数器(Kurt counter)的检测原理。该技术具有在单分子水平上实时和直接监测的特点,且一般不需要对分析物进行标记或修饰。这些优点使纳米孔成为生物传感和生物检测的新兴技术。Nanopore sensing is a single-molecule sensing technology with a detection principle similar to that of a Kurt counter. The technique features real-time and direct monitoring at the single-molecule level and generally does not require labeling or modification of the analyte. These advantages make nanopores an emerging technology for biosensing and biodetection.

由于氨基酸通过纳米孔时,易位过程中信噪比低和构象变化难以预测,这对于纳米孔直接区分并检测氨基酸仍是一项挑战。现有技术无法直接对20种氨基酸进行检测并实现良好区分,往往需要对纳米孔进行化学修饰或对氨基酸进行衍生化处理等。Due to the low signal-to-noise ratio and unpredictable conformational changes during translocation of amino acids through nanopores, it is still a challenge for nanopores to directly distinguish and detect amino acids. The existing technology cannot directly detect and distinguish the 20 kinds of amino acids, and often requires chemical modification of nanopores or derivatization of amino acids.

发明内容Contents of the invention

有鉴于此,本发明提供了一种基于M2MspA-N91H纳米孔的多肽成分分析方法,其特征在于,包括以下步骤:In view of this, the present invention provides a method for analyzing polypeptide components based on M2MspA-N91H nanopore, which is characterized in that it comprises the following steps:

S1将外肽酶与待检测多肽混合均匀并反应,得到水解产物;S1 mixes the exopeptidase and the polypeptide to be detected evenly and reacts to obtain a hydrolyzate;

S2将所述水解产物加入纳米孔系统中,所述纳米孔系统包括:M2MspA-N91HS2 adding the hydrolyzate into the nanopore system, the nanopore system comprising: M2MspA-N91H

纳米孔、绝缘膜、第一介质、第二介质,其中所述M2MspA-N91H纳米孔嵌入在所述绝缘膜中,所述M2MspA-N91H纳米孔提供连通所述第一介质和所述第二介质的通道,所述水解产物被加入至所述第一介质;nanopore, an insulating film, a first medium, and a second medium, wherein the M2MspA-N91H nanopore is embedded in the insulating film, and the M2MspA-N91H nanopore provides communication between the first medium and the second medium channel, the hydrolyzate is added to the first medium;

S3向所述第一介质和所述第二介质之间施加驱动力,所述水解产物中游离氨 基酸穿过所述M2MspA-N91H纳米孔并产生电信号;S3 applies a driving force between the first medium and the second medium, and free amino acids in the hydrolyzate pass through the M2MspA-N91H nanopore and generate an electrical signal;

S4对所述电信号进行数据分析,获得所述游离氨基酸的氨基酸种类信息。S4 performs data analysis on the electrical signal to obtain amino acid type information of the free amino acid.

在一个实施方案中,所述游离氨基酸包括天然氨基酸和非天然氨基酸。In one embodiment, the free amino acids include natural amino acids and unnatural amino acids.

在一个实施方案中,所述天然氨基酸包括甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、苯丙氨酸、色氨酸、酪氨酸、天冬氨酸、组氨酸、天冬酰胺、谷氨酸、赖氨酸、谷氨酰胺、甲硫氨酸、精氨酸、丝氨酸、苏氨酸、半胱氨酸、脯氨酸中的一种或多种。In one embodiment, the natural amino acids include glycine, alanine, valine, leucine, isoleucine, phenylalanine, tryptophan, tyrosine, aspartic acid, histamine One or more of acid, asparagine, glutamic acid, lysine, glutamine, methionine, arginine, serine, threonine, cysteine, proline.

在一个实施方案中,所述非天然氨基酸包括瓜氨酸、L-正缬氨酸、鹅膏氨酸中的一种或多种。In one embodiment, the unnatural amino acid includes one or more of citrulline, L-norvaline, and amarine.

在一个实施方案中,所述外肽酶包括氨肽酶和/或羧肽酶。In one embodiment, the exopeptidase comprises aminopeptidase and/or carboxypeptidase.

在一个实施方案中,在所述水解产物加入所述纳米孔系统之前,将所述M2MspA-N91H纳米孔与铜离子结合。In one embodiment, the M2MspA-N91H nanopore is bound to copper ions before the hydrolyzate is added to the nanopore system.

在一个实施方案中,所述第一介质和所述第二介质的pH值包括6.5-7.5。In one embodiment, the pH values of the first medium and the second medium comprise 6.5-7.5.

在一个实施方案中,所述待检测多肽溶液的终浓度大于10μM。In one embodiment, the final concentration of the polypeptide solution to be detected is greater than 10 μM.

在一个实施方案中,S4中所述数据分析步骤进一步包括:In one embodiment, the data analysis step described in S4 further includes:

i)建立标准信号数据库;i) Establish a standard signal database;

ii)设定信号过滤条件;ii) Set signal filtering conditions;

iii)排除部分电信号,获得保留的电信号;iii) Excluding part of the electrical signal to obtain the retained electrical signal;

iv)通过标准信号数据库中所述标准信号的阻塞率,判断所述保留的电信号对应的氨基酸种类;iv) judging the type of amino acid corresponding to the retained electrical signal according to the blocking rate of the standard signal in the standard signal database;

v)获得所述游离氨基酸的氨基酸种类信息。v) obtaining amino acid type information of the free amino acid.

在一个实施方案中,所述信号过滤条件为排除SD值大于SD阈值的电信号,其中所述电信号的SD值按照下列公式进行计算:In one embodiment, the signal filtering condition is to exclude electrical signals whose SD value is greater than the SD threshold, wherein the SD value of the electrical signal is calculated according to the following formula:

Figure PCTCN2022114670-appb-000001
Figure PCTCN2022114670-appb-000001

式中,x i为信号每个采集点的电流值,x为信号电流点的平均值,n为信号采集点的数量。 In the formula, x i is the current value of each collection point of the signal, x is the average value of the signal current point, and n is the number of signal collection points.

在一个实施方案中,所述方法进一步包括S5将所述游离氨基酸的氨基酸种类信息与所述待检测多肽的预期序列信息进行对比,以判断所述待检测多肽的实际序列与所述待检测多肽的预期序列是否存在差异;或In one embodiment, the method further includes S5 comparing the amino acid type information of the free amino acid with the expected sequence information of the polypeptide to be detected, so as to determine that the actual sequence of the polypeptide to be detected is different from that of the polypeptide to be detected. whether there is a difference in the expected sequence of ; or

S5将所述游离氨基酸的氨基酸种类信息结合所述外肽酶的信息与数据库中的多肽序列信息进行对比,以预测所述待检测多肽的实际序列。S5 compares the amino acid type information of the free amino acid combined with the information of the exopeptidase with the polypeptide sequence information in the database to predict the actual sequence of the polypeptide to be detected.

有益效果Beneficial effect

现有技术的纳米孔往往只能检测特定官能团的氨基酸,而本发明实验表明,M2MspA-N91H纳米孔(源自耻垢分枝杆菌(Mycobacterium smegmatis)的孔蛋白A)在检测氨基酸方面具有通用性,即可以在单分子水平上稳健且灵敏地检测和区分20种常见氨基酸和非天然氨基酸。此外,无需向M2MspA-N91H纳米孔系统中额外引入酶和/或对氨基酸进行修饰(例如,添加发光标记),20种常见氨基酸和非天然氨基酸都可以阻塞M2MspA-N91H纳米孔并引起相应的阻塞电流信号。通过分析不同电流信号可以实现对多种氨基酸的直接检测并区分。Nanopores of the prior art can only detect amino acids with specific functional groups, but experiments in the present invention show that M2MspA-N91H nanopores (porin A derived from Mycobacterium smegmatis) have versatility in detecting amino acids , which can robustly and sensitively detect and distinguish 20 common amino acids and unnatural amino acids at the single-molecule level. Furthermore, 20 common amino acids and unnatural amino acids can block M2MspA-N91H nanopores and cause corresponding blockage without additional introduction of enzymes into the M2MspA-N91H nanopore system and/or modification of amino acids (e.g., adding luminescent labels). current signal. The direct detection and differentiation of various amino acids can be realized by analyzing different current signals.

基于此,结合外肽酶(例如羧肽酶)的水解特性,本发明提供一种基于M2MspA-N91H纳米孔的多肽成分分析方法。可以根据建立的标准信号数据库中标准信号的阻塞率,判断检测到的游离氨基酸的信号对应的氨基酸种类,进而一定程度上实现对待检测多肽的成分分析。本发明提供的多肽成分分析方法适用于肿瘤新生抗原的表达和多肽药物的合成质量的判断,以及未知多肽序列的预测等多种应用场景。Based on this, combined with the hydrolysis properties of exopeptidases (such as carboxypeptidases), the present invention provides a method for analyzing polypeptide components based on M2MspA-N91H nanopores. According to the blocking rate of the standard signal in the established standard signal database, the amino acid type corresponding to the detected free amino acid signal can be judged, and then the component analysis of the polypeptide to be detected can be realized to a certain extent. The polypeptide component analysis method provided by the present invention is suitable for various application scenarios such as the expression of tumor neoantigens, the judgment of the synthesis quality of polypeptide drugs, and the prediction of unknown polypeptide sequences.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍。显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其它的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the following briefly introduces the drawings required for the description of the embodiments or the prior art. Apparently, the drawings in the following description are some embodiments of the present invention, and those skilled in the art can obtain other drawings according to these drawings without any creative effort.

图1为M2MspA-N91H纳米孔的结构与电生理特性图;Figure 1 is a diagram of the structure and electrophysiological properties of M2MspA-N91H nanopores;

图2为加入不同浓度的铜离子时,M2MspA-N91H纳米孔单孔电流信号图(1M KCl,pH 6);Fig. 2 is when adding different concentrations of copper ions, M2MspA-N91H nanopore single hole current signal diagram (1M KCl, pH 6);

图3为M2MspA-N91H纳米孔在3M KCl浓度下进行氨基酸检测的电流信号图;Fig. 3 is the electric current signal figure that M2MspA-N91H nanopore carries out amino acid detection under the concentration of 3M KCl;

图4为氨基酸体积v.s.电流信号阻塞率的散点图(标注字母为氨基酸缩写);Fig. 4 is a scatter diagram of amino acid volume vs. current signal blocking rate (labeled letters are amino acid abbreviations);

图5为氨基酸电流信号阻塞率v.s.阻塞时间图(标注字母为氨基酸缩写);Figure 5 is a diagram of blocking rate of amino acid current signal vs. blocking time (labeled letters are amino acid abbreviations);

图6为Clampfit提取单氨基酸信号的实验图(A为Clampfit单通道统计结果;B为信号的Amp S.D.值;C为根据SD值区分散点图中的单台阶信号(从上到下分别为同一次实验中空白组、asp实验组、asp+glu实验组统计2分钟的所有信号);D为-5pA背景信号来源;E为-12pA背景信号来源);Figure 6 is the experimental diagram of single amino acid signal extracted by Clampfit (A is the single-channel statistical result of Clampfit; B is the Amp S.D. In the second experiment, the blank group, the asp experimental group, and the asp+glu experimental group counted all the signals for 2 minutes); D is the source of the -5pA background signal; E is the source of the -12pA background signal);

图7为羧肽酶水解产物加入纳米孔体系后测得的电流信号图(A为羧肽酶在0.5M KCl、10mM MOPS、pH 7.5条件下,10U羧肽酶水解10mM EF多肽20分钟后加入纳米孔体系检测得到的电流信号图;B为A涉及实验中统计得到的所有电流信号的阻塞率及过孔时间的散点图;C为纳米孔分别检测三种多肽(P1,P2,P3)的水解产物);Figure 7 is the current signal diagram measured after the carboxypeptidase hydrolyzate was added to the nanopore system (A is carboxypeptidase under the conditions of 0.5M KCl, 10mM MOPS, pH 7.5, 10U carboxypeptidase hydrolyzed 10mM EF polypeptide for 20 minutes and then added The current signal diagram detected by the nanopore system; B is a scatter diagram involving the blocking rate and passage time of all current signals obtained in the experiment; C is the nanopore detection of three polypeptides (P1, P2, P3) hydrolyzate);

图8为M2MspA-N91H纳米孔检测非天然氨基酸的实验图(A为三种非天然氨基酸统计结果;B为同一次实验中空白组(Blank)和加入鹅膏氨酸后,1分钟内的电流阻塞信号散点图);Figure 8 is the experimental diagram of the M2MspA-N91H nanopore detection of unnatural amino acids (A is the statistical results of three unnatural amino acids; B is the current of the blank group (Blank) and the addition of amarin in the same experiment, within 1 minute blocking signal scatterplot);

图9为在3种pH值条件下,M2MspA-N91H纳米孔加入铜离子后的电流轨迹图;Fig. 9 is the current trajectory diagram of the M2MspA-N91H nanopore after adding copper ions under three kinds of pH conditions;

图10为本发明的纳米孔检测多肽水解产物示意图;Fig. 10 is a schematic diagram of the nanopore detection polypeptide hydrolyzate of the present invention;

图11为本发明实施例中建立标准信号数据库的流程图;FIG. 11 is a flowchart of establishing a standard signal database in an embodiment of the present invention;

图12为本发明实施例中数据分析的流程图。Fig. 12 is a flow chart of data analysis in the embodiment of the present invention.

具体实施方式Detailed ways

为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述。显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention more clear, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Apparently, the described embodiments are some, but not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置 中还存在另外的相同要素。It should be noted that, in this document, the term "comprising", "comprising" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, It also includes other elements not expressly listed, or elements inherent in the process, method, article, or device. Without further limitations, an element defined by the phrase "comprising a ..." does not preclude the presence of additional identical elements in the process, method, article, or apparatus comprising that element.

如在本说明书中使用的,术语“大约”,典型地表示为所述值的+/-5%,更典型的是所述值的+/-4%,更典型的是所述值的+/-3%,更典型的是所述值的+/-2%,甚至更典型的是所述值的+/-1%,甚至更典型的是所述值的+/-0.5%。As used in this specification, the term "about" typically means +/- 5% of the stated value, more typically +/- 4% of the stated value, more typically +/- 4% of the stated value /-3%, more typically +/-2% of the stated value, even more typically +/-1% of the stated value, even more typically +/-0.5% of the stated value.

在本说明书中,某些实施方式可能以一种处于某个范围的格式公开。应该理解,这种“处于某个范围”的描述仅仅是为了方便和简洁,且不应该被解释为对所公开范围的僵化限制。因此,范围的描述应该被认为是已经具体地公开了所有可能的子范围以及在此范围内的独立数字值。例如,范围

Figure PCTCN2022114670-appb-000002
的描述应该被看作已经具体地公开了子范围如从1到3,从1到4,从1到5,从2到4,从2到6,从3到6等,以及此范围内的单独数字,例如1,2,3,4,5和6。无论该范围的广度如何,均适用以上规则。 In this specification, certain embodiments may be disclosed in a range of formats. It should be understood that this description "within a certain range" is merely for convenience and brevity and should not be construed as an inflexible limitation on the disclosed scope. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, range
Figure PCTCN2022114670-appb-000002
The description should be read as having specifically disclosed subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6, etc., and within this range Individual numbers such as 1, 2, 3, 4, 5 and 6. The above rules apply regardless of the breadth of the scope.

纳米孔系统nanopore system

“纳米孔系统”包括具有纳米级尺寸的孔(简称为“纳米孔”)、绝缘膜、第一介质和第二介质。在本发明的一个实施例中,所述具有纳米级尺寸的孔为M2MspA-N91H纳米孔。所述具有纳米级尺寸的孔允许所述待测物从所述绝缘膜的一侧易位到另一侧。另外,本领域技术人员还可以根据实际情况,对上述M2MspA-N91H纳米孔进行修饰(例如,任何的突变、截短、融合、化学修饰等),来获得相应变体,且修饰手段都是本领域公知的。A "nanopore system" includes pores having nanoscale dimensions (referred to simply as "nanopores"), an insulating membrane, a first medium, and a second medium. In one embodiment of the present invention, the nanopores are M2MspA-N91H nanopores. The pores having nanoscale dimensions allow translocation of the analyte from one side of the insulating film to the other. In addition, those skilled in the art can also modify the above-mentioned M2MspA-N91H nanopore according to the actual situation (for example, any mutation, truncation, fusion, chemical modification, etc.) to obtain the corresponding variant, and the modification means are the present well known in the field.

在本发明的一个实施例中,所述具有纳米级尺寸的孔被嵌入所述绝缘膜中,所述绝缘膜(也可以理解为,所述纳米级尺寸的孔和所述绝缘膜的复合体)将所述第一介质与所述第二介质分隔开,所述具有纳米级尺寸的孔的孔道提供连通所述第一介质与所述第二介质的通道;向所述第一介质和所述第二介质之间施加驱动力后,位于所述第一介质的待测物与所述M2MspA-N91H纳米孔相互作用以形成电流(即电信号)。在本发明中,“第一介质”是指所述待测物被加入所述纳米孔系统时位于的介质;“第二介质”则是指被所述绝缘膜分隔开的两部分介质中,“第一介质”的另一侧。在本发明中,驱动力是指通过电势、电渗流、浓度梯度等方式驱动待测物与所述纳米孔相互作用的力。In one embodiment of the present invention, the nanoscale-sized pores are embedded in the insulating film, and the insulating film (also can be understood as a composite of the nanoscale-sized pores and the insulating film ) separates the first medium from the second medium, and the channels having nanometer-sized pores provide passages connecting the first medium and the second medium; After a driving force is applied between the second medium, the analyte located in the first medium interacts with the M2MspA-N91H nanopore to form a current (ie, an electrical signal). In the present invention, "first medium" refers to the medium in which the analyte is added to the nanopore system; "second medium" refers to the two parts of the medium separated by the insulating film. , the other side of the "first medium". In the present invention, the driving force refers to the force that drives the analyte to interact with the nanopore by means of potential, electroosmotic flow, concentration gradient and the like.

所述第一介质和所述第二介质可以相同或不同,并且所述第一介质和所述第二介质可以包括电导液。所述电导液为碱金属卤化物水溶液,具体为氯化钠 (NaCl)、氯化锂(LiCl)、氯化铯(CsCl)、氯化钾(KCl)、溴化钠(NaBr)。在本发明一个实施例中,所述第一介质和所述第二介质含有的电导液的浓度是不同的,换句话说,所述第一介质和所述第二介质中电导液的浓度存在差值,进而使得所述绝缘膜两侧的渗透压存在差值。所述第一介质和/或所述第二介质还可以包括缓冲液,例如HEPES、MOPS。所述第一介质和/或所述第二介质的浓度范围可以是0.3M-3M。所述第一介质和/或所述第二介质的pH范围为6-9(优选为6.5-7.5)。The first medium and the second medium may be the same or different, and the first medium and the second medium may comprise electrically conductive fluids. The conductive liquid is an aqueous alkali metal halide solution, specifically sodium chloride (NaCl), lithium chloride (LiCl), cesium chloride (CsCl), potassium chloride (KCl), and sodium bromide (NaBr). In one embodiment of the present invention, the concentrations of the conductive liquid contained in the first medium and the second medium are different, in other words, the concentrations of the conductive liquid in the first medium and the second medium exist The difference, and then there is a difference in the osmotic pressure on both sides of the insulating film. The first medium and/or the second medium may further comprise a buffer, such as HEPES, MOPS. The concentration range of the first medium and/or the second medium may be 0.3M-3M. The pH range of the first medium and/or the second medium is 6-9 (preferably 6.5-7.5).

绝缘膜是指具有搭载纳米孔并阻塞非纳米孔通过的离子电流的能力的膜。所述绝缘膜可以包括磷脂膜和/或高分子膜。示例性的磷脂膜包括DPHPC、DOPC、E.coli lipid,示例性的高分子膜包括三嵌段共聚物高分子膜。An insulating membrane refers to a membrane that has the ability to host nanopores and block ionic currents passing through non-nanopores. The insulating film may include a phospholipid film and/or a polymer film. Exemplary phospholipid membranes include DPHPC, DOPC, E.coli lipid, and exemplary polymer membranes include triblock copolymer polymer membranes.

在本发明的一个具体实施例中,所述纳米孔系统包括两个电解液室,其被绝缘膜分隔开而形成反式(-trans)隔室和顺式(-cis)隔室,所述M2MspA-N91H通道的孔嵌入绝缘膜中,绝缘膜上只有M2MspA-N91H通道来连通上述两个电解液室。当向上述两个电解液室施加电势时,电解液室中溶液中的电解质离子通过电泳移动并穿过所述M2MspA-N91H通道。In a specific embodiment of the present invention, the nanopore system includes two electrolyte chambers, which are separated by an insulating membrane to form a trans (-trans) compartment and a cis (-cis) compartment, the The pores of the M2MspA-N91H channel are embedded in the insulating film, and only the M2MspA-N91H channel on the insulating film communicates with the above two electrolyte chambers. When a potential is applied to the above two electrolyte compartments, electrolyte ions in solution in the electrolyte compartments move by electrophoresis and pass through the M2MspA-N91H channel.

待测物Analyte

所述待测物是荷电物质。如果待测物带有净电荷则它是荷电的。所述待测物可以荷负电也可以荷正电。如果待测物带有净负电荷则它是荷负电的。如果待测物带有净正电荷则它是荷正电的。合适的待测物优选为氨基酸。The analyte is a charged substance. An analyte is charged if it has a net charge. The analyte can be negatively charged or positively charged. An analyte is negatively charged if it has a net negative charge. An analyte is positively charged if it has a net positive charge. Suitable analytes are preferably amino acids.

在本发明的一个实施例中,所述氨基酸可以为天然氨基酸。天然氨基酸是天然存在的氨基酸及其翻译后修饰的变体,例如20种常见的氨基酸(甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、甲硫氨酸、脯氨酸、色氨酸、丝氨酸、酪氨酸、半胱氨酸、苯丙氨酸、天冬酰胺、谷氨酰胺、苏氨酸、天冬氨酸、谷氨酸、赖氨酸、精氨酸和组氨酸)等。In one embodiment of the present invention, the amino acid may be a natural amino acid. Natural amino acids are naturally occurring amino acids and their post-translationally modified variants, such as the 20 common amino acids (glycine, alanine, valine, leucine, isoleucine, methionine, proline , tryptophan, serine, tyrosine, cysteine, phenylalanine, asparagine, glutamine, threonine, aspartic acid, glutamic acid, lysine, arginine and histidine), etc.

在本发明的另一个实施例中,所述待测物可以为非天然氨基酸。“非天然氨基酸”是指不是蛋白氨基酸或其翻译后修饰的变体的氨基酸(即不是20种常见氨基酸或吡咯赖氨酸或硒代半胱氨酸或其翻译后修饰的变体的氨基酸)。在本发明的一个具体实施例中,所述非天然氨基酸为瓜氨酸、L-正缬氨酸、鹅膏氨酸。In another embodiment of the present invention, the analyte may be an unnatural amino acid. "Unnatural amino acid" means an amino acid that is not a proteinogenic amino acid or a post-translationally modified variant thereof (i.e. an amino acid other than the 20 common amino acids or pyrrolysine or selenocysteine or a post-translationally modified variant thereof) . In a specific embodiment of the present invention, the unnatural amino acid is citrulline, L-norvaline, and amanitine.

所述纳米孔和待测物之间的相互作用The interaction between the nanopore and the analyte

所述待测物可与所述纳米孔在所述绝缘膜两侧的任一侧接触。所述待测物可以与所述绝缘膜两侧中的任一侧相接触,使得所述待测物穿过所述纳米孔的通道以到达所述绝缘膜的另一侧。在这种情况下,所述待测物在其经由所述孔的通道穿过所述绝缘膜时,与所述纳米孔相互作用。或者,所述待测物可与所述绝缘膜的侧面接触,所述绝缘膜的侧面可使所述待测物与所述纳米孔相互作用,使其与所述纳米孔分离并停留在所述绝缘膜的同一侧。所述待测物可以以任何方式并在任何位点与所述纳米孔相互作用。所述待测物还可以撞击到所述纳米孔,与所述纳米孔相互作用,使其与所述纳米孔分离并停留在所述绝缘膜的同一侧。The analyte may be in contact with either side of the nanopore on both sides of the insulating film. The analyte may be in contact with either side of the insulating film, so that the analyte passes through the channel of the nanopore to reach the other side of the insulating film. In this case, the analyte interacts with the nanopore as it passes through the insulating membrane via the passage of the pore. Alternatively, the analyte can be in contact with the side of the insulating film, and the side of the insulating film can make the analyte interact with the nanopore so that it is separated from the nanopore and stays in the nanopore. on the same side as the insulating film. The analyte can interact with the nanopore in any manner and at any location. The analyte can also hit the nanopore, interact with the nanopore, make it separate from the nanopore and stay on the same side of the insulating membrane.

在所述待测物与所述纳米孔相互作用的过程中,所述待测物会以该待测物特异性的方式影响流过所述纳米孔的电流,即流经所述纳米孔的电流对特定待测物是特征性的。可进行对照实验以测定特定待测物对流过所述纳米孔的电流的效应,然后以鉴定样本中的特定待测物或测定样本中是否存在特定待测物。更具体地,可以根据通过检测待测物所获得的电流模式与在相同的条件下使用已知的待测物获得的已知的电流模式进行比较,以鉴定待测物的存在与否或浓度等。During the interaction of the analyte with the nanopore, the analyte affects the current flowing through the nanopore in a manner specific to the analyte, that is, the current flowing through the nanopore The current is characteristic for a particular DUT. Control experiments can be performed to determine the effect of a particular analyte on the current flowing through the nanopore, and then to identify the particular analyte in the sample or to determine the presence of the particular analyte in the sample. More specifically, the presence or absence or concentration of the analyte can be identified based on the comparison of the current pattern obtained by detecting the analyte with a known current pattern obtained using a known analyte under the same conditions wait.

本发明的纳米孔系统还可以包括一个或多个测量流过所述纳米孔的电流的测量装置,例如膜片钳放大器或数据采集设备。The nanopore system of the present invention may also include one or more measurement devices that measure the current flowing through the nanopore, such as patch clamp amplifiers or data acquisition devices.

一种基于M2MspA-N91H纳米孔的多肽成分分析方法A method for analyzing peptide components based on M2MspA-N91H nanopore

本发明还提供了一种基于M2MspA-N91H纳米孔的多肽成分分析方法,其特征在于,包括以下步骤:The present invention also provides a method for analyzing polypeptide components based on the M2MspA-N91H nanopore, which is characterized in that it comprises the following steps:

S1将外肽酶与待检测多肽混合均匀并反应,得到水解产物;S1 mixes the exopeptidase and the polypeptide to be detected evenly and reacts to obtain a hydrolyzate;

S2将所述水解产物加入纳米孔系统中,所述纳米孔系统包括:M2MspA-N91HS2 adding the hydrolyzate into the nanopore system, the nanopore system comprising: M2MspA-N91H

纳米孔、绝缘膜、第一介质、第二介质,其中所述M2MspA-N91H纳米孔嵌入在所述绝缘膜中,所述M2MspA-N91H纳米孔提供连通所述第一介质和所述第二介质的通道,所述水解产物被加入至所述第一介质;nanopore, an insulating film, a first medium, and a second medium, wherein the M2MspA-N91H nanopore is embedded in the insulating film, and the M2MspA-N91H nanopore provides communication between the first medium and the second medium channel, the hydrolyzate is added to the first medium;

S3向所述第一介质和所述第二介质之间施加驱动力,所述水解产物中游离氨基酸穿过所述M2MspA-N91H纳米孔并产生电信号;S3 applies a driving force between the first medium and the second medium, and the free amino acids in the hydrolyzate pass through the M2MspA-N91H nanopore and generate an electrical signal;

S4对所述电信号进行数据分析,获得所述游离氨基酸的氨基酸种类信息。S4 performs data analysis on the electrical signal to obtain amino acid type information of the free amino acid.

“多肽”是指含有通过肽键连接的两个或多个氨基酸的肽或蛋白。多肽可以含有天然的、修饰的或合成的氨基酸。多肽也可以被天然修饰(如通过翻译后加 工)或化学修饰(如酰胺化酰化、交联等)。在本发明中,所述多肽的序列信息可以是已知的或未知的。"Polypeptide" refers to a peptide or protein comprising two or more amino acids linked by peptide bonds. Polypeptides may contain natural, modified or synthetic amino acids. Polypeptides can also be modified naturally (e.g., by post-translational processing) or chemically (e.g., amidation, acylation, cross-linking, etc.). In the present invention, the sequence information of the polypeptide may be known or unknown.

“外肽酶”是指通过切割肽键去除多肽的末端氨基酸的水解酶。末端氨基酸是在多肽的N-末端或C-末端的大约10个氨基酸内的氨基酸。因此,“外肽酶”分为从多肽的N-末端逐个水解氨基酸的氨肽酶和从多肽的C-末端逐个水解氨基酸的羧肽酶。示例性的氨肽酶包括半胱氨酰氨肽酶、脯氨酰氨肽酶、精氨酰氨肽酶。示例性的羧肽酶包括羧肽酶A、B、C、Y。在本发明中,所述外肽酶可以为羧肽酶和/或氨肽酶。"Exopeptidase" refers to a hydrolase that removes terminal amino acids of polypeptides by cleaving peptide bonds. Terminal amino acids are amino acids within about 10 amino acids of the N- or C-terminus of a polypeptide. Accordingly, "exopeptidases" are classified into aminopeptidases that hydrolyze amino acids one by one from the N-terminus of polypeptides and carboxypeptidases that hydrolyze amino acids one by one from the C-terminus of polypeptides. Exemplary aminopeptidases include cysteinyl aminopeptidase, prolyl aminopeptidase, arginyl aminopeptidase. Exemplary carboxypeptidases include carboxypeptidases A, B, C, Y. In the present invention, the exopeptidase may be carboxypeptidase and/or aminopeptidase.

在适宜的条件下,所述外肽酶可以将待检测多肽完全水解或部分水解。例如,如图10所示,羧肽酶A1可以切割C-末端的除了赖氨酸(K)、精氨酸(R)、脯氨酸(P)的氨基酸(即水解的停止点为R、K、P)。换句话说,如果待检测多肽中不含R、K、P这三种氨基酸,则所述待检测多肽将被羧肽酶A完全水解,得到的水解产物中的游离氨基酸的氨基酸种类信息则为所述待检测多肽的氨基酸组成。如果待检测多肽中含R、K、P这三种氨基酸中的一种或多种,则所述待检测多肽将被羧肽酶A部分水解(在停止点停止水解),得到的水解产物中的游离氨基酸的氨基酸种类信息则为所述待检测多肽在停止点前的氨基酸组成。Under appropriate conditions, the exopeptidase can completely or partially hydrolyze the polypeptide to be detected. For example, as shown in Figure 10, carboxypeptidase A1 can cut C-terminal amino acids except lysine (K), arginine (R), proline (P) (that is, the stop point of hydrolysis is R, K, P). In other words, if the polypeptide to be detected does not contain the three amino acids R, K, and P, the polypeptide to be detected will be completely hydrolyzed by carboxypeptidase A, and the amino acid type information of the free amino acids in the obtained hydrolyzate is The amino acid composition of the polypeptide to be detected. If the polypeptide to be detected contains one or more of the three amino acids R, K, and P, the polypeptide to be detected will be partially hydrolyzed by carboxypeptidase A (stop hydrolysis at the stop point), and the obtained hydrolyzate The amino acid type information of the free amino acid is the amino acid composition of the polypeptide to be detected before the stop point.

当所述待检测多肽的序列已知时,可以根据得到的水解产物中游离氨基酸的氨基酸种类信息与所述待检测多肽的预期序列信息进行对比,以判断所述待检测多肽的实际序列与所述待检测多肽的预期序列是否存在差异(例如,某个(些)应该出现在水解产物的氨基酸没有出现,某个(些)不应该出现在水解产物的氨基酸却出现了),适用于肿瘤新生抗原的表达和多肽药物的合成质量的判断等应用场景。When the sequence of the polypeptide to be detected is known, the amino acid type information of the free amino acids in the obtained hydrolyzate can be compared with the expected sequence information of the polypeptide to be detected to determine whether the actual sequence of the polypeptide to be detected is consistent with the expected sequence of the polypeptide to be detected. Describe whether there are differences in the expected sequence of the polypeptide to be detected (for example, some (some) amino acids that should appear in the hydrolyzate do not appear, and some (some) amino acids that should not appear in the hydrolyzate appear), which is suitable for tumor neogenesis Application scenarios such as the expression of antigens and the judgment of the synthetic quality of peptide drugs.

当所述待检测多肽的序列未知时,可以根据得到的水解产物中游离氨基酸的氨基酸种类信息并结合所用外肽酶的信息(例如,羧肽酶或氨肽酶,是否具有“水解停止点”),通过机器学习等方式在数据库中进行比对,进而预测所述待检测多肽的序列或在一定程度上缩小所述待检测多肽的可能范围(例如,在N端或C端开始,氨基酸的组成(包括游离氨基酸的确定种类以及“水解停止点”对应的可能种类)。When the sequence of the polypeptide to be detected is unknown, it can be based on the amino acid type information of the free amino acids in the obtained hydrolyzate and in combination with the information of the exopeptidase used (for example, carboxypeptidase or aminopeptidase, whether there is a "hydrolysis stop point" ), compare in the database by means of machine learning, etc., and then predict the sequence of the polypeptide to be detected or narrow the possible range of the polypeptide to be detected to a certain extent (for example, starting at the N-terminus or C-terminus, amino acid Composition (including defined species of free amino acids and possible species corresponding to "hydrolysis stop points").

实施例一Embodiment one

材料与方法Materials and Methods

氨基酸检测:Amino Acid Detection:

使用经典的垂直脂质双层装置(Warner Instruments)进行单通道实验,包括通过150μm直径孔径连接的两个室(顺式cis和反式trans)。两个Ag-AgCl电极用于施加跨膜电压并测量跨膜离子电流。顺式室处于电压接地。氨基酸检测的缓冲液条件为:Single-channel experiments were performed using a classical vertical lipid bilayer setup (Warner Instruments), consisting of two chambers (cis in cis and trans in trans) connected by a 150 μm diameter aperture. Two Ag-AgCl electrodes are used to apply the transmembrane voltage and measure the transmembrane ionic current. The cis chamber is at voltage ground. The buffer conditions for amino acid detection are:

1)缓冲液:0.3M-3M的KCl和NaCl等盐溶液(优选为1M的KCl水溶液),10mM MOPS;1) Buffer: 0.3M-3M salt solution such as KCl and NaCl (preferably 1M aqueous KCl solution), 10mM MOPS;

2)铜离子浓度范围:1nM到200μM;2) Copper ion concentration range: 1nM to 200μM;

3)pH值范围:6-9(优选为6.5-7.5)。3) pH range: 6-9 (preferably 6.5-7.5).

随着缓冲液盐浓度或pH进一步降低,氨基酸检测效率和区分度可能会下降。所有实验温度可设置为15℃-60℃(优选为22.0℃±0.5℃)。通过在连接两个腔室的孔上涂上溶解在癸烷(40mg/ml)中的1,2-diphytanoyl-sn-glycero-3-phosphocholine(DPhPC)(Avanti Polar Lipids)薄膜,形成平面脂质双层膜。之后将蛋白溶液加入至顺式腔,在蛋白嵌入磷脂膜形成单通道后,将氯化铜加入到反式腔,并将氨基酸加入到顺式腔,轻轻混匀溶液后记录电流信号。As the buffer salt concentration or pH is further reduced, amino acid detection efficiency and discrimination may decrease. All experimental temperatures can be set at 15°C-60°C (preferably 22.0°C±0.5°C). Formation of planar lipids by coating the well connecting the two chambers with a thin film of 1,2-diphytanoyl-sn-glycero-3-phosphocholine (DPhPC) (Avanti Polar Lipids) dissolved in decane (40 mg/ml) Double membrane. Then add the protein solution to the cis cavity, after the protein is embedded in the phospholipid membrane to form a single channel, add copper chloride to the trans cavity, and add amino acids to the cis cavity, mix the solution gently and record the current signal.

数据处理:data processing:

使用Axopatch 200B放大器(Molecular Devices)放大单通道电流记录,并使用内置四孔低通贝塞尔滤波器以2kHz进行过滤。数据由Digidata 1550B转换器(Molecular Devices)以100kHz的采样率进行数字化。提取采样得到的电流信号文件,获得氨基酸信号数据。可通过信号阻塞率、阻塞时间、SD等数据对单氨基酸信号进行识别和定义。Single-channel current recordings were amplified using an Axopatch 200B amplifier (Molecular Devices) and filtered at 2 kHz using a built-in four-hole low-pass Bessel filter. Data was digitized by a Digidata 1550B converter (Molecular Devices) at a sampling rate of 100kHz. Extract the current signal file obtained by sampling to obtain amino acid signal data. Single amino acid signals can be identified and defined by data such as signal blocking rate, blocking time, and SD.

背景噪音的处理:Handling of background noise:

针对单个氨基酸信号的电流值分布,计算其SD值,得到信号各个采样点的电流值与总体平均值的偏离程度,从而将单氨基酸信号与背景信号,多氨基酸的叠加信号区分。For the current value distribution of a single amino acid signal, the SD value is calculated to obtain the degree of deviation between the current value of each sampling point of the signal and the overall average value, so as to distinguish the single amino acid signal from the background signal and the superimposed signal of multiple amino acids.

多肽组成成分分析方法:Peptide composition analysis method:

实验的缓冲液条件如上所述。实验温度可设置为15℃-60℃(优选为25℃)。 在纳米孔体系外,将外肽酶(例如羧肽酶)与多肽按照一定浓度比例进行混合,如10U羧肽酶、0.5M KCl、10mM MOPS、pH 7.5、10mM多肽(图7A),25℃反应20分钟后,加入纳米孔样品槽进行检测。根据出现的氨基酸信号种类和数量判断多肽成分。The buffer conditions for the experiments were as described above. The experimental temperature can be set at 15°C-60°C (preferably 25°C). Outside the nanopore system, mix exopeptidase (such as carboxypeptidase) and polypeptide according to a certain concentration ratio, such as 10U carboxypeptidase, 0.5M KCl, 10mM MOPS, pH 7.5, 10mM polypeptide (Figure 7A), 25℃ After reacting for 20 minutes, add the nanopore sample tank for detection. Determine the peptide composition based on the type and quantity of amino acid signals that appear.

实施例二:利用M2MspA-N91H纳米孔检测20种天然氨基酸Example 2: Detection of 20 Natural Amino Acids Using M2MspA-N91H Nanopore

使用铜离子修饰纳米孔的结构和特性:Use copper ions to modify the structure and properties of nanopores:

如图1所示,M2MspA-N91H纳米孔为8聚体结构,孔道最窄处的8个氨基酸残基朝向M2MspA-N91H纳米孔的中心轴线。将Cu 2+添加到反式室中,观察电流轨迹,Cu 2+可显著降低电流波动(图1)。电流水平的降低与Cu 2+的添加浓度有关(图2),铜离子浓度范围可以是1nM至200μM。 As shown in Figure 1, the M2MspA-N91H nanopore has an 8-mer structure, and the 8 amino acid residues at the narrowest part of the channel are oriented toward the central axis of the M2MspA-N91H nanopore. Adding Cu 2+ into the trans chamber and observing the current trajectory, Cu 2+ can significantly reduce the current fluctuation (Fig. 1). The reduction of the current level is related to the concentration of Cu 2+ added (Fig. 2), and the copper ion concentration can range from 1 nM to 200 μM.

20种天然氨基酸的检测:Detection of 20 natural amino acids:

本实施例利用M2MspA-N91H纳米孔检测了20种天然氨基酸。其中,MspA单体(MspA由八个相同的单体组装形成通道)的氨基酸序列为:In this example, 20 natural amino acids were detected by using the M2MspA-N91H nanopore. Among them, the amino acid sequence of the MspA monomer (MspA is assembled by eight identical monomers to form a channel) is:

Figure PCTCN2022114670-appb-000003
Figure PCTCN2022114670-appb-000003

M2MspA-N91H的氨基酸序列为:The amino acid sequence of M2MspA-N91H is:

Figure PCTCN2022114670-appb-000004
Figure PCTCN2022114670-appb-000004

Figure PCTCN2022114670-appb-000005
Figure PCTCN2022114670-appb-000005

当目标待测物通过Cu 2+螯合的M2MspA-N91H时,产生特定的电流信号,如图3所示。在3种pH值条件下,加入铜离子后的电流轨迹图,如图9所示。结果表明,在pH 7.9时铜离子结合不稳定,而在pH 7.5和6.5条件下铜离子稳定结合。鉴于较高的溶液pH值有利于更多氨基酸带负电,从而提高检测效率。故本实施例选择在溶液中添加MOPS缓冲盐并将溶液pH值稳定至7.5左右,以期获得更优的检测效果。 When the target analyte passes through Cu 2+ chelated M2MspA-N91H, a specific current signal is generated, as shown in Figure 3. Under the conditions of three pH values, the current trajectory diagram after adding copper ions is shown in Fig. 9 . The results showed that copper ion binding was unstable at pH 7.9, while copper ion binding was stable at pH 7.5 and 6.5. In view of the fact that a higher solution pH value is conducive to more amino acids being negatively charged, thereby improving the detection efficiency. Therefore, in this embodiment, MOPS buffer salt is added to the solution and the pH value of the solution is stabilized to about 7.5, in order to obtain a better detection effect.

从氨基酸体积与电流信号阻塞率的散点图(图4)可以看出,氨基酸引起的阻塞率与其疏水体积存在正相关的联系。From the scatter plot of amino acid volume and current signal blocking rate (Fig. 4), it can be seen that there is a positive correlation between the blocking rate caused by amino acid and its hydrophobic volume.

而从氨基酸电流信号阻塞率与阻塞时间图(图5)可以得到,由20种天然氨基酸引起的阻塞率分别为:甘氨酸(G):0.1207±0.0006;丙氨酸(A):0.1471±0.0025;缬氨酸(V):0.1904±0.0012;亮氨酸(L):0.1995±0.0003;异亮氨酸(I):0.2072±0.0003;苯丙氨酸(F):0.2201±0.0007;色氨酸(W):0.2274±0.0002;酪氨酸(Y):0.2127±0.0040;天冬氨酸(D):0.2140±0.0028; 组氨酸(H):0.2465±0.0009;天冬酰胺(N):0.1651±0.0002;谷氨酸(E):0.2452±0.0016;赖氨酸(K):0.1687±0.0031;谷氨酰胺(Q):0.1882±0.0009;甲硫氨酸(M):0.1977±0.0005;精氨酸(R):0.1714±0.0022;丝氨酸(S):0.1313±0.0002;苏氨酸(T):0.1610±0.0004;半胱氨酸(C):0.1862±0.0023;脯氨酸(P):0.2183±0.0010。根据氨基酸的阻塞率以及其标准差进行判断,互不重叠的氨基酸有G,S,A,T,N,R,V,Q,M,L,I,D,F,W,E,共15种氨基酸的信号互不重叠,可互相区分。From the amino acid current signal blocking rate and blocking time diagram (Figure 5), it can be obtained that the blocking rates caused by 20 natural amino acids are: glycine (G): 0.1207±0.0006; alanine (A): 0.1471±0.0025; Valine (V): 0.1904±0.0012; Leucine (L): 0.1995±0.0003; Isoleucine (I): 0.2072±0.0003; Phenylalanine (F): 0.2201±0.0007; Tryptophan ( W): 0.2274±0.0002; Tyrosine (Y): 0.2127±0.0040; Aspartic acid (D): 0.2140±0.0028; Histidine (H): 0.2465±0.0009; Asparagine (N): 0.1651± 0.0002; Glutamic acid (E): 0.2452±0.0016; Lysine (K): 0.1687±0.0031; Glutamine (Q): 0.1882±0.0009; Methionine (M): 0.1977±0.0005; Arginine (R): 0.1714±0.0022; Serine (S): 0.1313±0.0002; Threonine (T): 0.1610±0.0004; Cysteine (C): 0.1862±0.0023; Proline (P): 0.2183±0.0010 . Judging by the blocking rate of amino acids and their standard deviations, non-overlapping amino acids include G, S, A, T, N, R, V, Q, M, L, I, D, F, W, E, a total of 15 The signals of these amino acids do not overlap and can be distinguished from each other.

背景噪音的处理:Handling of background noise:

上述直接检测20种氨基酸的方法,可能会产生背景噪音干扰后续的分析。基于此,本发明实验人员针对单个氨基酸信号的电流值分布,用下述的公式计算其SD值,得到信号各个采集点的电流值与总体平均值的偏离程度。从而将单氨基酸信号与背景信号、多氨基酸的叠加信号区分。The above-mentioned method of directly detecting 20 kinds of amino acids may generate background noise and interfere with subsequent analysis. Based on this, the experimenters of the present invention calculated the SD value of the current value distribution of a single amino acid signal using the following formula, and obtained the degree of deviation between the current value of each collection point of the signal and the overall average value. In this way, the single amino acid signal can be distinguished from the background signal and the superimposed signal of multiple amino acids.

Figure PCTCN2022114670-appb-000006
Figure PCTCN2022114670-appb-000006

式中,x i为信号每个采集点的电流值,x为信号电流点的平均值,n为信号采集点的数量。 In the formula, x i is the current value of each collection point of the signal, x is the average value of the signal current point, and n is the number of signal collection points.

如图6C所示,上、中、下图分别为同一次实验中空白组(Blank)、加入天冬氨酸(Asp)和加入天冬氨酸与谷氨酸混合物(Asp+Glu)后,2分钟内的电流阻塞信号散点图。可以设定SD阈值来过滤信号,例如,可以将SD阈值设定为2,其中,SD值小于2的信号(蓝色)被认为是有效信号;SD值大于2的信号,被认为是背景信号或多氨基酸同时通过产生的叠加信号,因此可将其过滤。过滤后,只看蓝色的信号,可以清楚地看到加入天冬氨酸与谷氨酸后,新产生的信号峰。利用本背景噪音处理方法,可更准确地对氨基酸信号进行识别和统计。As shown in Figure 6C, the upper, middle and lower figures are the blank group (Blank), the addition of aspartic acid (Asp) and the addition of a mixture of aspartic acid and glutamic acid (Asp+Glu) in the same experiment, respectively. Scatterplot of current blocking signal over 2 minutes. The SD threshold can be set to filter the signal, for example, the SD threshold can be set to 2, where the signal (blue) with an SD value less than 2 is considered to be a valid signal; the signal with an SD value greater than 2 is considered to be a background signal Simultaneous passage of one or more amino acids produces superimposed signals, which can therefore be filtered. After filtering, only look at the blue signal, and you can clearly see the newly generated signal peak after adding aspartic acid and glutamic acid. Using the background noise processing method, amino acid signals can be identified and counted more accurately.

为了更好地展示处理过程,本发明实验人员使用了Clampfit软件来提取单氨基酸信号。首先,利用拟合SD值,将背景信号及双台阶信号过滤掉,并保留单氨基酸信号。如图6A所示,在Clampfit单通道统计结果中(只设定level 0和level 1),“Amp S.D.”值表示信号拟合效果。具体信号对应的SD值如图6B所示;在所示的6个信号中可以看出,双台阶信号的SD值普遍大于4。单台阶信号是指电信号的电流密度分布曲线呈现单峰分布,且电流在0与峰值的60%之间没有分布。而双台阶信号是指电信号的电流密度分布曲线呈现双峰分布。In order to better demonstrate the processing process, the experimenters of the present invention used Clampfit software to extract single amino acid signals. First, using the fitted SD value, the background signal and double-step signal were filtered out, and the single amino acid signal was retained. As shown in Figure 6A, in the Clampfit single-channel statistical results (only level 0 and level 1 are set), the "Amp S.D." value indicates the signal fitting effect. The SD value corresponding to the specific signal is shown in Figure 6B; it can be seen from the six signals shown that the SD value of the double-step signal is generally greater than 4. A single-step signal means that the current density distribution curve of the electrical signal presents a single-peak distribution, and the current has no distribution between 0 and 60% of the peak value. The double-step signal means that the current density distribution curve of the electrical signal presents a bimodal distribution.

SD阈值设定在0-2范围内,可过滤大部分多台阶信号。然而,在-5pA以及-12pA附近仍然存在背景信号干扰。-5pA附近的信号来源如图6D所示,因为其对应的level 0偏离基线过多,可在后期被过滤掉。-12pA附近的信号(图6E)是正常的背景信号,阻塞时间短。背景信号只是会干扰到附近的gly、ser、ala的信号,而这三种氨基酸的信号是可以区分的。The SD threshold is set in the range of 0-2, which can filter most multi-step signals. However, background signal interference still exists around -5pA and -12pA. The signal source near -5pA is shown in Figure 6D, because its corresponding level 0 deviates too much from the baseline and can be filtered out in the later stage. The signal around -12pA (Fig. 6E) is a normal background signal with a short blocking time. The background signal only interferes with the nearby signals of gly, ser, and ala, and the signals of these three amino acids can be distinguished.

基于此,可以使用已知检测数据,建立标准信号数据库,具体流程如图11所示:首先,通过M2MspA-N91H纳米孔检测标准品游离氨基酸,获得电信号数据。第二,设置过滤条件(例如将电信号的SD阈值设定为2)。第三,根据过滤条件,将部分电信号排除,获得保留的电信号。第四,判断保留的电信号是否为单台阶电信号:如果保留的电信号为单台阶电信号,则将该单台阶电信号纳入标准信号数据库,且将该单台阶电信号视为该氨基酸的标准信号;如果保留的电信号并非单台阶电信号(例如双台阶电信号),则重新设置过滤条件(例如将SD阈值上调),并重新获得保留的电信号,进而再次判断重新获得的保留的电信号是否为单台阶电信号。最后,使用单台阶电信号建立标准信号数据库。Based on this, known detection data can be used to establish a standard signal database, and the specific process is shown in Figure 11: First, standard free amino acids are detected through M2MspA-N91H nanopores to obtain electrical signal data. Second, filter conditions are set (for example, the SD threshold of the electrical signal is set to 2). Thirdly, according to the filter condition, some electrical signals are excluded to obtain retained electrical signals. Fourth, judge whether the reserved electrical signal is a single-step electrical signal: if the reserved electrical signal is a single-step electrical signal, then include the single-step electrical signal into the standard signal database, and regard the single-step electrical signal as the amino acid Standard signal; if the retained electrical signal is not a single-step electrical signal (such as a double-step electrical signal), reset the filter condition (for example, increase the SD threshold), and regain the retained electrical signal, and then judge again the retained Whether the electrical signal is a single-step electrical signal. Finally, a standard signal database is established using single-step electrical signals.

实施例三:利用M2MspA-N91H纳米孔检测3种非天然氨基酸Example 3: Using M2MspA-N91H nanopore to detect three kinds of unnatural amino acids

基于上述方法,本实施例进一步检测了3种代表性非天然氨基酸,分别为瓜氨酸、L-正缬氨酸、鹅膏氨酸。如图8A所示,这3种非天然氨基酸引起的阻塞率分别为:瓜氨酸:0.1912±0.0011;L-正缬氨酸:0.1806±0.0017;鹅膏氨酸:0.2333±0.0029。根据氨基酸阻塞率以及其标准差进行判断,3种非天然氨基酸的信号互不重叠,可相互区分。Based on the above method, three representative unnatural amino acids were further detected in this example, namely citrulline, L-norvaline and amarine. As shown in Figure 8A, the blocking rates caused by these three unnatural amino acids were: citrulline: 0.1912±0.0011; L-norvaline: 0.1806±0.0017; amarine: 0.2333±0.0029. According to the amino acid blocking rate and its standard deviation, the signals of the three unnatural amino acids do not overlap and can be distinguished from each other.

如图8B所示,上图和下图分别为同一次实验中空白组(Blank)和加入鹅膏氨酸后,1分钟内的电流阻塞信号散点图。SD值小于2的信号被认为是有效信号,过滤之后,可以清楚地看到加入鹅膏氨酸之后产生新的信号峰。As shown in FIG. 8B , the upper and lower graphs are the scatter diagrams of the current blocking signal within 1 minute of the blank group (Blank) and the addition of amarin in the same experiment, respectively. Signals with SD values less than 2 were considered valid signals, and after filtering, it was clearly seen that a new signal peak was generated after the addition of amarine.

多肽可以作为底物参与酶促反应,使用上述检测氨基酸的方法可以对酶促反应实现连续监测:通过检测反应体系中游离氨基酸的种类,可以判断酶促反应是否发生以及酶促级联反应的进展。Polypeptides can be used as substrates to participate in enzymatic reactions, and the above methods for detecting amino acids can be used to continuously monitor enzymatic reactions: By detecting the types of free amino acids in the reaction system, it is possible to determine whether the enzymatic reaction occurs and the progress of the enzymatic cascade reaction .

实施例四:基于M2MspA-N91H纳米孔的多肽成分分析方法:Example 4: Method for analyzing polypeptide components based on M2MspA-N91H nanopore:

在M2MspA-N91H纳米孔蛋白嵌入后,立即将电压调整至+50mV,记录10分钟。接着,在样品槽反式面加入1μL 20mM氯化铜溶液,记录10分钟。同时, 取8μL浓度为2mM的多肽溶液置于PCR管中,37℃孵育5分钟。加入2μL羧肽酶A1溶液,充分混匀,在37℃反应5分钟。反应完成后,将反应产物全部加入样品槽顺式面,充分混匀,在+50mV电压下记录2小时。检测原理如图10所示,羧肽酶A1(外肽酶的一种)可以从多肽C-末端逐一水解氨基酸,水解得到的游离氨基酸产物被纳米孔捕获从而被检测,根据出现的氨基酸信号的种类和数量可以判断多肽成分。Immediately after M2MspA-N91H nanopore intercalation, the voltage was adjusted to +50 mV and recorded for 10 min. Next, add 1 μL of 20 mM copper chloride solution to the trans side of the sample tank, and record for 10 minutes. At the same time, 8 μL of the peptide solution with a concentration of 2 mM was placed in a PCR tube, and incubated at 37° C. for 5 minutes. Add 2 μL carboxypeptidase A1 solution, mix thoroughly, and react at 37°C for 5 minutes. After the reaction is completed, add all the reaction products to the cis side of the sample tank, mix thoroughly, and record at +50mV for 2 hours. The detection principle is shown in Figure 10. Carboxypeptidase A1 (a type of exopeptidase) can hydrolyze amino acids one by one from the C-terminus of the polypeptide, and the free amino acid products obtained by hydrolysis are captured by nanopores to be detected. According to the appearance of amino acid signals The type and quantity can determine the peptide composition.

由于1U外肽酶在1min内能够切割1μmol多肽的末端1个氨基酸,可以根据以下公式计算多肽溶液的终浓度(假设该多肽能够被完全水解):Since 1 U of exopeptidase can cut 1 amino acid at the end of 1 μmol of polypeptide within 1 min, the final concentration of the polypeptide solution can be calculated according to the following formula (assuming that the polypeptide can be completely hydrolyzed):

Figure PCTCN2022114670-appb-000007
Figure PCTCN2022114670-appb-000007

式中,a为多肽长度,b为物质的量(μmol),d为外肽酶酶活(U),c为反应时间(min)。In the formula, a is the length of the polypeptide, b is the amount of substance (μmol), d is the activity of exopeptidase (U), and c is the reaction time (min).

在纳米孔系统中,游离氨基酸的浓度为1μM时,检测效果较好;如果游离氨基酸的终浓度低于1μM时,可能会使游离氨基酸的信号减少,进而影响检测效果。考虑到将水解产物(反应产物)加入到M2MspA-N91H纳米孔系统后,可能会被稀释至少10倍,因此,基于上述公式,反应体系中多肽溶液终浓度应大于1μM的10倍(即10μM)。In the nanopore system, when the concentration of free amino acids is 1 μM, the detection effect is better; if the final concentration of free amino acids is lower than 1 μM, the signal of free amino acids may be reduced, thereby affecting the detection effect. Considering that the hydrolyzate (reaction product) may be diluted at least 10 times after being added to the M2MspA-N91H nanopore system, therefore, based on the above formula, the final concentration of the polypeptide solution in the reaction system should be greater than 10 times of 1 μM (ie 10 μM) .

利用外肽酶水解多肽的末端氨基酸,获得水解产物(反应产物)。上述反应产物加入M2MspA-N91H纳米孔系统并施加驱动力后,反应产物中游离氨基酸穿过所述M2MspA-N91H纳米孔并产生电信号,并获得相应的电信号数据。具体的数据分析流程如图12所示:首先,通过M2MspA-N91H纳米孔检测上述反应产物,获得电信号数据。第二,建立标准信号数据库(如上所述)。第三,设置过滤条件(电信号的SD阈值)。第四,根据过滤条件,将部分电信号排除,获得保留的电信号。第五,通过标准信号数据库中所述标准信号的阻塞率,判断所述保留的电信号对应的氨基酸种类。第六,获得所述游离氨基酸的氨基酸种类信息。The terminal amino acid of the polypeptide is hydrolyzed by exopeptidase to obtain a hydrolyzate (reaction product). After the above reaction product is added to the M2MspA-N91H nanopore system and a driving force is applied, free amino acids in the reaction product pass through the M2MspA-N91H nanopore to generate an electrical signal, and obtain corresponding electrical signal data. The specific data analysis process is shown in Figure 12: First, the above reaction products are detected through the M2MspA-N91H nanopore to obtain electrical signal data. Second, build a standard signal database (as described above). Third, filter conditions (SD threshold of electrical signals) are set. Fourth, according to the filter condition, some electrical signals are excluded to obtain retained electrical signals. Fifth, determine the type of amino acid corresponding to the retained electrical signal based on the blocking rate of the standard signal in the standard signal database. Sixth, the amino acid type information of the free amino acid is obtained.

图7显示了基于上述多肽成分分析方法和数据分析方法分析四种不同多肽的组成。图7A为在0.5M KCl、10mM MOPS、pH 7.5条件下,10U羧肽酶水解10mM EF多肽20分钟后,将反应液加入纳米孔体系检测得到的电流信号图。图7B为图7A涉及实验中统计得到的所有电流信号的阻塞率及过孔时间的散点 图。可观察到SD值位于1-2之间的信号(红色)有3个分布峰,阻塞率0.4附近的为EF多肽的信号;左边两个峰与谷氨酸(E)和苯丙氨酸(F)的阻塞率最为接近,故判断为氨基酸E和F的信号。Fig. 7 shows the analysis of the composition of four different polypeptides based on the above polypeptide composition analysis method and data analysis method. Figure 7A is the current signal diagram obtained by adding the reaction solution to the nanopore system after hydrolyzing 10mM EF polypeptide with 10U carboxypeptidase under the conditions of 0.5M KCl, 10mM MOPS, and pH 7.5 for 20 minutes. FIG. 7B is a scatter diagram of FIG. 7A related to the blocking rate and hole-through time of all current signals obtained in the experiment. It can be observed that the signal (red) with SD value between 1-2 has 3 distribution peaks, and the signal of EF polypeptide near the blocking rate of 0.4; the left two peaks are related to glutamic acid (E) and phenylalanine ( The blocking rate of F) is the closest, so it is judged as the signal of amino acids E and F.

图7C显示了M2MspA-N91H纳米孔分别检测三种多肽(P1:APRLR FYSL;P2:DRVYIHP FHL;P3:RPVKVYPNGAEDESAEAFP LEF;其中下划线表示多肽C端的这些氨基酸能够被羧肽酶A1水解为游离氨基酸)的水解产物的实验结果。上面的分析结果表明,本发明提供的多肽成分分析方法能够对多肽末端氨基酸组成进行分析。 Fig. 7 C shows that M2MspA-N91H nanopore detects respectively three kinds of polypeptides (P1: APRLR FYSL ; P2: DRVYIHP FHL ; P3: RPVKVYPNGAEDESAEAFP LEF ; wherein underline represents that these amino acids of polypeptide C terminal can be hydrolyzed as free amino acids by carboxypeptidase A1) Experimental results of hydrolyzate. The above analysis results show that the polypeptide component analysis method provided by the present invention can analyze the terminal amino acid composition of the polypeptide.

本发明提供的上述多肽成分分析方法,可以应用于多种不同场景,例如已知多肽的鉴定和未知多肽的预测:The above-mentioned polypeptide composition analysis method provided by the present invention can be applied to many different scenarios, such as the identification of known polypeptides and the prediction of unknown polypeptides:

肿瘤新生抗原作为在正常组织中不表达,而仅在肿瘤组织表达的抗原,是理想的免疫治疗靶标。每位患者都有其独特的新抗原谱,可以用上述多肽成分分析方法来筛选肿瘤新生抗原肽:首先,利用高特异性的提取方法,提取并纯化已知肿瘤细胞表面抗原多肽,纯化后的混合物理论上仅含有少量且成分已知的抗原肽;其次,使用上述多肽成分分析方法对纯化后的混合物进行检测;最后,将检测结果中检测出的氨基酸种类与已知抗原肽的氨基酸种类进行对比,可以判断某些氨基酸是否产生或缺少,进而判断肿瘤新生抗原是否可能存在于肿瘤细胞表面。Tumor neoantigens, which are not expressed in normal tissues but only expressed in tumor tissues, are ideal targets for immunotherapy. Each patient has its unique neoantigen spectrum, and the above-mentioned peptide component analysis method can be used to screen tumor neoantigen peptides: first, use a highly specific extraction method to extract and purify known tumor cell surface antigen peptides, and the purified The mixture theoretically contains only a small amount of antigenic peptides with known components; secondly, use the above-mentioned peptide composition analysis method to detect the purified mixture; finally, compare the amino acid species detected in the test results with the amino acid species of known antigenic peptides In contrast, it can be judged whether certain amino acids are produced or lacked, and then judged whether neoplastic antigens may exist on the surface of tumor cells.

目前,一些多肽药物已被开发并应用于临床,但由于合成工艺的限制,导致多肽药物的产品质量和纯度等方面达不到要求。通过上述多肽组成成分分析方法,可以实现对多肽药物的纯度的检测:将待检测的多肽药物的氨基酸成分检测结果,与预期合成的氨基酸种类进行对比,能够一定程度上判断该多肽药物的合成质量。At present, some peptide drugs have been developed and applied clinically, but due to the limitation of the synthesis process, the product quality and purity of peptide drugs cannot meet the requirements. Through the above-mentioned polypeptide composition analysis method, the detection of the purity of the polypeptide drug can be realized: comparing the detection results of the amino acid composition of the polypeptide drug to be detected with the expected synthetic amino acid type can judge the synthetic quality of the polypeptide drug to a certain extent .

此外,本发明提供的多肽成分分析方法还可以与机器学习结合,来预测未知多肽:将待检测的未知多肽用外切酶水解并检测游离氨基酸;使用机器学习的方法,结合外切酶的种类、水解特点以及氨基酸成分检测结果,与数据库中已知多肽进行比对,能够在一定程度上预测该未知多肽的序列,同时起到缩小未知多肽范围的作用。In addition, the polypeptide composition analysis method provided by the present invention can also be combined with machine learning to predict unknown polypeptides: hydrolyze the unknown polypeptide to be detected with exonuclease and detect free amino acids; use machine learning methods to combine the type of exonuclease , hydrolysis characteristics, and amino acid composition detection results, compared with known peptides in the database, can predict the sequence of the unknown peptide to a certain extent, and at the same time play a role in narrowing the range of unknown peptides.

上面结合附图对本发明的实施例进行了描述,但是本发明并不局限于上述的具体实施方式,上述的具体实施方式仅仅是示意性的,而不是限制性的,本领 域的普通技术人员在本发明的启示下,在不脱离本发明宗旨和权利要求所保护的范围情况下,还可做出很多形式,这些均属于本发明的保护之内。Embodiments of the present invention have been described above in conjunction with the accompanying drawings, but the present invention is not limited to the above-mentioned specific implementations, and the above-mentioned specific implementations are only illustrative, rather than restrictive, and those of ordinary skill in the art will Under the enlightenment of the present invention, many forms can also be made without departing from the gist of the present invention and the protection scope of the claims, and these all belong to the protection of the present invention.

Claims (10)

一种基于M2MspA-N91H纳米孔的多肽成分分析方法,其特征在于,包括以下步骤:A method for analyzing polypeptide components based on M2MspA-N91H nanopore, characterized in that it comprises the following steps: S1将外肽酶与待检测多肽混合均匀并反应,得到水解产物;S1 mixes the exopeptidase and the polypeptide to be detected evenly and reacts to obtain a hydrolyzate; S2将所述水解产物加入纳米孔系统中,所述纳米孔系统包括:M2MspA-N91H纳米孔、绝缘膜、第一介质、第二介质,其中所述M2MspA-N91H纳米孔嵌入在所述绝缘膜中,所述M2MspA-N91H纳米孔提供连通所述第一介质和所述第二介质的通道,所述水解产物被加入至所述第一介质;S2 adds the hydrolyzate to the nanopore system, the nanopore system includes: M2MspA-N91H nanopore, insulating film, first medium, second medium, wherein the M2MspA-N91H nanopore is embedded in the insulating film Among them, the M2MspA-N91H nanopore provides a channel connecting the first medium and the second medium, and the hydrolyzate is added to the first medium; S3向所述第一介质和所述第二介质之间施加驱动力,所述水解产物中游离氨基酸穿过所述M2MspA-N91H纳米孔并产生电信号;S3 applies a driving force between the first medium and the second medium, and the free amino acids in the hydrolyzate pass through the M2MspA-N91H nanopore and generate an electrical signal; S4对所述电信号进行数据分析,获得所述游离氨基酸的氨基酸种类信息。S4 performs data analysis on the electrical signal to obtain amino acid type information of the free amino acid. 如权利要求1所述的方法,其特征在于,所述游离氨基酸包括天然氨基酸和非天然氨基酸。The method according to claim 1, wherein the free amino acids include natural amino acids and unnatural amino acids. 如权利要求2所述的方法,其特征在于,所述天然氨基酸包括甘氨酸、丙氨酸、缬氨酸、亮氨酸、异亮氨酸、苯丙氨酸、色氨酸、酪氨酸、天冬氨酸、组氨酸、天冬酰胺、谷氨酸、赖氨酸、谷氨酰胺、甲硫氨酸、精氨酸、丝氨酸、苏氨酸、半胱氨酸、脯氨酸中的一种或多种;所述非天然氨基酸可选地包括瓜氨酸、L-正缬氨酸、鹅膏氨酸中的一种或多种。The method according to claim 2, wherein said natural amino acid comprises glycine, alanine, valine, leucine, isoleucine, phenylalanine, tryptophan, tyrosine, Aspartic acid, histidine, asparagine, glutamic acid, lysine, glutamine, methionine, arginine, serine, threonine, cysteine, proline One or more; the unnatural amino acid may optionally include one or more of citrulline, L-norvaline, and amarine. 如权利要求1所述的方法,其特征在于,所述外肽酶包括氨肽酶和/或羧肽酶。The method of claim 1, wherein the exopeptidase comprises aminopeptidase and/or carboxypeptidase. 如权利要求1所述的方法,其特征在于,在所述水解产物加入所述纳米孔系统之前,将所述M2MspA-N91H纳米孔与铜离子结合。The method of claim 1, wherein said M2MspA-N91H nanopore is bound to copper ions before said hydrolyzate is added to said nanopore system. 如权利要求1所述的方法,其特征在于,所述第一介质和所述第二介质的pH值包括6.5-7.5。The method according to claim 1, wherein the pH values of the first medium and the second medium include 6.5-7.5. 如权利要求1所述的方法,其特征在于,所述待检测多肽溶液的终浓度大于10μM。The method according to claim 1, characterized in that the final concentration of the polypeptide solution to be detected is greater than 10 μM. 如权利要求1所述的方法,其特征在于,S4中所述数据分析步骤进一步包括:The method according to claim 1, wherein the data analysis step in S4 further comprises: i)建立标准信号数据库;i) Establish a standard signal database; ii)设定信号过滤条件;ii) Set signal filtering conditions; iii)排除部分电信号,获得保留的电信号;iii) Excluding part of the electrical signal to obtain the retained electrical signal; iv)通过标准信号数据库中所述标准信号的阻塞率,判断所述保留的电信号 对应的氨基酸种类;iv) judging the amino acid type corresponding to the retained electrical signal through the blocking rate of the standard signal in the standard signal database; v)获得所述游离氨基酸的氨基酸种类信息。v) obtaining amino acid type information of the free amino acid. 如权利要求8所述的方法,其特征在于,所述信号过滤条件为排除SD值大于SD阈值的电信号,其中所述电信号的SD值按照下列公式进行计算:The method according to claim 8, wherein the signal filtering condition is to exclude electrical signals whose SD value is greater than the SD threshold, wherein the SD value of the electrical signal is calculated according to the following formula:
Figure PCTCN2022114670-appb-100001
Figure PCTCN2022114670-appb-100001
式中,x i为信号每个采集点的电流值,x为信号电流点的平均值,n为信号采集点的数量。 In the formula, x i is the current value of each collection point of the signal, x is the average value of the signal current point, and n is the number of signal collection points.
如权利要求1所述的方法,其特征在于,所述方法进一步包括S5将所述游离氨基酸的氨基酸种类信息与所述待检测多肽的预期序列信息进行对比,以判断所述待检测多肽的实际序列与所述待检测多肽的预期序列是否存在差异;或The method according to claim 1, characterized in that, the method further comprises S5 comparing the amino acid type information of the free amino acid with the expected sequence information of the polypeptide to be detected to determine the actual sequence information of the polypeptide to be detected. Whether the sequence differs from the expected sequence of the polypeptide to be detected; or S5将所述游离氨基酸的氨基酸种类信息结合所述外肽酶的信息与数据库中的多肽序列信息进行对比,以预测所述待检测多肽的实际序列。S5 compares the amino acid type information of the free amino acid combined with the information of the exopeptidase with the polypeptide sequence information in the database to predict the actual sequence of the polypeptide to be detected.
PCT/CN2022/114670 2021-08-30 2022-08-25 Polypeptide composition analysis method based on copper ion modified mspa nanopores Ceased WO2023030146A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111006764 2021-08-30
CN202111006764.8 2021-08-30

Publications (1)

Publication Number Publication Date
WO2023030146A1 true WO2023030146A1 (en) 2023-03-09

Family

ID=85411912

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/114670 Ceased WO2023030146A1 (en) 2021-08-30 2022-08-25 Polypeptide composition analysis method based on copper ion modified mspa nanopores

Country Status (2)

Country Link
CN (1) CN116165373A (en)
WO (1) WO2023030146A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118362715B (en) * 2024-06-20 2024-08-23 四川大学华西医院 A method to improve nanopore detection performance

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120055792A1 (en) * 2008-09-22 2012-03-08 The Uab Research Foundation Msp nanopores and related methods
WO2014071250A1 (en) * 2012-11-01 2014-05-08 University Of Washington Through Its Center For Commercialization Methods for detecting and mapping modifications to nucleic acid polymers using nanopore systems
WO2015051378A1 (en) * 2013-10-04 2015-04-09 University Of Washington Through Its Center For Commercialization Systems and methods for nanopore-based analysis of nucleic acids
CN105801676A (en) * 2016-04-13 2016-07-27 东南大学 Mutant MspA protein monomer and expressed gene and application thereof
CN105992634A (en) * 2013-08-30 2016-10-05 华盛顿大学商业中心 Selective modification of polymer subunits for improved nanopore-based assays
CN106133513A (en) * 2014-02-19 2016-11-16 华盛顿大学 The analysis based on nano-pore of protein characteristic
CN110954445A (en) * 2019-10-31 2020-04-03 四川大学华西医院 A kind of living cell biosensor and its preparation method and application

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120055792A1 (en) * 2008-09-22 2012-03-08 The Uab Research Foundation Msp nanopores and related methods
WO2014071250A1 (en) * 2012-11-01 2014-05-08 University Of Washington Through Its Center For Commercialization Methods for detecting and mapping modifications to nucleic acid polymers using nanopore systems
CN105992634A (en) * 2013-08-30 2016-10-05 华盛顿大学商业中心 Selective modification of polymer subunits for improved nanopore-based assays
WO2015051378A1 (en) * 2013-10-04 2015-04-09 University Of Washington Through Its Center For Commercialization Systems and methods for nanopore-based analysis of nucleic acids
CN106133513A (en) * 2014-02-19 2016-11-16 华盛顿大学 The analysis based on nano-pore of protein characteristic
US20170199149A1 (en) * 2014-02-19 2017-07-13 University Of Washington Nanopore-based analysis of protein characteristics
CN105801676A (en) * 2016-04-13 2016-07-27 东南大学 Mutant MspA protein monomer and expressed gene and application thereof
CN110954445A (en) * 2019-10-31 2020-04-03 四川大学华西医院 A kind of living cell biosensor and its preparation method and application

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
ZHANG XIALIN, DOU LINQIN, ZHANG MING, WANG YU, JIANG XIN, LI XINQIONG, WEI LONG, CHEN YUEJIA, ZHOU CUISONG, GENG JIA: "Real-time sensing of neurotransmitters by functionalized nanopores embedded in a single live cell", MOLECULAR BIOMEDICINE, vol. 2, no. 1, 1 December 2021 (2021-12-01), XP093041675, DOI: 10.1186/s43556-021-00026-3 *

Also Published As

Publication number Publication date
CN116165373A (en) 2023-05-26

Similar Documents

Publication Publication Date Title
JP5328362B2 (en) How to use pores
US20100099198A1 (en) Apparatus and system for pattern recognition sensing for biomolecules
CN105074458A (en) Hybrid nanopores and uses thereof for detection of analytes
Hagan et al. Chemically tailoring nanopores for single-molecule sensing and glycomics
US9453833B2 (en) System and method for detecting and monitoring proteolysis of protein matrices
US20210172944A1 (en) Apparatus for analyzing and detecting interactions and reactions of molecules
CN105259229B (en) A kind of single molecule analysis method for detecting medicine
US20140248608A1 (en) Methods for characterizing a device component based on a contrast signal to noise ratio
CN115725685A (en) A method for the detection of drug molecules based on biological angstrompores
Ge et al. Aerolysin nanopore-based identification of proteinogenic amino acids using a bipolar peptide probe
CN106932448A (en) A kind of chiral Molecular Recognition and enantiomeric excess detection method based on nanometer pore single-molecule technology
CN114487399A (en) Method for joint analysis of particle size and surface marker
WO2023030146A1 (en) Polypeptide composition analysis method based on copper ion modified mspa nanopores
KR102094241B1 (en) Method for Measuring Concentration of Fibrinogen in Blood Sample and Nanoparticles for the Same
Han et al. Multifunctional peptide-oligonucleotide conjugate promoted sensitive electrochemical biosensing of cardiac troponin I
Chen et al. Single-particle fibrinogen detection using platelet membrane-coated fluorescent polystyrene nanoparticles
Liu et al. Reduction chemistry-assisted nanopore determination method for immunoglobulin isotypes
WO2024160029A1 (en) METHOD FOR REALIZING NUCLEIC ACID SINGLE-MOLECULE FRAGMENT-TYPE SEQUENCING ON THE BASIS OF γ-HEMOLYSIN NANOPORE CONTRACTION REGION
CN108732222B (en) A method for simultaneous rapid detection of glycated hemoglobin and glycated serum protein in blood
CN111912892A (en) Application of aerolysin nanopore channel in biological phosphorylation and related enzyme analysis
WO2024138570A1 (en) Biological nanopore sensor, preparation method therefor, and use thereof
CN115728353A (en) Application of MscCG in the preparation of test kits for detecting valproic acid drugs
CN116381228A (en) Reagent for capturing and detecting circulating exosomes and application thereof
CN118561971B (en) Protein mutant, nanopore detection system and use thereof
CN118362715B (en) A method to improve nanopore detection performance

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22863284

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 11202402185U

Country of ref document: SG

122 Ep: pct application non-entry in european phase

Ref document number: 22863284

Country of ref document: EP

Kind code of ref document: A1