WO2025138284A1 - Array and method for detecting spatial information of nucleic acids - Google Patents
Array and method for detecting spatial information of nucleic acids Download PDFInfo
- Publication number
- WO2025138284A1 WO2025138284A1 PCT/CN2023/143677 CN2023143677W WO2025138284A1 WO 2025138284 A1 WO2025138284 A1 WO 2025138284A1 CN 2023143677 W CN2023143677 W CN 2023143677W WO 2025138284 A1 WO2025138284 A1 WO 2025138284A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sequence
- nucleic acid
- capture
- probe
- molecule
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12M—APPARATUS FOR ENZYMOLOGY OR MICROBIOLOGY; APPARATUS FOR CULTURING MICROORGANISMS FOR PRODUCING BIOMASS, FOR GROWING CELLS OR FOR OBTAINING FERMENTATION OR METABOLIC PRODUCTS, i.e. BIOREACTORS OR FERMENTERS
- C12M1/00—Apparatus for enzymology or microbiology
- C12M1/34—Measuring or testing with condition measuring or sensing means, e.g. colony counters
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6841—In situ hybridisation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/686—Polymerase chain reaction [PCR]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
- C12Q1/6874—Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B20/00—Methods specially adapted for identifying library members
- C40B20/02—Identifying library members by their fixed physical location on a support or substrate
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B40/00—Libraries per se, e.g. arrays, mixtures
- C40B40/04—Libraries containing only organic compounds
- C40B40/06—Libraries containing nucleotides or polynucleotides, or derivatives thereof
Definitions
- the present application relates to the field of biomolecule spatial detection. Specifically, the present application provides a nucleic acid array for detecting nucleic acid spatial information in a sample, a method for detecting nucleic acid spatial information in a sample based on the array, and a method for generating the nucleic acid array.
- Spatial omics technology is a technology that obtains omics information such as transcriptome, genome, epigenome and proteome in cells in situ in tissues. Spatial omics technology was named the technical method of the year by Nature methods in 2020, and was listed as one of Nature's seven most noteworthy technologies of the year in 2022. Spatial omics technology can be divided into four categories based on different principles, namely microdissection, in situ hybridization, in situ sequencing, chip in situ capture and high-throughput sequencing. Spatial omics technology based on chip in situ capture and high-throughput sequencing occupies a dominant position due to its simple operation and high throughput, among which the capture chip is the key to this technology.
- Spatial omics technology based on chip capture uses probes with spatial information on the chip to capture nucleic acids and other molecules in tissue sections in situ, uses high-throughput sequencing technology to obtain spatial information on the chip and nucleic acid molecules in cells, and then restores the molecular information in the tissue through algorithm analysis. Since Joakim Lundeberg of the Royal Institute of Technology in Sweden first published spatial transcriptomics technology in Science in 2016, this field has developed rapidly in recent years due to the advantages of high-throughput whole transcriptome technology.
- HDST technology High-definition spatial transcriptomics
- slide-seq technology sequencing by oligonucleotide ligation and detection
- barcode microbeads barcode microbeads
- DBiT-seq technology Deterministic Barcoding in Tissue
- Seq-Scope technology based on Ill ⁇ Mina sequencing platform
- stereo-seq technology spatial enhanced resolution omics sequencing
- 10x Genomics has formed another spatial omics product line in addition to the single-cell production line, including instruments, analysis software and reagents and consumables.
- Biomarker Biotech also released spatial transcriptomics products based on this technology in 2022.
- the core of the existing spatial omics technology based on chip in situ capture lies in the resolution and capture efficiency of the capture chip.
- the resolution of the chip is determined by the spatial information density on the chip, which is related to whether the cells on the tissue slice can be finely analyzed with spatial single cell or subcellular resolution, while the capture efficiency is determined by the capture information on the chip, which is related to whether all molecular information in the cell can be captured without bias.
- the capture function area and spatial information area of the chip are on the same molecule of the nucleic acid array, but it is limited by The resolution of the sequencing chip is affected.
- the nucleic acid molecules containing the spatial information region in the nucleic acid array need to be separated by a certain distance to be recognized by the sequencer. If the separation distance is too small, each nucleic acid molecule cannot be accurately identified.
- the capture function region and the spatial information region are on the same nucleic acid molecule, the resolution is limited, and the capture efficiency is also limited.
- the present application splits the capture area and spatial information on the capture chip probe in the prior art into two independent molecules, namely the positioning probe and the capture probe, wherein the positioning probe is used to provide the spatial position information of spatial omics, and the capture probe is used to capture molecules in cells, thereby getting rid of the limitation that the number of capture probes is affected by spatial density, so as to improve the capture efficiency of molecules while ensuring high resolution.
- nucleic acid array e.g., capture chip
- the nucleic acid array can also realize the acquisition of 5' transcript information.
- the localization probe and the capture probe are each independently linked to the solid support via the same or different click chemistry reactions.
- the first universal sequence, the positioning sequence, the second universal sequence, the fixed sequence, and the capture sequence each independently comprise or do not comprise non-natural nucleotide residues (eg, modified nucleotide residues).
- (A) and (B) can be carried out in any order or simultaneously (for example, simultaneously in the same reaction system).
- (A) is performed before (B), and the solid support in (B) It is a solid support to which the localization probe has been attached.
- (B) is performed before (A), and the solid support in (A) is a solid support to which the capture probe has been attached.
- (A) and (B) are performed simultaneously, and the solid phase support in (A) and (B) is the same solid phase support.
- step (A) comprises:
- step (A) further comprises step (3): performing bridge amplification on the positioning probes on a solid support to which the positioning probes are attached, thereby obtaining multi-copy clusters of each positioning probe.
- any part of the positioning probe can be used to modify the molecule or group A, as long as it does not affect the positioning probe to perform its function (e.g., annealing with the target sequence and/or the function of initiating an extension reaction at the 3' end).
- the positioning probe is capable of annealing to the target sequence and the 3' end of the positioning probe is capable of initiating an extension reaction.
- any part of the positioning probe can be used to modify the molecule or group A, as long as it does not affect the annealing of the positioning probe to the target sequence and/or the initiation of an extension reaction at the 3' end.
- the positioning probe is capable of annealing to the target sequence and the 3' end of the positioning probe is closed. In this case, any part of the positioning probe can be used to modify the molecule or group A, as long as it does not affect the annealing of the positioning probe to the target sequence.
- the positioning sequence of the positioning probe does not modify the molecule or group A.
- the first universal sequence of the positioning probe does not modify the molecule or group A.
- neither the first universal sequence nor the positioning sequence of the positioning probe modifies the molecule or group A.
- the second universal sequence of the localization probe is modified with the molecule or group A.
- the 5' end (e.g., the 5' terminus) of the localization probe is modified with the molecule or group A.
- step (A) comprises:
- each vector comprising at least one copy of a vector sequence
- the vector sequence comprises: a complementary sequence of a positioning sequence and a complementary sequence of a first universal sequence; the first universal sequence and the positioning sequence are as defined above; wherein the complementary sequence of the positioning sequence of each vector sequence is different from each other;
- steps (3) and (4) are performed in any order (for example, step (3) is performed before or after step (4), or step (3) and step (4) are performed simultaneously).
- step (2) the carrier is placed on the surface of the solid support by forming a connection (eg, a non-covalent connection or a covalent connection) with the surface of the solid support.
- a connection eg, a non-covalent connection or a covalent connection
- the vector sequence comprises, in order from the 5' end to the 3' end: a complementary sequence to the universal sequence, and a complementary sequence to the positioning sequence.
- the extension product comprises, in order from 5' to 3', the positioning sequence, and the first universal sequence.
- each vector is a DNB formed from concatemers of multiple copies of the vector sequence.
- the method optionally comprises step (5): digesting the vector sequence, and/or separating (eg, melting) the extension product in step (3) from the vector sequence annealed thereto.
- DNB DNA nano ball
- RCA rolling circle amplification
- the RCA product is a multi-copy single-stranded DNA sequence, which can form a "spherical” structure due to the interaction between the bases of the internal DNA sequence.
- the library molecule is circularized to form a single-stranded circular DNA, and then the single-stranded circular DNA can be amplified by multiple orders of magnitude using the rolling circle amplification technique, and the resulting amplification product is called DNB.
- the fixed primer comprises the second universal sequence or a partial sequence thereof (e.g., the fixed primer comprises the second universal sequence or a partial sequence at the 5' end thereof).
- the second strand is not directly attached to the solid support.
- the 3' end of the second strand comprises the capture sequence, and the capture sequence does not anneal to the first strand.
- the molecule or group A' is DBCO and the molecule or group B' is azido.
- the first strand of the capture probe is modified with
- the surface of the solid support is modified with an azide group, and the first chain of the capture probe and the solid support are connected through a click chemical reaction between DBCO and the azide group.
- the solid support has one or more characteristics selected from the group consisting of:
- the solid support is planar, spherical or porous
- the solid support can be used in a sequencing platform; in certain embodiments, the solid support is a sequencing chip for use in an Ill ⁇ Mina, MGI or Thermo Fisher sequencing platform; and
- the solid support is capable of releasing the localization probe and/or capture probe spontaneously or upon exposure to one or more stimuli (e.g., temperature change, pH change, exposure to a specific chemical or phase, exposure to light, a reducing agent, etc.).
- stimuli e.g., temperature change, pH change, exposure to a specific chemical or phase, exposure to light, a reducing agent, etc.
- the same type of positioning probes i.e., positioning probes comprising the same positioning sequence
- different types of positioning probes occupy different areas on the surface of the support
- the center distance between any two adjacent areas is less than 1 ⁇ m (e.g., less than 900 nm, less than 800 nm, less than 700 nm, less than 600 nm, less than 500 nm, less than 400 nm, less than 300 nm, less than 200 nm).
- the distribution of the capture probe and the positioning probe on the solid support is configured as follows: the nucleic acid molecule connected to the capture probe (for example, a nucleic acid molecule annealed to the capture probe, or a nucleic acid molecule obtained by extending the capture probe through a nucleic acid polymerization reaction) is able to contact the positioning probe adjacent to it.
- the nucleic acid molecule connected to the capture probe for example, a nucleic acid molecule annealed to the capture probe, or a nucleic acid molecule obtained by extending the capture probe through a nucleic acid polymerization reaction
- each of the positioning probes is adjacent to at least 100 capture probes.
- At least 100 capture probes are distributed within a radius of 200 nm to 300 nm around the center of the position occupied by each positioning probe on the solid support (e.g., the center of the area occupied by each positioning probe on the surface of the solid support).
- there are 500-10000 (e.g., 1000-1500, 1000-2000, 1000-3000, 1000-4000, 1000-5000, 1000-8000, 1000-10000) capture probes distributed within a radius of 200nm-300nm around the center of the position occupied by each of the positioning probes on the solid support (e.g., the center of the area occupied by each of the positioning probes on the surface of the solid support).
- the present application provides a method for preparing a nucleic acid array as described above, comprising the following steps:
- nucleic acid array to be processed comprises a solid support, and the solid support is connected with at least two oligonucleotide molecules;
- Each oligonucleotide molecule occupies a different position on the solid support
- the oligonucleotide molecule contains a positioning sequence and a first universal sequence in sequence from the 5' end to the 3' end, wherein the positioning sequence has a nucleotide sequence corresponding to the position of the oligonucleotide molecule on the solid support; the first universal sequence is as defined above;
- each oligonucleotide molecule consists of one or more oligonucleotide molecules.
- the localization sequences contained in the same oligonucleotide molecules are identical to each other, and the localization sequences contained in different oligonucleotide molecules are different from each other.
- oligonucleotide molecules of the same oligonucleotide molecule have the same positioning sequence, however, it is not necessary for every oligonucleotide molecule of each oligonucleotide molecule to have the same complete sequence.
- the oligonucleotide molecule does not comprise a capture sequence at the 3' end of the first universal sequence, the capture sequence being as defined above.
- the oligonucleotide molecule comprises a capture sequence at the 3′ end of the first universal sequence, the capture sequence being as defined above, and the method further comprises step (B′): removing the capture sequence contained in the oligonucleotide molecule;
- steps (B) and (B') can be carried out in any order or simultaneously (for example, simultaneously in the same reaction system).
- the second universal sequence is located 5' to the localization sequence.
- the oligonucleotide molecule further comprises a MID sequence.
- the MID sequence is located at the 5' end of the first universal sequence, and/or, the MID sequence is located at the 3' end of the second universal sequence.
- the MID sequences comprised by the same oligonucleotide molecule are different from each other.
- the oligonucleotide molecules are covalently or non-covalently linked to the solid support.
- the molecule or group pair capable of the click chemistry reaction is selected from: alkynyl/azido, azido/cyano, amine/ene, thiol/ene, thiol/alkyne, aldehyde/1,3-diol, ketone/1,3-diol. In certain embodiments, the molecule or group pair capable of the click chemistry reaction is azido/alkynyl.
- any part of the capture probe can be used to modify the molecule or group A', as long as it does not affect the function of the capture probe (e.g., annealing with the target sequence and the function of the 3' end initiation extension reaction).
- the capture sequence of the capture probe does not modify the molecule or group A'.
- the 5' end (e.g., 5' end) of the capture probe is modified with the molecule or group A'.
- the capture sequence is located at the 3' end of the capture probe.
- the capture probe is present in a duplex form containing a single-stranded region comprising a capture sequence, wherein the capture sequence is as defined above, and step (B) comprises:
- (I)(1) providing: (a) a free first strand of the capture probe, wherein the first strand is modified with a molecule or group A'; and, (b) the nucleic acid array to be treated or the nucleic acid array treated in step (B'), wherein the solid support surface of the nucleic acid array is modified with a molecule or group B', wherein the molecule or group A' is capable of forming a connection (e.g., covalent and/or non-covalent connection) with the molecule or group B'; and, (c) the free second strand of the capture probe, wherein the second strand comprises the capture sequence and is capable of annealing with the first strand to form a duplex;
- a connection e.g., covalent and/or non-covalent connection
- step (3) contacting the free second strand with the solid support connected to the first strand formed in step (2) under conditions allowing annealing, thereby obtaining a solid support connected to the capture probe;
- (II)(1) providing: (a) a duplex formed by annealing the first strand and the second strand of the capture probe, wherein the first strand is connected to a molecule or group A', and the second strand contains the capture sequence; and, (b) the nucleic acid array to be treated or the nucleic acid array treated in step (B'), wherein the solid support surface of the nucleic acid array is modified with a molecule or group B', and the molecule or group A' can form a connection (e.g., covalent and/or non-covalent connection) with the molecule or group B'; and,
- any portion of the first strand can be used to bind to the solid support.
- the first strand is connected to the second strand, as long as the connection does not affect the first strand to perform its function (e.g., the function of annealing with the second strand).
- the portion of the first strand that is not annealed to the second strand is modified with the molecule or group A'.
- the end (e.g., 5' end or 3' end) of the first strand is modified with the molecule or group A'.
- the second strand is not directly attached to the solid support.
- the 3' end of the second strand comprises the capture sequence, and the capture sequence does not anneal to the first strand.
- the molecule or group A' is DBCO and the molecule or group B' is azido.
- the first extension product comprises a capture sequence and a complementary sequence of a nucleic acid molecule or a partial sequence thereof that anneals to the capture sequence, and the first extension product is connected to the solid support via the capture sequence contained therein;
- the method before performing step (6), the method further comprises a step of purifying the released nucleic acid molecules.
- the method has one or more features selected from the group consisting of:
- the sample is a tissue sample (eg, a tissue section) or a single cell sample (eg, a single cell suspension).
- the analysis comprises sequencing and/or sequence-specific PCR reaction.
- the method before sequencing, further comprises the step of constructing a sequencing library for the nucleic acid molecule containing the spatial information marker or its amplified product obtained in step (5).
- the methods are used to detect the spatial information of RNA (eg, mRNA) of cells in a sample.
- RNA eg, mRNA
- the nucleic acid derived from the sample to be tested is RNA (eg, mRNA) in cells of the sample to be tested, and the capture sequence comprises a poly (dT) sequence or a random oligonucleotide sequence.
- step (3) comprises:
- the first extension product comprises: a capture sequence, the cDNA chain sequence or a partial sequence thereof, and a complementary sequence of the common sequence.
- the 3' terminal overhang has a length of at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, 1-10, 1-5, or 2-10 nucleotides.
- the 3' terminal overhang is an overhang of 2-5 cytosine nucleotides (e.g., a CCC overhang).
- the method further comprises, before step (4), a step of removing the template switching sequence bound to the first extension product (eg, removing the template switching sequence bound to the first extension product by enzyme cleavage or denaturing melting).
- a step of removing the template switching sequence bound to the first extension product eg, removing the template switching sequence bound to the first extension product by enzyme cleavage or denaturing melting.
- the positioning probe does not contain a MID sequence and the template switch sequence contains a MID sequence; alternatively, the positioning probe contains a MID sequence and the template switch sequence does not contain a MID sequence; alternatively, both the positioning probe and the template switch sequence contain a MID sequence.
- the method is used to detect the spatial information of a target nucleic acid (eg, target DNA and/or RNA) of a cell in a sample, wherein the target nucleic acid comprises a target nucleotide sequence.
- a target nucleic acid eg, target DNA and/or RNA
- the nucleic acid derived from the sample to be tested that anneals to the capture sequence is a single-stranded nucleic acid or a double-stranded nucleic acid containing a single-stranded region comprising the first segment.
- RNA e.g., mRNA
- the second capture sequence comprises a sequence that specifically recognizes the first segment of the target nucleotide sequence; in step (4), the first universal sequence II comprises a sequence that specifically recognizes the complementary sequence of the second segment of the target nucleotide sequence, or the first universal sequence II comprises a random oligonucleotide sequence. In certain embodiments, in the target nucleotide sequence, the first segment is located at the 3' end of the second segment.
- the first segment is located at the 3' end of the second segment.
- the definition/description of the positioning probe above also applies to the first positioning probe and the second positioning probe
- the definition/description of the first universal sequence above also applies to the first universal sequence I and the first universal sequence II.
- An exemplary capture chip of the present application comprises a positioning probe and a capture probe, wherein the capture probe comprises a linker sequence and a capture sequence in sequence from the 5’ end to the 3’ end, and the positioning probe comprises a first linker sequence, a spatial position sequence (i.e., a positioning sequence) and a second linker sequence in sequence from the 5’ end to the 3’ end.
- the capture probe comprises a linker sequence and a capture sequence in sequence from the 5’ end to the 3’ end
- the positioning probe comprises a first linker sequence, a spatial position sequence (i.e., a positioning sequence) and a second linker sequence in sequence from the 5’ end to the 3’ end.
- the capture chip preparation steps are as follows:
- Chip surface modification Prepare 0.1% poly-lysine solution (PLL, sigma P8920), add it to the above-treated chip, and incubate at 30°C for 3 h to modify the chip surface with amino groups. Discard the reaction solution, wash the chip with ddH2O, and dry the chip for 1 h. Prepare NHS-PEG-N3 reaction solution (sigma JKA5088) according to the instructions and add it to the above-mentioned chip. Incubate at 37°C for 5 h to modify the chip surface with azide groups.
- Oligo dT modification of chip capture probe Bioengineering synthesized a capture probe with a linker and oligo dT: 5’-CCTCCGACTGTGTGACTTAGACTCCTGCCACCTCCTGATGTGCTTTTTTTTTTTTTTTTTTTTV-3’ (SEQ ID NO: 2). The 5’ end of the probe was modified with DBCO (dibenzocyclooctyne). The probe was diluted to 1 ⁇ M with PBS and added to the chip for reaction at 37°C overnight. This allowed the azide groups on the chip surface to undergo a click reaction with the DBCO-modified probe, thereby connecting the oligo dT probe to the chip to obtain a chip containing capture probes.
- DBCO dibenzocyclooctyne
- tissue slices such as mouse brain slices
- the slices are permeabilized to release mRNA, so that the mRNA in the tissue can be captured by the capture probes on the chip.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Immunology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Sustainable Development (AREA)
- Plant Pathology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Apparatus Associated With Microorganisms And Enzymes (AREA)
Abstract
Description
本申请涉及生物分子空间检测领域。具体地,本申请提供了用于检测样品中核酸空间信息的核酸阵列,以及基于所述阵列的检测样品中核酸空间信息的方法,以及产生所述核酸阵列的方法。The present application relates to the field of biomolecule spatial detection. Specifically, the present application provides a nucleic acid array for detecting nucleic acid spatial information in a sample, a method for detecting nucleic acid spatial information in a sample based on the array, and a method for generating the nucleic acid array.
空间组学技术是一种在组织原位获取细胞中转录组、基因组、表观组及蛋白组等组学信息的技术。空间组学技术于2020年被Nature methods评为年度技术方法,2022年又上榜Nature年度最值得关注的七大榜单技术之一。空间组学技术基于原理不同可以分为四类,即显微切割、原位杂交、原位测序、芯片原位捕获及高通量测序的空间组学技术。基于芯片原位捕获及高通量测序的空间组学技术因操作简便、通量高等优势占据主导地位,其中捕获芯片是该技术的关键。Spatial omics technology is a technology that obtains omics information such as transcriptome, genome, epigenome and proteome in cells in situ in tissues. Spatial omics technology was named the technical method of the year by Nature methods in 2020, and was listed as one of Nature's seven most noteworthy technologies of the year in 2022. Spatial omics technology can be divided into four categories based on different principles, namely microdissection, in situ hybridization, in situ sequencing, chip in situ capture and high-throughput sequencing. Spatial omics technology based on chip in situ capture and high-throughput sequencing occupies a dominant position due to its simple operation and high throughput, among which the capture chip is the key to this technology.
基于芯片捕获的空间组学技术,是采用芯片上带有空间信息的探针将组织切片的核酸等分子进行原位捕获后,采用高通量测序技术获取芯片上的空间及细胞内的核酸分子信息,再通过算法分析还原组织内的分子信息。自2016年,瑞典皇家理工学院Joakim Lundeberg在Science首次发表空间转录组技术以来,因该技术具备高通量全转录组等优势,该领域近年来得到迅猛发展。类似的其他空间转录组技术,基于硅胶磁珠的HDST技术(High-definition spatial transcriptomics),基于条形码微珠开发的slide-seq技术(sequencing by oligonucleotide ligation and detection),基于微流控技术开发的DBiT-seq技术(Deterministic Barcoding in Tissue)、基于IllμMina测序平台的Seq-Scope技术、以及基于DNB测序平台的stereo-seq技术(spatial enhanced resolution omics sequencing)。其中Joakim的空间转录组技术于2018年底被10x Genomics公司收购并商业化,于2019年发布VisiμM产品,至今基于该技术,10x Genomics公司已形成了除单细胞产线外的另一个包含仪器设备、分析软件及试剂耗材的空间组学产品线,同时百迈客生物科技公司也基于该技术在2022年发布了空间转录组产品。Spatial omics technology based on chip capture uses probes with spatial information on the chip to capture nucleic acids and other molecules in tissue sections in situ, uses high-throughput sequencing technology to obtain spatial information on the chip and nucleic acid molecules in cells, and then restores the molecular information in the tissue through algorithm analysis. Since Joakim Lundeberg of the Royal Institute of Technology in Sweden first published spatial transcriptomics technology in Science in 2016, this field has developed rapidly in recent years due to the advantages of high-throughput whole transcriptome technology. Other similar spatial transcriptomics technologies include HDST technology (High-definition spatial transcriptomics) based on silica magnetic beads, slide-seq technology (sequencing by oligonucleotide ligation and detection) based on barcode microbeads, DBiT-seq technology (Deterministic Barcoding in Tissue) based on microfluidics, Seq-Scope technology based on IllμMina sequencing platform, and stereo-seq technology (spatial enhanced resolution omics sequencing) based on DNB sequencing platform. Among them, Joakim's spatial transcriptomics technology was acquired and commercialized by 10x Genomics at the end of 2018, and the VisiμM product was released in 2019. Based on this technology, 10x Genomics has formed another spatial omics product line in addition to the single-cell production line, including instruments, analysis software and reagents and consumables. At the same time, Biomarker Biotech also released spatial transcriptomics products based on this technology in 2022.
现有基于芯片原位捕获的空间组学技术其核心在于捕获芯片的分辨率和捕获效率。芯片的分辨率由芯片上的空间信息密度决定,它关系到对组织切片上的细胞能否实现空间单细胞或亚细胞分辨率的精细解析,而捕获效率由芯片上的捕获信息决定,它关系到能否对细胞内所有分子信息进行无偏差的全面捕获。而现有的以测序芯片作为捕获芯片的技术,芯片的捕获功能区域和空间信息区域在核酸阵列的同一条分子上,然而受限于 测序芯片的分辨率影响,核酸阵列中含有空间信息区域的核酸分子之间需要间隔一定距离以上才可以被测序仪识别,若间隔距离太小则无法准确识别各核酸分子。当捕获功能区域和空间信息区域在同一条核酸分子上时,分辨率受到限制,则捕获效率也受到限制。The core of the existing spatial omics technology based on chip in situ capture lies in the resolution and capture efficiency of the capture chip. The resolution of the chip is determined by the spatial information density on the chip, which is related to whether the cells on the tissue slice can be finely analyzed with spatial single cell or subcellular resolution, while the capture efficiency is determined by the capture information on the chip, which is related to whether all molecular information in the cell can be captured without bias. In the existing technology that uses sequencing chips as capture chips, the capture function area and spatial information area of the chip are on the same molecule of the nucleic acid array, but it is limited by The resolution of the sequencing chip is affected. The nucleic acid molecules containing the spatial information region in the nucleic acid array need to be separated by a certain distance to be recognized by the sequencer. If the separation distance is too small, each nucleic acid molecule cannot be accurately identified. When the capture function region and the spatial information region are on the same nucleic acid molecule, the resolution is limited, and the capture efficiency is also limited.
此外,由于现有的捕获芯片都是基于poly T探针进行mRNA的捕获,为了同时获取转录本的表达图谱和空间信息,目前只能进行mRNA 3’端转录本的捕获,而丢失了其5’端转录本的信息,而对于免疫组库而言,重要的分子信息集中在5’端,因而现有技术在5’端转录本的捕获上是欠缺的。In addition, because existing capture chips are based on poly T probes to capture mRNA, in order to simultaneously obtain the expression map and spatial information of the transcripts, they can only capture the 3’ end transcripts of mRNA, while losing the information of the 5’ end transcripts. For the immune repertoire, important molecular information is concentrated at the 5’ end, so the existing technology is lacking in capturing the 5’ end transcripts.
发明内容Summary of the invention
为了同时实现高分辨以及高捕获效率的空间组学技术,本申请将现有技术中捕获芯片探针上的捕获区和空间信息拆分设计为分别存在于两个独立的分子上,即定位探针和捕获探针,其中定位探针用于提供空间组学的空间位置信息,而捕获探针用于捕获细胞内的分子,从而摆脱了捕获探针数量受空间密度影响的限制,以实现在保证高分辨率的同时,提升分子的捕获效率。In order to achieve both high-resolution and high-capture efficiency spatial omics technology, the present application splits the capture area and spatial information on the capture chip probe in the prior art into two independent molecules, namely the positioning probe and the capture probe, wherein the positioning probe is used to provide the spatial position information of spatial omics, and the capture probe is used to capture molecules in cells, thereby getting rid of the limitation that the number of capture probes is affected by spatial density, so as to improve the capture efficiency of molecules while ensuring high resolution.
此外,本申请提供的核酸阵列(例如,捕获芯片)还可以实现5’转录本信息的获取。In addition, the nucleic acid array (e.g., capture chip) provided in the present application can also realize the acquisition of 5' transcript information.
核酸阵列Nucleic acid array
因此,在一方面,本申请提供了一种用于检测样品中核酸空间信息的核酸阵列,其包括固相支持物,所述固相支持物连接有一种或多种捕获探针以及至少两种定位探针;Therefore, in one aspect, the present application provides a nucleic acid array for detecting spatial information of nucleic acids in a sample, comprising a solid support to which one or more capture probes and at least two localization probes are attached;
所述捕获探针与所述定位探针各自独立地与所述固相支持物连接;The capture probe and the localization probe are each independently connected to the solid support;
每种定位探针在所述固相支持物上占据不同位置;Each localization probe occupies a different position on the solid support;
所述定位探针含有第一通用序列和定位序列,其中,所述定位序列具有与该种定位探针在固相支持物的位置相对应的核苷酸序列;The positioning probe contains a first universal sequence and a positioning sequence, wherein the positioning sequence has a nucleotide sequence corresponding to the position of the positioning probe on the solid support;
所述捕获探针含有捕获序列,所述捕获序列能够与待捕获的核酸分子退火并起始延伸反应。The capture probe contains a capture sequence that is capable of annealing to the nucleic acid molecule to be captured and initiating an extension reaction.
在某些实施方案中,每种所述定位探针由一个或多个定位探针分子组成。In certain embodiments, each of the localization probes consists of one or more localization probe molecules.
在某些实施方案中,属于同种定位探针的定位探针分子所含有的定位序列彼此相同,不同种定位探针的定位探针分子含有的定位序列彼此不同。In certain embodiments, the localization sequences contained in localization probe molecules belonging to the same type of localization probe are identical to each other, and the localization sequences contained in localization probe molecules belonging to different types of localization probes are different from each other.
本领域技术人员易于理解,同种定位探针的一个或多个定位探针分子具有相同的定位序列,然而,不需要每种定位探针的每一个定位探针分子都具有相同的完整序列。It is easy for a person skilled in the art to understand that one or more localization probe molecules of the same localization probe have the same localization sequence, however, it is not necessary for every localization probe molecule of each localization probe to have the same complete sequence.
在某些实施方案中,所述核酸阵列的所述定位探针分子和所述捕获探针是相互分离的。 In certain embodiments, the localization probe molecules and the capture probes of the nucleic acid array are separated from each other.
在某些实施方案中,所述第一通用序列位于所述定位序列的3’端。In certain embodiments, the first universal sequence is located 3' to the localization sequence.
在某些实施方案中,所述定位探针进一步包含第二通用序列。In certain embodiments, the localization probe further comprises a second universal sequence.
在某些实施方案中,所述第二通用序列位于所述定位序列的5’端。In certain embodiments, the second universal sequence is located 5' to the localization sequence.
在某些实施方案中,所述固相支持物所连接的不同种定位探针所包含的所述第二通用序列相同或不相同。在某些实施方案中,所述固相支持物所连接的不同种定位探针所包含的所述第二通用序列相同。在某些实施方案中,所述固相支持物所连接的同种定位探针所包含的所述第二通用序列相同或不相同。In certain embodiments, the second universal sequence comprised by different types of localization probes attached to the solid support is the same or different. In certain embodiments, the second universal sequence comprised by different types of localization probes attached to the solid support is the same. In certain embodiments, the second universal sequence comprised by the same type of localization probes attached to the solid support is the same or different.
在某些实施方案中,所述固相支持物所连接的同种定位探针所包含的所述第一通用序列相同或不相同。In certain embodiments, the first universal sequence comprised by the same type of localization probes attached to the solid support is the same or different.
在某些实施方案中,所述定位探针不包含或者进一步包含分子标签序列(MID,molecular identifier)。In some embodiments, the localization probe does not contain or further contains a molecular tag sequence (MID, molecular identifier).
在某些实施方案中,所述定位探针进一步包含MID序列。In certain embodiments, the localization probe further comprises a MID sequence.
在某些实施方案中,所述MID序列位于所述第一通用序列的5’端,和/或,所述MID序列位于所述第二通用序列的3’端。In certain embodiments, the MID sequence is located at the 5' end of the first universal sequence, and/or, the MID sequence is located at the 3' end of the second universal sequence.
在某些实施方案中,同种定位探针所包含的MID序列彼此不同。In certain embodiments, cognate localization probes comprise MID sequences that are different from each other.
在某些实施方案中,所述捕获序列位于所述捕获探针的3’末端。In certain embodiments, the capture sequence is located at the 3' end of the capture probe.
在某些实施方案中,所述捕获序列的3’末端具有自由羟基(-OH)。In certain embodiments, the capture sequence has a free hydroxyl group (-OH) at the 3' end.
在某些实施方案中,所述捕获探针进一步包含固定序列。In certain embodiments, the capture probe further comprises an immobilization sequence.
在某些实施方案中,所述固相支持物所连接的不同捕获探针所包含的所述固定序列相同或不相同。在某些实施方案中,所述固相支持物所连接的不同捕获探针所包含的所述固定序列相同。In certain embodiments, the fixed sequences comprised by different capture probes attached to the solid support are identical or different. In certain embodiments, the fixed sequences comprised by different capture probes attached to the solid support are identical.
在某些实施方案中,所述固定序列位于所述捕获序列的5’端。In certain embodiments, the fixed sequence is located 5' to the capture sequence.
在某些实施方案中,所述捕获探针含有包含所述捕获序列的单链区域。In certain embodiments, the capture probe contains a single-stranded region comprising the capture sequence.
在某些实施方案中,所述捕获探针以含有所述单链区域的单链形式或双链体形式存在。In certain embodiments, the capture probe exists in a single-stranded form or in a duplex form containing the single-stranded region.
在某些实施方案中,所述捕获探针以单链形式存在,并且,所述捕获序列位于所述捕获探针的3’末端;或者,In certain embodiments, the capture probe is in a single-stranded form, and the capture sequence is located at the 3' end of the capture probe; or
所述捕获探针以含有单链区域的双链体形式存在,其包含直接与所述固相支持物连接的第一链,以及与所述第一链杂交的第二链;并且,所述第二链的3’端包含单链区域,所述捕获序列位于所述单链区域的3’末端。 The capture probe exists in the form of a double-stranded body containing a single-stranded region, which includes a first chain directly connected to the solid support and a second chain hybridized with the first chain; and the 3' end of the second chain includes a single-stranded region, and the capture sequence is located at the 3' end of the single-stranded region.
在某些实施方案中,所述第二链包含所述捕获序列与所述固定序列。In certain embodiments, the second strand comprises the capture sequence and the immobilization sequence.
在某些实施方案中,所述核酸阵列具备选自以下的一项或多项特征:In certain embodiments, the nucleic acid array has one or more features selected from the following:
(1)所述定位序列由5-50个(例如5-25个、5-35个、5-45个、10-25个、10-35个、10-45个、15-25个、15-35个、15-45个、20-25个、20-35个或20-45个)随机核苷酸组成的核苷酸序列,在某些实施方案中,每个随机核苷酸各自独立地为脱氧核糖核苷酸A、C、G和T中的任何一种;(1) the positioning sequence is a nucleotide sequence consisting of 5-50 (e.g., 5-25, 5-35, 5-45, 10-25, 10-35, 10-45, 15-25, 15-35, 15-45, 20-25, 20-35 or 20-45) random nucleotides, and in certain embodiments, each random nucleotide is independently any one of deoxyribonucleotides A, C, G and T;
(2)所述固相支持物选自乳胶珠、葡聚糖珠、聚苯乙烯表面、聚丙烯表面、聚丙烯酰胺凝胶、金表面、玻璃表面、芯片、传感器、电极和硅晶片;在某些实施方案中,所述固相支持物是芯片;在某些实施方案中,所述固相支持物能够用于测序平台;在某些实施方案中,所述固相支持物是测序芯片(例如,高通量测序芯片),例如用于Illumina、MGI或Thermo Fisher测序平台的测序芯片(例如,高通量测序芯片);(2) The solid support is selected from latex beads, dextran beads, polystyrene surfaces, polypropylene surfaces, polyacrylamide gels, gold surfaces, glass surfaces, chips, sensors, electrodes and silicon wafers; in some embodiments, the solid support is a chip; in some embodiments, the solid support can be used in a sequencing platform; in some embodiments, the solid support is a sequencing chip (e.g., a high-throughput sequencing chip), such as a sequencing chip for an Illumina, MGI or Thermo Fisher sequencing platform (e.g., a high-throughput sequencing chip);
(3)同种所述定位探针(也即,包含相同定位序列的定位探针)占据所述固相支持物表面同一个区域,不同种所述定位探针占据所述支持物表面的不同区域,并且,任意相邻近的两个所述区域之间的中心距离小于1μm(例如,小于900nm、小于800nm、小于700nm、小于600nm、小于500nm、小于400nm、小于300nm、小于200nm);(3) The same type of positioning probes (i.e., positioning probes containing the same positioning sequence) occupy the same area on the surface of the solid support, different types of positioning probes occupy different areas on the surface of the support, and the center distance between any two adjacent areas is less than 1 μm (e.g., less than 900 nm, less than 800 nm, less than 700 nm, less than 600 nm, less than 500 nm, less than 400 nm, less than 300 nm, less than 200 nm);
(4)所述第一通用序列能够与(i)所述捕获序列捕获的核酸分子,或者,(ii)衍生自(i)的核酸分子退火;(4) the first universal sequence is capable of annealing with (i) a nucleic acid molecule captured by the capture sequence, or (ii) a nucleic acid molecule derived from (i);
(5)所述固相支持物所连接的不同种定位探针所包含的第一通用序列相同或不相同;(5) the first universal sequences contained in the different types of localization probes connected to the solid support are the same or different;
(6)所述固相支持物能够自发地或在暴露于一种或多种刺激(例如,温度变化、pH变化、暴露于特定化学物质或相、暴露于光、还原剂等)时释放所述定位探针和/或捕获探针。(6) The solid support is capable of releasing the localization probe and/or capture probe spontaneously or when exposed to one or more stimuli (e.g., temperature change, pH change, exposure to specific chemicals or phases, exposure to light, reducing agents, etc.).
在某些实施方案中,所述定位探针和/或所述捕获探针与所述固相支持物共价连接或者非共价连接。In certain embodiments, the localization probe and/or the capture probe is covalently or non-covalently attached to the solid support.
在某些实施方案中,所述定位探针和/或所述捕获探针与所述固相支持物共价连接。In certain embodiments, the localization probe and/or the capture probe is covalently linked to the solid support.
在某些实施方案中,所述定位探针和所述捕获探针各自独立地通过相同或不同的点击化学反应与所述固相支持物形成连接。In certain embodiments, the localization probe and the capture probe are each independently linked to the solid support via the same or different click chemistry reactions.
在某些实施方案中,能够发生所述点击化学反应的分子或基团对选自:炔基/叠氮基、叠氮基/氰基、胺/烯、硫醇/烯、硫醇/炔、醛/1,3-二醇、酮/1,3-二醇。在某些实施方案中,所述能够发生点击化学反应的分子或基团对为叠氮基/炔基。In certain embodiments, the molecule or group pair capable of the click chemistry reaction is selected from: alkynyl/azido, azido/cyano, amine/ene, thiol/ene, thiol/alkyne, aldehyde/1,3-diol, ketone/1,3-diol. In certain embodiments, the molecule or group pair capable of the click chemistry reaction is azido/alkynyl.
在某些实施方案中,所述固相支持物表面修饰有分子或基团X,所述定位探针修饰 有分子或基团Y,所述分子或基团X能够与所述分子或基团Y能够发生点击化学反应,所述定位探针通过所述分子或基团X与所述分子或基团Y发生的点击化学反应与所述固相支持物形成连接;和/或,所述固相支持物表面修饰有所述分子或基团X’,所述捕获探针修饰有分子或基团Y’,所述分子或基团X’能够与所述分子或基团Y’能够发生点击化学反应,所述捕获探针通过所述分子或基团X’与所述分子或基团Y’发生的点击化学反应与所述固相支持物形成连接。In certain embodiments, the surface of the solid support is modified with a molecule or group X, and the localization probe is modified with There is a molecule or group Y, the molecule or group X can undergo a click chemical reaction with the molecule or group Y, and the positioning probe is connected to the solid support through the click chemical reaction between the molecule or group X and the molecule or group Y; and/or, the surface of the solid support is modified with the molecule or group X', the capture probe is modified with a molecule or group Y', the molecule or group X' can undergo a click chemical reaction with the molecule or group Y', and the capture probe is connected to the solid support through the click chemical reaction between the molecule or group X' and the molecule or group Y'.
在某些实施方案中,所述分子或基团对X-Y和X’-Y’各自独立地选自:炔基/叠氮基、叠氮基/氰基、胺/烯、硫醇/烯、硫醇/炔、醛/1,3-二醇、酮/1,3-二醇、叠氮基/炔基、氰基/叠氮基、烯/胺、烯/硫醇、炔/硫醇、1,3-二醇/醛、1,3-二醇/酮。In certain embodiments, the molecules or groups X-Y and X'-Y' are each independently selected from the group consisting of: alkynyl/azide, azide/cyano, amine/ene, thiol/ene, thiol/alkyne, aldehyde/1,3-diol, ketone/1,3-diol, azide/alkynyl, cyano/azide, ene/amine, ene/thiol, alkyne/thiol, 1,3-diol/aldehyde, 1,3-diol/ketone.
在某些实施方案中,所述分子或基团X和/或X’为叠氮基团,所述分子或基团Y和/或Y’为 In certain embodiments, the molecule or group X and/or X' is an azide group, and the molecule or group Y and/or Y' is
在某些实施方案中,所述固相支持物表面修饰有叠氮基团,所述定位探针和/或所述捕获探针修饰有所述定位探针和/或所述捕获探针通过所述叠氮基团与所述DBCO发生的点击化学反应与所述固相支持物形成连接。In certain embodiments, the surface of the solid support is modified with an azide group, and the localization probe and/or the capture probe is modified with The positioning probe and/or the capture probe is connected to the solid support through a click chemistry reaction between the azide group and the DBCO.
本领域技术人员易于理解,所述定位探针的任意部分均可用于与所述固相支持物进行连接,只要所述连接不影响所述定位探针行使其功能(例如,与目标序列的退火和/或3’末端起始延伸反应的功能)即可。在某些实施方案中,所述定位探针能够与目标序列退火并且所述定位探针的3’末端能够起始延伸反应,在这种情况下,所述定位探针的任意部分均可用于与所述固相支持物进行连接,只要所述连接不影响所述定位探针与目标序列的退火和/或3’末端起始延伸反应即可。在某些实施方案中,所述定位探针能够与目标序列退火并且所述定位探针的3’末端是封闭的,在这种情况下,所述定位探针的任意部分均可用于与所述固相支持物进行连接,只要所述连接不影响所述定位探针与目标序列的退火即可。在某些实施方案中,所述定位探针的所述定位序列不与所述固相支持物直接连接。在某些实施方案中,所述定位探针的所述第一通用序列不与所述固相支持物直接连接。在某些实施方案中,所述定位探针的所述第一通用序列和所述定位序列均不与所述固相支持物直接连接。在某些实施方案中,所述定位探针通过所述第二通用序列与所述固相支持物进行连接。在某些实施方案中,所述定位探针通过5’端(例如,5’末端) 与所述固相支持物进行连接。It is easy for a person skilled in the art to understand that any part of the positioning probe can be used to connect to the solid support, as long as the connection does not affect the positioning probe to perform its function (for example, annealing with the target sequence and/or the function of initiating an extension reaction at the 3' end). In certain embodiments, the positioning probe is capable of annealing to the target sequence and the 3' end of the positioning probe is capable of initiating an extension reaction. In this case, any part of the positioning probe can be used to connect to the solid support, as long as the connection does not affect the annealing of the positioning probe to the target sequence and/or the initiation of an extension reaction at the 3' end. In certain embodiments, the positioning probe is capable of annealing to the target sequence and the 3' end of the positioning probe is closed. In this case, any part of the positioning probe can be used to connect to the solid support, as long as the connection does not affect the annealing of the positioning probe to the target sequence. In certain embodiments, the positioning sequence of the positioning probe is not directly connected to the solid support. In certain embodiments, the first universal sequence of the positioning probe is not directly connected to the solid support. In some embodiments, neither the first universal sequence nor the localization sequence of the localization probe is directly attached to the solid support. In some embodiments, the localization probe is attached to the solid support via the second universal sequence. In some embodiments, the localization probe is attached to the solid support via the 5' end (e.g., the 5' end) Connected to the solid support.
本领域技术人员易于理解,所述捕获探针的任意部分均可用于与所述固相支持物进行连接,只要所述连接不影响所述捕获探针行使其功能(例如,与目标序列的退火以及3’末端起始延伸反应的功能)即可。在某些实施方案中,所述捕获探针以单链形式存在,在这种情况下,所述捕获探针的任意部分均可用于与所述固相支持物进行连接,只要所述连接不影响所述捕获探针与目标序列的退火以及3’末端起始延伸反应的功能。在某些实施方案中,所述捕获探针的所述捕获序列不与所述固相支持物直接连接。在某些实施方案中,所述捕获探针通过其5’端(例如,5’末端)与所述固相支持物进行连接。It is easy for a person skilled in the art to understand that any part of the capture probe can be used to connect to the solid support, as long as the connection does not affect the function of the capture probe (e.g., annealing with the target sequence and the function of the 3' end initiation extension reaction). In certain embodiments, the capture probe exists in a single-stranded form. In this case, any part of the capture probe can be used to connect to the solid support, as long as the connection does not affect the annealing of the capture probe to the target sequence and the function of the 3' end initiation extension reaction. In certain embodiments, the capture sequence of the capture probe is not directly connected to the solid support. In certain embodiments, the capture probe is connected to the solid support via its 5' end (e.g., 5' end).
在某些实施方案中,所述捕获探针以含有单链区域的双链体形式存在,所述双链体包含直接与所述固相支持物连接的第一链以及与所述第一链杂交的第二链,在这种情况下,所述第一链的任意部分均可用于与所述固相支持物进行连接,只要所述连接不影响所述第一链行使其功能(例如,与所述第二链退火的功能)即可。在某些实施方案中,所述第一链通过其中未与所述第二链退火的部分与所述固相支持物进行连接。在某些实施方案中,所述第一链通过其末端(例如,5’末端或3’末端)与所述固相支持物进行连接。在某些实施方案中,所述第二链不与所述固相支持物直接连接。In certain embodiments, the capture probe exists in the form of a duplex containing a single-stranded region, the duplex comprising a first chain directly connected to the solid support and a second chain hybridized to the first chain, in which case any portion of the first chain can be used to connect to the solid support, as long as the connection does not affect the first chain to perform its function (e.g., the function of annealing to the second chain). In certain embodiments, the first chain is connected to the solid support through a portion that is not annealed to the second chain. In certain embodiments, the first chain is connected to the solid support through its end (e.g., 5' end or 3' end). In certain embodiments, the second chain is not directly connected to the solid support.
在某些实施方案中,所述捕获探针和所述定位探针在所述固相支持物上的分布按照如下方式进行设置:与所述捕获探针连接的核酸分子(例如,与所述捕获探针退火的核酸分子,或者,由所述捕获探针经核酸聚合反应延伸获得的核酸分子)能够和与其邻近的定位探针相接触。In certain embodiments, the distribution of the capture probe and the positioning probe on the solid support is configured as follows: the nucleic acid molecule connected to the capture probe (for example, a nucleic acid molecule annealed to the capture probe, or a nucleic acid molecule obtained by extending the capture probe through a nucleic acid polymerization reaction) is able to contact the positioning probe adjacent to it.
在某些实施方案中,每种所述定位探针的邻近分布有至少100个所述捕获探针。例如,至少100个、至少200个、至少300个、至少400个、至少500个、至少600个、至少700个、至少800个、至少900个、至少1000个、至少1500个、至少2000个、至少2500个、至少3000个、至少3500个、至少4000个、至少4500个、至少5000个、至少5500个、至少6000个、至少6500个、至少7000个、至少7500个、至少8000个、至少8500个、至少9000个、至少9500个或至少10000个所述捕获探针。In certain embodiments, each of the positioning probes is adjacent to at least 100 capture probes. For example, at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, at least 1000, at least 1500, at least 2000, at least 2500, at least 3000, at least 3500, at least 4000, at least 4500, at least 5000, at least 5500, at least 6000, at least 6500, at least 7000, at least 7500, at least 8000, at least 8500, at least 9000, at least 9500 or at least 10000 capture probes.
在某些实施方案中,每种所述定位探针的邻近分布有500-10000个(例如,1000-1500个、1000-2000个、1000-3000个、1000-4000个、1000-5000个、1000-8000个、1000-10000个)所述捕获探针。In certain embodiments, there are 500-10,000 (eg, 1,000-1,500, 1,000-2,000, 1,000-3,000, 1,000-4,000, 1,000-5,000, 1,000-8,000, 1,000-10,000) capture probes distributed adjacent to each of the localization probes.
在某些实施方案中,表述“每种所述定位探针的邻近”表示固相支持物表面该种定位探针附近的区域,其中,与分布于该区域的捕获探针相连接的核酸分子(例如,与所 述捕获探针退火的核酸分子,或者,由所述捕获探针经核酸聚合反应延伸获得的核酸分子)能够与该种定位探针相接触。In certain embodiments, the expression "in the vicinity of each of the positioning probes" refers to an area on the surface of a solid support near the positioning probe, wherein a nucleic acid molecule (e.g., The nucleic acid molecule to which the capture probe is annealed, or the nucleic acid molecule obtained by extending the capture probe through nucleic acid polymerization reaction) can be contacted with the positioning probe.
在某些实施方案中,每种所述定位探针在所述固相支持物所占位置的中心(例如,每种所述定位探针在所述所述固相支持物表面所占据的区域的中心)周围半径200nm-300nm的范围内分布有至少100个所述捕获探针。例如,至少100个、至少200个、至少300个、400个、至少500个、至少600个、至少700个、至少800个、至少900个、至少1000个、至少1500个、至少2000个、至少2500个、至少3000个、至少3500个、至少4000个、至少4500个、至少5000个、至少5500个、至少6000个、至少6500个、至少7000个、至少7500个、至少8000个、至少8500个、至少9000个、至少9500个或至少10000个所述捕获探针。In certain embodiments, at least 100 capture probes are distributed within a radius of 200nm-300nm around the center of the position occupied by each of the positioning probes on the solid support (e.g., the center of the area occupied by each of the positioning probes on the surface of the solid support). For example, at least 100, at least 200, at least 300, 400, at least 500, at least 600, at least 700, at least 800, at least 900, at least 1000, at least 1500, at least 2000, at least 2500, at least 3000, at least 3500, at least 4000, at least 4500, at least 5000, at least 5500, at least 6000, at least 6500, at least 7000, at least 7500, at least 8000, at least 8500, at least 9000, at least 9500 or at least 10000 of the capture probes.
在某些实施方案中,每种所述定位探针在所述固相支持物所占位置的中心(例如,每种所述定位探针在所述固相支持物表面所占据的区域的中心)周围半径200nm-300nm的范围内分布有500-10000个(例如,1000-1500个、1000-2000个、1000-3000个、1000-4000个、1000-5000个、1000-8000个、1000-10000个)所述捕获探针。In certain embodiments, there are 500-10000 (e.g., 1000-1500, 1000-2000, 1000-3000, 1000-4000, 1000-5000, 1000-8000, 1000-10000) capture probes distributed within a radius of 200nm-300nm around the center of the position occupied by each of the positioning probes on the solid support (e.g., the center of the area occupied by each of the positioning probes on the surface of the solid support).
如本文所使用的,表述“每种所述定位探针在所述固相支持物表面所占据的区域”或类似表述,与“每种所述定位探针在所述固相支持物所占位置”具有相同含义,可互换使用。As used herein, the expression "the area occupied by each of the positioning probes on the surface of the solid support" or similar expressions has the same meaning as "the position occupied by each of the positioning probes on the solid support" and can be used interchangeably.
在某些实施方案中,所述核酸阵列包含一种或多种所述捕获探针。In certain embodiments, the nucleic acid array comprises one or more of the capture probes.
在某些实施方案中,不同种所述捕获探针含有的捕获序列不同。In certain embodiments, different species of capture probes contain different capture sequences.
在某些实施方案中,所述待捕获的核酸分子为RNA(例如,mRNA),所述捕获探针的捕获序列含有poly(dT)序列或随机寡核苷酸序列;In certain embodiments, the nucleic acid molecule to be captured is RNA (e.g., mRNA), and the capture sequence of the capture probe contains a poly (dT) sequence or a random oligonucleotide sequence;
所述待捕获的核酸分子为目标靶核酸(例如,目标靶DNA和/或RNA)或衍生自所述目标靶核酸的核酸分子,所述目标靶核酸或衍生自所述目标靶核酸的核酸分子具有所述目标靶核酸所含有的目标靶核苷酸序列,所述捕获探针的捕获序列包含能够与所述目标靶核苷酸序列特异性退火的序列;或者,The nucleic acid molecule to be captured is a target nucleic acid (e.g., a target DNA and/or RNA) or a nucleic acid molecule derived from the target nucleic acid, the target nucleic acid or the nucleic acid molecule derived from the target nucleic acid has a target nucleotide sequence contained in the target nucleic acid, and the capture sequence of the capture probe comprises a sequence that can specifically anneal to the target nucleotide sequence; or,
所述待捕获的核酸分子为RNA(例如,mRNA)和目标靶核酸(例如,目标靶DNA和/或RNA),所述核酸阵列包含用于捕获RNA(例如,mRNA)的第一所述捕获探针以及用于捕获所述目标靶核酸或衍生自所述目标靶核酸的核酸分子的第二捕获探针,所述目标靶核酸或衍生自所述目标靶核酸的核酸分子具有所述目标靶核酸所含有的目标靶核苷酸序列;其中,所述第一捕获探针的捕获序列含有poly(dT)序列或随机寡核苷酸序列,所述第二捕获探针的捕获序列含有能够与所述目标靶核苷酸序列特异性退火的序列。 The nucleic acid molecules to be captured are RNA (e.g., mRNA) and target nucleic acids (e.g., target DNA and/or RNA), and the nucleic acid array comprises a first capture probe for capturing RNA (e.g., mRNA) and a second capture probe for capturing the target nucleic acid or a nucleic acid molecule derived from the target nucleic acid, wherein the target nucleic acid or the nucleic acid molecule derived from the target nucleic acid has a target nucleotide sequence contained in the target nucleic acid; wherein the capture sequence of the first capture probe contains a poly (dT) sequence or a random oligonucleotide sequence, and the capture sequence of the second capture probe contains a sequence that can specifically anneal to the target nucleotide sequence.
本领域技术人员易于理解,除非本文另外指明或根据上下文明显矛盾,否则上文中有关捕获探针的定义/描述同样适用于所述第一捕获探针和所述第二捕获探针。It is easy for a person skilled in the art to understand that, unless otherwise specified herein or clearly contradicted by the context, the above definition/description of the capture probe is also applicable to the first capture probe and the second capture probe.
在某些实施方案中,所述核酸的空间信息包括核酸的定位、分布和/或丰度。In certain embodiments, the spatial information of the nucleic acid includes the location, distribution and/or abundance of the nucleic acid.
在某些实施方案中,所述第一通用序列为由5-80个(例如5-20个、5-30个、5-40个、5-50个、5-70个、10-30个、10-50个、10-70个、20-30个、20-50个、20-70个、25-40个、25-50个、25-70个)核苷酸组成的核苷酸序列。In certain embodiments, the first universal sequence is a nucleotide sequence consisting of 5-80 (e.g., 5-20, 5-30, 5-40, 5-50, 5-70, 10-30, 10-50, 10-70, 20-30, 20-50, 20-70, 25-40, 25-50, 25-70) nucleotides.
在某些实施方案中,所述第二通用序列为由5-80个(例如5-20个、5-30个、5-40个、5-50个、5-70个、10-30个、10-50个、10-70个、20-30个、20-50个、20-70个、25-40个、25-50个、25-70个)核苷酸组成的核苷酸序列。In certain embodiments, the second universal sequence is a nucleotide sequence consisting of 5-80 (e.g., 5-20, 5-30, 5-40, 5-50, 5-70, 10-30, 10-50, 10-70, 20-30, 20-50, 20-70, 25-40, 25-50, 25-70) nucleotides.
在某些实施方案中,所述固定序列为由5-80个(例如5-20个、5-30个、5-40个、5-50个、5-70个、10-30个、10-50个、10-70个、20-30个、20-50个、20-70个、25-40个、25-50个、25-70个)核苷酸组成的核苷酸序列。In certain embodiments, the fixed sequence is a nucleotide sequence consisting of 5-80 (e.g., 5-20, 5-30, 5-40, 5-50, 5-70, 10-30, 10-50, 10-70, 20-30, 20-50, 20-70, 25-40, 25-50, 25-70) nucleotides.
在某些实施方案中,所述捕获序列为由5-50个(例如5-25个、5-35个、5-45个、10-25个、10-35个、10-45个、15-25个、15-35个、15-45个、20-25个、20-35个或20-45个)核苷酸组成的核苷酸序列。In certain embodiments, the capture sequence is a nucleotide sequence consisting of 5-50 (e.g., 5-25, 5-35, 5-45, 10-25, 10-35, 10-45, 15-25, 15-35, 15-45, 20-25, 20-35, or 20-45) nucleotides.
在某些实施方案中,所述随机寡核苷酸序列由5-50个(例如5-25个、5-35个、5-45个、10-25个、10-35个、10-45个、15-25个、15-35个、15-45个、20-25个、20-35个或20-45个)随机核苷酸组成的核苷酸序列。在某些实施方案中,每个随机核苷酸各自独立地为脱氧核糖核苷酸A、C、G和T中的任意一种。In certain embodiments, the random oligonucleotide sequence is a nucleotide sequence consisting of 5-50 (e.g., 5-25, 5-35, 5-45, 10-25, 10-35, 10-45, 15-25, 15-35, 15-45, 20-25, 20-35 or 20-45) random nucleotides. In certain embodiments, each random nucleotide is independently any one of deoxyribonucleotides A, C, G and T.
在某些实施方案中,所述poly(dT)序列由5-50个(例如5-25个、5-35个、5-45个、10-25个、10-35个、10-45个、15-25个、15-35个、15-45个、20-25个、20-35个或20-45个)胸腺嘧啶脱氧核糖核苷酸残基组成。In certain embodiments, the poly (dT) sequence consists of 5-50 (e.g., 5-25, 5-35, 5-45, 10-25, 10-35, 10-45, 15-25, 15-35, 15-45, 20-25, 20-35, or 20-45) thymidine deoxyribonucleotide residues.
在某些实施方案中,所述第一通用序列、所述定位序列、所述第二通用序列、所述固定序列、所述捕获序列各自独立地包含或者不包含非天然核苷酸残基(例如,经修饰的核苷酸残基)。In certain embodiments, the first universal sequence, the positioning sequence, the second universal sequence, the fixed sequence, and the capture sequence each independently comprise or do not comprise non-natural nucleotide residues (eg, modified nucleotide residues).
核酸阵列制备方法Nucleic acid array preparation method
在另一方面,本申请提供了制备如上所述的核酸阵列的方法,其包括以下步骤:In another aspect, the present application provides a method for preparing a nucleic acid array as described above, comprising the following steps:
(A)在固相支持物上连接和/或合成定位探针,所述定位探针如上文所定义;以及,(A) attaching and/or synthesizing a localization probe on a solid support, the localization probe being as defined above; and,
(B)在固相支持物上连接和/或合成捕获探针,所述捕获探针如上文所定义;(B) attaching and/or synthesizing a capture probe on a solid support, wherein the capture probe is as defined above;
其中,所述(A)和(B)可以以任意顺序进行或同时进行(例如,在同一反应体系中同时进行)。Wherein, (A) and (B) can be carried out in any order or simultaneously (for example, simultaneously in the same reaction system).
在某些实施方案中,所述(A)在所述(B)之前进行,所述(B)中的固相支持物 是已经连接有所述定位探针的固相支持物。In certain embodiments, (A) is performed before (B), and the solid support in (B) It is a solid support to which the localization probe has been attached.
在某些实施方案中,所述(B)在所述(A)之前进行,所述(A)中的固相支持物是已经连接有所述捕获探针的固相支持物。In certain embodiments, (B) is performed before (A), and the solid support in (A) is a solid support to which the capture probe has been attached.
在某些实施方案中,所述(A)与所述(B)同时进行,所述(A)和所述(B)中的固相支持物是同一个固相支持物。In certain embodiments, (A) and (B) are performed simultaneously, and the solid phase support in (A) and (B) is the same solid phase support.
在某些实施方案中,步骤(A)包含:In certain embodiments, step (A) comprises:
(1)提供:(a)游离的所述定位探针,其中,所述定位探针修饰有分子或基团A;以及,(b)所述固相支持物,所述固相支持物表面修饰有分子或基团B,所述分子或基团A能够与所述分子或基团B形成连接(例如,共价或非共价连接);和,(1) providing: (a) the free positioning probe, wherein the positioning probe is modified with a molecule or group A; and, (b) the solid support, wherein the surface of the solid support is modified with a molecule or group B, wherein the molecule or group A can form a connection (e.g., a covalent or non-covalent connection) with the molecule or group B; and,
(2)在适于使所述分子或基团A与所述分子或基团B形成连接的条件下,将所述定位探针与所述固相支持物接触,从而获得连接有所述定位探针的固相支持物。(2) contacting the positioning probe with the solid support under conditions suitable for forming a connection between the molecule or group A and the molecule or group B, thereby obtaining a solid support to which the positioning probe is connected.
在某些实施方案中,步骤(A)进一步包括步骤(3):在连接有所述定位探针的固相支持物上对所述定位探针进行桥式扩增,从而获得每种定位探针的多拷贝簇。In certain embodiments, step (A) further comprises step (3): performing bridge amplification on the positioning probes on a solid support to which the positioning probes are attached, thereby obtaining multi-copy clusters of each positioning probe.
本领域技术人员易于理解,所述定位探针的任意部分均可用于修饰所述分子或基团A,只要其不影响所述定位探针行使其功能(例如,与目标序列的退火和/或3’末端起始延伸反应的功能)即可。在某些实施方案中,所述定位探针能够与目标序列退火并且所述定位探针的3’末端能够起始延伸反应,在这种情况下,所述定位探针的任意部分均可用于修饰所述分子或基团A,只要其不影响所述定位探针与目标序列的退火和/或3’末端起始延伸反应即可。在某些实施方案中,所述定位探针能够与目标序列退火并且所述定位探针的3’末端是封闭的,在这种情况下,所述定位探针的任意部分均可用于修饰所述分子或基团A,只要其不影响所述定位探针与目标序列的退火即可。在某些实施方案中,所述定位探针的所述定位序列没有修饰所述分子或基团A。在某些实施方案中,所述定位探针的所述第一通用序列没有修饰所述分子或基团A。在某些实施方案中,所述定位探针的所述第一通用序列和所述定位序列均没有修饰所述分子或基团A。在某些实施方案中,所述定位探针的所述第二通用序列修饰有所述分子或基团A。在某些实施方案中,所述定位探针的5’端(例如,5’末端)修饰有所述分子或基团A。It is easy for a person skilled in the art to understand that any part of the positioning probe can be used to modify the molecule or group A, as long as it does not affect the positioning probe to perform its function (e.g., annealing with the target sequence and/or the function of initiating an extension reaction at the 3' end). In certain embodiments, the positioning probe is capable of annealing to the target sequence and the 3' end of the positioning probe is capable of initiating an extension reaction. In this case, any part of the positioning probe can be used to modify the molecule or group A, as long as it does not affect the annealing of the positioning probe to the target sequence and/or the initiation of an extension reaction at the 3' end. In certain embodiments, the positioning probe is capable of annealing to the target sequence and the 3' end of the positioning probe is closed. In this case, any part of the positioning probe can be used to modify the molecule or group A, as long as it does not affect the annealing of the positioning probe to the target sequence. In certain embodiments, the positioning sequence of the positioning probe does not modify the molecule or group A. In certain embodiments, the first universal sequence of the positioning probe does not modify the molecule or group A. In certain embodiments, neither the first universal sequence nor the positioning sequence of the positioning probe modifies the molecule or group A. In certain embodiments, the second universal sequence of the localization probe is modified with the molecule or group A. In certain embodiments, the 5' end (e.g., the 5' terminus) of the localization probe is modified with the molecule or group A.
在某些实施方案中,所述分子或基团A能够与所述分子或基团B发生点击化学反应。In certain embodiments, the molecule or group A is capable of undergoing a click chemistry reaction with the molecule or group B.
在某些实施方案中,所述分子或基团A/B选自:炔基/叠氮基、叠氮基/氰基、胺/烯、硫醇/烯、硫醇/炔、醛/1,3-二醇、酮/1,3-二醇、叠氮基/炔基、氰基/叠氮基、烯/胺、烯/硫醇、炔/硫醇、1,3/二醇-醛、1,3/二醇-酮。In certain embodiments, the molecules or groups A/B are selected from: alkynyl/azido, azido/cyano, amine/ene, thiol/ene, thiol/alkyne, aldehyde/1,3-diol, ketone/1,3-diol, azido/alkynyl, cyano/azido, ene/amine, ene/thiol, alkyne/thiol, 1,3/diol-aldehyde, 1,3/diol-ketone.
在某些实施方案中,所述分子或基团A/B为叠氮/炔基或炔基/叠氮。In certain embodiments, the molecules or groups A/B are azide/alkynyl or alkynyl/azide.
在某些实施方案中,所述分子或基团A为DBCO,所述分子或基团B为叠氮基。 In certain embodiments, the molecule or group A is DBCO and the molecule or group B is azide.
在某些实施方案中,所述定位探针修饰有所述固相支持物表面修饰有叠氮基,所述定位探针和所述固相支持物通过DBCO与叠氮基发生的点击化学反应形成连接。In certain embodiments, the localization probe is modified with The surface of the solid support is modified with an azide group, and the positioning probe and the solid support are connected through a click chemical reaction between DBCO and the azide group.
在某些实施方案中,步骤(A)包含:In certain embodiments, step (A) comprises:
(1)提供至少两种载体,每种载体包含至少一个拷贝的载体序列,所述载体序列包含:定位序列的互补序列以及第一通用序列的互补序列;所述第一通用序列、定位序列如上文所定义;其中,每种载体序列的定位序列的互补序列互不相同;(1) providing at least two vectors, each vector comprising at least one copy of a vector sequence, wherein the vector sequence comprises: a complementary sequence of a positioning sequence and a complementary sequence of a first universal sequence; the first universal sequence and the positioning sequence are as defined above; wherein the complementary sequence of the positioning sequence of each vector sequence is different from each other;
(2)将至少两种所述载体置于所述固相支持物表面;(2) placing at least two of the carriers on the surface of the solid support;
(3)提供固定引物,并以所述载体序列为模板,进行核酸聚合反应,生成延伸产物,所述延伸产物即为定位探针;其中,所述固定引物能与所述载体序列退火并起始延伸反应;和,(3) providing a fixed primer, and using the vector sequence as a template, performing a nucleic acid polymerization reaction to generate an extension product, wherein the extension product is a positioning probe; wherein the fixed primer can anneal with the vector sequence and initiate an extension reaction; and,
(4)将所述固定引物与所述固相支持物表面连接;(4) connecting the immobilized primer to the surface of the solid support;
其中,步骤(3)与(4)以任意顺序进行(例如,步骤(3)在步骤(4)之前或之后进行,或者,步骤(3)与步骤(4)同时进行)。Wherein, steps (3) and (4) are performed in any order (for example, step (3) is performed before or after step (4), or step (3) and step (4) are performed simultaneously).
在某些实施方案中,步骤(2)中,所述载体通过与固相支持物表面形成连接(例如,非共价连接或共价连接)从而被置于所述固相支持物表面。In certain embodiments, in step (2), the carrier is placed on the surface of the solid support by forming a connection (eg, a non-covalent connection or a covalent connection) with the surface of the solid support.
在某些实施方案中,所述载体序列从5’端至3’端的方向上依次包含:所述通用序列的互补序列,以及,所述定位序列的互补序列。In certain embodiments, the vector sequence comprises, in order from the 5' end to the 3' end: a complementary sequence to the universal sequence, and a complementary sequence to the positioning sequence.
在某些实施方案中,所述延伸产物从5’到3’的方向上依次包含:所述定位序列,以及所述第一通用序列。In certain embodiments, the extension product comprises, in order from 5' to 3', the positioning sequence, and the first universal sequence.
在某些实施方案中,每种载体是由多个拷贝的载体序列的多联体所形成的DNB。In certain embodiments, each vector is a DNB formed from concatemers of multiple copies of the vector sequence.
在某些实施方案中,所述方法任选地包括步骤(5):消化所述载体序列,和/或,使步骤(3)中的延伸产物和与其退火的载体序列相分离(例如,解链)。In certain embodiments, the method optionally comprises step (5): digesting the vector sequence, and/or separating (eg, melting) the extension product in step (3) from the vector sequence annealed thereto.
在某些实施方案中,所述载体序列还包含切割位点。在某些实施方案中,所述切割选自切刻酶(nicking enzyme)酶切、USER酶切、光切除、化学切除或CRISPR切除。在某些实施方案中,步骤(5)中,通过对所述载体序列所包含的切割位点进行切割,使得所述载体序列和与其退火的步骤(3)的延伸产物相分离。In some embodiments, the vector sequence further comprises a cleavage site. In some embodiments, the cleavage is selected from nicking enzyme cleavage, USER cleavage, photoexcision, chemical excision or CRISPR excision. In some embodiments, in step (5), the cleavage site contained in the vector sequence is cut so that the vector sequence and the extension product of step (3) annealed thereto are separated.
在某些实施方案中,步骤(1)中通过以下步骤提供所述载体:In certain embodiments, the vector is provided in step (1) by:
(i)提供载体模板序列,所述载体模板序列包含所述载体序列的互补序列; (i) providing a vector template sequence, wherein the vector template sequence comprises a complementary sequence to the vector sequence;
(ii)以载体模板序列为模板,进行核酸扩增反应,以获得载体模板序列的扩增产物,所述扩增产物包含至少一个拷贝的载体序列;在某些实施方案中,进行滚环复制,以获得由所述载体序列的多联体所形成的DNB。(ii) performing a nucleic acid amplification reaction using the vector template sequence as a template to obtain an amplification product of the vector template sequence, wherein the amplification product comprises at least one copy of the vector sequence; in certain embodiments, performing a rolling circle replication to obtain a DNB formed by concatemers of the vector sequence.
如本文所用,“DNB”(DNA nano ball,DNA纳米球)是一种典型的RCA(rolling circle amplification,RCA)产物,其具有RCA产物的特征。其中,所述RCA产物是一种多拷贝的单链DNA序列,因内部DNA序列的碱基间的相互作用力,可以形成类似“球形“结构。典型地,文库分子环化形成单链环状DNA,随后使用滚环扩增技术可将单链环状DNA扩增多个数量级,所产生的扩增产物称为DNB。As used herein, "DNB" (DNA nano ball) is a typical RCA (rolling circle amplification, RCA) product, which has the characteristics of an RCA product. The RCA product is a multi-copy single-stranded DNA sequence, which can form a "spherical" structure due to the interaction between the bases of the internal DNA sequence. Typically, the library molecule is circularized to form a single-stranded circular DNA, and then the single-stranded circular DNA can be amplified by multiple orders of magnitude using the rolling circle amplification technique, and the resulting amplification product is called DNB.
在某些实施方案中,所述载体序列进一步包含第二通用序列或其部分序列的互补序列(例如,所述载体序列进一步包含所述第二通用序列或其3’端部分序列的互补序列),所述第二通用序列如上文所定义。In certain embodiments, the vector sequence further comprises a complementary sequence to a second universal sequence or a partial sequence thereof (e.g., the vector sequence further comprises a complementary sequence to the second universal sequence or a partial sequence at its 3' end), and the second universal sequence is as defined above.
在某些实施方案中,所述第二通用序列或其部分序列的互补序列位于所述定位序列的互补序列的3’端。In certain embodiments, the complementary sequence of the second universal sequence or a partial sequence thereof is located 3' to the complementary sequence of the positioning sequence.
在某些实施方案中,所述固定引物包含所述第二通用序列或其部分序列(例如,所述固定引物包含所述第二通用序列或其5’端部分序列)。In certain embodiments, the fixed primer comprises the second universal sequence or a partial sequence thereof (e.g., the fixed primer comprises the second universal sequence or a partial sequence at the 5' end thereof).
在某些实施方案中,所述延伸产物从5’到3’的方向上依次包含:所述第二通用序列,所述定位序列,以及所述第一通用序列。In certain embodiments, the extension product comprises, in order from 5' to 3' direction: the second universal sequence, the positioning sequence, and the first universal sequence.
在某些实施方案中,所述载体序列不包含或者进一步包含MID序列的模板序列。In certain embodiments, the vector sequence does not comprise or further comprises a template sequence for a MID sequence.
在某些实施方案中,所述载体序列进一步包含MID序列的模板序列。在某些实施方案中,所述MID序列的模板序列位于所述第一通用序列的互补序列的3’端,和/或,所述MID序列的模板序列位于所述第二通用序列或其部分序列的互补序列的5’端。在某些实施方案中,所述MID序列由5-50个(例如5-25个、5-35个、5-45个、10-25个、10-35个、10-45个、15-25个、15-35个、15-45个、20-25个、20-35个或20-45个)简并脱氧核糖核苷酸残基组成。In certain embodiments, the vector sequence further comprises a template sequence of a MID sequence. In certain embodiments, the template sequence of the MID sequence is located at the 3' end of the complementary sequence of the first universal sequence, and/or, the template sequence of the MID sequence is located at the 5' end of the complementary sequence of the second universal sequence or a partial sequence thereof. In certain embodiments, the MID sequence consists of 5-50 (e.g., 5-25, 5-35, 5-45, 10-25, 10-35, 10-45, 15-25, 15-35, 15-45, 20-25, 20-35 or 20-45) degenerate deoxyribonucleotide residues.
在某些实施方案中,所述固定引物与所述固相支持物共价连接或者非共价连接。In certain embodiments, the immobilized primer is covalently or non-covalently attached to the solid support.
在某些实施方案中,所述固定引物与所述固相支持物共价连接。In certain embodiments, the immobilized primer is covalently linked to the solid support.
在某些实施方案中,所述固定引物通过点击化学反应与所述固相支持物形成连接。In certain embodiments, the immobilized primer is linked to the solid support via a click chemistry reaction.
在某些实施方案中,能够发生所述点击化学反应的分子或基团对选自:炔基/叠氮基、叠氮基/氰基、胺/烯、硫醇/烯、硫醇/炔、醛/1,3-二醇、酮/1,3-二醇。在某些实施方案中,能够发生所述点击化学反应的分子或基团对为叠氮基/炔基。 In certain embodiments, the molecule or group pair capable of the click chemistry reaction is selected from: alkynyl/azido, azido/cyano, amine/ene, thiol/ene, thiol/alkyne, aldehyde/1,3-diol, ketone/1,3-diol. In certain embodiments, the molecule or group pair capable of the click chemistry reaction is azido/alkynyl.
在某些实施方案中,所述固定引物修饰有所述固相支持物表面修饰有叠氮基,所述固定引物和所述固相支持物通过DBCO与叠氮基发生的点击化学反应形成连接。In certain embodiments, the fixed primer is modified with The surface of the solid phase support is modified with an azide group, and the fixed primer and the solid phase support are connected through a click chemical reaction between DBCO and the azide group.
在某些实施方案中,所述捕获探针以包含含有捕获序列的单链区域的单链形式存在,其中,所述捕获序列如上文所定义,并且,所述步骤(B)包括:In certain embodiments, the capture probe is present in a single-stranded form comprising a single-stranded region comprising a capture sequence, wherein the capture sequence is as defined above, and step (B) comprises:
(1)提供:(a)游离的所述捕获探针,其中,所述捕获探针修饰有分子或基团A’;以及,(b)所述固相支持物,所述固相支持物表面修饰有分子或基团B’,所述分子或基团A’能够与所述分子或基团B’形成连接(例如,共价或非共价连接);和,(1) providing: (a) the free capture probe, wherein the capture probe is modified with a molecule or group A'; and, (b) the solid support, wherein the surface of the solid support is modified with a molecule or group B', wherein the molecule or group A' is capable of forming a connection (e.g., covalent or non-covalent connection) with the molecule or group B'; and,
(2)在适于使所述分子或基团A’与所述分子或基团B’形成连接的条件下,将所述捕获探针与所述固相支持物接触,从而获得连接有所述捕获探针的固相支持物。(2) contacting the capture probe with the solid support under conditions suitable for forming a connection between the molecule or group A' and the molecule or group B', thereby obtaining a solid support to which the capture probe is connected.
本领域技术人员易于理解,所述捕获探针的任意部分均可用于修饰所述分子或基团A’,只要其不影响所述捕获探针行使其功能(例如,与目标序列的退火以及3’末端起始延伸反应的功能)即可。在某些实施方案中,所述捕获探针的所述捕获序列没有修饰所述分子或基团A’。在某些实施方案中,所述捕获探针的5’端(例如,5’末端)修饰有所述分子或基团A’。It is easy for a person skilled in the art to understand that any part of the capture probe can be used to modify the molecule or group A', as long as it does not affect the function of the capture probe (e.g., annealing with the target sequence and the function of the 3' end initiation extension reaction). In some embodiments, the capture sequence of the capture probe does not modify the molecule or group A'. In some embodiments, the 5' end (e.g., 5' end) of the capture probe is modified with the molecule or group A'.
在某些实施方案中,所述捕获序列位于所述捕获探针的3’末端。In certain embodiments, the capture sequence is located at the 3' end of the capture probe.
在某些实施方案中,所述捕获探针以含有包含捕获序列的单链区域的双链体形式存在,其中,所述捕获序列如上文所定义,并且,所述步骤(B)包括:In certain embodiments, the capture probe is present in a duplex form containing a single-stranded region comprising a capture sequence, wherein the capture sequence is as defined above, and step (B) comprises:
(I)(1)提供:(a)游离的所述捕获探针的第一链,所述第一链修饰有分子或基团A’;以及,(b)所述固相支持物,所述固相支持物表面修饰了分子或基团B’,所述分子或基团A’能够与所述分子或基团B’之间形成连接(例如,共价和/或非共价连接);以及,(c)游离的所述捕获探针的所述第二链,所述第二链包含所述捕获序列,并能与所述第一链退火形成双链体;(I)(1) providing: (a) a free first strand of the capture probe, the first strand being modified with a molecule or group A'; and, (b) the solid support, the surface of the solid support being modified with a molecule or group B', the molecule or group A' being capable of forming a connection (e.g., covalent and/or non-covalent connection) with the molecule or group B'; and, (c) the free second strand of the capture probe, the second strand comprising the capture sequence and being capable of annealing with the first strand to form a duplex;
(2)在适于使所述分子或基团A’与所述分子或基团B’形成连接的条件下,将所述第一链与所述固相支持物接触,从而获得连接有所述第一链的固相支持物;和,(2) contacting the first chain with the solid support under conditions suitable for forming a connection between the molecule or group A' and the molecule or group B', thereby obtaining a solid support to which the first chain is connected; and,
(3)在允许退火的条件下,将所述游离的第二链与步骤(2)形成的连接有所述第一链的固相支持物接触,从而获得连接有所述捕获探针的固相支持物;(3) contacting the free second strand with the solid support connected to the first strand formed in step (2) under conditions allowing annealing, thereby obtaining a solid support connected to the capture probe;
或者,or,
(II)(1)提供:(a)由所述捕获探针的第一链与第二链退火形成的双链体,所述第一链连接有分子或基团A’,所述第二链含有所述捕获序列;以及,(b)固相支持物,所 述固相支持物表面修饰了分子或基团B’,所述分子或基团A’能够与所述分子或基团B’之间形成连接(例如,共价和/或非共价连接);和,(II)(1) providing: (a) a duplex formed by annealing a first strand and a second strand of the capture probe, wherein the first strand is connected to a molecule or group A', and the second strand contains the capture sequence; and (b) a solid support, wherein the The surface of the solid support is modified with a molecule or group B', and the molecule or group A' can form a connection (eg, covalent and/or non-covalent connection) with the molecule or group B'; and,
(2)在适于使所述分子或基团A’与所述分子或基团B’形成连接的条件下,将所述双链体与所述固相支持物接触,从而获得连接有所述捕获探针的固相支持物。(2) contacting the duplex with the solid support under conditions suitable for forming a connection between the molecule or group A' and the molecule or group B', thereby obtaining a solid support connected with the capture probe.
在某些实施方案中,所述第一链的任意部分均可用于与所述固相支持物进行连接,只要所述连接不影响所述第一链行使其功能(例如,与所述第二链退火的功能)即可。在某些实施方案中,所述第一链中未与所述第二链退火的部分修饰有所述分子或基团A’。在某些实施方案中,所述第一链的末端(例如,5’末端或3’末端)修饰有所述分子或基团A’。In certain embodiments, any portion of the first strand can be used to connect to the solid support, as long as the connection does not affect the first strand in performing its function (e.g., the function of annealing with the second strand). In certain embodiments, the portion of the first strand that is not annealed to the second strand is modified with the molecule or group A'. In certain embodiments, the end (e.g., 5' end or 3' end) of the first strand is modified with the molecule or group A'.
在某些实施方案中,所述第二链不与所述固相支持物直接连接。In certain embodiments, the second strand is not directly attached to the solid support.
在某些实施方案中,所述捕获探针的所述第一链和所述第二链退火形成的双链体中,所述第二链的3’末端包含所述捕获序列,并且所述捕获序列不与所述第一链发生退火。In certain embodiments, in the duplex formed by annealing of the first strand and the second strand of the capture probe, the 3' end of the second strand comprises the capture sequence, and the capture sequence does not anneal to the first strand.
在某些实施方案中,所述分子或基团A’能够与所述分子或基团B’发生点击化学反应。In certain embodiments, the molecule or group A' is capable of undergoing a click chemistry reaction with the molecule or group B'.
在某些实施方案中,所述分子或基团A’/B’选自:炔基/叠氮基、叠氮基/氰基、胺/烯、硫醇/烯、硫醇/炔、醛/1,3-二醇、酮/1,3-二醇、叠氮基/炔基、氰基/叠氮基、烯/胺、烯/硫醇、炔/硫醇、1,3-二醇/醛、1,3-二醇/酮。In certain embodiments, the molecules or groups A'/B' are selected from: alkynyl/azido, azido/cyano, amine/ene, thiol/ene, thiol/alkyne, aldehyde/1,3-diol, ketone/1,3-diol, azido/alkynyl, cyano/azido, ene/amine, ene/thiol, alkyne/thiol, 1,3-diol/aldehyde, 1,3-diol/ketone.
在某些实施方案中,所述分子或基团A’/B’为叠氮/炔基或炔基/叠氮。In certain embodiments, the molecules or groups A'/B' are azide/alkynyl or alkynyl/azide.
在某些实施方案中,所述分子或基团A’为DBCO,所述分子或基团B’为叠氮基。In certain embodiments, the molecule or group A' is DBCO and the molecule or group B' is azido.
在某些实施方案中,所述捕获探针的所述第一链修饰有所述固相支持物表面修饰有叠氮基,所述捕获探针的所述第一链和所述固相支持物通过DBCO与叠氮基发生的点击化学反应形成连接。In certain embodiments, the first strand of the capture probe is modified with The surface of the solid support is modified with an azide group, and the first chain of the capture probe and the solid support are connected through a click chemical reaction between DBCO and the azide group.
在某些实施方案中,所述固相支持物具有选自下列的一个或多个特征:In certain embodiments, the solid support has one or more characteristics selected from the group consisting of:
(1)所述固相支持物选自乳胶珠、葡聚糖珠、聚苯乙烯表面、聚丙烯表面、聚丙烯酰胺凝胶、金表面、玻璃表面、芯片、传感器、电极和硅晶片;在某些实施方案中,所述固相支持物是芯片;(1) The solid support is selected from the group consisting of latex beads, dextran beads, polystyrene surfaces, polypropylene surfaces, polyacrylamide gels, gold surfaces, glass surfaces, chips, sensors, electrodes, and silicon wafers; in certain embodiments, the solid support is a chip;
(2)所述固相支持物为平面的、球形的或多孔的;(2) The solid support is planar, spherical or porous;
(3)所述固相支持物能够用于测序平台;在某些实施方案中,所述固相支持物是用于IllμMina、MGI或Thermo Fisher测序平台的测序芯片;和(3) the solid support can be used in a sequencing platform; in certain embodiments, the solid support is a sequencing chip for use in an IllμMina, MGI or Thermo Fisher sequencing platform; and
(4)所述固相支持物能够自发地或在暴露于一种或多种刺激(例如,温度变化、pH变化、暴露于特定化学物质或相、暴露于光、还原剂等)时释放所述定位探针和/或捕获 探针。(4) The solid support is capable of releasing the localization probe and/or capture probe spontaneously or upon exposure to one or more stimuli (e.g., temperature change, pH change, exposure to a specific chemical or phase, exposure to light, a reducing agent, etc.). Probe.
在某些实施方案中,同种所述定位探针(也即,包含相同定位序列的定位探针)占据所述固相支持物表面同一个区域,不同种所述定位探针占据所述支持物表面的不同区域,并且,任意相邻近的两个所述区域之间的中心距离小于1μm(例如,小于900nm、小于800nm、小于700nm、小于600nm、小于500nm、小于400nm、小于300nm、小于200nm)。In certain embodiments, the same type of positioning probes (i.e., positioning probes comprising the same positioning sequence) occupy the same area on the surface of the solid support, different types of positioning probes occupy different areas on the surface of the support, and the center distance between any two adjacent areas is less than 1 μm (e.g., less than 900 nm, less than 800 nm, less than 700 nm, less than 600 nm, less than 500 nm, less than 400 nm, less than 300 nm, less than 200 nm).
在某些实施方案中,所述捕获探针和所述定位探针在所述固相支持物上的分布按照如下方式进行设置:与所述捕获探针连接的核酸分子(例如,与所述捕获探针退火的核酸分子,或者,由所述捕获探针经核酸聚合反应延伸获得的核酸分子)能够和与其邻近的定位探针相接触。In certain embodiments, the distribution of the capture probe and the positioning probe on the solid support is configured as follows: the nucleic acid molecule connected to the capture probe (for example, a nucleic acid molecule annealed to the capture probe, or a nucleic acid molecule obtained by extending the capture probe through a nucleic acid polymerization reaction) is able to contact the positioning probe adjacent to it.
在某些实施方案中,每种所述定位探针的邻近分布有至少100个所述捕获探针。例如,至少100个、至少200个、至少300个、至少400个、至少500个、至少600个、至少700个、至少800个、至少900个、至少1000个、至少1500个、至少2000个、至少2500个、至少3000个、至少3500个、至少4000个、至少4500个、至少5000个、至少5500个、至少6000个、至少6500个、至少7000个、至少7500个、至少8000个、至少8500个、至少9000个、至少9500个或至少10000个所述捕获探针。In certain embodiments, each of the positioning probes is adjacent to at least 100 capture probes. For example, at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, at least 1000, at least 1500, at least 2000, at least 2500, at least 3000, at least 3500, at least 4000, at least 4500, at least 5000, at least 5500, at least 6000, at least 6500, at least 7000, at least 7500, at least 8000, at least 8500, at least 9000, at least 9500 or at least 10000 capture probes.
在某些实施方案中,每种所述定位探针的邻近分布有500-10000个(例如,1000-1500个、1000-2000个、1000-3000个、1000-4000个、1000-5000个、1000-8000个、1000-10000个)所述捕获探针。In certain embodiments, there are 500-10,000 (eg, 1,000-1,500, 1,000-2,000, 1,000-3,000, 1,000-4,000, 1,000-5,000, 1,000-8,000, 1,000-10,000) capture probes distributed adjacent to each of the localization probes.
在某些实施方案中,表述“每种所述定位探针的邻近”表示固相支持物表面该种定位探针附近的区域,其中,与分布于该区域的捕获探针相连接的核酸分子(例如,与所述捕获探针退火的核酸分子,或者,由所述捕获探针经核酸聚合反应延伸获得的核酸分子)能够与该种定位探针相接触。In certain embodiments, the expression "adjacent to each of the positioning probes" refers to an area near the positioning probe on the surface of a solid support, wherein nucleic acid molecules connected to capture probes distributed in the area (e.g., nucleic acid molecules annealed to the capture probes, or nucleic acid molecules obtained by extending the capture probes through nucleic acid polymerization reaction) are able to contact the positioning probes.
在某些实施方案中,每种所述定位探针在所述固相支持物所占位置的中心(例如,每种所述定位探针在所述所述固相支持物表面所占据的区域的中心)周围半径200nm-300nm的范围内分布有至少100个所述捕获探针。例如,至少100个、至少200个、至少300个、400个、至少500个、至少600个、至少700个、至少800个、至少900个、至少1000个、至少1500个、至少2000个、至少2500个、至少3000个、至少3500个、至少4000个、至少4500个、至少5000个、至少5500个、至少6000个、至少6500个、至少7000个、至少7500个、至少8000个、至少8500个、至少9000个、至少9500个或至 少10000个所述捕获探针。In certain embodiments, at least 100 capture probes are distributed within a radius of 200 nm to 300 nm around the center of the position occupied by each positioning probe on the solid support (e.g., the center of the area occupied by each positioning probe on the surface of the solid support). For example, at least 100, at least 200, at least 300, 400, at least 500, at least 600, at least 700, at least 800, at least 900, at least 1000, at least 1500, at least 2000, at least 2500, at least 3000, at least 3500, at least 4000, at least 4500, at least 5000, at least 5500, at least 6000, at least 6500, at least 7000, at least 7500, at least 8000, at least 8500, at least 9000, at least 9500 or more. At least 10,000 of the capture probes.
在某些实施方案中,每种所述定位探针在所述固相支持物所占位置的中心(例如,每种所述定位探针在所述固相支持物表面所占据的区域的中心)周围半径200nm-300nm的范围内分布有500-10000个(例如,1000-1500个、1000-2000个、1000-3000个、1000-4000个、1000-5000个、1000-8000个、1000-10000个)所述捕获探针。In certain embodiments, there are 500-10000 (e.g., 1000-1500, 1000-2000, 1000-3000, 1000-4000, 1000-5000, 1000-8000, 1000-10000) capture probes distributed within a radius of 200nm-300nm around the center of the position occupied by each of the positioning probes on the solid support (e.g., the center of the area occupied by each of the positioning probes on the surface of the solid support).
核酸阵列制备方法Nucleic acid array preparation method
在另一方面,本申请提供了制备如上所述的核酸阵列的方法,其包括以下步骤:In another aspect, the present application provides a method for preparing a nucleic acid array as described above, comprising the following steps:
(A)提供待处理核酸阵列,所述待处理核酸阵列包括固相支持物,所述固相支持物连接有至少两种寡核苷酸分子;(A) providing a nucleic acid array to be processed, wherein the nucleic acid array to be processed comprises a solid support, and the solid support is connected with at least two oligonucleotide molecules;
每种寡核苷酸分子在所述固相支持物上占据不同位置;Each oligonucleotide molecule occupies a different position on the solid support;
所述寡核苷酸分子从5’端至3’端依次含有定位序列和第一通用序列,其中,所述定位序列具有与该种寡核苷酸分子在固相支持物的位置相对应的核苷酸序列;所述第一通用序列如上文所定义;The oligonucleotide molecule contains a positioning sequence and a first universal sequence in sequence from the 5' end to the 3' end, wherein the positioning sequence has a nucleotide sequence corresponding to the position of the oligonucleotide molecule on the solid support; the first universal sequence is as defined above;
和,and,
(B)在所述待处理核酸阵列所包含的固相支持物上连接和/或合成捕获探针,所述捕获探针如上文所定义。(B) attaching and/or synthesizing capture probes on the solid support contained in the nucleic acid array to be processed, wherein the capture probes are as defined above.
在某些实施方案中,每种寡核苷酸分子由一个或多个寡核苷酸分子组成。In certain embodiments, each oligonucleotide molecule consists of one or more oligonucleotide molecules.
在某些实施方案中,同种寡核苷酸分子含有的定位序列彼此相同,不同种寡核苷酸分子含有的定位序列彼此不同。In certain embodiments, the localization sequences contained in the same oligonucleotide molecules are identical to each other, and the localization sequences contained in different oligonucleotide molecules are different from each other.
本领域技术人员易于理解,同种寡核苷酸分子的一个或多个寡核苷酸分子具有相同的定位序列,然而,不需要每种寡核苷酸分子的每一个寡核苷酸分子都具有相同的完整序列。It is easy for a person skilled in the art to understand that one or more oligonucleotide molecules of the same oligonucleotide molecule have the same positioning sequence, however, it is not necessary for every oligonucleotide molecule of each oligonucleotide molecule to have the same complete sequence.
在某些实施方案中,所述寡核苷酸分子在所述第一通用序列的3’端不包含捕获序列,所述捕获序列如上文所定义。In certain embodiments, the oligonucleotide molecule does not comprise a capture sequence at the 3' end of the first universal sequence, the capture sequence being as defined above.
在某些实施方案中,所述寡核苷酸分子在所述第一通用序列的3’端包含捕获序列,所述捕获序列如上文所定义,并且,所述方法进一步包括步骤(B’):去除所述寡核苷酸分子所含有的所述捕获序列;In certain embodiments, the oligonucleotide molecule comprises a capture sequence at the 3′ end of the first universal sequence, the capture sequence being as defined above, and the method further comprises step (B′): removing the capture sequence contained in the oligonucleotide molecule;
其中,所述步骤(B)和(B’)可以以任意顺序进行或同时进行(例如,在同一反应体系中同时进行)。Wherein, the steps (B) and (B') can be carried out in any order or simultaneously (for example, simultaneously in the same reaction system).
在某些实施方案中,步骤(B’)中通过包括以下的步骤去除所述寡核苷酸分子所含有的所述捕获序列:In certain embodiments, step (B') removes the capture sequence contained in the oligonucleotide molecule by the steps comprising:
(i)提供封闭探针,所述封闭探针能够与(a)所述寡核苷酸分子的所述第一通用序 列或其部分序列(例如,所述第一通用序列的3’端部分序列),或者,(b)所述寡核苷酸分子中位于所述捕获序列的5’端且位于所述第一通用序列的3’端的序列,或者,(c)(a)和(b)的组合,退火;(i) providing a blocking probe, wherein the blocking probe is capable of binding to the first universal sequence of the oligonucleotide molecule of (a) (a) a sequence or a partial sequence thereof (e.g., a partial sequence at the 3' end of the first universal sequence), or (b) a sequence in the oligonucleotide molecule that is located at the 5' end of the capture sequence and at the 3' end of the first universal sequence, or (c) a combination of (a) and (b), annealing;
(ii)在适于使所述封闭探针与所述寡核苷酸分子退火的条件下,使所述封闭探针与含有所述寡核苷酸分子的所述待处理核酸阵列接触;(ii) contacting the blocking probe with the nucleic acid array to be processed containing the oligonucleotide molecules under conditions suitable for annealing the blocking probe with the oligonucleotide molecules;
(iii)在允许核酸外切酶发挥其切割活性的条件下,将核酸外切酶与步骤(ii)的产物接触;其中,所述核酸外切酶具有单链核酸3’至5’端外切活性。(iii) contacting the product of step (ii) with an exonuclease under conditions that allow the exonuclease to exert its cleavage activity; wherein the exonuclease has exonuclease activity from the 3' to 5' end of a single-stranded nucleic acid.
在某些实施方案中,所述核酸外切酶为核酸外切酶I(Exonuclease I)。In some embodiments, the exonuclease is exonuclease I.
在某些实施方案中,所述步骤(B’)进一步包括去除所述封闭探针的步骤(例如,通过使所述封闭探针与所述寡核苷酸分子解链从而去除所述封闭探针)。In certain embodiments, step (B') further comprises the step of removing the blocking probe (e.g., removing the blocking probe by unzipping the blocking probe from the oligonucleotide molecule).
在某些实施方案中,所述方法中,步骤(B)在所述步骤(B’)之后进行。In certain embodiments, in the method, step (B) is performed after step (B').
在某些实施方案中,所述寡核苷酸分子进一步包含第二通用序列,所述第二通用序列如上文所定义。In certain embodiments, the oligonucleotide molecule further comprises a second universal sequence, the second universal sequence being as defined above.
在某些实施方案中,所述第二通用序列位于所述定位序列的5’端。In certain embodiments, the second universal sequence is located 5' to the localization sequence.
在某些实施方案中,所述寡核苷酸分子不包含或者进一步包含MID序列。In certain embodiments, the oligonucleotide molecule does not comprise or further comprises a MID sequence.
在某些实施方案中,所述寡核苷酸分子进一步包含MID序列。在某些实施方案中,所述MID序列位于所述第一通用序列的5’端,和/或,所述MID序列位于所述第二通用序列的3’端。在某些实施方案中,同种寡核苷酸分子所包含的MID序列彼此不同。In certain embodiments, the oligonucleotide molecule further comprises a MID sequence. In certain embodiments, the MID sequence is located at the 5' end of the first universal sequence, and/or, the MID sequence is located at the 3' end of the second universal sequence. In certain embodiments, the MID sequences comprised by the same oligonucleotide molecule are different from each other.
在某些实施方案中,所述待处理核酸阵列中,所述寡核苷酸分子与所述固相支持物共价连接或者非共价连接。In certain embodiments, in the nucleic acid array to be processed, the oligonucleotide molecules are covalently or non-covalently linked to the solid support.
在某些实施方案中,所述寡核苷酸分子与所述固相支持物共价连接。In certain embodiments, the oligonucleotide molecule is covalently linked to the solid support.
在某些实施方案中,所述寡核苷酸分子通过点击化学反应与所述固相支持物形成连接。In certain embodiments, the oligonucleotide molecule is linked to the solid support via a click chemistry reaction.
在某些实施方案中,能够发生所述点击化学反应的分子或基团对选自:炔基/叠氮基、叠氮基/氰基、胺/烯、硫醇/烯、硫醇/炔、醛/1,3-二醇、酮/1,3-二醇。在某些实施方案中,能够发生所述点击化学反应的分子或基团对为叠氮基/炔基。In certain embodiments, the molecule or group pair capable of the click chemistry reaction is selected from: alkynyl/azido, azido/cyano, amine/ene, thiol/ene, thiol/alkyne, aldehyde/1,3-diol, ketone/1,3-diol. In certain embodiments, the molecule or group pair capable of the click chemistry reaction is azido/alkynyl.
在某些实施方案中,所述固相支持物表面修饰有叠氮基团,所述寡核苷酸分子修饰有所述寡核苷酸分子通过所述叠氮基团与所述DBCO发生的点击化学反应与所述固相支持物形成连接。In certain embodiments, the surface of the solid support is modified with an azide group, and the oligonucleotide molecule is modified with The oligonucleotide molecule is connected to the solid support through a click chemistry reaction between the azide group and the DBCO.
在某些实施方案中,所述捕获探针以包含含有捕获序列的单链区域的单链形式存在,其中,所述捕获序列如上文所定义,并且,所述步骤(B)包括: In certain embodiments, the capture probe is present in a single-stranded form comprising a single-stranded region comprising a capture sequence, wherein the capture sequence is as defined above, and step (B) comprises:
(1)提供:(a)游离的所述捕获探针,其中,所述捕获探针修饰有分子或基团A’;以及,(b)所述待处理核酸阵列或者经步骤(B’)处理后的核酸阵列,所述核酸阵列的固相支持物表面修饰有分子或基团B’,所述分子或基团A’能够与所述分子或基团B’形成连接(例如,共价或非共价连接);和,(1) providing: (a) the free capture probe, wherein the capture probe is modified with a molecule or group A'; and, (b) the nucleic acid array to be treated or the nucleic acid array treated in step (B'), wherein the solid support surface of the nucleic acid array is modified with a molecule or group B', and the molecule or group A' can form a connection (e.g., covalent or non-covalent connection) with the molecule or group B'; and,
(2)在适于使所述分子或基团A’与所述分子或基团B’形成连接的条件下,将所述捕获探针与所述固相支持物接触,从而获得连接有所述捕获探针的固相支持物。(2) contacting the capture probe with the solid support under conditions suitable for forming a connection between the molecule or group A' and the molecule or group B', thereby obtaining a solid support to which the capture probe is connected.
本领域技术人员易于理解,所述捕获探针的任意部分均可用于修饰所述分子或基团A’,只要其不影响所述捕获探针行使其功能(例如,与目标序列的退火以及3’末端起始延伸反应的功能)即可。在某些实施方案中,所述捕获探针的所述捕获序列没有修饰所述分子或基团A’。在某些实施方案中,所述捕获探针的5’端(例如,5’末端)修饰有所述分子或基团A’。It is easy for a person skilled in the art to understand that any part of the capture probe can be used to modify the molecule or group A', as long as it does not affect the function of the capture probe (e.g., annealing with the target sequence and the function of the 3' end initiation extension reaction). In some embodiments, the capture sequence of the capture probe does not modify the molecule or group A'. In some embodiments, the 5' end (e.g., 5' end) of the capture probe is modified with the molecule or group A'.
在某些实施方案中,所述捕获序列位于所述捕获探针的3’末端。In certain embodiments, the capture sequence is located at the 3' end of the capture probe.
在某些实施方案中,所述捕获探针以含有包含捕获序列的单链区域的双链体形式存在,其中,所述捕获序列如上文所定义,并且,所述步骤(B)包括:In certain embodiments, the capture probe is present in a duplex form containing a single-stranded region comprising a capture sequence, wherein the capture sequence is as defined above, and step (B) comprises:
(I)(1)提供:(a)游离的所述捕获探针的第一链,所述第一链修饰有分子或基团A’;以及,(b)所述待处理核酸阵列或者经步骤(B’)处理后的核酸阵列,所述核酸阵列的固相支持物表面修饰了分子或基团B’,所述分子或基团A’能够与所述分子或基团B’之间形成连接(例如,共价和/或非共价连接);以及,(c)游离的所述捕获探针的所述第二链,所述第二链包含所述捕获序列,并能与所述第一链退火形成双链体;(I)(1) providing: (a) a free first strand of the capture probe, wherein the first strand is modified with a molecule or group A'; and, (b) the nucleic acid array to be treated or the nucleic acid array treated in step (B'), wherein the solid support surface of the nucleic acid array is modified with a molecule or group B', wherein the molecule or group A' is capable of forming a connection (e.g., covalent and/or non-covalent connection) with the molecule or group B'; and, (c) the free second strand of the capture probe, wherein the second strand comprises the capture sequence and is capable of annealing with the first strand to form a duplex;
(2)在适于使所述分子或基团A’与所述分子或基团B’形成连接的条件下,将所述第一链与所述固相支持物接触,从而获得连接有所述第一链的固相支持物;和,(2) contacting the first chain with the solid support under conditions suitable for forming a connection between the molecule or group A' and the molecule or group B', thereby obtaining a solid support to which the first chain is connected; and,
(3)在允许退火的条件下,将所述游离的第二链与步骤(2)形成的连接有所述第一链的固相支持物接触,从而获得连接有所述捕获探针的固相支持物;(3) contacting the free second strand with the solid support connected to the first strand formed in step (2) under conditions allowing annealing, thereby obtaining a solid support connected to the capture probe;
或者,or,
(II)(1)提供:(a)由所述捕获探针的第一链与第二链退火形成的双链体,所述第一链连接有分子或基团A’,所述第二链含有所述捕获序列;以及,(b)所述待处理核酸阵列或者经步骤(B’)处理后的核酸阵列,所述核酸阵列的固相支持物表面修饰了分子或基团B’,所述分子或基团A’能够与所述分子或基团B’之间形成连接(例如,共价和/或非共价连接);和,(II)(1) providing: (a) a duplex formed by annealing the first strand and the second strand of the capture probe, wherein the first strand is connected to a molecule or group A', and the second strand contains the capture sequence; and, (b) the nucleic acid array to be treated or the nucleic acid array treated in step (B'), wherein the solid support surface of the nucleic acid array is modified with a molecule or group B', and the molecule or group A' can form a connection (e.g., covalent and/or non-covalent connection) with the molecule or group B'; and,
(2)在适于使所述分子或基团A’与所述分子或基团B’形成连接的条件下,将所述双链体与所述固相支持物接触,从而获得连接有所述捕获探针的固相支持物。(2) contacting the duplex with the solid support under conditions suitable for forming a connection between the molecule or group A' and the molecule or group B', thereby obtaining a solid support connected with the capture probe.
在某些实施方案中,所述第一链的任意部分均可用于与所述固相支持物进行连 接,只要所述连接不影响所述第一链行使其功能(例如,与所述第二链退火的功能)即可。在某些实施方案中,所述第一链中未与所述第二链退火的部分修饰有所述分子或基团A’。在某些实施方案中,所述第一链的末端(例如,5’末端或3’末端)修饰有所述分子或基团A’。In certain embodiments, any portion of the first strand can be used to bind to the solid support. In some embodiments, the first strand is connected to the second strand, as long as the connection does not affect the first strand to perform its function (e.g., the function of annealing with the second strand). In some embodiments, the portion of the first strand that is not annealed to the second strand is modified with the molecule or group A'. In some embodiments, the end (e.g., 5' end or 3' end) of the first strand is modified with the molecule or group A'.
在某些实施方案中,所述第二链不与所述固相支持物直接连接。In certain embodiments, the second strand is not directly attached to the solid support.
在某些实施方案中,所述捕获探针的所述第一链和所述第二链退火形成的双链体中,所述第二链的3’末端包含所述捕获序列,并且所述捕获序列不与所述第一链发生退火。In certain embodiments, in the duplex formed by annealing of the first strand and the second strand of the capture probe, the 3' end of the second strand comprises the capture sequence, and the capture sequence does not anneal to the first strand.
在某些实施方案中,所述分子或基团A’能够与所述分子或基团B’发生点击化学反应。In certain embodiments, the molecule or group A' is capable of undergoing a click chemistry reaction with the molecule or group B'.
在某些实施方案中,所述分子或基团A’/B’选自:炔基/叠氮基、叠氮基/氰基、胺/烯、硫醇/烯、硫醇/炔、醛/1,3-二醇、酮/1,3-二醇、叠氮基/炔基、氰基/叠氮基、烯/胺、烯/硫醇、炔/硫醇、1,3-二醇/醛、1,3-二醇/酮。In certain embodiments, the molecules or groups A'/B' are selected from: alkynyl/azido, azido/cyano, amine/ene, thiol/ene, thiol/alkyne, aldehyde/1,3-diol, ketone/1,3-diol, azido/alkynyl, cyano/azido, ene/amine, ene/thiol, alkyne/thiol, 1,3-diol/aldehyde, 1,3-diol/ketone.
在某些实施方案中,所述分子或基团A’/B’为叠氮/炔基或炔基/叠氮。In certain embodiments, the molecules or groups A'/B' are azide/alkynyl or alkynyl/azide.
在某些实施方案中,所述分子或基团A’为DBCO,所述分子或基团B’为叠氮基。In certain embodiments, the molecule or group A' is DBCO and the molecule or group B' is azido.
在某些实施方案中,所述捕获探针的所述第一链修饰有所述固相支持物表面修饰有叠氮基,所述捕获探针的所述第一链和所述固相支持物通过DBCO与叠氮基发生的点击化学反应形成连接。In certain embodiments, the first strand of the capture probe is modified with The surface of the solid support is modified with an azide group, and the first chain of the capture probe and the solid support are connected through a click chemical reaction between DBCO and the azide group.
在某些实施方案中,所述固相支持物具有选自下列的一个或多个特征:In certain embodiments, the solid support has one or more characteristics selected from the group consisting of:
(1)所述固相支持物选自乳胶珠、葡聚糖珠、聚苯乙烯表面、聚丙烯表面、聚丙烯酰胺凝胶、金表面、玻璃表面、芯片、传感器、电极和硅晶片;在某些实施方案中,所述固相支持物是芯片;(1) The solid support is selected from the group consisting of latex beads, dextran beads, polystyrene surfaces, polypropylene surfaces, polyacrylamide gels, gold surfaces, glass surfaces, chips, sensors, electrodes, and silicon wafers; in certain embodiments, the solid support is a chip;
(2)所述固相支持物为平面的、球形的或多孔的;(2) The solid support is planar, spherical or porous;
(3)所述固相支持物能够用于测序平台;在某些实施方案中,所述固相支持物是用于IllμMina、MGI或Thermo Fisher测序平台的测序芯片;和(3) the solid support can be used in a sequencing platform; in certain embodiments, the solid support is a sequencing chip for use in an IllμMina, MGI or Thermo Fisher sequencing platform; and
(4)所述固相支持物能够自发地或在暴露于一种或多种刺激(例如,温度变化、pH变化、暴露于特定化学物质或相、暴露于光、还原剂等)时释放所述定位探针和/或捕获探针。(4) The solid support is capable of releasing the localization probe and/or capture probe spontaneously or upon exposure to one or more stimuli (e.g., temperature change, pH change, exposure to specific chemicals or phases, exposure to light, reducing agents, etc.).
在某些实施方案中,同种所述定位探针(也即,包含相同定位序列的定位探针)占据所述固相支持物表面同一个区域,不同种所述定位探针占据所述支持物表面的不同区域,并且,任意相邻近的两个所述区域之间的中心距离小于1μm(例如,小于900nm、小于800nm、小于700nm、小于600nm、小于500nm、小于400nm、小于300nm、 小于200nm)。In certain embodiments, the same type of positioning probes (i.e., positioning probes comprising the same positioning sequence) occupy the same area on the surface of the solid support, different types of positioning probes occupy different areas on the surface of the support, and the center distance between any two adjacent areas is less than 1 μm (e.g., less than 900 nm, less than 800 nm, less than 700 nm, less than 600 nm, less than 500 nm, less than 400 nm, less than 300 nm, less than 500 nm, less than 600 nm, less than 700 nm, less than 800 nm, less than 900 nm, less than 10 ... Less than 200nm).
在某些实施方案中,所述捕获探针和所述定位探针在所述固相支持物上的分布按照如下方式进行设置:与所述捕获探针连接的核酸分子(例如,与所述捕获探针退火的核酸分子,或者,由所述捕获探针经核酸聚合反应延伸获得的核酸分子)能够和与其邻近的定位探针相接触。In certain embodiments, the distribution of the capture probe and the positioning probe on the solid support is configured as follows: the nucleic acid molecule connected to the capture probe (for example, a nucleic acid molecule annealed to the capture probe, or a nucleic acid molecule obtained by extending the capture probe through a nucleic acid polymerization reaction) is able to contact the positioning probe adjacent to it.
在某些实施方案中,每种所述定位探针的邻近分布有至少100个所述捕获探针。例如,至少100个、至少200个、至少300个、至少400个、至少500个、至少600个、至少700个、至少800个、至少900个、至少1000个、至少1500个、至少2000个、至少2500个、至少3000个、至少3500个、至少4000个、至少4500个、至少5000个、至少5500个、至少6000个、至少6500个、至少7000个、至少7500个、至少8000个、至少8500个、至少9000个、至少9500个或至少10000个所述捕获探针。In certain embodiments, each of the positioning probes is adjacent to at least 100 capture probes. For example, at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, at least 1000, at least 1500, at least 2000, at least 2500, at least 3000, at least 3500, at least 4000, at least 4500, at least 5000, at least 5500, at least 6000, at least 6500, at least 7000, at least 7500, at least 8000, at least 8500, at least 9000, at least 9500 or at least 10000 capture probes.
在某些实施方案中,每种所述定位探针的邻近分布有500-10000个(例如,1000-1500个、1000-2000个、1000-3000个、1000-4000个、1000-5000个、1000-8000个、1000-10000个)所述捕获探针。In certain embodiments, there are 500-10,000 (eg, 1,000-1,500, 1,000-2,000, 1,000-3,000, 1,000-4,000, 1,000-5,000, 1,000-8,000, 1,000-10,000) capture probes distributed adjacent to each of the localization probes.
在某些实施方案中,表述“每种所述定位探针的邻近”表示固相支持物表面该种定位探针附近的区域,其中,与分布于该区域的捕获探针相连接的核酸分子(例如,与所述捕获探针退火的核酸分子,或者,由所述捕获探针经核酸聚合反应延伸获得的核酸分子)能够与该种定位探针相接触。In certain embodiments, the expression "adjacent to each of the positioning probes" refers to an area near the positioning probe on the surface of a solid support, wherein nucleic acid molecules connected to capture probes distributed in the area (e.g., nucleic acid molecules annealed to the capture probes, or nucleic acid molecules obtained by extending the capture probes through nucleic acid polymerization reaction) are able to contact the positioning probes.
在某些实施方案中,每种所述定位探针在所述固相支持物所占位置的中心(例如,每种所述定位探针在所述所述固相支持物表面所占据的区域的中心)周围半径200nm-300nm的范围内分布有至少100个所述捕获探针。例如,至少100个、至少200个、至少300个、400个、至少500个、至少600个、至少700个、至少800个、至少900个、至少1000个、至少1500个、至少2000个、至少2500个、至少3000个、至少3500个、至少4000个、至少4500个、至少5000个、至少5500个、至少6000个、至少6500个、至少7000个、至少7500个、至少8000个、至少8500个、至少9000个、至少9500个或至少10000个所述捕获探针。In certain embodiments, at least 100 capture probes are distributed within a radius of 200nm-300nm around the center of the position occupied by each of the positioning probes on the solid support (e.g., the center of the area occupied by each of the positioning probes on the surface of the solid support). For example, at least 100, at least 200, at least 300, 400, at least 500, at least 600, at least 700, at least 800, at least 900, at least 1000, at least 1500, at least 2000, at least 2500, at least 3000, at least 3500, at least 4000, at least 4500, at least 5000, at least 5500, at least 6000, at least 6500, at least 7000, at least 7500, at least 8000, at least 8500, at least 9000, at least 9500 or at least 10000 of the capture probes.
在某些实施方案中,每种所述定位探针在所述固相支持物所占位置的中心(例如,每种所述定位探针在所述固相支持物表面所占据的区域的中心)周围半径200nm-300nm的范围内分布有500-10000个(例如,1000-1500个、1000-2000个、1000-3000个、1000-4000个、1000-5000个、1000-8000个、1000-10000个)所述捕获探针。 In certain embodiments, there are 500-10000 (e.g., 1000-1500, 1000-2000, 1000-3000, 1000-4000, 1000-5000, 1000-8000, 1000-10000) capture probes distributed within a radius of 200nm-300nm around the center of the position occupied by each of the positioning probes on the solid support (e.g., the center of the area occupied by each of the positioning probes on the surface of the solid support).
核酸空间信息检测方法Nucleic acid spatial information detection method
在另一方面,本申请提供了一种检测样品中核酸的空间信息的方法,其包括以下步骤:In another aspect, the present application provides a method for detecting spatial information of nucleic acids in a sample, comprising the following steps:
(1)提供如上所述的核酸阵列;(1) providing a nucleic acid array as described above;
(2)在允许退火的条件下,将所述核酸阵列与待测样品接触,使得源自所述待测样品的核酸与所述核酸阵列的捕获探针中的捕获序列发生退火;(2) contacting the nucleic acid array with a sample to be tested under conditions that allow annealing, so that the nucleic acid derived from the sample to be tested anneals with the capture sequence in the capture probe of the nucleic acid array;
(3)在允许核酸聚合的条件下,进行核酸聚合反应,产生第一延伸产物,所述第一延伸产物包含捕获序列以及与所述捕获序列退火的核酸分子或其部分序列的互补序列,所述第一延伸产物通过其所包含的捕获序列与所述固相支持物相连接;(3) performing a nucleic acid polymerization reaction under conditions that allow nucleic acid polymerization to produce a first extension product, wherein the first extension product comprises a capture sequence and a complementary sequence of a nucleic acid molecule or a partial sequence thereof that anneals to the capture sequence, and the first extension product is connected to the solid support via the capture sequence contained therein;
(4)在允许退火的条件下,与所述固相支持物相连接的所述第一延伸产物与其邻近的所述核酸阵列的定位探针发生退火;(4) annealing the first extension product connected to the solid support with the adjacent positioning probe of the nucleic acid array under conditions that allow annealing;
其中,所述定位探针的第一通用序列能与所述第一延伸产物退火;并且,所述第一通用序列位于所述定位探针的定位序列的3’端;wherein the first universal sequence of the positioning probe can anneal with the first extension product; and the first universal sequence is located at the 3' end of the positioning sequence of the positioning probe;
(5)在允许核酸聚合的条件下,进行核酸聚合反应,产生第二延伸产物和/或其互补链,所述第二延伸产物包含:(a)所述定位探针的定位序列以及所述第一延伸产物或其部分序列的互补序列,或者,(b)所述第一延伸产物序列或其部分序列,以及所述定位探针的定位序列的互补序列;所述第二延伸产物中所包含的所述定位序列或其互补序列作为其空间信息标记,所述第二延伸产物的互补链所包含的所述定位序列的互补序列或所述定位序列作为其空间信息标记,从而使得源自待测样品的核酸在所述待测样品中的位置被对应至所述定位序列所对应的定位探针的位置;由此获得含有所述空间信息标记的核酸分子;和(5) performing a nucleic acid polymerization reaction under conditions that allow nucleic acid polymerization to produce a second extension product and/or its complementary chain, wherein the second extension product comprises: (a) the positioning sequence of the positioning probe and the complementary sequence of the first extension product or its partial sequence, or (b) the first extension product sequence or its partial sequence and the complementary sequence of the positioning sequence of the positioning probe; the positioning sequence or its complementary sequence contained in the second extension product serves as its spatial information marker, and the complementary sequence of the positioning sequence or the positioning sequence contained in the complementary chain of the second extension product serves as its spatial information marker, so that the position of the nucleic acid derived from the sample to be tested in the sample to be tested is mapped to the position of the positioning probe corresponding to the positioning sequence; thereby obtaining a nucleic acid molecule containing the spatial information marker; and
(6)分析:(i)步骤(5)中获得的含有所述空间信息标记的核酸分子,和/或,(ii)衍生自(i)的含有所述空间信息标记的核酸分子的序列。(6) Analyzing: (i) the nucleic acid molecule containing the spatial information marker obtained in step (5), and/or, (ii) the sequence of the nucleic acid molecule containing the spatial information marker derived from (i).
本领域技术人员易于理解,步骤(4)至(5)中,所述第一延伸产物通过与其邻近的所述定位探针发生退火并进行核酸聚合反应,产生所述第二延伸产物,由此,所述第二延伸产物中含有所述定位探针中的定位序列或其互补序列,从而使得被所述捕获探针捕获的源自待测样品的核酸在所述待测样品中的位置被对应至核酸阵列上与所述第一延伸产物退火的定位探针的位置。It is easy for a person skilled in the art to understand that in steps (4) to (5), the first extension product anneals with the adjacent positioning probe and undergoes nucleic acid polymerization reaction to produce the second extension product, whereby the second extension product contains the positioning sequence or its complementary sequence in the positioning probe, so that the position of the nucleic acid derived from the sample to be tested captured by the capture probe in the sample to be tested is corresponded to the position of the positioning probe on the nucleic acid array that anneals with the first extension product.
在某些实施方案中,所述核酸的空间信息包括核酸的定位、分布和/或丰度。In certain embodiments, the spatial information of the nucleic acid includes the location, distribution and/or abundance of the nucleic acid.
在某些实施方案中,步骤(3)中,所述核酸聚合反应以与所述捕获序列退火的核酸分子为模板,延伸所述捕获序列。In certain embodiments, in step (3), the nucleic acid polymerization reaction uses the nucleic acid molecule that anneals to the capture sequence as a template to extend the capture sequence.
在某些实施方案中,所述捕获序列位于所述捕获探针的3’末端,和/或,所述捕获序列的3’末端具有自由羟基(-OH)。 In certain embodiments, the capture sequence is located at the 3' end of the capture probe, and/or the 3' end of the capture sequence has a free hydroxyl group (-OH).
在某些实施方案中,步骤(5)中,所述聚合反应以所述第一延伸产物为模板,延伸所述定位探针;和/或,以所述定位探针为模板,延伸所述第一延伸产物。In certain embodiments, in step (5), the polymerization reaction uses the first extension product as a template to extend the positioning probe; and/or uses the positioning probe as a template to extend the first extension product.
在某些实施方案中,步骤(5)中,所述聚合反应以所述第一延伸产物为模板,延伸所述定位探针,以产生第二延伸产物和/或其互补链,所述第二延伸产物包含所述定位探针的定位序列以及所述第一延伸产物或其部分序列的互补序列。在某些实施方案中,所述定位探针的第一通用序列位于所述定位探针的3’末端,和/或,所述定位探针的第一通用序列的3’末端具有自由羟基(-OH)。In some embodiments, in step (5), the polymerization reaction uses the first extension product as a template to extend the positioning probe to produce a second extension product and/or its complementary chain, wherein the second extension product comprises the positioning sequence of the positioning probe and the complementary sequence of the first extension product or a partial sequence thereof. In some embodiments, the first universal sequence of the positioning probe is located at the 3' end of the positioning probe, and/or the 3' end of the first universal sequence of the positioning probe has a free hydroxyl group (-OH).
在某些实施方案中,步骤(5)中,所述聚合反应以所述定位探针为模板,延伸所述第一延伸产物,以产生第二延伸产物和/或其互补链,所述第二延伸产物包含所述第一延伸产物序列或其部分序列以及所述定位探针的定位序列的互补序列。在某些实施方案中,所述定位探针的第一通用序列位于或者不位于所述定位探针的3’末端,和/或,所述定位探针的第一通用序列的3’末端是封闭的或者未封闭的。In certain embodiments, in step (5), the polymerization reaction uses the positioning probe as a template to extend the first extension product to produce a second extension product and/or its complementary strand, wherein the second extension product comprises the first extension product sequence or a partial sequence thereof and a complementary sequence to the positioning sequence of the positioning probe. In certain embodiments, the first universal sequence of the positioning probe is located at or not located at the 3' end of the positioning probe, and/or the 3' end of the first universal sequence of the positioning probe is blocked or unblocked.
在某些实施方案中,步骤(5)中,所述聚合反应中,所述第一延伸产物与所述定位探针互为模板,延伸所述第一延伸产物以及所述定位探针,以产生第二延伸产物和/或其互补链,所述第二延伸产物包含所述定位探针的定位序列以及所述第一延伸产物或其部分序列的互补序列,以及,所述第二延伸产物包含所述第一延伸产物序列或其部分序列以及所述定位探针的定位序列的互补序列。在某些实施方案中,所述定位探针的第一通用序列位于所述定位探针的3’末端,和/或,所述定位探针的第一通用序列的3’末端具有自由羟基(-OH)。In certain embodiments, in step (5), in the polymerization reaction, the first extension product and the positioning probe serve as templates for each other, and the first extension product and the positioning probe are extended to produce a second extension product and/or its complementary chain, the second extension product comprises the positioning sequence of the positioning probe and the complementary sequence of the first extension product or its partial sequence, and the second extension product comprises the first extension product sequence or its partial sequence and the complementary sequence of the positioning sequence of the positioning probe. In certain embodiments, the first universal sequence of the positioning probe is located at the 3' end of the positioning probe, and/or the 3' end of the first universal sequence of the positioning probe has a free hydroxyl group (-OH).
在某些实施方案中,步骤(6)中,分析与所述核酸阵列的固相支持物连接的所述含有空间信息标记的核酸分子的序列。In certain embodiments, in step (6), the sequence of the nucleic acid molecule containing the spatial information label attached to the solid support of the nucleic acid array is analyzed.
在某些实施方案中,在步骤(6)之前,步骤(5)之后,所述方法还包括步骤pre-(6):从所述核酸阵列表面释放至少一部分所述含有空间信息标记的核酸分子。In certain embodiments, before step (6) and after step (5), the method further comprises a step pre-(6): releasing at least a portion of the nucleic acid molecules containing spatial information labels from the surface of the nucleic acid array.
在某些实施方案中,步骤(6)中,分析(i)步骤pre-(6)释放的所述含有空间信息标记的核酸分子,和/或,(ii)衍生自(i)的含有所述空间信息标记的核酸分子的序列。In certain embodiments, in step (6), (i) the nucleic acid molecule containing the spatial information marker released in step pre-(6) and/or, (ii) the sequence of the nucleic acid molecule containing the spatial information marker derived from (i) are analyzed.
在某些实施方案中,在步骤pre-(6)中,所述核酸分子通过如下方法从固相支持物表面释放:(i)核酸剪切;和/或,(ii)变性。In certain embodiments, in step pre-(6), the nucleic acid molecule is released from the solid support surface by: (i) nucleic acid shearing; and/or, (ii) denaturation.
在某些实施方案中,在步骤pre-(6)之后步骤(6)之前,所述方法还包括对所述被释放的核酸分子进行扩增的步骤。In certain embodiments, after step pre-(6) and before step (6), the method further comprises a step of amplifying the released nucleic acid molecules.
在某些实施方案中,在进行步骤(6)之前,所述方法进一步包括对被释放的核酸分子进行纯化的步骤。In certain embodiments, before performing step (6), the method further comprises a step of purifying the released nucleic acid molecules.
在某些实施方案中,所述方法具备选自以下的一项或多项特征:In certain embodiments, the method has one or more features selected from the group consisting of:
(i)步骤(1)中,通过如上所述的制备核酸阵列的方法提供所述核酸阵列; (i) in step (1), providing the nucleic acid array by the method for preparing a nucleic acid array as described above;
(ii)步骤(2)中,对所述待测样品进行处理(例如,透化处理或细胞裂解处理)以释放源自所述待测样品的核酸,使其与所述捕获序列发生退火;(ii) in step (2), the sample to be tested is treated (for example, permeabilized or cell lysed) to release the nucleic acid from the sample to be tested so that the nucleic acid anneals with the capture sequence;
(iii)在步骤(3)之后步骤(4)之前,所述方法还包括去除与所述第一延伸产物结合的模板链的步骤(例如,通过酶切或变性解链去除与所述第一延伸产物结合的模板链);(iii) after step (3) and before step (4), the method further comprises the step of removing the template strand bound to the first extension product (for example, removing the template strand bound to the first extension product by enzyme cleavage or denaturing and melting);
(iv)在步骤(3)之后步骤(4)之前,所述方法还包括洗涤核酸阵列以除去残余样品(例如,组织或细胞)的步骤;(iv) after step (3) and before step (4), the method further comprises a step of washing the nucleic acid array to remove residual sample (e.g., tissue or cells);
(v)在步骤(5)之后步骤(6)之前(例如,在步骤(5)之后步骤pre-(6)之前),所述方法还包括对含有所述空间信息标记的核酸分子进行扩增(例如,原位扩增)的步骤。在某些实施方案中,所述含有所述空间信息标记的核酸分子与所述核酸阵列的固相支持物相连接。(v) After step (5) and before step (6) (e.g., after step (5) and before step pre-(6)), the method further comprises the step of amplifying (e.g., in situ amplification) the nucleic acid molecule containing the spatial information marker. In certain embodiments, the nucleic acid molecule containing the spatial information marker is connected to the solid support of the nucleic acid array.
在某些实施方案中,所述样品是组织样品(例如,组织切片)或单细胞样品(例如,单细胞悬液)。In certain embodiments, the sample is a tissue sample (eg, a tissue section) or a single cell sample (eg, a single cell suspension).
在某些实施方案中,所述组织样品(例如,组织切片)从固定组织制备,例如,以福尔马林固定石蜡包埋(FFPE)的组织或、深度冷冻的组织或新鲜的组织。In certain embodiments, the tissue sample (eg, tissue section) is prepared from fixed tissue, eg, formalin-fixed paraffin-embedded (FFPE) tissue or deep-frozen tissue or fresh tissue.
在某些实施方案中,所述细胞是免疫细胞,例如B细胞或T细胞。在某些实施方案中,所述源自待测样品的核酸包含T细胞受体基因或其核酸产物(例如,mRNA),或B细胞受体基因或其核酸产物(例如,mRNA)。In certain embodiments, the cell is an immune cell, such as a B cell or a T cell. In certain embodiments, the nucleic acid derived from the sample to be tested comprises a T cell receptor gene or its nucleic acid product (e.g., mRNA), or a B cell receptor gene or its nucleic acid product (e.g., mRNA).
在某些实施方案中,所述源自待测样品的核酸选自:RNA分子(例如,mRNA)分子,目标靶核酸分子(例如,目标靶DNA和/或RNA),处于染色质开放区的基因组核酸片段,以及其任意组合。In certain embodiments, the nucleic acid derived from the sample to be tested is selected from: RNA molecules (e.g., mRNA) molecules, target nucleic acid molecules (e.g., target DNA and/or RNA), genomic nucleic acid fragments in open chromatin regions, and any combination thereof.
在某些实施方案中,在步骤(6)中,所述分析包括测序和/或序列特异性的PCR反应。In certain embodiments, in step (6), the analysis comprises sequencing and/or sequence-specific PCR reaction.
在某些实施方案中,在进行测序之前,所述方法还包括对步骤(5)获得的含有所述空间信息标记的核酸分子或其扩增产物进行测序文库构建的步骤。In certain embodiments, before sequencing, the method further comprises the step of constructing a sequencing library for the nucleic acid molecule containing the spatial information marker or its amplified product obtained in step (5).
在某些实施方案中,所述方法用于检测样品中细胞的RNA(例如,mRNA)的空间信息。In certain embodiments, the methods are used to detect the spatial information of RNA (eg, mRNA) of cells in a sample.
在某些实施方案中,步骤(2)中,所述源自待测样品的核酸为所述待测样品的细胞中的RNA(例如,mRNA),所述捕获序列包含poly(dT)序列或随机寡核苷酸序列。In certain embodiments, in step (2), the nucleic acid derived from the sample to be tested is RNA (eg, mRNA) in cells of the sample to be tested, and the capture sequence comprises a poly (dT) sequence or a random oligonucleotide sequence.
在某些实施方案中,所述步骤(3)包括:In certain embodiments, step (3) comprises:
(a)在允许核酸聚合的条件下,以与所述捕获序列退火的核酸分子为模板,进行核酸聚合反应,延伸所述捕获序列,生成cDNA链,所述cDNA链包含以所述捕获序列为逆转录引物形成的与所述RNA(例如,mRNA)互补的cDNA序列,以及3’末端悬突;(a) under conditions allowing nucleic acid polymerization, using a nucleic acid molecule annealed to the capture sequence as a template, performing a nucleic acid polymerization reaction, extending the capture sequence, and generating a cDNA chain, wherein the cDNA chain comprises a cDNA sequence complementary to the RNA (e.g., mRNA) formed using the capture sequence as a reverse transcription primer, and a 3' end overhang;
(b)在允许核酸聚合的条件下,将模板转换序列与(a)中生成的所述cDNA链进 行退火,并以所述模板转换序列为模板继续进行核酸聚合反应,生成所述第一延伸产物;(b) under conditions that allow nucleic acid polymerization, reacting the template switching sequence with the cDNA strand generated in (a) annealing, and continuing nucleic acid polymerization reaction with the template switching sequence as a template to generate the first extension product;
其中,所述模板转换序列包含共有序列,3’末端悬突互补序列,以及任选的MID序列;在某些实施方案中,每个模板转换序列所包含的MID序列彼此不同;Wherein, the template switching sequence comprises a consensus sequence, a 3'-terminal overhang complementary sequence, and an optional MID sequence; in certain embodiments, the MID sequences contained in each template switching sequence are different from each other;
所述第一延伸产物包含:捕获序列,所述cDNA链序列或其部分序列,以及所述共有序列的互补序列。The first extension product comprises: a capture sequence, the cDNA chain sequence or a partial sequence thereof, and a complementary sequence of the common sequence.
在某些实施方案中,所述3’末端悬突具有至少1个,至少2个,至少3个,至少4个,至少5个,至少6个,至少7个,至少8个,至少9个,至少10个,1-10个,1-5个或2-10个核苷酸的长度。在某些实施方案中,所述3’末端悬突为2-5个胞嘧啶核苷酸的悬突(例如CCC悬突)。In certain embodiments, the 3' terminal overhang has a length of at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, 1-10, 1-5, or 2-10 nucleotides. In certain embodiments, the 3' terminal overhang is an overhang of 2-5 cytosine nucleotides (e.g., a CCC overhang).
在某些实施方案中,所述方法在步骤(4)之前,还包括去除与所述第一延伸产物结合的所述模板转换序列的步骤(例如,通过酶切或变性解链去除与所述第一延伸产物结合的所述模板转换序列)。In certain embodiments, the method further comprises, before step (4), a step of removing the template switching sequence bound to the first extension product (eg, removing the template switching sequence bound to the first extension product by enzyme cleavage or denaturing melting).
在某些实施方案中,所述方法的步骤(4)中,所述定位探针的第一通用序列能够与所述共有序列的互补序列退火。In certain embodiments, in step (4) of the method, the first universal sequence of the localization probe is capable of annealing to the complement of the consensus sequence.
在某些实施方案中,所述定位探针不含有MID序列,所述模板转换序列含有MID序列;或者,所述定位探针含有MID序列,所述模板转换序列不含有MID序列,或者,所述定位探针和所述模板转换序列均含有MID序列。In certain embodiments, the positioning probe does not contain a MID sequence and the template switch sequence contains a MID sequence; alternatively, the positioning probe contains a MID sequence and the template switch sequence does not contain a MID sequence; alternatively, both the positioning probe and the template switch sequence contain a MID sequence.
在某些实施方案中,所述方法用于检测样品中细胞的目标靶核酸(例如,目标靶DNA和/或RNA)的空间信息,所述目标靶核酸包含目标核苷酸序列。In certain embodiments, the method is used to detect the spatial information of a target nucleic acid (eg, target DNA and/or RNA) of a cell in a sample, wherein the target nucleic acid comprises a target nucleotide sequence.
在某些实施方案中,步骤(2)中,所述源自待测样品的核酸包含所述目标核苷酸序列。在某些实施方案中,所述源自待测样品的核酸包含所述目标靶核酸和/或衍生自所述目标靶核酸的核酸分子,所述衍生自目标靶核酸的核酸分子包含所述靶核苷酸序列。In certain embodiments, in step (2), the nucleic acid derived from the sample to be tested comprises the target nucleotide sequence. In certain embodiments, the nucleic acid derived from the sample to be tested comprises the target nucleic acid and/or a nucleic acid molecule derived from the target nucleic acid, and the nucleic acid molecule derived from the target nucleic acid comprises the target nucleotide sequence.
在某些实施方案中,步骤(2)中,所述捕获序列包含特异性识别所述目标靶核苷酸序列的序列。In certain embodiments, in step (2), the capture sequence comprises a sequence that specifically recognizes the target nucleotide sequence.
在某些实施方案中,步骤(2)中,所述捕获序列包含特异性识别所述目标靶核苷酸序列第一区段的序列;步骤(4)中,所述定位探针的所述第一通用序列包含特异性识别所述目标靶核苷酸序列的第二区段的互补序列的序列,或者,所述第一通用序列包含随机寡核苷酸序列;在某些实施方案中,在所述目标靶核苷酸序列中,所述第一区段位于所述第二区段的3’端。In certain embodiments, in step (2), the capture sequence comprises a sequence that specifically recognizes the first segment of the target target nucleotide sequence; in step (4), the first universal sequence of the positioning probe comprises a sequence that specifically recognizes the complementary sequence of the second segment of the target target nucleotide sequence, or the first universal sequence comprises a random oligonucleotide sequence; in certain embodiments, in the target target nucleotide sequence, the first segment is located at the 3’ end of the second segment.
在某些实施方案中,步骤(2)中,所述第一区段存在于与所述捕获序列退火的源自待测样品的核酸分子的单链区域。In certain embodiments, in step (2), the first segment is present in a single-stranded region of the nucleic acid molecule derived from the sample to be tested that anneals to the capture sequence.
在某些实施方案中,步骤(2)中,与所述捕获序列退火的源自待测样品的核酸为含有包含所述第一区段的单链区域的单链核酸或双链核酸。 In certain embodiments, in step (2), the nucleic acid derived from the sample to be tested that anneals to the capture sequence is a single-stranded nucleic acid or a double-stranded nucleic acid containing a single-stranded region comprising the first segment.
在某些实施方案中,所述方法用于检测样品中细胞的RNA(例如,mRNA)和目标靶核酸(例如,目标靶DNA和/或RNA)的空间信息,其中,所述目标靶核酸包含目标靶核苷酸序列。In certain embodiments, the method is used to detect spatial information of RNA (eg, mRNA) and target nucleic acid (eg, target DNA and/or RNA) of cells in a sample, wherein the target nucleic acid comprises a target nucleotide sequence.
在某些实施方案中,所述核酸阵列含有能够捕获所述RNA(例如,mRNA)的第一捕获探针,以及,能够捕获含有所述目标靶核苷酸序列的核酸分子的第二捕获探针;所述第一捕获探针含有第一捕获序列,所述第一捕获序列包含poly(dT)序列或随机寡核苷酸序列;所述第二捕获探针含有第二捕获序列,所述第二捕获序列包含特异性识别所述目标靶核苷酸序列的序列。In certain embodiments, the nucleic acid array contains a first capture probe capable of capturing the RNA (e.g., mRNA) and a second capture probe capable of capturing a nucleic acid molecule containing the target nucleotide sequence; the first capture probe contains a first capture sequence, which comprises a poly (dT) sequence or a random oligonucleotide sequence; and the second capture probe contains a second capture sequence, which comprises a sequence that specifically recognizes the target nucleotide sequence.
本领域技术人员易于理解,除非本文另外指明或根据上下文明显矛盾,否则上文中有关捕获探针的定义/描述同样适用于所述第一捕获探针和所述第二捕获探针,上文中有关捕获序列的定义/描述同样适用于所述第一捕获序列和所述第二捕获序列。It is easy for those skilled in the art to understand that, unless otherwise specified herein or obviously contradictory according to the context, the definition/description of the capture probe above also applies to the first capture probe and the second capture probe, and the definition/description of the capture sequence above also applies to the first capture sequence and the second capture sequence.
在某些实施方案中,步骤(2)中,所述源自待测样品的核酸包含源自待测样品的RNA(例如,mRNA)以及包含目标靶核苷酸序列的核酸分子;在某些实施方案中,所述包含目标靶核苷酸序列的核酸分子包含所述目标靶核酸和/或衍生自所述目标靶核酸的核酸分子。In certain embodiments, in step (2), the nucleic acid derived from the sample to be tested comprises RNA (e.g., mRNA) derived from the sample to be tested and a nucleic acid molecule comprising a target nucleotide sequence; in certain embodiments, the nucleic acid molecule comprising a target nucleotide sequence comprises the target target nucleic acid and/or a nucleic acid molecule derived from the target target nucleic acid.
在某些实施方案中,所述步骤(3)包括:In certain embodiments, step (3) comprises:
(A)(a)在允许核酸聚合的条件下,以与所述第一捕获序列退火的核酸分子为模板,进行核酸聚合反应,延伸所述第一捕获序列,生成cDNA链,所述cDNA链包含以所述第一捕获序列为逆转录引物形成的与所述RNA(例如,mRNA)互补的cDNA序列,以及3’末端悬突;(A)(a) Under conditions that allow nucleic acid polymerization, using a nucleic acid molecule that anneals to the first capture sequence as a template, performing a nucleic acid polymerization reaction, extending the first capture sequence, and generating a cDNA chain, wherein the cDNA chain comprises a cDNA sequence complementary to the RNA (e.g., mRNA) formed using the first capture sequence as a reverse transcription primer, and a 3' end overhang;
(b)在允许核酸聚合的条件下,将模板转换序列与(a)中生成的所述cDNA链进行退火,并以所述模板转换序列为模板继续进行核酸聚合反应,生成第一延伸产物I;(b) annealing the template switching sequence with the cDNA chain generated in (a) under conditions that allow nucleic acid polymerization, and continuing nucleic acid polymerization reaction with the template switching sequence as a template to generate a first extension product I;
其中,所述模板转换序列包含共有序列,3’末端悬突互补序列,以及任选的MID序列;在某些实施方案中,每个模板转换序列所包含的MID序列彼此不同;Wherein, the template switching sequence comprises a consensus sequence, a 3'-terminal overhang complementary sequence, and an optional MID sequence; in certain embodiments, the MID sequences contained in each template switching sequence are different from each other;
以及,as well as,
(B)在允许核酸聚合的条件下,以与所述第二捕获序列退火的核酸分子为模板,进行核酸聚合反应,延伸所述第二捕获序列,产生第一延伸产物II,所述第一延伸产物II包含第二捕获序列以及与所述第二捕获序列退火的核酸分子或其部分序列的互补序列;(B) under conditions allowing nucleic acid polymerization, using the nucleic acid molecule annealed to the second capture sequence as a template, performing a nucleic acid polymerization reaction to extend the second capture sequence to generate a first extension product II, wherein the first extension product II comprises a second capture sequence and a complementary sequence to the nucleic acid molecule annealed to the second capture sequence or a partial sequence thereof;
其中,所述步骤(B)与步骤(A)以任意顺序进行;例如,步骤(B)在步骤(A)之前或之后进行,或者,步骤(B)与步骤(A)同时进行(例如,在同一反应体系中进行)。Wherein, step (B) and step (A) are performed in any order; for example, step (B) is performed before or after step (A), or step (B) and step (A) are performed simultaneously (for example, in the same reaction system).
在某些实施方案中,所述3’末端悬突具有至少1个,至少2个,至少3个,至少4个,至少5个,至少6个,至少7个,至少8个,至少9个,至少10个,1-10个,1-5个 或2-10个核苷酸的长度。在某些实施方案中,所述3’末端悬突为2-5个胞嘧啶核苷酸的悬突(例如CCC悬突)。In certain embodiments, the 3' terminal overhang has at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, 1-10, 1-5 or a length of 2-10 nucleotides. In certain embodiments, the 3'-terminal overhang is an overhang of 2-5 cytosine nucleotides (eg, a CCC overhang).
在某些实施方案中,步骤(4)中,所述定位探针含有能与所述第一延伸产物I退火的第一通用序列I,以及能与所述第一延伸产物II退火的第一通用序列II;在某些实施方案中,所述第一通用序列I和所述第一通用序列II共同存在于相同的定位探针,或者,分别存在于不同的定位探针。In certain embodiments, in step (4), the positioning probe contains a first universal sequence I that can anneal to the first extension product I, and a first universal sequence II that can anneal to the first extension product II; in certain embodiments, the first universal sequence I and the first universal sequence II are present in the same positioning probe, or, respectively, in different positioning probes.
在某些实施方案中,所述核酸阵列包含含有所述第一通用序列I的第一定位探针以及含有所述第一通用序列II的第二定位探针。In certain embodiments, the nucleic acid array comprises a first positioning probe comprising the first universal sequence I and a second positioning probe comprising the first universal sequence II.
本领域技术人员易于理解,除非本文另外指明或根据上下文明显矛盾,否则上文中有关定位探针的定义/描述同样适用于所述第一定位探针和所述第二定位探针,上文中有关第一通用序列的定义/描述同样适用于所述第一通用序列I和所述第一通用序列II。It is easy for those skilled in the art to understand that, unless otherwise specified herein or clearly contradictory according to the context, the definition/description of the positioning probe above also applies to the first positioning probe and the second positioning probe, and the definition/description of the first universal sequence above also applies to the first universal sequence I and the first universal sequence II.
在某些实施方案中,所述方法步骤(4)中,所述第一通用序列I能够与所述共有序列的互补序列退火。In certain embodiments, in step (4) of the method, the first universal sequence I is capable of annealing to the complementary sequence of the consensus sequence.
在某些实施方案中,所述方法步骤(2)中,所述第二捕获序列包含特异性识别所述目标靶核苷酸序列第一区段的序列;步骤(4)中,所述第一通用序列II包含特异性识别所述目标靶核苷酸序列的第二区段的互补序列的序列,或者,所述第一通用序列II包含随机寡核苷酸序列。在某些实施方案中,在所述目标靶核苷酸序列中,所述第一区段位于所述第二区段的3’端。In certain embodiments, in step (2) of the method, the second capture sequence comprises a sequence that specifically recognizes the first segment of the target nucleotide sequence; in step (4), the first universal sequence II comprises a sequence that specifically recognizes the complementary sequence of the second segment of the target nucleotide sequence, or the first universal sequence II comprises a random oligonucleotide sequence. In certain embodiments, in the target nucleotide sequence, the first segment is located at the 3' end of the second segment.
在某些实施方案中,所述方法在步骤(4)之前,还包括去除与所述第一延伸产物I结合的所述模板转换序列的步骤(例如,通过酶切或变性解链去除与所述第一延伸产物I结合的所述模板转换序列)。In certain embodiments, the method further comprises, before step (4), a step of removing the template switching sequence bound to the first extension product I (e.g., removing the template switching sequence bound to the first extension product I by enzyme cleavage or denaturing melting).
在某些实施方案中,步骤(2)中,所述第一区段存在于与所述第二捕获序列退火的源自待测样品的核酸分子的单链区域。In certain embodiments, in step (2), the first segment is present in a single-stranded region of the nucleic acid molecule derived from the sample to be tested that anneals to the second capture sequence.
在某些实施方案中,步骤(2)中,与所述第二捕获序列退火的源自待测样品的核酸为含有包含所述第一区段的单链区域的单链核酸或双链核酸。In certain embodiments, in step (2), the nucleic acid derived from the sample to be tested that anneals to the second capture sequence is a single-stranded nucleic acid or a double-stranded nucleic acid containing a single-stranded region comprising the first segment.
在某些实施方案中,所述第一定位探针不含有MID序列,所述模板转换序列含有MID序列;或者,所述第一定位探针含有MID序列,所述模板转换序列不含有MID序列,或者,所述第一定位探针和所述模板转换序列均含有MID序列。In certain embodiments, the first positioning probe does not contain a MID sequence and the template switch sequence contains a MID sequence; or, the first positioning probe contains a MID sequence and the template switch sequence does not contain a MID sequence, or, both the first positioning probe and the template switch sequence contain a MID sequence.
试剂盒Reagent test kit
在另一方面,本申请提供了试剂盒,其包含如上所述的核酸阵列。In another aspect, the present application provides a kit comprising the nucleic acid array as described above.
在某些实施方案中,所述试剂盒进一步包含使用说明。在某些实施方案中,所述使用说明记载了如上所述的核酸空间信息检测方法。 In some embodiments, the kit further comprises instructions for use. In some embodiments, the instructions for use record the method for detecting nucleic acid spatial information as described above.
在某些实施方案中,所述核酸的空间信息包括核酸的定位、分布和/或丰度。In certain embodiments, the spatial information of the nucleic acid includes the location, distribution and/or abundance of the nucleic acid.
用于mRNA捕获For mRNA capture
在某些实施方案中,所述核酸阵列的捕获序列包含poly(dT)序列或随机寡核苷酸序列。In certain embodiments, the capture sequence of the nucleic acid array comprises a poly(dT) sequence or a random oligonucleotide sequence.
在某些实施方案中,所述试剂盒进一步包含模板转换序列,所述模板转换序列包含共有序列,cDNA 3’末端悬突互补序列,以及任选的MID序列;在某些实施方案中,每个模板转换序列所包含的MID序列彼此不同。在某些实施方案中,所述cDNA 3’末端悬突具有至少1个,至少2个,至少3个,至少4个,至少5个,至少6个,至少7个,至少8个,至少9个,至少10个,1-10个,1-5个或2-10个核苷酸的长度。在某些实施方案中,所述cDNA 3’末端悬突为2-5个胞嘧啶核苷酸的悬突(例如CCC悬突)。In some embodiments, the kit further comprises a template switching sequence, the template switching sequence comprising a consensus sequence, a cDNA 3' terminal overhang complementary sequence, and an optional MID sequence; in some embodiments, the MID sequences contained in each template switching sequence are different from each other. In some embodiments, the cDNA 3' terminal overhang has a length of at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, 1-10, 1-5 or 2-10 nucleotides. In some embodiments, the cDNA 3' terminal overhang is an overhang of 2-5 cytosine nucleotides (e.g., CCC overhang).
在某些实施方案中,所述模板转换序列的共有序列的互补序列能与所述核酸阵列的定位探针的第一通用序列退火。In certain embodiments, the complement of the consensus sequence of the template switch sequence is capable of annealing to the first universal sequence of the positioning probe of the nucleic acid array.
在某些实施方案中,所述定位探针不含有MID序列,所述模板转换序列含有MID序列;或者,所述定位探针含有MID序列,所述模板转换序列不含有MID序列;或者,所述定位探针和所述模板转换序列均含有MID序列。In certain embodiments, the positioning probe does not contain a MID sequence and the template switch sequence contains a MID sequence; alternatively, the positioning probe contains a MID sequence and the template switch sequence does not contain a MID sequence; alternatively, both the positioning probe and the template switch sequence contain a MID sequence.
用于靶核酸捕获For target nucleic acid capture
在某些实施方案中,所述核酸阵列的捕获序列包含特异性识别目标靶核酸(例如,特定靶DNA和/或RNA)所包含的目标靶核苷酸序列的序列。In certain embodiments, the capture sequence of the nucleic acid array comprises a sequence that specifically recognizes a target nucleotide sequence comprised in a target nucleic acid (eg, a specific target DNA and/or RNA).
在某些实施方案中,所述第一通用序列包含特异性识别所述目标靶核苷酸序列的互补序列的序列。In certain embodiments, the first universal sequence comprises a sequence that specifically recognizes a complementary sequence of the target nucleotide sequence of interest.
在某些实施方案中,所述捕获序列包含特异性识别所述目标靶核苷酸序列第一区段的序列,所述核酸阵列的定位探针的所述第一通用序列包含特异性识别所述目标靶核苷酸序列第二区段的互补序列的序列,或者,所述第一通用序列包含随机寡核苷酸序列。In certain embodiments, the capture sequence comprises a sequence that specifically recognizes the first segment of the target target nucleotide sequence, the first universal sequence of the positioning probe of the nucleic acid array comprises a sequence that specifically recognizes the complementary sequence of the second segment of the target target nucleotide sequence, or the first universal sequence comprises a random oligonucleotide sequence.
在某些实施方案中,在所述目标靶核苷酸序列中,所述第一区段位于所述第二区段的3’端。In certain embodiments, in the target nucleotide sequence of interest, the first segment is located at the 3' end of the second segment.
用于mRNA和靶核酸的双重捕获For dual capture of mRNA and target nucleic acid
在某些实施方案中,所述核酸阵列含有第一捕获探针和第二捕获探针;所述第一捕获探针含有第一捕获序列,所述第一捕获序列包含poly(dT)序列或随机寡核苷酸序列;所述第二捕获探针含有第二捕获序列,所述第二捕获序列包含特异性识别目标靶核酸(例如,特定靶DNA和/或RNA)所包含的目标靶核苷酸序列的序列。 In certain embodiments, the nucleic acid array contains a first capture probe and a second capture probe; the first capture probe contains a first capture sequence, and the first capture sequence comprises a poly (dT) sequence or a random oligonucleotide sequence; the second capture probe contains a second capture sequence, and the second capture sequence comprises a sequence that specifically recognizes a target target nucleotide sequence contained in a target target nucleic acid (e.g., a specific target DNA and/or RNA).
本领域技术人员易于理解,除非本文另外指明或根据上下文明显矛盾,否则上文中有关捕获探针的定义/描述同样适用于所述第一捕获探针和所述第二捕获探针,上文中有关捕获序列的定义/描述同样适用于所述第一捕获序列和所述第二捕获序列。It is easy for those skilled in the art to understand that, unless otherwise specified herein or obviously contradictory according to the context, the definition/description of the capture probe above also applies to the first capture probe and the second capture probe, and the definition/description of the capture sequence above also applies to the first capture sequence and the second capture sequence.
在某些实施方案中,所述试剂盒进一步包含模板转换序列,所述模板转换序列包含共有序列,cDNA 3’末端悬突互补序列,以及任选的MID序列。在某些实施方案中,每个模板转换序列所包含的MID序列彼此不同。在某些实施方案中,所述cDNA 3’末端悬突具有至少1个,至少2个,至少3个,至少4个,至少5个,至少6个,至少7个,至少8个,至少9个,至少10个,1-10个,1-5个或2-10个核苷酸的长度。在某些实施方案中,所述cDNA 3’末端悬突为2-5个胞嘧啶核苷酸的悬突(例如CCC悬突)。In certain embodiments, the kit further comprises a template conversion sequence, the template conversion sequence comprising a consensus sequence, a cDNA 3' terminal overhang complementary sequence, and an optional MID sequence. In certain embodiments, the MID sequences contained in each template conversion sequence are different from each other. In certain embodiments, the cDNA 3' terminal overhang has a length of at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, 1-10, 1-5 or 2-10 nucleotides. In certain embodiments, the cDNA 3' terminal overhang is an overhang of 2-5 cytosine nucleotides (e.g., a CCC overhang).
在某些实施方案中,所述核酸阵列的定位探针包含第一通用序列I和第一通用序列II;其中,所述第一通用序列I能够与所述共有序列的互补序列退火,所述第一通用序列II包含特异性识别所述目标靶核苷酸序列的互补序列的序列,或者,所述第一通用序列II包含随机寡核苷酸序列。在某些实施方案中,所述第一通用序列I和所述第一通用序列II共同存在于相同的定位探针,或者,分别存在于不同的定位探针。In certain embodiments, the positioning probe of the nucleic acid array comprises a first universal sequence I and a first universal sequence II; wherein the first universal sequence I is capable of annealing with a complementary sequence to the consensus sequence, the first universal sequence II comprises a sequence that specifically recognizes a complementary sequence to the target nucleotide sequence, or the first universal sequence II comprises a random oligonucleotide sequence. In certain embodiments, the first universal sequence I and the first universal sequence II are present in the same positioning probe together, or are present in different positioning probes respectively.
在某些实施方案中,所述核酸阵列包含含有所述第一通用序列I的第一定位探针以及含有所述第一通用序列II的第二定位探针。在某些实施方案中,所述第一定位探针不含有MID序列,所述模板转换序列含有MID序列;或者,所述第一定位探针含有MID序列,所述模板转换序列不含有MID序列,或者,所述第一定位探针和所述模板转换序列均含有MID序列。In certain embodiments, the nucleic acid array comprises a first positioning probe comprising the first universal sequence I and a second positioning probe comprising the first universal sequence II. In certain embodiments, the first positioning probe does not contain a MID sequence, and the template switch sequence contains a MID sequence; or, the first positioning probe contains a MID sequence, and the template switch sequence does not contain a MID sequence, or, both the first positioning probe and the template switch sequence contain a MID sequence.
本领域技术人员易于理解,除非本文另外指明或根据上下文明显矛盾,否则上文中有关定位探针的定义/描述同样适用于所述第一定位探针和所述第二定位探针,上文中有关第一通用序列的定义/描述同样适用于所述第一通用序列I和所述第一通用序列II。It is easy for those skilled in the art to understand that, unless otherwise specified herein or clearly contradictory according to the context, the definition/description of the positioning probe above also applies to the first positioning probe and the second positioning probe, and the definition/description of the first universal sequence above also applies to the first universal sequence I and the first universal sequence II.
在某些实施方案中,所述第二捕获序列包含特异性识别所述目标靶核苷酸序列第一区段的序列,所述第一通用序列II包含特异性识别所述目标靶核苷酸序列的第二区段的互补序列的序列,或者,所述第一通用序列II包含随机寡核苷酸序列。在某些实施方案中,在所述目标靶核苷酸序列中,所述第一区段位于所述第二区段的3’端。In certain embodiments, the second capture sequence comprises a sequence that specifically recognizes the first segment of the target nucleotide sequence, the first universal sequence II comprises a sequence that specifically recognizes the complementary sequence of the second segment of the target nucleotide sequence, or the first universal sequence II comprises a random oligonucleotide sequence. In certain embodiments, in the target target nucleotide sequence, the first segment is located at the 3' end of the second segment.
在某些实施方案中,所述核酸阵列的定位探针所包含的第一通用序列位于或者不位于所述定位探针的3’末端,和/或,所述核酸阵列的定位探针的第一通用序列的3’末端是封闭的或者未封闭的。In certain embodiments, the first universal sequence contained in the positioning probe of the nucleic acid array is located at or not located at the 3' end of the positioning probe, and/or the 3' end of the first universal sequence of the positioning probe of the nucleic acid array is blocked or unblocked.
在某些实施方案中,所述试剂盒进一步包含:用于进行核酸杂交的试剂、用于进行核酸延伸的试剂、用于进行核酸扩增的试剂、用于回收或纯化核酸的试剂、用于构建转 录组测序文库的试剂、用于测序(例如二代测序或三代测序)的试剂、或其任何组合。In certain embodiments, the kit further comprises: reagents for nucleic acid hybridization, reagents for nucleic acid extension, reagents for nucleic acid amplification, reagents for recovering or purifying nucleic acids, reagents for constructing a transfection kit, The invention relates to reagents for sequencing a genome sequencing library, reagents for sequencing (e.g., second-generation sequencing or third-generation sequencing), or any combination thereof.
在另一方面,本申请提供了如上所述的核酸阵列或试剂盒用于构建核酸分子文库、用于进行核酸测序或用于检测样品中核酸空间信息的用途。In another aspect, the present application provides the use of the nucleic acid array or kit as described above for constructing a nucleic acid molecule library, for performing nucleic acid sequencing, or for detecting nucleic acid spatial information in a sample.
在某些实施方案中,所述核酸的空间信息包括核酸的定位、分布和/或丰度。In certain embodiments, the spatial information of the nucleic acid includes the location, distribution and/or abundance of the nucleic acid.
如本文所使用的,“目标核苷酸序列”与“目标靶核苷酸序列”具有相同含义,两者可互换使用。As used herein, "target nucleotide sequence" and "target nucleotide sequence" have the same meaning and can be used interchangeably.
发明的有益效果Advantageous Effects of the Invention
本申请提供的核酸阵列(例如,捕获芯片)的捕获区和空间信息分别存在于两个独立的分子,例如,捕获区和空间信息分别存在于捕获探针和定位探针,其中定位探针用于提供空间组学的空间位置信息,而捕获探针用于捕获细胞内的分子,从而摆脱了捕获探针数量受空间密度影响的限制,能够充分利用芯片上所有区域接枝捕获探针,从而在保证高分辨率的同时,提升细胞内分子的捕获效率。The capture area and spatial information of the nucleic acid array (e.g., capture chip) provided by the present application exist in two independent molecules, for example, the capture area and spatial information exist in the capture probe and the positioning probe, respectively, wherein the positioning probe is used to provide spatial position information of spatial omics, and the capture probe is used to capture molecules in cells, thereby getting rid of the limitation that the number of capture probes is affected by spatial density, and being able to make full use of all areas on the chip to graft capture probes, thereby improving the capture efficiency of intracellular molecules while ensuring high resolution.
下面将结合附图和实施例对本发明的实施方案进行详细描述,但是本领域技术人员将理解,下列附图和实施例仅用于说明本发明,而不是对本发明的范围的限定。根据附图和优选实施方案的下列详细描述,本发明的各种目的和有利方面对于本领域技术人员来说将变得显然。Embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings and examples, but it will be appreciated by those skilled in the art that the following drawings and examples are only used to illustrate the present invention, rather than to limit the scope of the present invention. Various objects and advantages of the present invention will become apparent to those skilled in the art based on the following detailed description of the accompanying drawings and preferred embodiments.
图1:本申请示例性捕获芯片的示意图。其中,芯片上捕获探针(包含捕获区,capture domain)和定位探针(包含空间信息,spatial barcode)分别在两条分子上,所述两条分子相互独立。Figure 1: Schematic diagram of an exemplary capture chip of the present application, wherein the capture probe (including the capture domain) and the positioning probe (including spatial information, spatial barcode) on the chip are respectively on two molecules, and the two molecules are independent of each other.
图2:本申请空间转录组学的示例性实现流程。Figure 2: Exemplary implementation process of spatial transcriptomics in this application.
图3:芯片探针检测结果。Figure 3: Chip probe detection results.
图4:cDNA生物分析仪2100检测结果。Figure 4: cDNA Bioanalyzer 2100 test results.
图5:鼠脑切片的空间表达图谱。Figure 5: Spatial expression map of mouse brain sections.
图6:鼠脑切片细胞类型。Figure 6: Cell types in mouse brain slices.
图7:捕获到的转录组的基因覆盖度分析结果。 Figure 7: Gene coverage analysis results of the captured transcriptome.
序列信息Sequence information
本申请涉及的序列的描述提供于下表中。A description of the sequences involved in this application is provided in the table below.
表1:序列信息
Table 1: Sequence information
注:V=A,C,or G;N=A,T,C,or G;“r”表示其3’相邻位置的核苷酸为核糖核苷酸;“iXNA”表示LNA锁核苷酸。Note: V=A, C, or G; N=A, T, C, or G; "r" means the nucleotide at the 3' adjacent position is a ribonucleotide; "iXNA" means LNA locked nucleotide.
现参照下列意在举例说明本发明(而非限定本发明)的实施例来描述本发明,且不意欲限制本发明所要求保护的范围。The present invention will now be described with reference to the following examples which are intended to illustrate the invention rather than to limit the invention and are not intended to limit the scope of the invention as claimed.
基于本申请提供的芯片获取样本空间转录组学信息的示例性实现流程如图2所示:An exemplary implementation process for obtaining spatial transcriptomic information of samples based on the chip provided in this application is shown in FIG2 :
(1)制备如图1所示的捕获芯片;(1) preparing a capture chip as shown in FIG1 ;
(2)mRNA捕获:组织切片贴在捕获芯片上,对组织进行处理释放mRNA分子;(2) mRNA capture: tissue sections are attached to the capture chip, and the tissue is processed to release mRNA molecules;
(3)释放的mRNA分子被芯片上的捕获探针所捕获,在具有末端转移酶活性的逆转录酶作用下原位合成cDNA分子;由于具有末端转移酶活性的逆转录酶会在新合成的cDNA分子的3'末端添加一些额外核苷酸C,与TSO(模板转换寡核苷酸)3’末端形成碱基互补配对,逆转录酶“转换”模板,以TSO为模板,继续合成cDNA分子,直至TSO的5'末端;(3) The released mRNA molecules are captured by the capture probes on the chip and synthesized into cDNA molecules in situ under the action of reverse transcriptase with terminal transferase activity; since the reverse transcriptase with terminal transferase activity will add some additional nucleotides C to the 3' end of the newly synthesized cDNA molecules, forming base complementary pairing with the 3' end of TSO (template switching oligonucleotide), the reverse transcriptase "switches" the template and continues to synthesize cDNA molecules using TSO as a template until the 5' end of TSO;
(4)cDNA分子3'端可被临近的含有位置信息的定位探针捕获,并进一步延伸,从而在cDNA分子上加上空间信息,合成带有空间信息的cDNA分子;(4) The 3' end of the cDNA molecule can be captured by the adjacent positioning probe containing position information and further extended, thereby adding spatial information to the cDNA molecule and synthesizing a cDNA molecule with spatial information;
(5)将cDNA分子从芯片上收集下来并进行cDNA扩增,或者,在芯片上直接进行PCR以扩增cDNA; (5) collecting cDNA molecules from the chip and performing cDNA amplification, or performing PCR directly on the chip to amplify cDNA;
(6)文库构建及测序;(6) Library construction and sequencing;
(7)数据分析,将空间信息和RNA表达情况还原至芯片位置。(7) Data analysis: restore the spatial information and RNA expression to the chip location.
此外,申请人期望强调的是,基于捕获区序列的不同设计,其可被设计用于靶向多种靶标序列,并不仅限于mRNA,因此,本申请的提供的芯片并不仅能用于获取空间转录组学信息,其同样可用于基因组、表观组、多组学等信息的获取,也可以用于获取特定目标分子的信息。In addition, the applicant wishes to emphasize that based on different designs of the capture zone sequence, it can be designed to target a variety of target sequences, not limited to mRNA. Therefore, the chip provided in this application can not only be used to obtain spatial transcriptomics information, but it can also be used to obtain genomic, epigenomic, multi-omics and other information, and can also be used to obtain information on specific target molecules.
1.捕获芯片制备1. Capture Chip Preparation
本申请的一个示例性捕获芯片包含定位探针和捕获探针,所述捕获探针从5’端至3’端依次包含接头序列和捕获序列,所述定位探针从5’端至3’端依次包含第一接头序列、空间位置序列(即,定位序列)和第二接头序列。An exemplary capture chip of the present application comprises a positioning probe and a capture probe, wherein the capture probe comprises a linker sequence and a capture sequence in sequence from the 5’ end to the 3’ end, and the positioning probe comprises a first linker sequence, a spatial position sequence (i.e., a positioning sequence) and a second linker sequence in sequence from the 5’ end to the 3’ end.
本实施例示出了所述捕获芯片的一个示例性制备方法,其包括:This embodiment shows an exemplary method for preparing the capture chip, which comprises:
(a)提供本领域常规使用的包含定位序列和捕获序列于同一探针的传统芯片(例如,可通过购买或自行制备的方式来提供所述传统芯片);所述传统芯片的探针从5’端至3’端依次包含第一接头序列、空间位置序列(即,定位序列)、第二接头序列和捕获序列;(a) providing a conventional chip conventionally used in the art that contains a positioning sequence and a capture sequence in the same probe (for example, the conventional chip can be provided by purchase or self-preparation); the probe of the conventional chip sequentially contains a first linker sequence, a spatial position sequence (i.e., a positioning sequence), a second linker sequence, and a capture sequence from the 5' end to the 3' end;
(b)将所述第二接头封闭,例如,使能够与所述第二接头序列退火的寡核苷酸分子与所述探针杂交,从而,在所述探针的第二接头的位置形成双链结构;(b) blocking the second linker, for example, hybridizing an oligonucleotide molecule capable of annealing to the second linker sequence with the probe, thereby forming a double-stranded structure at the position of the second linker of the probe;
(c)将核酸外切酶与(b)的产物进行孵育,所述核酸外切酶具有3’至5’端单链核酸外切酶活性;从而,使得所述传统芯片的探针的捕获序列被所述核酸外切酶切去,从而形成不含有捕获序列、含有定位序列的定位探针;(c) incubating a nuclease with the product of (b), wherein the nuclease has a 3' to 5' end single-stranded nuclease activity; thereby, the capture sequence of the probe of the conventional chip is cut off by the nuclease, thereby forming a positioning probe that does not contain a capture sequence but contains a positioning sequence;
(d)在(c)获得的含有定位探针的芯片上通过点击化学反应连接捕获探针,从而获得本申请的包含捕获探针和定位探针的一个示例性捕获芯片。(d) Connecting capture probes to the chip containing the positioning probes obtained in (c) through a click chemistry reaction, thereby obtaining an exemplary capture chip comprising capture probes and positioning probes of the present application.
具体地,捕获芯片制备步骤如下:Specifically, the capture chip preparation steps are as follows:
(1)封闭并切除常规芯片捕获区,形成定位探针:从生工合成一段oligo序列:GTCTTAGGAAGACAA(SEQ ID NO:1),使用5×SSC(saline sodium citrate)将其制备成1μM oligo溶液;取出stereo-seq transcriptomics T kit(华大,货号:111KT114)试剂盒里面的芯片(常规芯片,每一条核酸分子的结构从5’端到3’端方向依次为:接头1-空间位置序列-接头2-捕获序列poly T),吸取100μL 1μM oligo溶液至芯片上,常温杂交30min,从而封闭常规芯片上核酸分子的接头2;吸掉上述杂交液,并用0.1×SSC清洗芯片,加入50μL Exonuclease I(ThermoFisher:EN0581)反应液(5μL Exonuclease I,5μL 10x Reaction buffer,40μL ddH2O)37℃反应20min,从而切除常规芯片上核酸分子的捕获序列poly T;弃掉反应液,并用0.1×SSC清洗干净;芯片上仅含有“接头1-空间位置序列-接头2”。 (1) Block and remove the conventional chip capture area to form a positioning probe: synthesize an oligo sequence from Bio-Industry: GTCTTAGGAAGACAA (SEQ ID NO: 1), and use 5×SSC (saline sodium citrate) to prepare it into a 1 μM oligo solution; take out the chip in the stereo-seq transcriptomics T kit (BGI, catalog number: 111KT114) (conventional chip, the structure of each nucleic acid molecule from 5' to 3' end is: linker 1-spatial position sequence-linker 2-capture sequence poly T), draw 100 μL of 1 μM oligo solution onto the chip, and hybridize at room temperature for 30 min to block the linker 2 of the nucleic acid molecule on the conventional chip; draw off the above hybridization solution, wash the chip with 0.1×SSC, and add 50 μL Exonuclease I (ThermoFisher: EN0581) reaction solution (5 μL Exonuclease I, 5 μL 10x Reaction buffer, 40 μL ddH 2 O) react at 37°C for 20 minutes to remove the capture sequence poly T of the nucleic acid molecule on the conventional chip; discard the reaction solution and wash it with 0.1×SSC; the chip only contains "connector 1-spatial position sequence-connector 2".
(2)芯片表面修饰:配置0.1%多聚赖氨酸溶液(PLL,sigma P8920),加入到上述处理好的芯片上,30℃孵育3h,使得芯片表面都进行氨基化修饰,弃掉反应液,并用ddH2O清洗芯片后,干燥芯片1h。按说明书配置NHS-PEG-N3反应液(sigma JKA5088)并加到上述芯片上,37℃,反应5h,使得芯片表面修饰上叠氮基团。(2) Chip surface modification: Prepare 0.1% poly-lysine solution (PLL, sigma P8920), add it to the above-treated chip, and incubate at 30°C for 3 h to modify the chip surface with amino groups. Discard the reaction solution, wash the chip with ddH2O, and dry the chip for 1 h. Prepare NHS-PEG-N3 reaction solution (sigma JKA5088) according to the instructions and add it to the above-mentioned chip. Incubate at 37°C for 5 h to modify the chip surface with azide groups.
(3)芯片捕获探针oligo dT修饰:生工合成一段带有接头及oligo dT的捕获探针:5’-CCTCCGACTGTGTGACTTAGACTCCTGCCACCTCCTGATGTGCTTTTTTTTTTTTTTTTTTTTTTV-3’(SEQ ID NO:2),探针5’端为DBCO(二苯并环辛炔)修饰,用PBS将探针稀释成1μM,加入到芯片上37℃反应过夜,使得芯片表面的叠氮基团与DBCO修饰探针发生点击反应,进而将oligo dT探针连接在芯片上,获得含有捕获探针的芯片。(3) Oligo dT modification of chip capture probe: Bioengineering synthesized a capture probe with a linker and oligo dT: 5’-CCTCCGACTGTGTGACTTAGACTCCTGCCACCTCCTGATGTGCTTTTTTTTTTTTTTTTTTTTTTV-3’ (SEQ ID NO: 2). The 5’ end of the probe was modified with DBCO (dibenzocyclooctyne). The probe was diluted to 1 μM with PBS and added to the chip for reaction at 37°C overnight. This allowed the azide groups on the chip surface to undergo a click reaction with the DBCO-modified probe, thereby connecting the oligo dT probe to the chip to obtain a chip containing capture probes.
(4)芯片探针检测:使用3’端修饰cy5荧光的检测探针AAAAAAAAAAAAAAAAGCACA(SEQ ID NO:3)进行杂交拍照,检测芯片上捕获探针的合成情况以及步骤1中芯片上的捕获区域切除情况,结果如图3:图中①是步骤(2)获得的PLL修饰的芯片在添加检测探针后的拍照情况,图中②是步骤(1)获得的定位探针芯片的背景(未添加检测探针);图中③是步骤(1)获得的定位探针芯片在添加检测探针后的拍照情况;图中④是步骤(3)获得的连接oligo dT后的芯片在添加检测探针的荧光情况。结果显示,图中②和③显示商品化芯片上的原捕获区域poly T被成功切除,图中④说明新合成的捕获探针oligo dT被成功修饰在芯片上。(4) Chip probe detection: The detection probe AAAAAAAAAAAAAAAAGCACA (SEQ ID NO: 3) with cy5 fluorescence modified at the 3' end was used for hybridization photography to detect the synthesis of the capture probe on the chip and the excision of the capture area on the chip in step 1. The results are shown in Figure 3: Figure ① is a photograph of the PLL-modified chip obtained in step (2) after adding the detection probe, and Figure ② is the background of the positioning probe chip obtained in step (1) (no detection probe added); Figure ③ is a photograph of the positioning probe chip obtained in step (1) after adding the detection probe; Figure ④ is the fluorescence of the chip connected to oligo dT obtained in step (3) after adding the detection probe. The results show that Figures ② and ③ show that the original capture area poly T on the commercial chip was successfully excised, and Figure ④ shows that the newly synthesized capture probe oligo dT was successfully modified on the chip.
2.mRNA分子捕获2. mRNA Molecular Capture
参考stereo-seq transcriptomics T kit的说明书,将组织切片,例如鼠脑切片贴在芯片上,并对切片进行透化等处理后释放mRNA,使组织内的mRNA被芯片上的捕获探针捕获。Referring to the instruction manual of the stereo-seq transcriptomics T kit, tissue slices, such as mouse brain slices, are attached to the chip, and the slices are permeabilized to release mRNA, so that the mRNA in the tissue can be captured by the capture probes on the chip.
3.cDNA原位合成3. cDNA in situ synthesis
参考stereo-seq transcriptomics T kit的说明书,配置如下逆转录反应体系,42℃,反应5h。
Refer to the instructions of the Stereo-seq Transcriptomics T Kit and configure the following reverse transcription reaction system at 42°C for 5 hours.
4.添加空间信息4. Add spatial information
cDNA合成结束后,弃掉反应液后,按照说明书添加TR buffer(1000028507)反应30min移除掉组织;配制甲酰胺溶液,将其加入到芯片表面,并放置80℃反应3h,吸掉反应液,去除mRNA,并用ddH2O清洗芯片表面。加入5×SSC反应液至芯片表面,并于37℃反应30min;去掉反应液,并使用0.1×SSC清洗芯片表面,再加入如下反应液,37℃反应3h,延伸出空间信息。
After cDNA synthesis, discard the reaction solution, add TR buffer (1000028507) according to the instructions, react for 30 minutes, remove the tissue; prepare formamide solution, add it to the chip surface, and place it at 80℃ for 3 hours, aspirate the reaction solution, remove mRNA, and wash the chip surface with ddH 2 O. Add 5×SSC reaction solution to the chip surface and react at 37℃ for 30 minutes; remove the reaction solution, use 0.1×SSC to wash the chip surface, and then add the following reaction solution, react at 37℃ for 3 hours to extend the spatial information.
5.cDNA释放及cDNA扩增5. cDNA Release and cDNA Amplification
配置100mM的KOH溶液100μL,加至芯片55℃反应3h,收集反应上清液至PCR管,Prepare 100 μL of 100 mM KOH solution, add it to the chip and react at 55°C for 3 h. Collect the reaction supernatant into a PCR tube.
并使用50μL ddH2O清洗芯片,将清洗液与上清液合并至PCR管中,采用0.1M的HCl中和反应液至pH8.5,混匀后分成三管进行PCR反应,反应液如下,按照说明书进行PCR反应,获取的cDNA用生物分析仪2100检测,结果如图4。
The chip was cleaned with 50 μL ddH 2 O, and the cleaning solution and supernatant were combined into a PCR tube. The reaction solution was neutralized to pH 8.5 with 0.1 M HCl, and then divided into three tubes for PCR reaction. The reaction solution was as follows. The PCR reaction was carried out according to the instructions. The obtained cDNA was detected by Bioanalyzer 2100. The results are shown in Figure 4.
6.建库测序6. Library Construction and Sequencing
参考stereo-seq transcriptomics T kit说明书进行cDNA文库打断,制备成DNB。Refer to the instructions of stereo-seq transcriptomics T kit to shear the cDNA library and prepare DNB.
按照测序仪MGISEQ 2000操作说明,将DNB装载至MGISEQ2000的测序芯片上,进行测序。测序设置25bp进行空间信息解码,接着22bp暗反应,再进行5bp测序反应获取分子标签(MID),再进行50bp-100bp的测序反应可以获取cDNA 5’端转录本的信息。According to the operating instructions of the sequencer MGISEQ 2000, DNB was loaded onto the sequencing chip of MGISEQ2000 for sequencing. The sequencing was set to 25bp for spatial information decoding, followed by a 22bp dark reaction, and then a 5bp sequencing reaction to obtain the molecular tag (MID), and then a 50bp-100bp sequencing reaction to obtain the information of the cDNA 5' end transcript.
7.数据分析7. Data Analysis
(1)登录网站http://stereomap.cngb.org/Stereo-Draftsman/report/index,按照网站操作指南进行数据分析。将步骤6测序得到的read1序列(来自于一链测序)前25bp与 步骤1中的捕获芯片25bp位置信息进行比对,把能够比对到芯片上的位置信息的reads保留下来,并将它们对应到相应的芯片位置上。找出对应到芯片位置上的reads所对应的46bp起始的序列即为cDNA序列,将序列进行反向互补后,与鼠脑基因组进行比对,根据41bp-45bp获得的MID信息去掉重复的reads,获得鼠脑中每个基因表达的数目。(1) Log in to the website http://stereomap.cngb.org/Stereo-Draftsman/report/index and analyze the data according to the website operation guide. Compare the first 25 bp of the read1 sequence (from the first strand sequencing) obtained in step 6 with The 25bp position information of the capture chip in step 1 is compared, and the reads that can be compared to the position information on the chip are retained and matched to the corresponding chip positions. The sequence starting at 46bp corresponding to the reads corresponding to the chip position is found as the cDNA sequence. After reverse complementation of the sequence, it is compared with the mouse brain genome. According to the MID information obtained from 41bp-45bp, duplicate reads are removed to obtain the number of each gene expressed in the mouse brain.
(2)利用每个基因表达的数目进一步作图,获得鼠脑切片的空间表达图谱如图5,并表达量进行空间聚类分析,可以得到该鼠脑切片细胞类型如图6。(2) The number of genes expressed was further plotted to obtain the spatial expression map of the mouse brain slice as shown in Figure 5. The expression levels were subjected to spatial cluster analysis to obtain the cell types of the mouse brain slice as shown in Figure 6.
(3)5’端转录本捕获分析(3) 5’-end transcript capture analysis
将捕获到的转录组进行基因覆盖度分析,可以看到捕获到的转录组多数集中在转录起始区(TSS)区,证明该方案可以实现5’端转录本的捕获,如图7所示。The captured transcriptome was subjected to gene coverage analysis, and it can be seen that most of the captured transcriptome is concentrated in the transcription start region (TSS), proving that this scheme can achieve the capture of 5' end transcripts, as shown in Figure 7.
尽管本发明的具体实施方式已经得到详细的描述,但本领域技术人员将理解:根据已经公布的所有教导,可以对细节进行各种修改和变动,并且这些改变均在本发明的保护范围之内。本发明的全部分为由所附权利要求及其任何等同物给出。 Although the specific embodiments of the present invention have been described in detail, it will be understood by those skilled in the art that various modifications and changes may be made to the details according to all the teachings that have been published, and these changes are within the scope of protection of the present invention. The entire invention is given by the attached claims and any equivalents thereof.
Claims (44)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2023/143677 WO2025138284A1 (en) | 2023-12-29 | 2023-12-29 | Array and method for detecting spatial information of nucleic acids |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/CN2023/143677 WO2025138284A1 (en) | 2023-12-29 | 2023-12-29 | Array and method for detecting spatial information of nucleic acids |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2025138284A1 true WO2025138284A1 (en) | 2025-07-03 |
Family
ID=96216481
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2023/143677 Pending WO2025138284A1 (en) | 2023-12-29 | 2023-12-29 | Array and method for detecting spatial information of nucleic acids |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2025138284A1 (en) |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103781918A (en) * | 2011-04-13 | 2014-05-07 | 空间转录公司 | Methods and products for local or spatial detection of nucleic acids in tissue samples |
| CN110724733A (en) * | 2019-10-30 | 2020-01-24 | 东南大学 | Sequencing chip for high-resolution space transcriptome |
| CN114207105A (en) * | 2019-05-15 | 2022-03-18 | 深圳华大生命科学研究院 | Array for detecting nucleic acid spatial information and detection method |
| CN115698324A (en) * | 2020-02-21 | 2023-02-03 | 10X基因组学有限公司 | Methods and compositions for integrating in situ spatial determination |
-
2023
- 2023-12-29 WO PCT/CN2023/143677 patent/WO2025138284A1/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103781918A (en) * | 2011-04-13 | 2014-05-07 | 空间转录公司 | Methods and products for local or spatial detection of nucleic acids in tissue samples |
| CN114207105A (en) * | 2019-05-15 | 2022-03-18 | 深圳华大生命科学研究院 | Array for detecting nucleic acid spatial information and detection method |
| CN110724733A (en) * | 2019-10-30 | 2020-01-24 | 东南大学 | Sequencing chip for high-resolution space transcriptome |
| CN115698324A (en) * | 2020-02-21 | 2023-02-03 | 10X基因组学有限公司 | Methods and compositions for integrating in situ spatial determination |
Non-Patent Citations (2)
| Title |
|---|
| RAO, A. ET AL.: "Exploring Tissue Architecture Using Spatial Transcriptomics", NATURE, vol. 596, no. 7871, 11 August 2021 (2021-08-11), pages 211 - 220, XP037535469, ISSN: 0028-0836, DOI: 10.1038/s41586-021-03634-9 * |
| STAHL, P. L ET AL.: "Visualization and Analysis of Gene Expression in Tissue Sections by Spatial Transcriptomics", SCIENCE, vol. 353, no. 6294, 1 July 2016 (2016-07-01), pages 78 - 82, XP093007918, ISSN: 0036-8075 * |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP3559274B1 (en) | Reagents and methods for the analysis of linked nucleic acids | |
| JP5916166B2 (en) | Methods and products for localized or spatial detection of nucleic acids in tissue samples | |
| JP2024149636A (en) | Nuclease-based RNA depletion | |
| AU2018429361B2 (en) | Methods for the analysis of circulating microparticles | |
| US20220315984A1 (en) | Reagents and Methods for the Analysis of Microparticles | |
| CN114207105B (en) | Array and detection method for detecting nucleic acid spatial information | |
| JP2023547394A (en) | Nucleic acid detection method by oligohybridization and PCR-based amplification | |
| JP2025525100A (en) | Method for preparing a normalized nucleic acid sample, kit and device for use in said method | |
| WO2025138284A1 (en) | Array and method for detecting spatial information of nucleic acids | |
| US20060228714A1 (en) | Nucleic acid representations utilizing type IIB restriction endonuclease cleavage products | |
| WO2023116373A1 (en) | Method for generating population of labeled nucleic acid molecules and kit for the method | |
| WO2023115536A1 (en) | Method for generating labeled nucleic acid molecular population and kit thereof | |
| WO2023116376A1 (en) | Labeling and analysis method for single-cell nucleic acid | |
| EP3696279A1 (en) | Methods for noninvasive prenatal testing of fetal abnormalities | |
| HK40066267A (en) | Array and method for detecting spatial information of nucleic acids | |
| HK40066267B (en) | Array and method for detecting spatial information of nucleic acids |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23962957 Country of ref document: EP Kind code of ref document: A1 |