[go: up one dir, main page]

WO2025000136A1 - Procédé de préparation d'une banque de brins spécifiques pour la détection rapide de divers types d'arn et technique de séquençage à haut débit - Google Patents

Procédé de préparation d'une banque de brins spécifiques pour la détection rapide de divers types d'arn et technique de séquençage à haut débit Download PDF

Info

Publication number
WO2025000136A1
WO2025000136A1 PCT/CN2023/102155 CN2023102155W WO2025000136A1 WO 2025000136 A1 WO2025000136 A1 WO 2025000136A1 CN 2023102155 W CN2023102155 W CN 2023102155W WO 2025000136 A1 WO2025000136 A1 WO 2025000136A1
Authority
WO
WIPO (PCT)
Prior art keywords
rna
reagent
reaction
preparation
base
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2023/102155
Other languages
English (en)
Chinese (zh)
Inventor
欧日晶
张艳
王新新
雷常贵
王莹莹
王业钦
殷建华
金鑫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BGI Shenzhen Co Ltd
Original Assignee
BGI Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BGI Shenzhen Co Ltd filed Critical BGI Shenzhen Co Ltd
Priority to PCT/CN2023/102155 priority Critical patent/WO2025000136A1/fr
Publication of WO2025000136A1 publication Critical patent/WO2025000136A1/fr
Anticipated expiration legal-status Critical
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Definitions

  • the present invention relates to the field of biotechnology, and in particular to a method for preparing a chain-specific library for rapidly detecting multiple types of RNA and a high-throughput sequencing technology.
  • RNA Cell free RNA
  • body fluids such as blood, urine, cerebrospinal fluid, saliva, milk, pleural effusion, ascites, etc.
  • RNA in cells include ribosomal RNA (rRNA), messenger RNA (mRNA), transfer RNA (tRNA), micro non-coding RNA (miRNA), long non-coding RNA (lncRNA), etc.
  • the free RNA in body fluid samples carries real-time gene expression information from different tissues and organs, which can provide more solutions for early screening of diseases, auxiliary diagnosis, recurrence monitoring, disease prognosis, etc. Due to the non-invasive or minimally invasive sampling method and the advantages of real-time and comprehensive monitoring, free RNA may become an important detection method for precision medicine in the future.
  • RNA detection technology based on high-throughput sequencing has the advantages of single-base resolution, wide detection range and high throughput.
  • challenges in the preparation technology of free RNA high-throughput sequencing libraries including but not limited to low sample starting amount, easy RNA degradation, long experimental cycle, low efficiency, single or limited target RNA type and other problems.
  • the scientific research field mainly uses kits suitable for sequencing library preparation of conventional cell tissue sample RNA to study free RNA.
  • the commercial RNA sequencing library preparation kits on the market have relatively single or limited target RNA types, and the information that can be obtained is also limited, and it is impossible to conduct systematic research on all types of RNA.
  • RNA sequencing library preparation kits are mainly divided into kits suitable for mRNA and miRNA, which cannot capture multiple types of RNA in total RNA at the same time.
  • Various types of RNA include mRNA, lncRNA, tRNA, miRNA, etc.
  • Most commercial RNA sequencing library preparation kits have high requirements for the starting amount of total RNA.
  • commercial RNA sequencing libraries have more experimental steps, more complicated processes, and longer experimental cycles, which are not conducive to rapid and stable detection in the clinical field.
  • kits suitable for mRNA detection include Illumina's TruSeq RNA Library Prep Kit v2, TruSeq Stranded mRNA and Total RNA Library prep kits, NEB's NEBNext Ultra TM II RNA Library Prep Kit for Illumina, TAKARA's SMARTer Stranded Total RNA-Seq Kit, etc.
  • the above kit scheme uses polydeoxythymidine ribonucleotide (oligo dT) magnetic beads or primers to selectively enrich mRNA containing polyadenylic acid (polyA) tails and specifically prepare mRNA sequencing libraries.
  • oligo dT polydeoxythymidine ribonucleotide
  • polyA polyadenylic acid
  • the above kit can also capture lncRNA while capturing mRNA by adapting the rRNA removal step and then using primers containing multiple random bases for reverse transcription, but this method is difficult to capture more short fragments of miRNA and other RNA.
  • the above kits all need to first reverse transcribe into single-stranded cDNA, and then synthesize complementary double-stranded DNA to form double-stranded DNA, and then prepare it as a sequencing library on this basis, so the experimental steps are relatively complicated and the experimental cycle is relatively long.
  • kits suitable for miRNA or small RNA For example, Illumina's TruSeq small RNA Library Prep Kit, Bioo Scientific's Nextflex small RNA-Seq Kit v3, NEB's NEBNext small RNA Library Prep Set for Illumina, MGI's MGIEasy small RNA library preparation kit, and QIAGEN's QIAseq miRNA Library Kit.
  • the above kits all use RNA ligase to directly connect adapters containing specific sequences to the 3' and 5' ends of RNA molecules, then transcribe them into cDNA, and then prepare sequencing libraries through PCR amplification reactions.
  • kits are not suitable for capturing long-chain RNAs such as mRNA and lncRNA, and the experiments require RNA ligase, which is relatively costly, relatively inefficient, takes a long time to connect, has many steps, and a long experimental cycle.
  • RNA Ribonucleic acid
  • oligo dT polydeoxythymidine ribonucleotides
  • PCR is performed using the adapter sequences added at both ends to complete the preparation of the sequencing library.
  • RNA library preparation methods are not suitable for low starting amounts or degraded samples, and have a low adaptability to plasma samples.
  • Scientific research programs generally require mL-level plasma inputs, and this higher starting amount of plasma input will reduce its widespread use.
  • the technical problem to be solved by the present invention is to provide a method for preparing a chain-specific library for rapidly detecting multiple types of RNA and a high-throughput sequencing technology.
  • the present invention provides a method for preparing a chain-specific library for rapidly detecting multiple types of RNA and a high-throughput sequencing technology, which have high sensitivity, a wide range of applications, strong anti-interference ability, a simple and quick preparation method, and are suitable for high-throughput sequencing technology.
  • the present invention provides a method for preparing various types of RNA libraries, including: adding polyA to the end of an RNA sample, performing reverse transcription and U base digestion to prepare a cDNA library;
  • the RNA sample contains at least one of total RNA, rRNA, mRNA, tRNA, miRNA and/or lncRNA.
  • the present invention adds polyadenylic acid (poly A) tails to the ends of various types of RNA molecules and combines the reverse transcription method using polydeoxythymidine ribonucleotide (oligo dT) primers, so that various types of RNA molecules including non-coding RNA (lncRNA, miRNA, etc.) can be reverse transcribed in subsequent steps, thereby realizing the construction of libraries of various types of RNA.
  • poly A polyadenylic acid
  • oligo dT polydeoxythymidine ribonucleotide
  • the preparation method specifically includes: DNA digestion, RNA end modification, poly A tail addition, reverse transcription, U base digestion, cDNA end modification, denaturation, linker addition and library construction.
  • the present invention combines two or more of the above-mentioned specific reaction steps by optimizing the reaction reagents and other conditions. For example, in some embodiments, the steps of DNA digestion, RNA end modification and polyA tailing are combined. In some embodiments, the steps of DNA digestion and RNA end modification are combined, and then the steps of polyA tailing and reverse transcription are combined. In some embodiments, the steps of U base digestion, cDNA end modification and denaturation are combined. In some embodiments, the steps of U base digestion and cDNA end modification are combined. By combining the reaction steps, the reaction steps are reduced and the required time is shortened, while ensuring the efficiency of the reaction and the quality of the obtained library.
  • the DNA digestion and RNA end modification are performed in a first system
  • the reverse transcription is performed in a second system
  • the step of adding the polyA tail is performed in the first system or the second system.
  • the step of adding polyA tail is carried out in a first system, wherein:
  • the first system includes: RNA sample, PolyA polymerase reaction buffer, ATP, BSA, DNase I, T4 polynucleotide kinase, PolyA polymerase, RNase inhibitor and nuclease-free water;
  • the second system includes: the reaction product of the first system, reverse transcription primer, HiScript III reaction buffer, HiScript III reverse transcriptase, dNTP Mix, RNase inhibitor and nuclease-free water.
  • the present invention optimizes the reaction system.
  • the components in the reaction system are properly coordinated to jointly ensure the progress of the reaction.
  • the present invention optimizes the concentration of the components in each system.
  • the concentrations of the components in the first system are as follows: 14 ⁇ L RNA sample, 3 ⁇ L 10 ⁇ PolyA polymerase reaction buffer, 1 ⁇ L 10 mM ATP, 4 ⁇ L 10 mg/mL BSA, 2 ⁇ L DNase I, 0.5 ⁇ L 10 U/ ⁇ L T4 polynucleotide kinase, 1 ⁇ L 5 U/ ⁇ L PolyA polymerase, 0.5 ⁇ L 40 U/ ⁇ L RNase inhibitor, and 4 ⁇ L nuclease-free water;
  • the concentrations of the components of the second system include: 30 ⁇ L of the reaction product of the first system, 2 ⁇ L of 5 ⁇ M reverse transcription primer, 4 ⁇ L of 5 ⁇ HiScript III reaction buffer, 1 ⁇ L of 200 U/ ⁇ l HiScript III reverse transcriptase, 2 ⁇ L of 5 mM dNTP Mix, 0.5 ⁇ L of 40 U/ ⁇ l RNase inhibitor and 10.5 ⁇ L of nuclease-free water.
  • the step of adding polyA tail is carried out in the second system, wherein:
  • the first system includes: RNA sample, DNase I reaction buffer, ATP, DNase I, T4 polynucleotide kinase and RNase inhibitor;
  • the second system includes: the reaction product of the first system, HiScript III reaction buffer, BSA, PEG8000, dNTP Mix, reverse transcription primer, PolyA polymerase, RNase inhibitor, HiScript III reverse transcriptase and nuclease-free water.
  • the present invention optimizes the reaction system.
  • the components in the reaction system are properly coordinated to jointly ensure the progress of the reaction.
  • the present invention optimizes the concentration of the components in each system.
  • the concentrations of the components in the first system are: 14 ⁇ L RNA sample, 2 ⁇ L 10 ⁇ DNase I reaction buffer, 1 ⁇ L 10 mM ATP, 2 ⁇ L DNase I, 0.5 ⁇ L 10 U/ ⁇ L T4 polynucleotide kinase, and 0.5 ⁇ L 40 U/ ⁇ L RNase inhibitor.
  • the concentrations of the components of the second system are: 20 ⁇ L of the reaction product of the first system, 6 ⁇ L of 5 ⁇ HiScript III reaction buffer, 1 ⁇ L of 10 mg/mL BSA, 10 ⁇ L of 50% PEG8000, 2 ⁇ L of 5 mM dNTP Mix, 2 ⁇ L of 5 ⁇ M reverse transcription primer, 1 ⁇ L of 5 U/ ⁇ L Poly A polymerase, 0.5 ⁇ L of 40 U/ ⁇ l RNase inhibitor, 1 ⁇ L of 200 U/ ⁇ l HiScript III reverse transcriptase and 6.5 ⁇ L of nuclease-free water.
  • the cDNA end modification and denaturation are performed in a third system.
  • the third system comprises: the reaction product after U base digestion, polynucleotide kinase reaction buffer, T4 polynucleotide kinase, Tris buffer with pH 8.0, ultra-thermostable single-stranded binding protein and nuclease-free water.
  • the present invention optimizes the reaction system.
  • the components in the reaction system are properly coordinated to jointly ensure the progress of the reaction.
  • the present invention optimizes the concentration of the components in each system.
  • the concentrations of the components of the third system are: the reaction product after U base digestion, 5 ⁇ L of 10 ⁇ polynucleotide kinase reaction buffer, 1 ⁇ L of 10U/ ⁇ L T4 polynucleotide kinase, 2 ⁇ L of 220mM Tris buffer, pH 8.0, 0.6 ⁇ L of 500ng/ ⁇ L ultra-thermostable single-stranded binding protein, and 21.4 ⁇ L of nuclease-free water.
  • cDNA end modification, denaturation and U base digestion are performed in the same system, and therefore, the third system also includes reagents for U base digestion.
  • the third system includes: the reaction product of polyA tailing and reverse transcription, polynucleotide kinase reaction buffer, T4 polynucleotide kinase, Tris buffer at pH 8.0, ultra-thermostable single-stranded binding protein, uracil-specific excision reagent USER enzyme and nuclease-free water.
  • the present invention optimizes the reaction system.
  • the components in the reaction system are properly coordinated to jointly ensure the progress of the reaction.
  • the present invention optimizes the concentration of the components in each system.
  • the concentrations of the components of the third system are: the reaction product after polyA tailing and reverse transcription, 5 ⁇ L of 10 ⁇ polynucleotide kinase reaction buffer, 1 ⁇ L of 10 U/ ⁇ L T4 polynucleotide kinase, 2 ⁇ L of 220 mM Tris buffer, pH 8.0, 0.6 ⁇ L of 500 ng/ ⁇ L ultra-thermostable single-stranded binding protein, 2 ⁇ L of 10 U/ ⁇ L uracil-specific excision reagent USER enzyme, and 19.4 ⁇ L of nuclease-free water.
  • the present invention adopts a flexible method of combining multiple reaction steps into the same experimental operation step and optimizes the experimental conditions, thereby reducing the operation steps and time required for library preparation and ensuring the efficiency of the reaction, thereby obtaining better experimental results.
  • reverse transcription primer has the following nucleotide sequence: poly(T)n-UVNm;
  • n represents the number of bases T
  • m represents the number of bases N
  • n is an integer of 8 to 50
  • m is an integer of 1 to 4
  • At least one T in the poly(T)n is replaced by U; V is selected from any one of base A, base C and base G; and N is selected from any one of base A, base T, base C and base G.
  • the present invention provides a polydeoxythymidine ribonucleotide (oligo dT) reverse transcription primer containing deoxyuracil (dU) for synthesizing a single-stranded cDNA molecule by reverse transcription reaction.
  • the primer sequence comprises deoxyuracil ribonucleotide (dU), polydeoxythymidine ribonucleotide (oligo dT), and other random bases (V and N).
  • a single-stranded cDNA molecule containing a polydeoxythymidine ribonucleotide sequence of deoxyuracil (dU) can be obtained, so that a uracil-specific excision reagent is used to perform a digestion reaction on the single-stranded cDNA in the U base digestion step, thereby excising the polydeoxythymidine ribonucleotide sequence fragment containing deoxyuracil (dU) in the single-stranded cDNA, removing the influence of the low-complexity sequence artificially introduced in the previous step on the subsequent sequencing and analysis, and better realizing the preparation of the cDNA library.
  • the solution provided by the present invention is more conducive to increasing the amount of effective data in sequencing.
  • the reverse transcription primer has a nucleotide sequence as shown in SEQ ID NO: 1.
  • SEQ ID NO: 1 the reverse transcription primer shown in SEQ ID NO: 1 can achieve a better reverse transcription reaction, thereby obtaining a better experimental effect.
  • the linker includes a 5' end linker and a 3' end linker, the 5' end linker sequence has a nucleotide sequence as shown in SEQ ID NO 2 and SEQ ID NO 3, and the 3' end linker sequence has a nucleotide sequence as shown in SEQ ID NO 4 and SEQ ID NO 5.
  • the present invention provides a connector for introducing a specific target sequence at the 3' and 5' ends of a single-stranded cDNA molecule and a corresponding DNA connection scheme.
  • Both connectors contain a double-stranded region and at least one protruding single-stranded region, wherein the double-stranded region contains a universal structural sequence for PCR or sequencing, and the protruding single-stranded region contains one or more (1 to 10) random base sequences, which are used for complementary pairing with the end of the single-stranded DNA molecule, so that the connector and the single-stranded DNA can be splinted.
  • Each connector is formed by the interaction of two polynucleotide sequences to form its special structure, and the two sequences contain complementary pairing regions, which will be complementary paired to form a specific double-stranded structure and a protruding single-stranded region containing random bases after solution mixing and static treatment.
  • PCR amplification can be directly performed to obtain the final library, so as to better realize the preparation of cDNA library.
  • the present invention can connect the DNA adapters at the 3’ and 5’ ends simultaneously in one step, thereby shortening the steps and time while ensuring the connection effect.
  • the reaction system for adding a linker includes: a 5' end linker solution, a 3' end linker solution, a ligation reaction buffer, a ligation reaction enhancer and T4 DNA ligase.
  • a DNA ligase scheme is used to replace RNA ligase, thereby greatly shortening the connection time (in the scheme of adding linkers at both ends of RNA using RNA ligase, due to the relatively low connection efficiency of RNA ligase, the single-end linker connection reaction time usually takes 1 to 2 hours, and the experimental steps of connecting linkers at both ends in steps are also relatively complicated, and the total connection time of the two-end linkers is 2 to 3 hours).
  • the present invention optimizes other components and their concentrations in the reaction system, thereby further improving the connection efficiency of the linker.
  • the reaction system for adding a linker includes: a 5' end linker solution, a 3' end linker solution, a ligation reaction buffer, a hexammaminecobalt chloride solution and T4 DNA ligase.
  • the present invention also provides a sequencing method for various types of RNA libraries, and the cDNA library prepared by the above preparation method is used as a sample for on-machine sequencing.
  • the cDNA library is sequenced after PCR amplification, purification, sample mixing, and single-stranded circularization.
  • the upstream primer of the PCR amplification has a nucleotide sequence as shown in SEQ ID NO 6, and the downstream primer of the PCR amplification has a nucleotide sequence as shown in SEQ ID NO 7.
  • the downstream primers may contain a label (Barcode) sequence for sample identification.
  • Barcode a label
  • the primer sequence By using the primer sequence to amplify the library sample, library samples with different Barcode sequence labels can be obtained.
  • the Barcode sequence of the primer band can be matched with the Barcode of the universal sequencing library of the sequencing platform.
  • the structure of the PCR amplification product can be consistent with the universal library structure of the sequencing platform, so that the source of the sample library can be accurately separated by the Barcode.
  • the present invention provides library construction reagents, including reagent I, reagent II and reagent III;
  • the reagent I comprises: DNase I reaction buffer, ATP, DNase I, T4 polynucleotide kinase, and RNase inhibitor;
  • the reagent II includes: reverse transcription primer, HiScript III reaction buffer, HiScript III reverse transcriptase, dNTP Mix, and RNase inhibitor;
  • the reagent III comprises: polynucleotide kinase reaction buffer, T4 polynucleotide kinase, Tris buffer at pH 8.0, and super-thermostable single-stranded binding protein.
  • the reagent I further comprises: PolyA polymerase reaction buffer, BSA, PolyA polymerase;
  • the reagent II also includes: BSA, PEG8000, and PolyA polymerase.
  • the reagent III also includes a U base excision reagent, and the U base excision reagent includes a uracil-specific excision reagent USER enzyme.
  • linker adding reagent which includes: Tris-Hcl buffer, sodium chloride, EDTA, linkers as shown in SEQ ID NO 2 to 5, ligation reaction buffer, T4 ligase and hexamminecobalt chloride.
  • a purification reagent which includes Agencourt AMPure XP magnetic beads.
  • PCR amplification reagents which include PCR enzyme reaction solution, upstream primers of the nucleotide sequence shown in SEQ ID NO 6, and downstream primers of the nucleotide sequence shown in SEQ ID NO 7.
  • RNA extraction reagents also includes RNA extraction reagents.
  • the present invention also provides the application of the construction reagent in preparing various types of RNA libraries.
  • the present invention provides a method for preparing a strand-specific library for rapid detection of multiple types of RNA and a high-throughput sequencing technology.
  • the present invention uses polyadenylic acid polymerase to artificially add polyadenylic acid (poly A) tails to the 3' ends of multiple RNAs, and simultaneously uses polydeoxythymidine ribonucleotide primers with deoxyuracil to synthesize single-stranded cDNA under the action of reverse transcriptase.
  • the obtained single-stranded cDNA molecules are subjected to a series of reactions and finally amplified by PCR to obtain a strand-specific library of multiple types of RNA.
  • the present invention uses polyadenylic acid polymerase to artificially add polyadenylic acid (poly A) tails to the 3’ ends of various RNAs, so that various types of RNA molecules including non-coding RNA (lncRNA, miRNA, etc.) can be reverse transcribed in subsequent steps, thereby achieving library construction and sequencing of various types of RNAs;
  • Single-stranded cDNA is synthesized using poly-deoxythymidine ribonucleotide primers containing deoxyuracil under the action of reverse transcriptase.
  • the obtained single-stranded cDNA molecules can remove the artificially added polynucleotide sequences through the U base digestion reaction, avoiding the addition of low-complexity nucleotide sequences in the sequencing library, so that the sequencing quality and data volume are guaranteed;
  • the present invention adopts a single-stranded cDNA molecule library preparation method, which does not require the synthesis of complementary double-stranded DNA of cDNA molecules, thereby reducing the steps and time required for library preparation.
  • the present invention adopts a single-stranded cDNA molecule library preparation method, the strand-specific information of RNA is retained, which is more conducive to RNA analysis such as gene annotation;
  • the present invention combines some reaction steps by optimizing the reaction reagents and other conditions.
  • the three reaction steps of DNA digestion, RNA end modification and polyadenylic acid tailing reaction are combined into the same operation step; the two reaction steps of cDNA end modification and denaturation reaction are combined into one operation step.
  • the two reaction steps of DNA digestion and RNA end modification reaction are combined into one operation step; the two reaction steps of polyadenylic acid tailing and reverse transcription reaction are combined into one operation step; the three reaction steps of U base digestion, cDNA end modification and denaturation reaction are combined into the same operation step, etc.
  • the overall solution reduces the number of operation steps and time required for library preparation, and ensures the efficiency of the reaction;
  • connection reaction enhancer such as the chemical reagent hexaaminocobalt chloride
  • the present invention uses the reverse transcribed cDNA and the adapter for connection and adds the sequencing structure sequence. Since the DNA ligase scheme is used instead of the RNA ligase scheme, the DNA ligase is superior to the RNA ligase in terms of both cost and efficiency. The total connection time only takes 30 minutes or even shorter. The overall experimental process has fewer operating steps and takes less time.
  • FIG1 is a schematic diagram showing the RNA library preparation and detection principle
  • FIG2 is a schematic diagram of linker preparation, wherein P represents a phosphorylation modification group and B represents a modification group that blocks the connection;
  • FIG3 is a schematic diagram showing a method for preparing an RNA library
  • FIG4 shows the sequencing quality distribution diagram of RNA library sequencing along with the sequencing cycle number
  • FIG5 shows the number of various RNA genes detected in the samples, wherein the RNA samples in FIG5a include mRNA IncRNA and pseudogene RNA, and the RNA samples in FIG5b include miRNA, tRNA, mt-tRNA, mt-rRNA, snoRNA and snRNA;
  • FIG6 shows the number and percentage of various RNA genes detected in the samples, wherein FIG6a is a 200 ⁇ L starting plasma free RNA sample, FIG6b is a 10 ng starting amount UHRR sample, and FIG6c is a 2 ng starting amount UHRR sample;
  • FIG7 shows a scatter plot of gene expression between samples and a consistency analysis result diagram, wherein the Log2(TPM+1) value of gene expression is used for analysis, FIG7a is two technical parallels of a 200 ⁇ L starting plasma free RNA sample, FIG7b is a UHRR sample with a starting amount of 10 ng and 100 ng, and FIG7c is a UHRR sample with a starting amount of 2 ng and 10 ng;
  • FIG8 shows the Pearson correlation coefficient results of gene expression between samples, wherein FIG8a is the Pearson correlation coefficient calculated using the TPM value of the gene expression, and FIG8b is the Pearson correlation coefficient calculated using the Log2(TPM+1) value after the gene expression is converted.
  • the present invention provides a method for preparing a strand-specific library and a high-throughput sequencing technology for rapid detection of multiple types of RNA.
  • Those skilled in the art can refer to the content of this article and appropriately improve the process parameters to achieve it. It should be particularly noted that all similar substitutions and modifications are obvious to those skilled in the art, and they are all considered to be included in the present invention.
  • the methods and applications of the present invention have been described through preferred embodiments, and relevant personnel can obviously modify or appropriately change and combine the methods and applications of this article without departing from the content, spirit and scope of the present invention to implement and apply the technology of the present invention.
  • the present invention provides a chain-specific library preparation and high-throughput gene sequencing technology for rapid detection of various types of RNA molecules.
  • This technology can read various types of RNA information with a relatively low starting amount of RNA input, and is suitable for various types of usage scenarios such as tissue cells and body fluid samples.
  • RNA information can be read and analyzed using high-throughput sequencing technology.
  • the present invention provides a chain-specific library preparation method and high-throughput sequencing technology for rapid detection of multiple types of RNA, which can simultaneously detect multiple types of RNA including mRNA, lncRNA, tRNA, miRNA, etc.
  • the method provided by the present invention is suitable for high-throughput sequencing detection of multiple sample types such as tissue cell RNA and free RNA (cell-free RNA, abbreviated as cfRNA), and can be applicable to RNA samples with low starting amounts at the ng level or even the pg level.
  • the present invention can be used in the fields of free RNA molecular diagnosis in the field of liquid biopsy, RNA molecular detection in tissue cells, etc., and has broad application prospects in the directions of early disease screening, auxiliary diagnosis, recurrence monitoring, disease prognosis, etc., among which potential clinical application scenarios include but are not limited to complex diseases such as pregnancy diseases, tumors, cardiovascular and cerebrovascular diseases, infectious diseases, genetic diseases, nervous system diseases, and mental diseases.
  • the present invention can be extended to research in the fields of animals, plants, and microorganisms, and simultaneously detect various types of RNA; it can be used to study the growth and development characteristics, tissue and cell functions, gene functions, environmental adaptability, species interactions, etc.
  • the technical principle of the present invention may also be extended to single-cell or spatiotemporal omics technology, which can be used to simultaneously capture different types of RNA, including non-coding RNA, and characterize the expression of multiple types of RNA in single cells or at different spatial locations.
  • the principle of the technology of the present invention combined with long-chain PCR technology and single-molecule sequencing, may also be applied to the application research of the full-length transcriptome.
  • an optional step is to perform DNA digestion on the RNA sample to remove the influence of residual DNA molecules on subsequent steps.
  • the end modification enzyme is used to modify the end of the RNA molecule, and the 3' end of the RNA molecule is hydroxylated, so that more natural RNA molecules can achieve polyadenylic acid tailing reaction at the 3' end.
  • polyadenylic acid polymerase is used to complete polyadenylic acid tailing at the 3' end of the RNA molecule.
  • oligo dT primers containing deoxyuracil (dU) are used to complementally pair with RNA molecules, and single-stranded cDNA is synthesized using RNA as a template under the action of reverse transcriptase to obtain single-stranded cDNA molecules with deoxyuracil (dU) and polydeoxythymidine ribonucleotide (oligo dT) sequences.
  • a uracil-specific excision reagent is used to digest the cDNA, thereby excising the polydeoxythymidine ribonucleotide sequence fragment containing deoxyuracil (dU) in the cDNA, removing the influence of the low-complexity sequence artificially introduced in the previous step on subsequent sequencing and analysis. Then, the cDNA library is prepared.
  • the cDNA library preparation method provided by the present invention is to first perform high temperature denaturation treatment on cDNA to unwind the double-stranded structure that may appear in the local area of cDNA.
  • the denaturation reaction system can add single-stranded binding protein to assist in maintaining the linear structure of single-stranded cDNA molecules to avoid annealing and renaturing into complex hairpin-like structures after denaturation.
  • the cDNA after high temperature denaturation is then used through special double-stranded adapters and matching connection technology to complete the cDNA library preparation.
  • the present invention provides a single-stranded DNA library preparation technology.
  • adapters can be added to both ends of the single-stranded cDNA molecule at the same time, and a universal structural sequence for PCR amplification reaction can be introduced, and then a sequencing library can be obtained through PCR amplification.
  • the sequencing library product is subsequently processed and sequenced according to the requirements of the gene sequencer, and the purpose of detecting RNA is achieved by reading and analyzing the sequence information.
  • the label (Barcode) sequence for sample identification can be introduced through PCR primers, and the library samples with different Barcode sequences can be mixed into a sequencing sample according to the proportion requirements of the sequencing data.
  • the sequencing samples are subsequently processed and sequenced according to the requirements of the gene sequencer.
  • Each sequencing result will be accurately located in each sample after the Barcode sequence alignment and splitting, and the sequencing results of each sample will be sequenced to achieve the purpose of detecting RNA for each sample.
  • the Barcode sequence is used to achieve the mixing of multiple library samples for sequencing detection, improve the detection throughput, and reduce the detection cost.
  • the present invention completes polyadenylic acid tailing at the 3' end of various types of RNA molecules, and combines the reverse transcription method using polydeoxythymidine ribonucleotide (oligo dT) primers to achieve effective reverse transcription of various types of RNA, thereby achieving the capture of various types of RNA molecules and the construction of sequencing libraries.
  • Various types of RNA include but are not limited to mRNA, lncRNA, miRNA, tRNA, etc.
  • the polyadenylic acid tail is Poly (A) n (n represents the number of adenine A, and n is an integer between 8 and 200).
  • the present invention designs a polydeoxythymidine ribonucleotide (oligo dT) primer containing deoxyuracil (dU) for synthesizing a single-stranded cDNA molecule by reverse transcription reaction, and the primer sequence is named as the first nucleotide sequence.
  • the primer sequence contains deoxyuracil ribonucleotide (dU), polydeoxythymidine ribonucleotide (oligo dT), and other random bases (V and N), and its sequence is the nucleotide composition shown in Table 1 below.
  • the present invention can combine the multiple reaction steps mentioned in the above experimental principles into the same experimental operation step, thereby reducing the operation steps and time required for library preparation and ensuring the efficiency of the reaction.
  • the preparation method of the library of the present invention specifically includes: DNA digestion, RNA end modification, adding polyA tail, reverse transcription, U base digestion, cDNA end modification, denaturation, adding linker and library construction.
  • the DNA digestion and RNA end modification are performed in a first system; the reverse transcription is performed in a second system; and the cDNA end modification and denaturation are performed in a third system.
  • the step of adding the polyA tail is performed in the first system or the second system.
  • the step of adding the polyA tail is performed in a first system, wherein the first system is: 14 ⁇ L of RNA sample, 3 ⁇ L of 10 ⁇ PolyA polymerase reaction buffer, 1 ⁇ L of 10 mM ATP, 4 ⁇ L of 10 mg/mL BSA, 2 ⁇ L of DNase I, 0.5 ⁇ L of 10 U/ ⁇ L T4 polynucleotide kinase, 1 ⁇ L of 5 U/ ⁇ L PolyA polymerase, 0.5 ⁇ L of 40 U/ ⁇ L RNase inhibitor, and 4 ⁇ L of nuclease-free water;
  • the concentrations of the components of the second system are: 30 ⁇ L of the reaction product of the first system, 2 ⁇ L of 5 ⁇ M reverse transcription primer, 4 ⁇ L of 5 ⁇ HiScript III reaction buffer, 1 ⁇ L of 200 U/ ⁇ l HiScript III reverse transcriptase, 2 ⁇ L of 5 mM dNTP Mix, 0.5 ⁇ L of 40 U/ ⁇ l RNase inhibitor and 10.5 ⁇ L of nuclease-free water.
  • the step of adding polyA tail is performed in a second system, wherein the first system is: 14 ⁇ L RNA sample, 2 ⁇ L 10 ⁇ DNase I reaction buffer, 1 ⁇ L 10 mM ATP, 2 ⁇ L DNase I, 0.5 ⁇ L 10 U/ ⁇ L T4 polynucleotide kinase and 0.5 ⁇ L 40 U/ ⁇ L RNase inhibitor;
  • the second system consists of: 20 ⁇ L of the reaction product of the first system, 6 ⁇ L of 5 ⁇ HiScript III reaction buffer, 1 ⁇ L of 10 mg/mL BSA, 10 ⁇ L of 50% PEG8000, 2 ⁇ L of 5 mM dNTP Mix, 2 ⁇ L of 5 ⁇ M reverse transcription primer, 1 ⁇ L of 5 U/ ⁇ L Poly A polymerase, 0.5 ⁇ L of 40 U/ ⁇ l RNase inhibitor, 1 ⁇ L of 200 U/ ⁇ l HiScript III reverse transcriptase and 6.5 ⁇ L of nuclease-free water.
  • the third system is: the reaction product after U base digestion, 5 ⁇ L of 10 ⁇ polynucleotide kinase reaction buffer, 1 ⁇ L of 10U/ ⁇ L T4 polynucleotide kinase, 2 ⁇ L of 220mM Tris buffer, pH 8.0, 0.6 ⁇ L of 500ng/ ⁇ L ultra-thermostable single-stranded binding protein, and 21.4 ⁇ L of nuclease-free water.
  • the third system also includes a reagent for U base digestion.
  • the third system is: the reaction product of polyA tailing and reverse transcription, 5 ⁇ L of 10 ⁇ polynucleotide kinase reaction buffer, 1 ⁇ L of 10U/ ⁇ L T4 polynucleotide kinase, 2 ⁇ L of 220mM Tris buffer at pH 8.0, 0.6 ⁇ L of 500ng/ ⁇ L ultra-thermostable single-stranded binding protein, 2 ⁇ L of 10U/ ⁇ L uracil specific excision reagent USER enzyme, and 19.4 ⁇ L of nuclease-free water.
  • reaction steps can be selectively combined into the same experimental operation step, so as to flexibly modify the experimental steps and control the experimental time.
  • the combination of reaction steps can be adjusted according to the specific experiment, and the present invention is not limited to this.
  • the present invention provides a connector for introducing a specific target sequence at the 5' and 3' ends of a single-stranded cDNA molecule and a corresponding DNA connection scheme.
  • This scheme involves two connectors, a first connector and a second connector.
  • Both connectors contain a double-stranded region and at least one protruding single-stranded region, wherein the double-stranded region contains a universal structural sequence for PCR or sequencing, and the protruding single-stranded region contains one or more (1-10) random base sequences, which are used for complementary pairing with the end of the single-stranded DNA molecule, so that the connector and the single-stranded DNA can be splinted.
  • the protruding single-stranded region of the first connector is at the 5' end, and the protruding single-stranded region of the second connector is at the 3' end.
  • the first connector is used for complementary pairing at the 5' end of the single-stranded DNA molecule
  • the second connector is used for complementary pairing and splint connection at the 3' end of the single-stranded DNA molecule.
  • One embodiment of the first connector is that two polynucleotide sequences interact to form its special structure, and the two sequences contain complementary pairing regions, which will be complementary paired to form a specific double-stranded structure and a protruding single-stranded region containing random bases after solution mixing and static treatment.
  • One implementation of the second linker is that two polynucleotide sequences interact to form its special structure, and the two sequences contain complementary paired regions, which will complement each other to form a specific double-stranded structure and a protruding single-stranded region containing random bases after solution mixing and static treatment.
  • the schematic diagram of the preparation principle of the first linker and the second linker solution is shown in Figure 2.
  • the present invention provides a sequence scheme of a first joint and a second joint, which consists of four nucleotide sequences, and the sequences are shown in Table 2.
  • the first joint consists of two sequences, namely, the second nucleotide sequence and the third nucleotide sequence in Table 2.
  • the 5' end of the third nucleotide sequence contains a random base sequence, the number of random bases is 1-10, and the random base is base A, base T, base C or base G; the random base sequence can be complementary to the 5' end of the single-stranded cDNA molecule in the first joint structure, and is used for splint connection to improve the connection efficiency.
  • the two sequences of the second joint are the fourth nucleotide sequence and the fifth nucleotide sequence in Table 2.
  • the 3' end of the fifth nucleotide sequence contains a random base sequence, the number of random bases is 1-10, and the random base is base A, base T, base C or base G; the random base sequence can be complementary to the 3' end of the single-stranded cDNA molecule in the second joint structure, and is used for splint connection to improve the connection efficiency.
  • the 5' end of the second nucleotide sequence needs to be specially modified to block it from performing a connection reaction.
  • the 5' end of the fourth nucleotide sequence is phosphorylated; the 3' end is specially modified to block the ligation reaction.
  • the 5' end and 3' end of the third nucleotide sequence and the fifth nucleotide sequence are specially modified to block the ligation reaction.
  • both the 5' and 3' ends of the single-stranded DNA molecule can be directly connected to the linker, and then the primers designed based on the linker sequence can be used for PCR amplification reaction.
  • the present invention uses a DNA ligation reaction enhancing reagent, for example, a chemical reagent hexaamminecobalt chloride with a suitable concentration is used to improve the DNA ligation efficiency, realize the connection of the two end adapters at the 5' end and the 3' end of the single-stranded cDNA molecule in a one-step experimental operation, and shorten the reaction time of the connection.
  • a DNA ligation reaction enhancing reagent for example, a chemical reagent hexaamminecobalt chloride with a suitable concentration is used to improve the DNA ligation efficiency, realize the connection of the two end adapters at the 5' end and the 3' end of the single-stranded cDNA molecule in a one-step experimental operation, and shorten the reaction time of the connection.
  • PCR amplification can be directly performed to obtain the final library.
  • PCR primers a complete sequencing structure sequence can be introduced, including a barcode sequence that can distinguish samples, to adapt to the needs of the sequencing platform.
  • the present invention does not require an additional separate step to synthesize the complementary double-stranded DNA of the cDNA molecule, thereby reducing the steps and time required for library preparation.
  • the present invention adopts the library preparation method of the single-stranded cDNA molecule, the strand-specific information of the RNA is retained, which is more conducive to RNA analysis such as gene annotation.
  • the present invention provides a set of universal primer sequences for PCR amplification of library samples, wherein the forward universal primer is the sixth nucleic acid sequence, and the reverse primer is the seventh nucleic acid sequence.
  • the reverse primer may contain a label (Barcode) sequence for sample identification, which is named the seventh nucleic acid sequence-N, where N represents the Barcode number, and different Barcode sequences have different numbers.
  • the universal primer sequence is composed of the nucleotides in Table 3 below.
  • the Barcode sequence in the seventh nucleic acid sequence-N can be matched with the Barcode of the universal sequencing library of the sequencing platform, the structure of the PCR amplification product can be kept consistent with the universal library structure of the sequencing platform, and the source of the sample library can be accurately split by the Barcode.
  • the adapter and PCR primer sequence schemes (Table 2 and Table 3) provided by the present invention are suitable for preparing high-throughput sequencing libraries of the DNBSEQ and MGISEQ series of the MGI sequencing platform of MGI.
  • the design principle provided by the present invention can also be used to design sequence schemes suitable for other sequencing platforms to prepare compatible sequencing libraries.
  • test materials used in the present invention are all common commercial products and can be purchased on the market.
  • the present invention is further described below in conjunction with the embodiments:
  • UHRR Universal Human Reference RNA
  • Agilent Agilent, 740000
  • One tube of commercial standard 200 ⁇ g RNA, 70% ethanol and 0.1 M sodium acetate solution
  • the precipitate was washed with 70% ethanol, centrifuged again at 4°C, 12,000 ⁇ g for 15 minutes, the supernatant was carefully aspirated, the precipitate was dried at room temperature for 30 minutes, and the precipitate was re-dissolved with 1 mL of nuclease-free H 2 O to a RNA concentration of about 200 ng/ ⁇ L after re-dissolution, and the accurate concentration of the UHRR solution after re-dissolution was determined using a Qubit3.0 fluorescence quantifier (Invitrogen, Q33216).
  • the standard solution was diluted stepwise with nuclease-free H 2 O to prepare 4 samples with different RNA starting amounts, namely 100 ng, 10 ng, 2 ng and 0.2 ng, with a total volume of 14 ⁇ L in each tube.
  • a human plasma sample with a volume of 400 ⁇ L was used and divided into two portions of 200 ⁇ L plasma.
  • Serum/plasma miRNA extraction and separation kits spin column type, TIANGEN, DP503 were used for extraction, and the operation was performed strictly according to the instructions.
  • Example 1 According to the method described in Example 1, the starting amounts of 2 ng, 10 ng, and 100 ng of human universal RNA standard UHRR solutions were prepared, and the samples were named UHRR-2ng, UHRR-10ng, and UHRR-100ng.
  • DNase I RNase Free
  • NEB NEB
  • M0303L DNase Free
  • T4 polynucleotide kinase T4 PNK, NEB, M0201L
  • polyadenylic acid tailing was performed at the 3' end of RNA molecules using polyadenylic acid polymerase (NEB, E. coli Poly (A) Polymerase, M0276L)
  • the reaction system is shown in Table 7. The reaction system was incubated at 37°C for 15 minutes, inactivated at high temperature (95°C) for 5 minutes, and after the reaction, the sample was placed on ice for 2 minutes.
  • the first nucleic acid sequence primer i.e., reverse transcription primer
  • the first nucleotide sequence in this embodiment is specifically 5'-TTTTTTTUTTTTTTTUVN-3' (SEQ ID NO: 1). After adding the primer solution, shake and mix, centrifuge briefly, denature at 65°C for 5 minutes, and incubate at 30°C for 1 minute.
  • the U base digestion reaction was performed using uracil specific excision reagent (USER) enzyme (NEB, USER Enzyme, M5505L), and the digested single-stranded cDNA molecule product was obtained. 4 ⁇ L of U base digestion reaction mixture was added to the reaction system, and its composition is shown in Table 10. The reaction system was placed on a PCR instrument (Bori, TC-96) and the program in Table 11 was run.
  • U base digestion reaction was performed using uracil specific excision reagent (USER) enzyme (NEB, USER Enzyme, M5505L), and the digested single-stranded cDNA molecule product was obtained. 4 ⁇ L of U base digestion reaction mixture was added to the reaction system, and its composition is shown in Table 10. The reaction system was placed on a PCR instrument (Bori, TC-96) and the program in Table 11 was run.
  • T4 polynucleotide kinase T4 PNK, NEB, M0201L
  • high temperature is used to denature the cDNA molecule to unwind the double-stranded structure that may appear in the local area of the cDNA.
  • ET SSB NEB, M2401S
  • the reaction system is placed on a PCR instrument (Bori, TC-96) and the program in Table 13 is run.
  • the present invention uses a DNA ligation reaction enhancing reagent.
  • a chemical reagent hexaamminecobalt chloride with a suitable concentration is used to improve the DNA ligation efficiency, realize the connection of the two end adapters of the 5' end and the 3' end of the single-stranded cDNA molecule in a one-step experimental operation, and shorten the reaction time of the connection.
  • the primer of the sixth nucleic acid sequence (40 ⁇ M) is used as the forward primer
  • the primer of the seventh nucleic acid sequence-N (40 ⁇ M) is used as the reverse primer for universal PCR amplification reaction.
  • the reverse primer contains a Barcode sequence for sample identification. Each sample uses a unique Barcode sequence, and the different Barcode sequences in the reverse primers of different samples are used to distinguish different samples in the offline data.
  • the PCR enzyme reaction solution used in this embodiment is KAPA HiFi HotStart ReadyMix (2X) (Kapa Biosystems, KK2602), and the reaction composition is shown in Table 16.
  • the reaction system is placed on a PCR instrument (Bori, TC-96) to run the program in Table 17.
  • PCR product obtained by universal PCR amplification was purified using 50 ⁇ L of Agencourt AMPure XP magnetic beads (Beckman Coulter, A63881). The concentration of 35 ⁇ L of purified DNA was measured using Qubit3.0 fluorescence quantification instrument (Invitrogen, Q33216). At the same time, the library samples were mixed into sequencing library samples according to the same final concentration and shaken for use.
  • Single-chain cyclization was performed using the MGIEasy cyclization module V2.0 (MGI, 1000005260) of Shenzhen MGI Intelligent Manufacturing Technology Co., Ltd., and sequencing was performed using the MGISEQ-2000RS high-throughput sequencing reagent kit (FCL PE100) (MGI, 1000012554) of Shenzhen MGI Intelligent Manufacturing Technology Co., Ltd. All operations were performed strictly in accordance with the instructions of the kit.
  • the PE100+10 (Paired end 100+10) sequencing type was used to obtain reliable base sequence information.
  • the offline data was split and screened according to the Barcode sequence to obtain the sequencing data of each sample.
  • the human universal RNA standard UHRR was prepared with starting amounts of 0.2 ng, 2 ng, and 10 ng, respectively, and the samples were named UHRR-F-0.2 ng, UHRR-F-2 ng, and UHRR-F-10 ng.
  • Plasma-F-200uL-1 and Plasma-F-200uL-2 Plasma-F-200uL-2.
  • DNase I RNase Free
  • NEB NEB
  • M0303L DNase Free
  • T4 polynucleotide kinase T4 PNK, NEB, M0201L
  • the reaction system is shown in Table 18. The reaction system was incubated at 37°C for 15 minutes and inactivated at high temperature (95°C) for 5 minutes. After the reaction, the sample was placed on ice for 2 minutes.
  • Polyadenylic acid tailing was performed on the 3' end of the RNA molecule using polyadenylic acid polymerase (NEB, E. coli Poly (A) Polymerase, M0276L); reverse transcription was performed using the primer of the first nucleic acid sequence and 200U/ ⁇ l HiScript III Reverse transcriptase (Novozyme, R302-01).
  • the first nucleic acid sequence in this embodiment is specifically 5'-TTTTTTTUTTTTTTTUVN-3'.
  • 30 ⁇ L of polyadenylic acid tailing and reverse transcription reaction mixture was added to the reaction system, and its composition is shown in Table 19. The reaction system was placed on a PCR instrument (Bori, TC-96) and the program in Table 20 was run.
  • Table 19 Compositions of the polyadenylation tailing and reverse transcription reaction mixture.
  • the U base digestion reaction was performed using uracil specific excision reagent (USER) enzyme (NEB, USER Enzyme, M5505L), and the digested single-stranded cDNA molecule product was obtained.
  • the 5' end of the digested single-stranded cDNA molecule was phosphorylated using T4 polynucleotide kinase (T4 PNK, NEB, M0201L).
  • T4 PNK T4 polynucleotide kinase
  • ET SSB (NEB, M2401S) was added to the denaturation reaction system to assist in maintaining the linear structure of the single-stranded cDNA molecules and avoid annealing and renaturing into complex hairpin-like structures after denaturation.
  • 30 ⁇ L of the U base digestion reaction mixture was added to the reaction system, and its composition is shown in Table 21.
  • the reaction system was placed on a PCR instrument (Bori, TC-96) and the program in Table 22 was run.
  • Example 3 and Example 4 The sequencing quality of the original data sequenced in Example 3 and Example 4 was analyzed, and the sequencing quality of the sequences above Q30 reached more than 90%.
  • the quality of the sequencing chip was analyzed, and the sequencing quality did not decrease significantly during the entire sequencing cycle, and always remained at a high level, as shown in Figure 4. The above results show that high-quality sequencing data can be obtained using the method of the present invention.
  • the original sequencing data is split and screened by the Barcode sequence to obtain the sequencing data of each sample.
  • the present invention can realize the simultaneous detection of multiple samples in the same sequencing chip.
  • the raw sequencing data (raw reads) of each sample were filtered for low-quality bases, short sequences, linkers, and microbial sequences, and then ribosomal RNA (rRNA) was removed.
  • rRNA ribosomal RNA
  • the alignment rate of UHRR samples with low starting amount is relatively low, for example, the total alignment rate of UHRR-F-0.2ng is 43.63%, and the unique alignment rate is 35.27%; the total alignment rate of UHRR-F-2ng is 86.36%, and the unique alignment rate is 61.15%.
  • the total alignment rate of plasma free RNA sample starting from 200 ⁇ L is about 35%, and the unique alignment rate is about 28%.
  • the chain-specific ratios calculated for UHRR samples were all above 80%, and most of them could reach around 90% or above; the chain-specific ratios calculated for plasma samples were around 70%.
  • RNA types analyzed in this embodiment include mRNA (messenger RNA), lncRNA (long non-coding RNA), pseudogene RNA, miRNA (microRNA), tRNA (transfer RNA), mt-tRNA (mitochondrial transfer RNA), mt-rRNA (mitochondrial ribosomal RNA), snoRNA (small neclear RNA), and snRNA (small cytoplasmic RNA).
  • mRNA messenger RNA
  • lncRNA long non-coding RNA
  • pseudogene RNA miRNA
  • miRNA miRNA
  • microRNA miRNA
  • tRNA transfer RNA
  • mt-tRNA mitochondrial transfer RNA
  • mt-rRNA mitochondrial ribosomal RNA
  • snoRNA small neclear RNA
  • snRNA small cytoplasmic RNA
  • All UHRR samples can detect more than 10,000 protein-coding genes; can also detect about 4,000 lncRNA genes; can detect about 3,000 to 10,000 pseudogene RNA; can detect about 200-900 miRNA genes; can detect about 200-300 tRNA genes; can detect about 300-800 snoRNA genes and about 40-90 snRNA genes; 22 tRNAs and 2 rRNAs of mitochondria, namely mt-tRNA and mt-rRNA, can all be detected.
  • the number of genes that can be detected in free RNA is relatively small compared to UHRR. But it can also detect nearly 10,000 protein-coding genes, and can detect other types of RNA. Attached Figure 6 of this embodiment also shows the percentage of the number of various RNA genes detected in some samples.
  • RNA expression was performed using the TPM (Transcript per million) calculation method, based on the whole genome gtf (gene transfer format) file, to achieve qualitative and quantitative analysis of different types of RNA and obtain RNA expression profiles.
  • TPM Transcript per million
  • gtf gene transfer format
  • RNA expression Quantitative analysis of RNA expression was performed according to the Log 2 (TPM+1) value.
  • the fluctuation of low-expression genes had a certain negative impact on the results, and the calculated correlation coefficient was lower than the value calculated by the original TPM value.
  • the correlation coefficient of UHRR samples was at least 0.67, and most of them were above 0.70; the correlation coefficient of two technical replicates of plasma free RNA samples was 0.83.
  • the UHRR sample starting amount range implemented in the present invention is relatively wide, including 4 different starting amount ranges of 0.2ng, 2ng, 10ng, and 100ng. Among them, the lowest UHRR starting amount is as low as 0.2ng. The starting amount of plasma samples is also as low as 200 ⁇ L. The above starting amount is much lower than the RNA starting amount used in most of the current commercial RNA kits and literature. In the case of the low starting amount, under the experimental conditions of two batches of two different library preparation processes (Example 3 and Example 4), a high correlation coefficient can still be shown, indicating that the method of the present invention has high stability and low starting amount advantages.
  • the present invention is a strand-specific library preparation method and high-throughput sequencing technology for rapid detection of multiple types of RNA, and the RNA types that can be detected simultaneously include but are not limited to mRNA, lncRNA, tRNA, miRNA, etc. It solves the problems of the traditional RNA library preparation method that the RNA types captured are relatively single or limited, the experimental steps are complicated, and the time period is long.
  • RNA sequencing library prepared by the present invention does not contain artificially added low-complexity sequences, so that the sequencing quality and data volume are guaranteed.
  • the sequencing library prepared by the present invention contains RNA strand-specific information, which is beneficial to RNA analysis such as gene annotation.
  • the present invention combines multiple reaction steps mentioned in the experimental principle into the same experimental operation step by optimizing the reaction conditions. For example, polyadenylation tailing and reverse transcription reaction are combined into one step, and the two end adapters of cDNA are connected separately and combined into one step, thereby reducing the operation steps and time of library preparation, and the experimental operation is simpler and faster.
  • the present invention uses a DNA ligase scheme to replace the RNA ligase scheme in the miRNA library preparation method to add a sequencing adapter sequence through experimental design, thereby improving the efficiency of connection, shortening the time required for connection, and reducing the cost.
  • the present invention adopts PCR amplification technology to amplify the library sequence, introduces the Barcode sequence for library identification and the structural sequence required for cyclization reaction and sequencing through PCR technology, and can realize the mixing of multiple library samples for sequencing together, thereby improving the detection throughput and reducing the detection cost.
  • the method of the present invention is applicable to high-throughput sequencing detection of various sample types such as tissue cell RNA and free RNA, and can be applied to RNA samples with low starting amounts at the ng level or even the pg level.
  • the present invention can be used for free RNA detection in the field of liquid biopsy.
  • the above is only a preferred embodiment of the present invention. It should be pointed out that for ordinary technicians in this technical field, several improvements and modifications can be made without departing from the principles of the present invention. These improvements and modifications should also be regarded as the protection scope of the present invention.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Analytical Chemistry (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

La présente invention porte sur un procédé de préparation d'une banque de brins spécifiques pour la détection rapide de divers types d'ARN, et sur un procédé de séquençage à haut débit. Une queue d'acide polyadénylique (poly A) est ajoutée artificiellement à l'extrémité 3' de divers ARN par une polymérase poly A, et un ADNc simple brin est synthétisé à l'aide d'une amorce ribonucléotidique poly-désoxythymidine avec de la désoxyuridine sous l'action d'une transcriptase inverse. La molécule d'ADNc simple brin obtenue est soumise à une série de réactions, et le produit de la réaction est finalement amplifié au moyen de la PCR pour obtenir une banque de brins spécifiques de différents types d'ARN. L'échantillon de la banque peut être soumis à un séquençage sur machine.
PCT/CN2023/102155 2023-06-25 2023-06-25 Procédé de préparation d'une banque de brins spécifiques pour la détection rapide de divers types d'arn et technique de séquençage à haut débit Pending WO2025000136A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2023/102155 WO2025000136A1 (fr) 2023-06-25 2023-06-25 Procédé de préparation d'une banque de brins spécifiques pour la détection rapide de divers types d'arn et technique de séquençage à haut débit

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2023/102155 WO2025000136A1 (fr) 2023-06-25 2023-06-25 Procédé de préparation d'une banque de brins spécifiques pour la détection rapide de divers types d'arn et technique de séquençage à haut débit

Publications (1)

Publication Number Publication Date
WO2025000136A1 true WO2025000136A1 (fr) 2025-01-02

Family

ID=93936497

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/102155 Pending WO2025000136A1 (fr) 2023-06-25 2023-06-25 Procédé de préparation d'une banque de brins spécifiques pour la détection rapide de divers types d'arn et technique de séquençage à haut débit

Country Status (1)

Country Link
WO (1) WO2025000136A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120330309A (zh) * 2025-06-13 2025-07-18 北京明识至善生物技术有限公司 高通量测序文库按比例混合的方法及试剂盒

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104388426A (zh) * 2012-02-28 2015-03-04 盛司潼 一种Oligo dT引物及构建cDNA文库的方法
US20180030436A1 (en) * 2013-12-05 2018-02-01 New England Biolabs, Inc. Enrichment and Sequencing of RNA Species
CN111379031A (zh) * 2018-12-28 2020-07-07 深圳华大智造科技有限公司 核酸文库构建方法、得到的核酸文库及其用途
CN111961707A (zh) * 2020-10-14 2020-11-20 苏州贝康医疗器械有限公司 一种核酸文库构建方法及其在植入前胚胎染色体结构异常分析中的应用
CN114736951A (zh) * 2022-04-20 2022-07-12 深圳大学 一种小分子rna的高通量测序文库构建方法
CN115003867A (zh) * 2020-03-16 2022-09-02 深圳华大智造科技股份有限公司 一种待测样本rna的测序文库的构建方法
CN115896243A (zh) * 2022-11-25 2023-04-04 上海厦维医学检验实验室有限公司 基于混合酶体系的超灵敏反转录方法及其应用

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104388426A (zh) * 2012-02-28 2015-03-04 盛司潼 一种Oligo dT引物及构建cDNA文库的方法
US20180030436A1 (en) * 2013-12-05 2018-02-01 New England Biolabs, Inc. Enrichment and Sequencing of RNA Species
CN111379031A (zh) * 2018-12-28 2020-07-07 深圳华大智造科技有限公司 核酸文库构建方法、得到的核酸文库及其用途
CN115003867A (zh) * 2020-03-16 2022-09-02 深圳华大智造科技股份有限公司 一种待测样本rna的测序文库的构建方法
CN111961707A (zh) * 2020-10-14 2020-11-20 苏州贝康医疗器械有限公司 一种核酸文库构建方法及其在植入前胚胎染色体结构异常分析中的应用
CN114736951A (zh) * 2022-04-20 2022-07-12 深圳大学 一种小分子rna的高通量测序文库构建方法
CN115896243A (zh) * 2022-11-25 2023-04-04 上海厦维医学检验实验室有限公司 基于混合酶体系的超灵敏反转录方法及其应用

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
SHI HUAJUAN, ZHOU YING, JIA ERTENG, PAN MIN, BAI YUNFEI, GE QINYU: "Bias in RNA‐seq Library Preparation: Current Challenges and Solutions", BIOMED RESEARCH INTERNATIONAL, HINDAWI PUBLISHING CORPORATION, vol. 2021, no. 1, 1 January 2021 (2021-01-01), XP093252765, ISSN: 2314-6133, DOI: 10.1155/2021/6647597 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120330309A (zh) * 2025-06-13 2025-07-18 北京明识至善生物技术有限公司 高通量测序文库按比例混合的方法及试剂盒

Similar Documents

Publication Publication Date Title
CN113166797B (zh) 基于核酸酶的rna耗尽
CN105400776B (zh) 寡核苷酸接头及其在构建核酸测序单链环状文库中的应用
US20150284769A1 (en) Reduced representation bisulfite sequencing with diversity adaptors
JP7641118B2 (ja) プローブ及びこれをハイスループットシーケンシングに適用するターゲット領域の濃縮方法
CN102732629B (zh) 利用高通量测序同时测定基因表达量和多聚腺苷酸加尾的方法
WO2016037416A1 (fr) Lieur vésiculaire et ses utilisations dans la construction et le séquençage d'une bibliothèque d'acides nucléiques
CN111808854B (zh) 带有分子条码的平衡接头及快速构建转录组文库的方法
US20030104432A1 (en) Methods of amplifying sense strand RNA
CN107893260B (zh) 高效去除核糖体rna的构建转录组测序文库的方法及试剂盒
CN114507711B (zh) 一种单细胞转录组测序方法及其应用
JP2010514452A (ja) ヘテロ二重鎖による濃縮
CN112662771B (zh) 一种肿瘤融合基因的靶向捕获探针及其应用
CN110157785A (zh) 一种单细胞rna测序文库构建方法
CN114736951B (zh) 一种小分子rna的高通量测序文库构建方法
JP7248228B2 (ja) Rnaライブラリーの構築のための方法及びキット
EP2785865A1 (fr) Procédé et kit pour la caractérisation d'arn dans une composition
CN112410331A (zh) 带分子标签和样本标签的接头及其单链建库方法
CN112941635A (zh) 一种提高文库转化率的二代测序建库试剂盒及其方法
CN112626173A (zh) Rna建库方法
CN110699425A (zh) 基因目标区域的富集方法及体系
CN110205365B (zh) 一种高效研究rna相互作用组的高通量测序方法及其应用
WO2025000136A1 (fr) Procédé de préparation d'une banque de brins spécifiques pour la détection rapide de divers types d'arn et technique de séquençage à haut débit
CN114250224B (zh) 一种用于提取或检测样本中小分子rna的核酸组合物及其试剂盒和方法
CN115874291A (zh) 一种对样本中dna和rna分子进行标记并同时检测的方法
JP2025505870A (ja) 単一細胞トランスクリプトーム配列決定法及びその適用

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23942669

Country of ref document: EP

Kind code of ref document: A1