Hawkins et al., 2018 - Google Patents
Error-correcting DNA barcodes for high-throughput sequencingHawkins et al., 2018
View PDF- Document ID
- 12808283182401640267
- Author
- Hawkins J
- Jones Jr S
- Finkelstein I
- Press W
- Publication year
- Publication venue
- bioRxiv
External Links
Snippet
Many large-scale high-throughput experiments use DNA barcodes—short DNA sequences prepended to DNA libraries—for identification of individuals in pooled biomolecule populations. However, DNA synthesis and sequencing errors confound the correct …
- 229920003013 deoxyribonucleic acid 0 title abstract description 64
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/22—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for sequence comparison involving nucleotides or amino acids, e.g. homology search, motif or SNP [Single-Nucleotide Polymorphism] discovery or sequence alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/24—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for machine learning, data mining or biostatistics, e.g. pattern finding, knowledge discovery, rule extraction, correlation, clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/14—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for phylogeny or evolution, e.g. evolutionarily conserved regions determination or phylogenetic tree construction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/28—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for programming tools or database systems, e.g. ontologies, heterogeneous data integration, data warehousing or computing architectures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/70—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds
- G06F19/708—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds for data visualisation, e.g. molecular structure representations, graphics generation, display of maps or networks or other visual representations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30943—Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type
- G06F17/30946—Information retrieval; Database structures therefor; File system structures therefor details of database functions independent of the retrieved data type indexing structures
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES OR MICRO-ORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or micro-organisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or micro-organisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Hawkins et al. | Indel-correcting DNA barcodes for high-throughput sequencing | |
| Anavy et al. | Data storage in DNA with fewer synthesis cycles using composite DNA letters | |
| Baaijens et al. | Computational graph pangenomics: a tutorial on data structures and their applications | |
| Buschmann et al. | Levenshtein error-correcting barcodes for multiplexed DNA sequencing | |
| Marsan et al. | Algorithms for extracting structured motifs using a suffix tree with an application to promoter and regulatory site consensus identification | |
| Jain et al. | Duplication-correcting codes for data storage in the DNA of living organisms | |
| US12146189B2 (en) | Methods, systems, computer readable media, and kits for sample identification | |
| Organick et al. | Random access in large-scale DNA data storage | |
| Bystrykh | Generalized DNA barcode design based on Hamming codes | |
| Zakov et al. | An algorithmic approach for breakage-fusion-bridge detection in tumor genomes | |
| US20180211001A1 (en) | Trace reconstruction from noisy polynucleotide sequencer reads | |
| US20210098081A1 (en) | Flexible decoding in dna data storage based on redundancy codes | |
| Bhardwaj et al. | Trace reconstruction problems in computational biology | |
| US20120185177A1 (en) | Harnessing high throughput sequencing for multiplexed specimen analysis | |
| Yan et al. | Scaling logical density of DNA storage with enzymatically-ligated composite motifs | |
| Zhang et al. | CRISPR-powered quantitative keyword search engine in DNA data storage | |
| Leung et al. | IDBA-MTP: a hybrid metatranscriptomic assembler based on protein information | |
| Goussarov et al. | Introduction to the principles and methods underlying the recovery of metagenome‐assembled genomes from metagenomic data | |
| Milenkovic et al. | DNA-based data storage systems: A review of implementations and code constructions | |
| WO2019204702A1 (en) | Error-correcting dna barcodes | |
| Altschul et al. | Sequence alignment | |
| Anavy et al. | Improved DNA based storage capacity and fidelity using composite DNA letters | |
| Erlich et al. | Capacity-approaching DNA storage | |
| Hawkins et al. | Error-correcting DNA barcodes for high-throughput sequencing | |
| Sella et al. | Dna archival storage, a bottom up approach |