DNA Sequencing using Brain-inspired Hyperdimensional Computing

Abstract

DNA sequencing has a vast number of applications in a multitude of applied fields including, but not limited to, medical diagnosis and biotechnology. In this paper, we propose HDNA to apply the concepts of hyper dimensional (HD) computing (computing with hypervectors) to DNA sequencing. HDNA first assigns holographic and (pseudo)random hypervectors to DNA bases. Using an encoder, it then exploits the orthogonality of these hypervectors to represent a DNA sequence by generating a class hypervector. The class hypervector keeps the information of combined individual hypervectors (i.e., the DNA bases) with high probability. HDNA uses the same encoding to map a DNA sequence with unknown labels to a query hypervectors and performs the classification task by checking the similarity of the query hypervector against all class hypervectors. Our experimental evaluation shows that HDNA can achieve 99.7% classification accuracy for Empirical dataset which is 5.2% higher than state-of-the-art techniques for the same dataset. Moreover, our HDNA can improve the execution time and energy consumption of classification by 4.32 and 2.05 respectively, when compared against prior techniques.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 25, 2019
Accession Number
AD1076212

Entities

People

  • Justin Morris
  • Mohsen Imani
  • Tajana Rosing
  • Tarek Nassar

Organizations

  • University of California, San Diego

Tags

Communities of Interest

  • Biomedical
  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Biology
  • Coders
  • Computer Science
  • Computing System Architectures
  • Content Addressable Memory
  • Detectors
  • Dna Sequence Analysis
  • Efficiency
  • Energy Consumption
  • Energy Efficiency
  • Machine Learning
  • Measurement
  • Molecular Biology
  • Pattern Recognition
  • Supervised Machine Learning

Fields of Study

  • Computer science

Readers

  • Auditory Neuroscience/Auditory Physiology.
  • Computer Programming and Software Development.
  • Distributed Systems and Data Platform Development

Technology Areas

  • Biotechnology