The Mercury System: Embedding Computation into Disk Drives

Abstract

Having inexpensive data storage has enabled the amassing of vast amounts of information. At present, these data sets far exceed the capacity of modern processors, so searching them has become a serious challenge. In a recent invited talk at the High Performance Embedded Computing Workshop, John Reynders of Celera Genomics commented that, "The size of the databases we deal with is no longer measured in terabytes, but in exabytes." The Mercury system is a prototype data search engine that can be embedded within the disk drive itself. We focus on the specific problems associated with searches of unstructured, unindexed data. Three specific applications include approximate matching of text (important for text searches of specific interest to homeland security where the original alphabet is different than the Latin alphabet and transliteration is involved), genomics and proteomics searches (important biological applications), and image searches (also of significant interest for homeland security).

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 20, 2004
Accession Number
ADA428394

Entities

People

  • Mark A. Franklin
  • Roger D. Chamberlain
  • Ron K. Cytron
  • Ronald S. Indeck

Organizations

  • Washington University in St. Louis

Tags

Communities of Interest

  • Space

DTIC Thesaurus Topics

  • Computations
  • Computer Programming
  • Computer Science
  • Computer Vision
  • Computers
  • Data Sets
  • Data Storage Systems
  • Embedding
  • Field Programmable Gate Arrays
  • Homeland Security
  • Object Recognition
  • Object-Oriented Database Management Systems
  • Operating Systems
  • Recognition
  • Shift Registers
  • Software-Defined Hardware
  • Tissue Donors

Fields of Study

  • Computer science

Readers

  • Distributed Systems and Data Platform Development
  • Educational Psychology
  • Parallel and Distributed Computing.

Technology Areas

  • Biotechnology