Probing the physical limits of reliable DNA data retrieval

Abstract

Synthetic DNA is gaining momentum as a potential storage medium for archival data storage. In this process, digital information is translated into sequences of nucleotides and the resulting synthetic DNA strands are then stored for later retrieval. Here, we demonstrate reliable file recovery with PCR-based random access when as few as ten copies per sequence are stored, on average. This results in density of about 17 exabytes/gram, nearly two orders of magnitude greater than prior work has shown. We successfully retrieve the same data in a complex pool of over 1010 unique sequences per microliter with no evidence that we have begun to approach complexity limits. Finally, we also investigate the effects of file size and sequencing coverage on successful file retrieval and look for systematic DNA strand drop out. These findings substantiate the robustness and high data density of the process examined here.

Document Details

Document Type
Pub Defense Publication
Publication Date
Jan 30, 2020
Source ID
10.1038/s41467-020-14319-8

Entities

People

  • Karin Strauss
  • Lee Organick
  • Luis Ceze
  • Randolph Lopez
  • Siena Dumas Ang
  • Xiaomeng Liu
  • Yuan-Jyue Chen

Organizations

  • Microsoft
  • United States Department of Defense

Tags

Readers

  • Computer Science/Computer Engineering/Data Science/Digital Signal Processing.
  • Molecular Genetics
  • Theoretical Analysis.