Probing the physical limits of reliable DNA data retrieval

Abstract

Synthetic DNA is gaining momentum as a potential storage medium for archival data storage. In this process, digital information is translated into sequences of nucleotides and the resulting synthetic DNA strands are then stored for later retrieval. Here, we demonstrate reliable file recovery with PCR-based random access when as few as ten copies per sequence are stored, on average. This results in density of about 17 exabytes/gram, nearly two orders of magnitude greater than prior work has shown. We successfully retrieve the same data in a complex pool of over 1010 unique sequences per microliter with no evidence that we have begun to approach complexity limits. Finally, we also investigate the effects of file size and sequencing coverage on successful file retrieval and look for systematic DNA strand drop out. These findings substantiate the robustness and high data density of the process examined here.

Document Details

Document Type: Pub Defense Publication
Publication Date: Jan 30, 2020
Source ID: 10.1038/s41467-020-14319-8

Entities

People

Karin Strauss
Lee Organick
Luis Ceze
Randolph Lopez
Siena Dumas Ang
Xiaomeng Liu
Yuan-Jyue Chen

Organizations

Microsoft
United States Department of Defense

Probing the physical limits of reliable DNA data retrieval

Abstract

Document Details

Entities

People

Organizations

Tags

Readers