Probing the physical limits of reliable DNA data retrieval
Abstract
Synthetic DNA is gaining momentum as a potential storage medium for archival data storage. In this process, digital information is translated into sequences of nucleotides and the resulting synthetic DNA strands are then stored for later retrieval. Here, we demonstrate reliable file recovery with PCR-based random access when as few as ten copies per sequence are stored, on average. This results in density of about 17 exabytes/gram, nearly two orders of magnitude greater than prior work has shown. We successfully retrieve the same data in a complex pool of over 1010 unique sequences per microliter with no evidence that we have begun to approach complexity limits. Finally, we also investigate the effects of file size and sequencing coverage on successful file retrieval and look for systematic DNA strand drop out. These findings substantiate the robustness and high data density of the process examined here.
Document Details
- Document Type
- Pub Defense Publication
- Publication Date
- Jan 30, 2020
- Source ID
- 10.1038/s41467-020-14319-8
Entities
People
- Karin Strauss
- Lee Organick
- Luis Ceze
- Randolph Lopez
- Siena Dumas Ang
- Xiaomeng Liu
- Yuan-Jyue Chen
Organizations
- Microsoft
- United States Department of Defense