EDGE COVID-19: a web platform to generate submission-ready genomes from SARS-CoV-2 sequencing efforts

Abstract

Genomics has become an essential technology for surveilling emerging infectious disease outbreaks. A range of technologies and strategies for pathogen genome enrichment and sequencing are being used by laboratories worldwide, together with different and sometimes ad hoc, analytical procedures for generating genome sequences. A fully integrated analytical process for raw sequence to consensus genome determination, suited to outbreaks such as the ongoing COVID-19 pandemic, is critical to provide a solid genomic basis for epidemiological analyses and well-informed decision making. We have developed a web-based platform and integrated bioinformatic workflows that help to provide consistent high-quality analysis of SARS-CoV-2 sequencing data generated with either the Illumina or Oxford Nanopore Technologies (ONT). Using an intuitive web-based interface, this workflow automates data quality control, SARS-CoV-2 reference-based genome variant and consensus calling, lineage determination and provides the ability to submit the consensus sequence and necessary metadata to GenBank, GISAID and INSDC raw data repositories. We tested workflow usability using real world data and validated the accuracy of variant and lineage analysis using several test datasets, and further performed detailed comparisons with results from the COVID-19 Galaxy Project workflow. Our analyses indicate that EC-19 workflows generate high-quality SARS-CoV-2 genomes. Finally, we share a perspective on patterns and impact observed with Illumina versus ONT technologies on workflow congruence and differences.

Document Details

Document Type
Pub Defense Publication
Publication Date
Mar 24, 2022
Source ID
10.1093/bioinformatics/btac176

Entities

People

  • Adán Myers y Gutiérrez
  • Bin Hu
  • Chien-Chi Lo
  • Elais Player Jackson
  • Karen W Davenport
  • Mark Flynn
  • Migun Shakya
  • Patrick S. G. Chain
  • Po-e Li
  • Ryan Connor
  • Yan Xu

Organizations

  • Defense Threat Reduction Agency
  • Los Alamos National Laboratory
  • National Center for Biotechnology Information
  • National Institutes of Health
  • National Science Foundation
  • United States Department of Energy
  • United States National Library of Medicine

Tags

Fields of Study

  • Biology

Readers

  • Clinical Trial Research.
  • Enterprise Information Systems Architecture and Joint Command Capability Interoperability Support.
  • Molecular Genetics