Multi-domain translation between single-cell imaging and sequencing data using autoencoders

Abstract

The development of single-cell methods for capturing different data modalities including imaging and sequencing has revolutionized our ability to identify heterogeneous cell states. Different data modalities provide different perspectives on a population of cells, and their integration is critical for studying cellular heterogeneity and its function. While various methods have been proposed to integrate different sequencing data modalities, coupling imaging and sequencing has been an open challenge. We here present an approach for integrating vastly different modalities by learning a probabilistic coupling between the different data modalities using autoencoders to map to a shared latent space. We validate this approach by integrating single-cell RNA-seq and chromatin images to identify distinct subpopulations of human naive CD4+ T-cells that are poised for activation. Collectively, our approach provides a framework to integrate and translate between data modalities that cannot yet be measured within the same cell for diverse applications in biomedical discovery.

Document Details

Document Type
Pub Defense Publication
Publication Date
Jan 04, 2021
Source ID
10.1038/s41467-020-20249-2

Entities

People

  • Abigail Katcoff
  • Adityanarayanan Radhakrishnan
  • Anastasiya Belyaeva
  • Caroline Uhler
  • G V Shivashankar
  • Karren Dai Yang
  • Karthik Damodaran
  • Saradha Venkatachalapathy

Organizations

  • Alfred P. Sloan Foundation
  • Ministry of Education
  • National Science Foundation
  • Nvidia
  • Office of Naval Research
  • Simons Foundation

Tags

Fields of Study

  • Biology

Readers

  • Molecular Genetics
  • Neural Network Machine Learning.
  • Systems Analysis and Design

Technology Areas

  • Biotechnology
  • Biotechnology - Cancer Biotech
  • Space