CytofIn enables integrated analysis of public mass cytometry datasets using generalized anchors

Abstract

The increasing use of mass cytometry for analyzing clinical samples offers the possibility to perform comparative analyses across public datasets. However, challenges in batch normalization and data integration limit the comparison of datasets not intended to be analyzed together. Here, we present a data integration strategy, CytofIn, using generalized anchors to integrate mass cytometry datasets from the public domain. We show that low-variance controls, such as healthy samples and stable channels, are inherently homogeneous, robust against stimulation, and can serve as generalized anchors for batch correction. Single-cell quantification comparing mass cytometry data from 989 leukemia files pre- and post normalization with CytofIn demonstrates effective batch correction while recapitulating the gold-standard bead normalization. CytofIn integration of public cancer datasets enabled the comparison of immune features across histologies and treatments. We demonstrate the ability to integrate public datasets without necessitating identical control samples or bead standards for fast and robust analysis using CytofIn.

Document Details

Document Type
Pub Defense Publication
Publication Date
Feb 17, 2022
Source ID
10.1038/s41467-022-28484-5

Entities

People

  • Astraea Jager
  • Bita Sahaf
  • Charles G. Mullighan
  • Jeffrey Waters
  • Jolanda Sarno
  • Kara Davis
  • Kathleen M. Sakamoto
  • Norman Lacayo
  • Pablo Domizi
  • Ravindra Majeti
  • Sean C Bendall
  • Timothy J. Keyes
  • Yu-Chen Lo

Organizations

  • Hyundai Hope On Wheels
  • Leukemia & Lymphoma Society
  • United States Department of Defense

Tags

Fields of Study

  • Biology

Readers

  • Immunology
  • Regression Analysis.
  • Systems Analysis and Design