CytofIn enables integrated analysis of public mass cytometry datasets using generalized anchors
Abstract
The increasing use of mass cytometry for analyzing clinical samples offers the possibility to perform comparative analyses across public datasets. However, challenges in batch normalization and data integration limit the comparison of datasets not intended to be analyzed together. Here, we present a data integration strategy, CytofIn, using generalized anchors to integrate mass cytometry datasets from the public domain. We show that low-variance controls, such as healthy samples and stable channels, are inherently homogeneous, robust against stimulation, and can serve as generalized anchors for batch correction. Single-cell quantification comparing mass cytometry data from 989 leukemia files pre- and post normalization with CytofIn demonstrates effective batch correction while recapitulating the gold-standard bead normalization. CytofIn integration of public cancer datasets enabled the comparison of immune features across histologies and treatments. We demonstrate the ability to integrate public datasets without necessitating identical control samples or bead standards for fast and robust analysis using CytofIn.
Document Details
- Document Type
- Pub Defense Publication
- Publication Date
- Feb 17, 2022
- Source ID
- 10.1038/s41467-022-28484-5
Entities
People
- Astraea Jager
- Bita Sahaf
- Charles G. Mullighan
- Jeffrey Waters
- Jolanda Sarno
- Kara Davis
- Kathleen M. Sakamoto
- Norman Lacayo
- Pablo Domizi
- Ravindra Majeti
- Sean C Bendall
- Timothy J. Keyes
- Yu-Chen Lo
Organizations
- Hyundai Hope On Wheels
- Leukemia & Lymphoma Society
- United States Department of Defense