Multi-component background learning automates signal detection for spectroscopic data

Abstract

Automated experimentation has yielded data acquisition rates that supersede human processing capabilities. Artificial Intelligence offers new possibilities for automating data interpretation to generate large, high-quality datasets. Background subtraction is a long-standing challenge, particularly in settings where multiple sources of the background signal coexist, and automatic extraction of signals of interest from measured signals accelerates data interpretation. Herein, we present an unsupervised probabilistic learning approach that analyzes large data collections to identify multiple background sources and establish the probability that any given data point contains a signal of interest. The approach is demonstrated on X-ray diffraction and Raman spectroscopy data and is suitable to any type of data where the signal of interest is a positive addition to the background signals. While the model can incorporate prior knowledge, it does not require knowledge of the signals since the shapes of the background signals, the noise levels, and the signal of interest are simultaneously learned via a probabilistic matrix factorization framework. Automated identification of interpretable signals by unsupervised probabilistic learning avoids the injection of human bias and expedites signal extraction in large datasets, a transformative capability with many applications in the physical sciences and beyond.

Document Details

Document Type
Pub Defense Publication
Publication Date
Jul 19, 2019
Source ID
10.1038/s41524-019-0213-0

Entities

People

  • Carla Gomes
  • Dan Guevarra
  • David A. Boyd
  • Helge S. Stein
  • Joel A. Haber
  • John Gregoire
  • Lan Zhou
  • Mitsutaro Umehara
  • Sebastian E. Ament

Organizations

  • Air Force Office of Scientific Research
  • National Science Foundation
  • Office of Basic Energy Sciences

Tags

Readers

  • Approximation Theory.
  • Computational Linguistics
  • Distributed Systems and Data Platform Development

Technology Areas

  • AI & ML
  • AI & ML - DoD AI Strategy
  • AI & ML - Neural Networks