Stability-driven nonnegative matrix factorization to interpret spatial gene expression and build local gene networks

Abstract

Despite the abundance of spatial gene expression data, extracting meaningful information to reveal how genes interact remains a challenge. We developed staNMF, a method that combines a powerful unsupervised learning algorithm, nonnegative matrix factorization (NMF), with a new stability criterion that selects the size of the dictionary or the set of principal patterns (PP). We demonstrate that PP give rise to a novel and concise representation of the Drosophila embryonic spatial expression patterns and they correspond to biologically meaningful regions of the Drosophila embryo. Furthermore, this new representation was used to automatically predict manual annotations, categorize gene expression patterns, and reconstruct the local gap gene network with high accuracy.

Document Details

Document Type
Pub Defense Publication
Publication Date
Apr 06, 2016
Source ID
10.1073/pnas.1521171113

Entities

People

  • Ann S. Hammonds
  • Antony Joseph
  • Bin Yu
  • Erwin Frise
  • Siqi Wu
  • Susan E. Celniker

Organizations

  • Air Force Office of Scientific Research
  • Army Research Office
  • Lawrence Berkeley National Laboratory
  • National Human Genome Research Institute
  • National Institutes of Health
  • National Science Foundation
  • Statistics New Zealand
  • Walmart Labs

Tags

Fields of Study

  • Biology

Readers

  • Computer Vision.
  • Linear Algebra
  • Molecular Biology and Genetics

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms