Stability-driven nonnegative matrix factorization to interpret spatial gene expression and build local gene networks
Abstract
Despite the abundance of spatial gene expression data, extracting meaningful information to reveal how genes interact remains a challenge. We developed staNMF, a method that combines a powerful unsupervised learning algorithm, nonnegative matrix factorization (NMF), with a new stability criterion that selects the size of the dictionary or the set of principal patterns (PP). We demonstrate that PP give rise to a novel and concise representation of the Drosophila embryonic spatial expression patterns and they correspond to biologically meaningful regions of the Drosophila embryo. Furthermore, this new representation was used to automatically predict manual annotations, categorize gene expression patterns, and reconstruct the local gap gene network with high accuracy.
Document Details
- Document Type
- Pub Defense Publication
- Publication Date
- Apr 06, 2016
- Source ID
- 10.1073/pnas.1521171113
Entities
People
- Ann S. Hammonds
- Antony Joseph
- Bin Yu
- Erwin Frise
- Siqi Wu
- Susan E. Celniker
Organizations
- Air Force Office of Scientific Research
- Army Research Office
- Lawrence Berkeley National Laboratory
- National Human Genome Research Institute
- National Institutes of Health
- National Science Foundation
- Statistics New Zealand
- Walmart Labs