Statistical Inferences from the Topology of Complex Networks

Abstract

Topological data analysis provides machinery for summarizing the topology of complex data. The main goal of this project was to develop a new summary compatible with statistics and machine learning. This goal was met with the development of a new summary, the "persistence landscape." This summary is stable, does not lose any information, has continuous and discrete versions, and obeys a strong law of large numbers and a central limit theorem. All of the standard tools in statistics and machine learning are available for subsequent analysis. For example, one can easily calculate averages and differences, apply principal component analysis and support vector machines, or feed these results into a neural network. The secondary goal of the project was to help place Topological Data Analysis on a firmer mathematical foundation, strengthening its connections to mathematics and making it easier for researchers to leverage mathematical results for analyzing complex data and complex networks. This goal has been met with the development a very general framework for topological data analysis. Both of these developments have led to much work by other researchers.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Oct 04, 2016
Accession Number
AD1018673

Entities

People

  • John B Holcomb
  • Peter Bubenik

Organizations

  • Cleveland State University

Tags

DTIC Thesaurus Topics

  • Air Force Research Laboratories
  • Algorithms
  • Computational Science
  • Computations
  • Contracts
  • Data Analysis
  • Data Mining
  • Electronic Mail
  • Factor Analysis
  • Information Science
  • Machine Learning
  • Mathematics
  • Statistical Analysis
  • Statistical Inference
  • Statistics
  • Supervised Machine Learning
  • Topology

Readers

  • Mathematical Modeling and Probability Theory.
  • Neural Network Machine Learning.
  • Theoretical Analysis.

Technology Areas

  • AI & ML