Embracing Statistical Challenges in the Information Technology Age

Abstract

Information Technology is creating an exciting time for statistics. In this article, we review the diverse sources of IT data in three clusters: IT core, IT systems, and IT fringe. The new data forms, huge data volumes, and high data speeds of IT are contrasted against the constraints on storage, transmission and computation to point to the challenges and opportunities. In particular, we describe the impacts of IT on a typical statistical investigation of data collection, data visualization, and model fitting, with an emphasis on computation and feature selection. Moreover, two research projects on network tomography and arctic cloud detection are used throughout the paper to bring the discussions to a concrete level.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2006
Accession Number
ADA446888

Entities

People

  • Bin Yu

Organizations

  • University of California, Berkeley

Tags

Communities of Interest

  • Autonomy
  • Energy and Power Technologies
  • Sensors

DTIC Thesaurus Topics

  • Computational Science
  • Computer Languages
  • Computer Programming
  • Computer Science
  • Computer Vision
  • Data Analysis
  • Dimensionality Reduction
  • Feature Extraction
  • Information Science
  • Information Systems
  • Information Theory
  • Machine Learning
  • Natural Language Processing
  • Network Science
  • Statistical Analysis
  • Supervised Machine Learning
  • Teamwork

Readers

  • Business Analytics
  • Joint Military Operations and Doctrine.
  • Neural Network Machine Learning.