Compact Information Representations

Abstract

Numerous modern applications in the context of network traffic, information retrieval, and databases are faced with very large, inherently high-dimensional, or naturally streaming datasets. This proposal aims at developing mathematically rigorous and general purpose statistical methods based on stable random projections, to achieve compact information representations, for solving very large-scale engineering problems in data stream computations, real-time network monitoring and anomaly detections (e.g., DDoSattacks), machine learning, databases, and search. Fundamentally, compact data representations are highly beneficial because they could substantially reduce memory or disk storage, facilitate efficient data transmission over the networks, accomplish time critical missions, improve experience in user-facing applications, reduce energy consumptions, etc. The proposed research topics largely fall into three categories: (i) Data steam algorithms for network anomaly detections; (ii) Probabilistic quantization for compact information storage, indexing and search; and (iii) Effective sparse recovery from (quantized) stable random projections.The proposed research is highly interdisciplinary, across statistics, theoretical and applied computer science, and applied math. Withinthe scope of this proposal, the focus is preliminarily on the fundamental, theoretical research which lies in the mission of AFOSR.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 02, 2016
Accession Number
AD1013975

Entities

People

  • Li Ping
  • Martin Wells

Organizations

  • Cornell University

Tags

Communities of Interest

  • Autonomy
  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Air Force Research Laboratories
  • Algorithms
  • Anomaly Detection
  • Applied Computer Science
  • Artificial Intelligence
  • Change Detection
  • Compressed Sensing
  • Computer Science
  • Data Mining
  • Databases
  • Information Processing
  • Information Retrieval
  • Information Science
  • Machine Learning
  • Network Science
  • Signal Processing
  • Supervised Machine Learning

Fields of Study

  • Computer science

Readers

  • Distributed Systems and Data Platform Development
  • Energy Conservation and Renewable Energy Engineering.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms