Simplifying Data Analysis For Subject Matter Experts

Abstract

In todays data-intensive world, the power to analyze huge amounts of data is critical to the success of any organization, including the military. Many data analysis tools have been developed in the past decade along with the high-performance machine learning algorithms. At present, many of these tools unfortunately are out of reach of the target audiencesubject matter expertsbecause one must master some of the advanced computer science concepts to use these tools effectively. This thesis proposes to build a prototype data analysis platform that will hide the underlying complexity of the tools from the subject matter experts. Using the platform, the end users can analyze data through a simple, menu-driven interface. The prototype will be built using the programming language Python and the open-source, distributed data processing engine Apache Spark 2.0. Different components of Spark 2.0 will be studied and evaluated to determine the best approach for building the prototype. The effectiveness of the prototype will be examined using the ADSB (Automatic Dependent Surveillance - Broadcast) unfiltered flight data. The thesis concludes with the review of the prototype developed for ADSB and the recommendation on possible ways of extending the prototype.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Dec 01, 2018
Accession Number
AD1069768

Entities

People

  • Timberon C. Vanzant

Organizations

  • Naval Postgraduate School

Tags

Communities of Interest

  • Air Platforms
  • Autonomy
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Big Data
  • Computer Languages
  • Computer Programming
  • Computer Science
  • Computers
  • Data Analysis
  • Data Mining
  • Data Processing
  • Domain Specific Programming Languages
  • Information Science
  • Machine Learning
  • Military Aircraft
  • Network Science
  • Operating Systems
  • Programming Languages
  • Supervised Machine Learning

Fields of Study

  • Computer science
  • Engineering

Readers

  • Distributed Systems and Data Platform Development
  • Theoretical Analysis.

Technology Areas

  • AI & ML