Parameter Management: Solved

Abstract

The current explosion of test and evaluation data being collected from various systems has exposed a strong need for low-cost digital infrastructure to facilitate scalable analytics across all available data. Private industry and academic research have built such systems utilizing Open-Source Software (OSS) with tremendous success. The 309th Software Engineering Group (SWEG) developed OPAL (Open Platform for Advanced Learning) platform is a government owned and developed solution to address this gap and provide data discovery, analytics, and warehousing all license free. OPAL leverages best-in-breed Open-Source Software including JupyterLab (Python analysis environment), MinIO (S3-compliant, redundant and object-versioning data backend), Postgres (Data cataloging), and Dask (scalable compute), among others. In addition to Open-Source tooling, custom integration and software piping are used to further lower the analysts' barrier to available data: custom Chapter 10 parsing and translating at high speed (10GB/min) into Apache Parquet format, a web-based data catalog for discovery, and lightweight arbitrary object storage organization. This paper will delineate design choices, our DevOps paradigm, benchmarking numbers, and results against a publicly available commercial flight dataset.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Oct 18, 2022
Accession Number
AD1183294

Entities

People

  • Daniel Lowe
  • Isaac J Myers
  • Kenneth Call
  • Zyad L Ma

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Attachment
  • Cloud Computing
  • Computer Access Control
  • Computer Programming
  • Computer Programs
  • Computing-Related Activities
  • Department Of Defense
  • Digital Information
  • Engineering
  • Environment
  • Explosions
  • Governments
  • Information Operations
  • Infrastructure
  • Learning
  • Lightweight
  • Open Source Software
  • Platforms
  • Software Development
  • Standards
  • Test And Evaluation

Fields of Study

  • Computer science
  • Engineering

Readers

  • Database Systems and Applications
  • Distributed Systems and Data Platform Development
  • Software Engineering.