Training Logit and Random Forest Models to Predict IT Spending

Abstract

The Air Force must modernize, but the distribution of funds for technology remains as tight as ever. To this end, the Air Force Audit Agency is looking to utilize machine learning techniques to enhance their capabilities. This research explores Logistic Regression and Random Forest modeling to streamline data collection and cost classification. The final Logistic Regression model identified 4 significant attributes out of the 36 given and was 85 accurate in predicting whether a purchase amount was over or under $10,000. To expand beyond binary classification, a six-category classification Random Forest model was developed. It identified 6 significant attributes and was 34 accurate in in predicting whethera purchase was in 1 of 6 amount categories. Due to the class imbalance of the given data, it was necessary to use a class weighting and over-sampling technique to enhance the Random Forest model. The final class balanced model identifiedthe same 6 significant attributes but was 78 accurate in predicting whether a purchase was in 1 of 6 amount categories. No models were able to predict whether a purchase should be classified as an information technology purchase of not.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 24, 2022
Accession Number
AD1170680

Entities

People

  • Jacob P. Batt

Organizations

  • Air Force Institute of Technology

Tags

Communities of Interest

  • Autonomy
  • Cyber

DTIC Thesaurus Topics

  • Abstracts
  • Air Force
  • Algorithms
  • Artificial Intelligence
  • Computer Programming
  • Computer Programs
  • Computer Science
  • Computers
  • Contracts
  • Data Mining
  • Data Science
  • Dimensionality Reduction
  • Governments
  • Information Science
  • Information Systems
  • Literature Surveys
  • Machine Learning
  • Sampling
  • Supervised Machine Learning
  • United States
  • United States Government

Readers

  • Naval Personnel Management
  • Neural Network Machine Learning.
  • Research Science/Academic Research

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks