Conducting an Analysis of a Qualitative Dataset Using the Waikato Environment for Knowledge Analysis (WEKA)

Abstract

The purpose of this technical report is to provide an exemplar for conducting an analysis of a qualitative dataset using machine learning techniques. Qualitative data are measured or expressed as a natural language description (e.g., category or attribute) rather than numbers as in quantitative datasets. It is often difficult to evaluate the relationship between qualitative variables of interest and outcomes (dependent variables), but machine learning techniques offer simplified methods to classify these outcomes. WEKA, the Waikato Environment for Knowledge Analysis, is a popular set of machine learning algorithms developed at the University of Waikato in New Zealand, which can be used to analyze both qualitative and quantitative data. To illustrate the use of WEKA on a qualitative dataset, we selected a known set of primate species with the desire to classify them into 1 of 3 classes (prosimians, monkeys, and apes) based on 7 qualitative attributes. When an existing dataset is used with known relationships, it allows us to evaluate a large number of WEKA algorithms in a relatively short time and validate their accuracy with the goal of identifying best practices for analyzing qualitative data.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Feb 01, 2015
Accession Number
ADA613659

Entities

People

  • Robert A. Sottilare

Organizations

  • United States Army Research Laboratory

Tags

Communities of Interest

  • Autonomy

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Artificial Intelligence
  • Classification
  • Data Mining
  • Data Sets
  • Environment
  • Estimators
  • Information Science
  • Learning
  • Machine Learning
  • Military Research
  • New Zealand
  • Precision
  • Probability
  • Test Sets
  • Universities

Readers

  • Computational Modeling and Simulation
  • Military Logistics and Supply Chain Management
  • Neural Network Machine Learning.

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks