Detecting Food Safety Risks and Human Trafficking Using Interpretable Machine Learning Methods

Abstract

Black box machine learning models have allowed researchers to design accurate models using large amounts of data, at the cost of interpretability. Model interpretability not only improves user buy-in, but in many cases provides users with important information. Especially in the case of the classification problems addressed in this thesis, the ideal model should not only provide accurate predictions, but should also inform users of how features affect the results. My research goal is to solve real-world problems and compare how different classification models affect the outcomes and interpretability. To this end, this thesis is divided into two parts: food safety risk analysis and human trafficking detection. The first half analyzes the characteristics of supermarket suppliers in China that indicate a high risk of food safety violations. Contrary to expectations, supply chain dispersion, internal inspections, and quality certification systems are not found to be predictive of food safety risk in our data. The second half focuses on identifying human trafficking advertisements, specifically sex trafficking, hidden amongst online classified escort service advertisements. We propose a novel but interpretable keyword detection and modeling pipeline that is more accurate and actionable than current neural network approaches. The algorithms and applications presented in this thesis succeed in providing users with not just classifications but also the characteristics that indicate food safety risk and human trafficking.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 01, 2019
Accession Number
AD1092388

Entities

People

  • Jessica H Zhu

Organizations

  • MIT Lincoln Laboratory

Tags

Communities of Interest

  • Human Systems

DTIC Thesaurus Topics

  • Business Administration
  • Computational Science
  • Data Mining
  • Dimensionality Reduction
  • Food Safety
  • Human Trafficking
  • Information Science
  • Knowledge Management
  • Law Enforcement Officers
  • Machine Learning
  • Maximum Likelihood Estimation
  • Natural Language Processing
  • Neural Networks
  • Ontologies
  • Operations Research
  • Supervised Machine Learning
  • United States

Fields of Study

  • Computer science
  • Engineering

Readers

  • Industrial Economics
  • Neural Network Machine Learning.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - DoD AI Strategy
  • AI & ML - Neural Networks