Identity Preserve Transform: Understand What Activity Classification Models Have Learnt

Abstract

Activity classification has observed great success recently. The performance on small dataset is almost saturated and people are moving towards larger datasets. What leads to the performance gain on the model and what the model has learnt? In this paper we propose identity preserve transform (IPT) to study this problem. IPT manipulates the nuisance factors (background, viewpoint, etc.) of the data while keeping those factors related to the task (human motion) unchanged. To our surprise, we found popular models are using highly correlated information (background, object) to achieve high classification accuracy, rather than using the essential information (human motion). This can explain why an activity classification model usually fails to generalize to datasets it is not trained on. We implement IPT in two forms, i.e. image-space transform and 3D transform, using synthetic images. The tool will be made open-source to help study model and dataset design.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 14, 2020
Accession Number
AD1152569

Entities

People

  • Alan Yuille
  • Jialing Lyu
  • Weichao Qiu

Organizations

  • Johns Hopkins University
  • University of Science and Technology of China

Tags

Communities of Interest

  • Autonomy

DTIC Thesaurus Topics

  • Accuracy
  • Artificial Intelligence Software
  • Computer Graphics
  • Computer Programs
  • Computer Stereo Vision
  • Computer Vision
  • Computers
  • Elevation
  • Graphics
  • Image Processing
  • Image Recognition
  • Machine Learning
  • Motion Capture
  • Neural Networks
  • Object Recognition
  • Test Sets
  • Three Dimensional
  • Visualizations

Fields of Study

  • Computer science

Readers

  • Computer Vision.
  • Neural Network Machine Learning.
  • Space/Atmospheric Physics.

Technology Areas

  • Space
  • Space - Space Objects