Augmented Annotation Phase 3

Abstract

Automated visual object detection is an important capability in reducing the burden on human operators in many DoD applications. To train modern deep learning algorithms to recognize desired objects, the algorithms must be "fed" more than 1000 labeled images (for 55%85%accuracy according to project Maven - Oct 2017 O6, Working Group slide 27) of each particular object. The task of labeling training data for use in machine learning algorithms is human intensive, requires special software, and takes a great deal of time. Estimates from ImageNet, a widely used and publicly available visual object detection dataset, indicate that humans generated four annotations per minute in the overall production of ImageNet annotations. DoD's need is to reduce direct object-by-object human labeling particularly in the video domain where data quantity can be significant. The Augmented Annotations System addresses this need by leveraging a small amount of human annotation effort to propagate human initiated annotations through video to build an initial labeled dataset for training an object detector, and utilizing an automated object detector in an iterative loop to assist humans in pre-annotating new datasets.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 09, 2020
Accession Number
AD1104050

Entities

People

  • L. Lee

Organizations

  • MIT Lincoln Laboratory

Tags

Communities of Interest

  • Autonomy

DTIC Thesaurus Topics

  • Accuracy
  • Air Force
  • Algorithms
  • Computer Vision
  • Computers
  • Data Sets
  • Data Storage Systems
  • Deep Learning
  • Department Of Defense
  • Detection
  • Detectors
  • Drone Targeting
  • Learning
  • Lessons Learned
  • Machine Learning
  • Training
  • User Interface

Fields of Study

  • Computer science

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Neural Network Machine Learning.

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks