Adaptive Feature Aggregation for Video Object Detection

Abstract

Object detection, as a fundamental research topic of computer vision, is facing the challenges of video-related tasks. Objects in videos tend to be blurred, occluded, or out of focus more frequently. Existing works adopt feature aggregation and enhancement to design video-based object detectors. However, most of them do not consider the diversity of object movements and the quality of aggregated context features. Thus, they can not generate comparable results given blurred or crowded videos. In this paper, we propose an adaptive feature aggregation method for video object detection to deal with these problems. We introduce an adaptive quality-similarity weight, with a sparse and dense temporal aggregation policy, into our model. Compared with both image-based and video-based baselines on ImageNet and VIRAT datasets, our work consistently demonstrates better performance.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Mar 01, 2020
Accession Number: AD1154807

Entities

People

Alexander G. Hauptmann
Guoliang Kang
Lijun Yu
Wenhe Liu
Yijun Qian

Organizations

Carnegie Mellon University

Adaptive Feature Aggregation for Video Object Detection

Abstract

Document Details

Entities

People

Organizations

Tags

DTIC Thesaurus Topics

Fields of Study

Readers

Technology Areas