Picking Deep Filter Responses for Fine-Grained Image Recognition

Abstract

Recognizing fine-grained sub-categories such as birds and dogs is extremely challenging due to the highly localized and subtle differences in some specific parts. Most previous works rely on object / part level annotations to build part-based representation, which is demanding in practical applications. This paper proposes an automatic fine-grained recognition approach which is free of any object / part annotation at both training and testing stages. Our method explores a unified framework based on two steps of deep filter response picking. The first picking step is to find distinctive filters which respond to specific patterns significantly and consistently, and learn a set of part detectors via iteratively alternating between new positive sample mining and part model retraining. The second picking step is to pool deep filter responses via spatially weighted combination of Fisher Vectors. We conditionally pick deep filter responses to encode them into the final representation, which considers the importance of filter responses themselves. Integrating all these techniques produces a much more powerful framework, and experiments conducted on CUB-200-2011 and Stanford Dogs demonstrate the superiority of our proposed algorithm over the existing methods.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 27, 2016
Accession Number
AD1048548

Entities

People

  • Hongkai Xiong
  • Qi Tian
  • Weiyao Lin
  • Wengang Zhou
  • Xiaopeng Zhang

Organizations

  • Shanghai Jiao Tong University

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Awards
  • Birds
  • Computer Vision
  • Detection
  • Detectors
  • Feature Extraction
  • Filters
  • Image Recognition
  • Images
  • Information Science
  • Iterations
  • Layers
  • Learning
  • Recognition
  • Training

Fields of Study

  • Computer science

Readers

  • Adaptive Control and Estimation with Uncertainty in Dynamic Systems.
  • Computer Vision.
  • Educational Psychology

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks