Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time (Open Access)

Abstract

We present a weakly-supervised visual data mining approach that discovers connections between recurring mid-level visual elements in historic (temporal) and geographic (spatial) image collections, and attempts to capture the underlying visual style. In contrast to existing discovery methods that mine for patterns that remain visually consistent throughout the dataset, our goal is to discover visual elements whose appearance changes due to change in time or location; i.e., exhibit consistent stylistic variations across the label space (date or geo-location). To discover these elements, we first identify groups of patches that are style-sensitive. We then incrementally build correspondences to find the same element across the entire dataset. Finally, we train style-aware regressors that model each element's range of stylistic differences. We apply our approach to date and geo-location prediction and show substantial improvement over several baselines that do not model visual style. We also demonstrate the method's effectiveness on the related task of fine-grained classification.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 03, 2014
Accession Number
AD1039805

Entities

People

  • Alexei A. Efros
  • Martial Hebert
  • Yong Jae Lee

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Accuracy
  • Air Force Research Laboratories
  • Birds
  • Computer Vision
  • Data Mining
  • Detection
  • Detectors
  • Geographic Regions
  • Object Recognition
  • Recognition
  • Supervised Machine Learning

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Sensor Fusion and Tracking Systems.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks
  • Space