Recognizing Activities via Bag of Words for Attribute Dynamics (Open Access)

Abstract

In this work, we propose a novel video representation for activity recognition that models video dynamics with attributes of activities. A video sequence is decomposed into short-term segments, which are characterized by the dynamics of their attributes. These segments are modeled by a dictionary of attribute dynamics templates, which are implemented by a recently introduced generative model, the binary dynamic system~(BDS). We propose methods for learning a dictionary of BDSs from a training corpus, and for quantizing attribute sequences extracted from videos into these BDS code words. This procedure produces a representation of the video as a histogram of BDS code words, which is denoted the bag-of-words for attribute dynamics (BoWAD). An extensive experimental evaluation reveals that this representation outperforms other state-of-the-art approaches in temporal structure modeling for complex activity recognition.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Oct 03, 2013
Accession Number
AD1037277

Entities

People

  • Harpreet Sawhney
  • Nuno Vasconcelos
  • Qian Yu
  • Weixin Li

Organizations

  • Sarnoff Corporation

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Algorithms
  • Artificial Intelligence Software
  • Bayesian Networks
  • Bernoulli Distribution
  • Computer Vision
  • Detection
  • Dictionaries
  • Event Detection
  • Hidden Markov Models
  • Image Classification
  • Information Science
  • Machine Learning
  • Markov Processes
  • Models
  • Probability
  • Recognition
  • Supervised Machine Learning

Fields of Study

  • Computer science

Readers

  • Artificial Intelligence
  • Computational Fluid Dynamics (CFD)
  • Computer Vision.

Technology Areas

  • AI & ML