Smart Room: Participant and Speaker Localization and Identification

Abstract

Our long-term objective is to create Smart Room Technologies that are aware of the users presence and their behavior and can become an active, but not an intrusive, part of the interaction. In this work, we present a multimodal approach for estimating and tracking the location and identity of the participants including the active speaker. Our smart room design contains three user-monitoring systems: four CCD cameras, an omnidirectional camera and a 16channel microphone array. The various sensory modalities are processed both individually and jointly and it is shown that the multimodal approach results in significantly improved performance in spatial localization, identification and speech activity detection of the participants.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2005
Accession Number
AD1171494

Entities

People

  • Carlos Busso
  • Chi-wei Chu
  • Isaac Cohen
  • Panayiotis G. Georgiou
  • Sergi Hernanz
  • Shrikanth Narayanan
  • Soon-il Kwon
  • Sung Lee

Organizations

  • Department of Computer Science, University of Oxford
  • University of Southern California

Tags

Communities of Interest

  • Materials and Manufacturing Processes
  • Sensors

DTIC Thesaurus Topics

  • Acoustic Signals
  • Algorithms
  • Computer Science
  • Computer Vision
  • Computers
  • Detection
  • Electrical Engineering
  • Engineering
  • Event Detection
  • Far Field
  • Gaussian Distributions
  • Identification
  • Image Processing
  • Information Processing
  • Pattern Recognition
  • Probability
  • Recognition
  • Signal Processing
  • Silhouettes
  • Three Dimensional

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Sensor Fusion and Tracking Systems.
  • Speech Processing/Speech Recognition.