Computational Auditory Scene Analysis Based Perceptual and Neural Principles

Abstract

A remarkable feat of the auditory system is its ability to disentangle the acoustic mixture and group the acoustic energy from the same event. This fundamental process of auditory perception is called auditory scene analysis. of particular importance in auditory scene analysis is the separation of speech from interfering sounds, or speech segregation. Consistent with specified objectives, this project made major advances along the following three directions. First, the problem of multipitch tracking was investigated in the context of multiple sound sources, and a robust algorithm for multipitch tracking of noisy speech was developed. The second advance is in monaural separation of voiced speech, where a new system was proposed that employs different strategies in the low- and the high-frequency range. A key element of the system is amplitude modulation analysis in the high-frequency range. Third, the problem of location-based separation was studied in the joint feature space of interaural time difference and interaural intensity difference, and a novel classification approach was introduced to optimally determine whether a target sound dominates in local time-frequency units. All of the three models were comprehensively evaluated and shown to be substantially superior to existing approaches.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Mar 01, 2004
Accession Number
ADA421188

Entities

People

  • DeLiang Wang

Organizations

  • Ohio State University

Tags

Communities of Interest

  • Air Platforms
  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Algorithms
  • Amplitude
  • Amplitude Modulation
  • Artificial Intelligence
  • Automated Speech Recognition
  • Cognitive Science
  • Frequency
  • Frequency Bands
  • Hidden Markov Models
  • Information Processing
  • Information Science
  • Information Systems
  • Modulation
  • Neural Networks
  • Recognition
  • Repetition Rate
  • Signal Processing

Readers

  • Speech Processing/Speech Recognition.
  • Systems Analysis and Design

Technology Areas

  • Space