Co-Channel Speaker Separation

Abstract

In the co-channel speaker separation problem, the goal is to recover two separate speech signals from a monaural channel which contains the sum of the two speech signals. A new methodology is developed that if given that a segment of co-channel speech is separated into a stronger and weaker segment, the correct assignment of these separated segments to the appropriate talker can be made using a Linear Predictive Coding (LPC) based minimum-prediction residual computation. The uniqueness of the developed technique is that no a priori information is required of the co-channel speech signal. The information needed to appropriately assign these separated segments from the co-channel speech signal are clean speech that is separate from the co-channel speech signal that are used to compute model LPC vectors. This clean speech is derived from the same channel that the co-channel speech signal is derived from. This technique has shown the ability to correctly assign the given stronger and weaker segments to the appropriate talker at signal-to-signal ratios down to equal power levels. The resulting separated speech is clearly understandable, and the interfering talker's speech signal is effectively eliminated. Co-Channel, Speaker Separation, Speech Processing, LPC, Itakura Minimum Prediction Residual.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 1992
Accession Number
ADA256443

Entities

People

  • Thomas S. Andrews

Organizations

  • Air Force Institute of Technology

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Air Force
  • Classification
  • Computations
  • Computer Programming
  • Computers
  • Data Science
  • Data Sets
  • Databases
  • Detectors
  • Digital Signal Processing
  • Electrical Engineering
  • Information Science
  • Neural Networks
  • Recognition
  • Signal Processing
  • Sine Waves
  • Standards

Readers

  • Speech Processing/Speech Recognition.