Separation of Singing Voice from Music Accompaniment for Monaural Recordings

Abstract

Separating singing voice from music accompaniment is very useful in many applications, such as lyrics recognition and alignment, singer identification, and music information retrieval. Although speech separation has been extensively studied for decades, singing voice separation has been little investigated. We propose a system to separate singing voice from music accompaniment for monaural recordings. Our system consists of three stages. The singing voice detection stage partitions and classifies an input into vocal and non-vocal portions. For vocal portions, the predominant pitch detection stage detects the pitch of the singing voice and then the separation stage uses the detected pitch to group the time-frequency segments of the singing voice. Quantitative results show that the system performs the separation task successfully.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 2005
Accession Number
AD1001211

Entities

People

  • DeLiang Wang
  • Yipeng Li

Organizations

  • Ohio State University

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Algorithms
  • Change Detection
  • Cognitive Science
  • Computer Science
  • Databases
  • Detection
  • Detectors
  • Filtration
  • Frequency
  • Hidden Markov Models
  • Information Retrieval
  • Information Science
  • Machine Learning
  • Modulation
  • Network Science
  • Probability
  • Supervised Machine Learning

Fields of Study

  • Computer science

Readers

  • Computer Vision.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML