Speaker-dependent Multipitch Tracking Using Deep Neural Networks

Abstract

Multipitch tracking is important for speech and signal processing. However, it is challenging to design an algorithm that achieves accurate pitch estimation and correct speaker assignment at the same time. In this paper, we use deep neural networks (DNNs) to model the probabilistic pitch states of two simultaneous speakers. To capture speaker-dependent information, we propose two types of DNN with different training strategies. Thefirst is trained for each speaker enrolled in the system (speaker-dependent DNN), and the second is trained for each speaker pair (speaker-pair-dependent DNN). Several extensions, including gender-pair-dependent DNNs, speaker adaptation of gender-pair-dependent DNNs and multi-ratio training, are introduced later to relax constraints. A factorial hidden Markov model (FHMM) then integrates pitch probabilities and generates the most likely pitch tracks with a junction tree algorithm. Experiments show that the proposed methods substantially outperform other speaker-independent and speaker-dependent multipitch trackers on two-speaker mixtures. With multi-ratio training, our methods achieve consistent performance at various energies ratios of the two speakers in a mixture.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2015
Accession Number
AD1007863

Entities

People

  • DeLiang Wang
  • Yuzhou Liu

Organizations

  • Ohio State University

Tags

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Artificial Intelligence Software
  • Automated Speech Recognition
  • Computational Complexity
  • Computer Science
  • Computers
  • Databases
  • Engineering
  • Hidden Markov Models
  • Markov Chains
  • Neural Networks
  • Probabilistic Models
  • Probability
  • Probability Distributions
  • Recognition
  • Test Sets

Fields of Study

  • Computer science

Readers

  • Applied Combinatorial Optimization and Logic Circuit Design.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms
  • AI & ML - Neural Networks