Real-Time Speech/Music Classification With a Hierarchical Oblique Decision Tree

Abstract

In the problem of classification of audio signals, the requirements of low-complexity, high-accuracy and short delay are crucial for some practical scenarios. This paper proposes a method of real-time speech/music classification with a hierarchical oblique decision tree. A set of discrimination features in frequency domain are selected together with a proposed simple harmonic structure stability feature, which is based on a rough estimation of the harmonic structure. A feature subset selection tool is used to select a subset of short and long term features to feed into a hierarchical oblique decision tree classifier. The method is evaluated and compared with the open loop selection mode in AMR-WB+. Experiments show the proposed approach gives a better performance (98.3%) compared to other prevailing approaches. In particular, it comes with promising short delay of 10 ms and low complexity of 1 wmops.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 2008
Accession Number
ADA505115

Entities

People

  • Haojiang Deng
  • Jun Wang
  • Qin Yan
  • Qiong Wu

Organizations

  • Chinese Academy of Sciences

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Abstracts
  • Accuracy
  • Algorithms
  • Amplitude
  • Classification
  • Coefficients
  • Computing-Related Activities
  • Discrimination
  • Feature Selection
  • Frequency
  • Frequency Bands
  • Frequency Domain
  • Machine Learning
  • Sequences
  • Signal Processing
  • Time Domain
  • Training

Fields of Study

  • Computer science

Readers

  • Computer Vision.
  • Marine Hydrodynamics
  • Systems Analysis and Design