Emotion Recognition From Speech Signals via a Probabilistic Echo-State Network
Abstract
The paper presents a probabilistic echo-state network (pi-ESN) for density estimation over variable-length sequences of multivariate random vectors. The pi-ESN stems from the combination of the reservoir of an ESN and a parametric density model based on radial basis functions. A constrained maximum likelihood training algorithm is introduced, suitable for sequence classification. Extensions of the algorithm to unsupervised clustering and semi-supervised learning (SSL) of sequences are proposed. Experiments in emotion recognition from speech signals are conducted on the WaSeP (C) dataset. Compared with established techniques, the pi-ESN yields the highest recognition accuracies, and shows interesting clustering and SSL capabilities
Document Details
- Document Type
- Technical Report
- Publication Date
- Nov 15, 2015
- Accession Number
- AD1170911
Entities
People
- Edmondo Trentin
- Friedhelm Schwenker
- Stefan Scherer
Organizations
- Ulm University
- University of Southern California