Combination of generative models and SVM based classifier for speech emotion recognition

C. Chandra Sekhar

doi:10.1109/IJCNN.2009.5178777

Profiles Research Units Publications

Conferences

Combination of generative models and SVM based classifier for speech emotion recognition

C. Chandra Sekhar

Published in

2009

DOI: 10.1109/IJCNN.2009.5178777

Pages: 497 - 502

Abstract

Modeling time series data of varying length is important in different domains. There are two paradigms for modeling the varying length sequential data. Tasks such as speech recognition need modeling the temporal dynamics and the correlations among the features. Hidden Markov models (HMM) are used for these tasks. In tasks such as speaker recognition, audio classification and speech emotion recognition, modeling the temporal dynamics is not critical. Gaussian mixture models (GMM) are commonly used for these tasks. Generative models such as HMMs and GMMs focus on estimating the density of the data and are not suitable for classifying the data of confusable classes. Discriminative classifiers such as support vector machines (SVM) are suitable for the fixed dimensional patterns. In this paper, we propose a hybrid framework where a generative front end is used for representing the varying length time series data and then a discriminative model is used for classification. A score based approach and a segment modeling based approach are proposed in this framework. Both the approaches are applied for speech emotion recognition. The performance is compared with that of an SVM classifier that uses different statistical features and also with that of the GMM classifiers that use maximum likelihood method and the variational Bayes method for parameter estimation. Both the proposed approaches outperform the methods used for comparison. ©2009 IEEE.

Topics: Generative model (62)%, Speaker recognition (61)%, Discriminative model (58)%, Hidden Markov model (58)% and Support vector machine (54)%

View more info for "Combination of generative models and SVM based classifier for speech emotion recognition"

About the journal

Journal	Proceedings of the International Joint Conference on Neural Networks
Open Access	No

Concepts (33)

Audio classification
Different domains
Discriminative classifiers
DISCRIMINATIVE MODELS
Front end
GAUSSIAN MIXTURE MODEL
Generative model
Hybrid framework
Maximum likelihood methods
SEGMENT MODELING
SEQUENTIAL DATA
SPEAKER RECOGNITION
SPEECH EMOTION RECOGNITION
STATISTICAL FEATURES
Svm classifiers
SVM-BASED CLASSIFIERS
Temporal dynamics
Time-series data
VARIATIONAL-BAYES METHOD
Audio acoustics
Blind source separation
Classifiers
Face recognition
Hidden markov models
Image retrieval
MAGNETOSTRICTIVE DEVICES
Maximum likelihood estimation
Neural networks
Object recognition
Parameter estimation
Support vector machines
Time series
Speech recognition

ABOUT IIT MADRAS

R & D

RANKINGS & ACHIEVEMENTS

QUICK FIND