Header menu link for other important links
X
Combination of generative models and SVM based classifier for speech emotion recognition
C. Chandra Sekhar
Published in
2009
Pages: 497 - 502
Abstract
Modeling time series data of varying length is important in different domains. There are two paradigms for modeling the varying length sequential data. Tasks such as speech recognition need modeling the temporal dynamics and the correlations among the features. Hidden Markov models (HMM) are used for these tasks. In tasks such as speaker recognition, audio classification and speech emotion recognition, modeling the temporal dynamics is not critical. Gaussian mixture models (GMM) are commonly used for these tasks. Generative models such as HMMs and GMMs focus on estimating the density of the data and are not suitable for classifying the data of confusable classes. Discriminative classifiers such as support vector machines (SVM) are suitable for the fixed dimensional patterns. In this paper, we propose a hybrid framework where a generative front end is used for representing the varying length time series data and then a discriminative model is used for classification. A score based approach and a segment modeling based approach are proposed in this framework. Both the approaches are applied for speech emotion recognition. The performance is compared with that of an SVM classifier that uses different statistical features and also with that of the GMM classifiers that use maximum likelihood method and the variational Bayes method for parameter estimation. Both the proposed approaches outperform the methods used for comparison. ©2009 IEEE.
About the journal
JournalProceedings of the International Joint Conference on Neural Networks
Open AccessNo
Concepts (33)
  •  related image
    Audio classification
  •  related image
    Different domains
  •  related image
    Discriminative classifiers
  •  related image
    DISCRIMINATIVE MODELS
  •  related image
    Front end
  •  related image
    GAUSSIAN MIXTURE MODEL
  •  related image
    Generative model
  •  related image
    Hybrid framework
  •  related image
    Maximum likelihood methods
  •  related image
    SEGMENT MODELING
  •  related image
    SEQUENTIAL DATA
  •  related image
    SPEAKER RECOGNITION
  •  related image
    SPEECH EMOTION RECOGNITION
  •  related image
    STATISTICAL FEATURES
  •  related image
    Svm classifiers
  •  related image
    SVM-BASED CLASSIFIERS
  •  related image
    Temporal dynamics
  •  related image
    Time-series data
  •  related image
    VARIATIONAL-BAYES METHOD
  •  related image
    Audio acoustics
  •  related image
    Blind source separation
  •  related image
    Classifiers
  •  related image
    Face recognition
  •  related image
    Hidden markov models
  •  related image
    Image retrieval
  •  related image
    MAGNETOSTRICTIVE DEVICES
  •  related image
    Maximum likelihood estimation
  •  related image
    Neural networks
  •  related image
    Object recognition
  •  related image
    Parameter estimation
  •  related image
    Support vector machines
  •  related image
    Time series
  •  related image
    Speech recognition