Header menu link for other important links
X
Text and language-independent speaker recognition using suprasegmental features and support vector machines
Anvita Bajpai
Published in
2009
Volume: 40
   
Pages: 307 - 317
Abstract
In this paper, presence of the speaker-specific suprasegmental information in the Linear Prediction (LP) residual signal is demonstrated. The LP residual signal is obtained after removing the predictable part of the speech signal. This information, if added to existing speaker recognition systems based on segmental and subsegmental features, can result in better performing combined system. The speaker-specific suprasegmental information can not only be perceived by listening to the residual, but can also be seen in the form of excitation peaks in the residual waveform. However, the challenge lies in capturing this information from the residual signal. Higher order correlations among samples of the residual are not known to be captured using standard signal processing and statistical techniques. The Hilbert envelope of residual is shown to further enhance the excitation peaks present in the residual signal. A speaker-specific pattern is also observed in the autocorrelation sequence of the Hilbert envelope, and further in the statistics of this autocorrelation sequence. This indicates the presence of the speaker-specific suprasegmental information in the residual signal. In this work, no distinction between voiced and unvoiced sounds is done for extracting these features. Support Vector Machine (SVM) is used to classify the patterns in the variance of the autocorrelation sequence for the speaker recognition task. © 2009 Springer Berlin Heidelberg.
About the journal
JournalCommunications in Computer and Information Science
ISSN18650929
Open AccessNo
Concepts (15)
  •  related image
    HILBERT ENVELOPE
  •  related image
    LINEAR PREDICTION
  •  related image
    SPEAKER RECOGNITION
  •  related image
    SUPPORT VECTOR
  •  related image
    SUPRASEGMENTAL FEATURES
  •  related image
    Autocorrelation
  •  related image
    Character recognition
  •  related image
    Feature extraction
  •  related image
    Forecasting
  •  related image
    Gears
  •  related image
    Microphones
  •  related image
    Signal processing
  •  related image
    Support vector machines
  •  related image
    Vectors
  •  related image
    Speech recognition