Header menu link for other important links
X
Modified cepstral mean normalization - Transforming to utterance specific non-zero mean
Published in International Speech and Communication Association
2013
Pages: 881 - 885
Abstract
Cepstral Mean Normalization (CMN) is a widely used technique for channel compensation and for noise robustness. CMN compensates for noise by transforming both train and test utterances to zero mean, thus matching first-order moment of train and test conditions. Since all utterances are normalized to zero mean, CMN could lead to loss of discriminative speech information, especially for short utterances. In this paper, we modify CMN to reduce this loss by transforming every noisy test utterance to the estimate of clean utterance mean (mean estimate of the given utterance if noise was not present) and not to zero mean. A look-up table based approach is proposed to estimate the clean-mean of the noisy utterance. The proposed method is particularly relevant for IVR-based applications, where the utterances are usually short and noisy. In such cases, techniques like Histogram Equalization (HEQ) do not perform well and a simple approach like CMN leads to loss of discrimination. We obtain a 12% relative improvement over CMN in WER for Aurora-2 database; and when we analyze only short utterances, we obtain a relative improvement of 5% and 25% in WER over CMN and HEQ respectively. Copyright © 2013 ISCA.
About the journal
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
PublisherInternational Speech and Communication Association
ISSN2308457X
Open AccessNo
Concepts (10)
  •  related image
    Table lookup
  •  related image
    CEPSTRAL MEAN NORMALIZATION
  •  related image
    CHANNEL COMPENSATION
  •  related image
    CMN
  •  related image
    CMVN
  •  related image
    FIRST-ORDER MOMENTS
  •  related image
    HEQ
  •  related image
    HISTOGRAM EQUALIZATIONS
  •  related image
    ROBUST SPEECH RECOGNITION
  •  related image
    Speech recognition