Header menu link for other important links
X
Sub-band based histogram equalization in cepstral domain for speech recognition
Raghavendra R. Bilgi,
Published in Elsevier
2015
Volume: 69
   
Pages: 46 - 65
Abstract
This paper describes a novel framework to sub-band based Histogram Equalization (HEQ) applied to robust speech recognition. We propose a frequency band specific equalization to compensate the noise distortion on the individual frequency bands. The proposed equalization framework is a two step process. In the first step, conventional histogram equalization is done. By analyzing the histograms of equalized cepstra, we show that the first stage of conventional HEQ approach does not compensate the sub-band specific noise distortion, even though the overall histogram is normalized. Hence, in the second stage, sub-band specific histogram equalization is done. Every frame of cepstral coefficients is decomposed into low-frequency (LF) cepstra and high-frequency (HF) cepstra. Separate equalization is done on LF and HF cepstra to compensate LF and HF specific noise distortion. The cepstra corresponding to the LF and HF bands are obtained by using simple averaging and differencing filters on the cepstral components within a particular frame. The proposed approach is referred to as Sub-band Histogram Equalization (S-HEQ). Using histogram analysis, we show that the S-HEQ approach is able to compensate for the sub-band specific noise distortion. S-HEQ approach shows a consistent improvement over the conventional HEQ approach with a relative improvement of 12% and 22.10% over conventional HEQ in WER on Aurora-2 and Aurora-4 databases respectively. Proposed equalization approach can also be used with the deep neural network based systems and has shown a consistent improvement in the recognition accuracies over conventional HEQ. Finally, the efficacy of the proposed S-HEQ approach for embedded real-time speech applications is shown by comparing the performance and computational complexity trade-off with other state-of-the-art noise compensation methods. © 2015 Elsevier B.V. All rights reserved.
About the journal
JournalData powered by TypesetSpeech Communication
PublisherData powered by TypesetElsevier
ISSN01676393
Open AccessNo
Concepts (14)
  •  related image
    Complex networks
  •  related image
    Economic and social effects
  •  related image
    Frequency bands
  •  related image
    Graphic methods
  •  related image
    Statistical methods
  •  related image
    CEPSTRAL COEFFICIENTS
  •  related image
    Deep neural networks
  •  related image
    Histogram analysis
  •  related image
    HISTOGRAM EQUALIZATIONS
  •  related image
    Recognition accuracy
  •  related image
    ROBUST SPEECH RECOGNITION
  •  related image
    S-HEQ
  •  related image
    SPEECH APPLICATIONS
  •  related image
    Speech recognition