Scale-invariant speech analysis via joint time-frequency-scale processing

Srinivasan Umesh; Cohen Leon; M. Marinovich Nenad; J. Nelson Douglas

Profiles Research Units Publications

Other

Scale-invariant speech analysis via joint time-frequency-scale processing

, Cohen Leon, M. Marinovich Nenad, J. Nelson Douglas

Published in

1995

Volume: 2569

Issue: 2/-

Pages: 522 - 537

Abstract

We argue that an important aspect of the human speech signal is scaling in the frequency domain. We discuss the two physical mechanisms responsible for the scaling. The first mechanism is that when we have a harmonic signal whose fundamental is frequency modulated then the spectrum is the sum of scaled functions. The second comes about from the consideration that while different speakers have very different size vocal tracts (for example an adult and a child), we none the less produce speech which is similar in some sense. We will argue and present evidence to show that the speaker differences result in scaling in the frequency domain. We further discuss how one can handle scale processing.

About the journal

Journal	Proceedings of SPIE - The International Society for Optical Engineering
ISSN	0277786X
Open Access	No

Authors (1)

Srinivasan Umesh
- Department of Electrical Engineering

ABOUT IIT MADRAS

R & D

RANKINGS & ACHIEVEMENTS

QUICK FIND