Non-uniform speaker normalization using affine-transformation

Srinivasan Umesh; Ravi Kumar; Nandan Kumar Sinha

Profiles Research Units Publications

Other

Non-uniform speaker normalization using affine-transformation

, ,

Published in

2004

Volume: 1

Pages: 121 - 124

Abstract

In this paper, we propose a mathematical model to describe the relation between the formant frequencies of speakers and show that with the proposed affine model, speaker differences separate out as translation factors when a "mel-like" warping is performed. Using speech data we estimate the parameters of this warping function and show that it is close to the usual mel-formula. This model is motivated by Rohit et al.'s shift-based non-uniform speaker-normalization method, which provides improvement over the conventional maximum-likelihood based speaker normalization methods. We therefore provide a unified framework that relates the relationship between formants of speakers and method of removing speakers difference (which involves mel-warping) in a neat mathematical framework which is substantiated by our recognition experiments.

About the journal

Journal	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN	15206149
Open Access	No

Authors (3)

Srinivasan Umesh
- Department of Electrical Engineering
Ravi Kumar
- Department of Metallurgical and Materials Engineering
Nandan Kumar Sinha
- Department of Aerospace Engineering

ABOUT IIT MADRAS

R & D

RANKINGS & ACHIEVEMENTS

QUICK FIND