Microsegment-based connected digit recognition

C. S. Ramalingam; J. Godfrey John; Rangarajan Aravind; Picone Joseph

Profiles Research Units Publications

Other

Microsegment-based connected digit recognition

, J. Godfrey John, , Picone Joseph

Published in IEEE, Piscataway, NJ, United States

1997

Volume: 3

Pages: 1755 - 1758

Abstract

By building acoustic phonetic models which explicitly represent as much knowledge of pronunciation in a small domain (the digits) as possible, we can create a recognition system which not only performs well but allows for meaningful error analysis and improvement. An HMM-based recognizer for the digits and a few associated words was constructed in accord with these principles. About 65 phonetic models were trained on 140 carefully labeled utterances, then iteratively trained on unlabeled data under orthographic supervision. The basic system achieved less than 3% word error rate on digit strings of unknown length from unseen test speakers, and 1.4% on 7-digit strings of known length. This is competitive with word-based models using the same HMM engine and similar parameter settings. As an R&D system, it allows meaningful analysis of errors and relatively straightforward means of improvement.

About the journal

Journal	Data powered by TypesetICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Publisher	Data powered by TypesetIEEE, Piscataway, NJ, United States
ISSN	07367791
Open Access	No

Authors (2)

C. S. Ramalingam
- Department of Electrical Engineering
Rangarajan Aravind
- Department of Electrical Engineering

ABOUT IIT MADRAS

R & D

RANKINGS & ACHIEVEMENTS

QUICK FIND