Header menu link for other important links
X
Subspace based for Indian languages
Published in
2012
Pages: 35 - 39
Abstract
The interest in this paper is in efficient configuration of automatic speech recognition (ASR) systems for use by under-served speaker populations. A task domain involving Indian farmers accessing information on agricultural commodities through a spoken dialog system in multiple languages is presented. To facilitate the development of ASR system for this domain, a speech corpus was collected in rural areas from speakers of four languages over wireless cellular channels. This paper investigates the problem of ASR acoustic modelling for this task domain. Continuous density hidden Markov model (CDHMM) and subspace Gaussian mixture model (SGMM) [1] based techniques are used to train acoustic models in four languages: Assamese, Bengali, Hindi and Marathi. Issues relating to limited linguistic resources with their impact on ASR word accuracy for these languages are addressed. © 2012 IEEE.
About the journal
Journal2012 11th International Conference on Information Science, Signal Processing and their Applications, ISSPA 2012
Open AccessNo
Concepts (19)
  •  related image
    Acoustic model
  •  related image
    ACOUSTIC MODELLING
  •  related image
    AGRICULTURAL COMMODITIES
  •  related image
    Automatic speech recognition system
  •  related image
    CONTINUOUS DENSITY HIDDEN MARKOV MODELS
  •  related image
    GAUSSIAN MIXTURE MODEL
  •  related image
    Indian languages
  •  related image
    LINGUISTIC RESOURCES
  •  related image
    Multiple languages
  •  related image
    SPEECH CORPORA
  •  related image
    SPOKEN DIALOG SYSTEMS
  •  related image
    SUBSPACE BASED
  •  related image
    TASK DOMAIN
  •  related image
    WORD ACCURACIES
  •  related image
    Agriculture
  •  related image
    Information science
  •  related image
    Rural areas
  •  related image
    Signal processing
  •  related image
    Speech recognition