Header menu link for other important links
X
Two-pass IB based speaker diarization system using meeting-specific ANN based features
, , Nauman Dawalatabad, Srikanth Madikeri
Published in International Speech and Communication Association
2016
Volume: 08-12-September-2016
   
Pages: 2199 - 2203
Abstract
In this paper, we present a two-pass Information Bottleneck (IB) based system for speaker diarization which uses meetingspecific artificial neural network (ANN) based features. We first use IB based speaker diarization system to get the labelled speaker segments. These segments are re-segmented using Kullback-Leibler Hidden Markov Model (KL-HMM) based re-segmentation. The multi-layer ANN is then trained to discriminate these speakers using the re-segmented output labels and the spectral features. We then extract the bottleneck features from the trained ANN and perform principal component analysis (PCA) on these features. After performing PCA, these bottleneck features are used along with the different spectral features in the second pass using the same IB based system with KL-HMM re-segmentation. Our experiments on NIST RT and AMI datasets show that the proposed system performs better than the baseline IB system in terms of speaker error rate (SER) with a best case relative improvement of 28.6% amongst AMI datasets and 27.1% on NIST RT04eval dataset. Copyright © 2016 ISCA.
About the journal
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
PublisherInternational Speech and Communication Association
ISSN2308457X
Open AccessYes
Concepts (12)
  •  related image
    Hidden markov models
  •  related image
    Markov processes
  •  related image
    Neural networks
  •  related image
    Speech communication
  •  related image
    Speech processing
  •  related image
    BOTTLENECK FEATURES
  •  related image
    ERROR RATE
  •  related image
    INFORMATION BOTTLENECK
  •  related image
    KULLBACK-LEIBLER
  •  related image
    SPEAKER DIARIZATION
  •  related image
    Spectral feature
  •  related image
    Principal component analysis