Multi-variability speech database for robust speaker recognition

B.C. Haris; G. Pradhan; A. Misra; S. Shukla; R. Sinha; Prasanna S

doi:10.1109/NCC.2011.5734775

Profiles Research Units Publications

Conferences

Multi-variability speech database for robust speaker recognition

B.C. Haris, G. Pradhan, A. Misra, S. Shukla, R. Sinha,

Published in

2011

DOI: 10.1109/NCC.2011.5734775

Abstract

In this paper, we present our initial study with the recently collected speech database for developing robust speaker recognition systems in Indian context. The database contains the speech data collected across different sensors, languages, speaking styles, and environments, from 200 speakers. The speech data is collected across five different sensors in parallel, in English and multiple Indian languages, in reading and conversational speaking styles, and in office and uncontrolled environments such as laboratories, hostel rooms and corridors etc. The collected database is evaluated using adapted Gaussian mixture model based speaker verification system following the NIST 2003 speaker recognition evaluation protocol and gives comparable performance to those obtained using NIST data sets. Our initial study exploring the impact of mismatch in training and test conditions with collected data finds that the mismatch in sensor, speaking style, and environment result in significant degradation in performance compared to the matched case whereas for language mismatch case the degradation is found to be relatively smaller. © 2011 IEEE.