Header menu link for other important links
Robust optimal sub-band wavelet cepstral coefficient method for speech recognition
Published in Inderscience Publishers
Volume: 11
Issue: 2
Pages: 163 - 173
The objective of this paper is to propose a robust feature extraction technique for speech recognition system which is insusceptible in the adverse environments. Efficacy of the speech recognition system depends on the feature extraction method. This paper proposes an auditory scale like filter banks using optimal sub-band tree structuring based on wavelet transform. The optimised wavelet filter banks along with energy, logarithmic, discrete cosine transform and cepstral mean normalisation blocks form a robust feature extraction method. This method is validated on a hidden Markov model (HMM)-based single Gaussian isolated word recognition system for additive white Gaussian noise, street and airport noises with different noise levels. Compared with Fourier transform-based methods such as mel-frequency cepstral coefficient (MFCC) and perceptual linear predictive (PLP) methods, the wavelet transform-based method yielded significant improvement across all the noise levels. The experiments also performed with higher dimensions of MFCC features including delta, acceleration features (MFCC-D-A). This study proves that the outcome of wavelet transform-based method gives an increased recognition accuracy of 13% over MFCC-D-A for non-stationary noises. © 2019 Inderscience Enterprises Ltd.
About the journal
JournalInternational Journal of Computer Aided Engineering and Technology
PublisherInderscience Publishers
Open AccessNo