Header menu link for other important links
Analysis of spurious vowel-like regions (VLRs) detected by excitation source information
B. Dev Sarma,
Published in
This work treats vowels and semivowels as vowellike regions. An analysis of the spurious vowel-like regions (VLRs) detected by a signal processing based method using excitation source information is demonstrated. Limitation of excitation information in detecting some of the nasals and voiced consonants as non-VLRs is discussed. An attempt to reduce spurious VLRs compared to the existing signal processing based method for VLRs detection [1] is made. A multi-class statistical phone classifier that classifies speech into broad vowel, consonant and silence categories is trained. The outputs of the classifier are suitably combined to get evidence for vowel-like regions, different broad categories of consonants and silence regions. The output from the existing signal processing method is compared with different evidences from the statistical method. The spurious ones are eliminated by using the evidences from the statistical method. The experimental studies conducted on TIMIT and inhouse databases demonstrate significant reduction in the spurious VLRs with a little loss in the VLRs detection performance. A net gain of 4.21% and 7.71% in frame error rate is achieved for TIMIT and in-house databases, respectively. © 2013 IEEE.
About the journal
Journal2013 Annual IEEE India Conference, INDICON 2013