Header menu link for other important links
X
Analysis of lombard and angry speech using Gaussian Mixture Models and KL divergence
S. Mittal, S. Vyas,
Published in Institute of Electrical and Electronics Engineers Inc.
2013
Abstract
Recognition of expressions from speech has emerged as an important research area in the recent past. However, the scientific community still faces problems in differentiating between angry and lombard speech. The objective of this work is to analyze the differences between the Lombard and angry speech using the features representing the excitation source of speech production. The instantaneous fundamental frequency, the strength of excitation and loudness measure, reflecting the sharpness of the impulse-like excitation around the epochs are used as excitation source features. The distributions curves of these three parameters are next plotted. We employ the concept of Gaussian Mixture Models (GMMs) and KL divergence (a measure of relative entropy) to calculate an exact measure of difference between angry, lombard and neutral speech with context to the aforementioned parameters and successfully show differences among the Lombard and angry speech signals at the excitation source level. © 2013 IEEE.
About the journal
JournalData powered by Typeset2013 National Conference on Communications, NCC 2013
PublisherData powered by TypesetInstitute of Electrical and Electronics Engineers Inc.
ISSN15503607