Header menu link for other important links
X
Shouted/normal speech classification using speech-specific features
S. Baghel, B.K. Khonglah, , P. Guha
Published in Institute of Electrical and Electronics Engineers Inc.
2017
Pages: 1655 - 1659
Abstract
This work explores the classification of shouted speech from normal speech which are typical scenarios encountered in news debates. The shouted and normal speech are studied in terms of the characteristics of the excitation source, vocal tract system and long term information. The different behavior of these speech characteristics in the shouted and normal class motivates to define features which provide good classification for the two classes. The Peak to side lobe ratio (PSR) of Hilbert envelope (HE) of Linear Prediction (LP) residual represents the source feature and the sum of ten largest peaks of the spectrum represents the vocal tract shape feature. The modulation spectrum energy represents the long term feature which indicates the slowly varying temporal envelope components in speech. These features are non-linearly mapped and combined for threshold based classification. It is observed that the sum of ten largest spectral peaks performed the classification task with highest F-score for almost all thresholds. © 2016 IEEE.
About the journal
JournalData powered by TypesetIEEE Region 10 Annual International Conference, Proceedings/TENCON
PublisherData powered by TypesetInstitute of Electrical and Electronics Engineers Inc.
ISSN21593442