Header menu link for other important links
X
Speech synthesis for glottal activity region processing
N. Adiga,
Published in Springer New York LLC
2019
Volume: 22
   
Issue: 1
Pages: 79 - 91
Abstract
The objective of this paper is to demonstrate the significance of combining different features present in the glottal activity region for statistical parametric speech synthesis (SPSS). Different features present in the glottal activity regions are broadly categorized as F0, system, and source features, which represent the quality of speech. F0 feature is computed from zero frequency filter and system feature is computed from 2-D based Riesz transform. Source features include aperiodicity and phase component. Aperiodicity component representing the amount of aperiodic component present in a frame is computed from Riesz transform, whereas, phase component is computed by modeling integrated linear prediction residual. The combined features resulted in better quality compared to STRAIGHT based SPSS both in terms of objective and subjective evaluation. Further, the proposed method is extended to two Indian languages, namely, Assamese and Manipuri, which shows similar improvement in quality. © 2018, Springer Science+Business Media, LLC, part of Springer Nature.
About the journal
JournalData powered by TypesetInternational Journal of Speech Technology
PublisherData powered by TypesetSpringer New York LLC
ISSN13812416