Header menu link for other important links
X
Automatic pause marking for speech synthesis
L.G. Singh, N. Adiga, B. Sharma, S.R. Singh,
Published in Institute of Electrical and Electronics Engineers Inc.
2017
Volume: 2017-December
   
Pages: 1790 - 1794
Abstract
Accurate detection of pause boundary plays a major role in the segmentation of the speech corpus and improving the quality of speech synthesis. For pause modelling, we need to have pause tags in the training sentences. Manual tagging of pause is accurate but have the possibilities of missing out due to human error, and it is time-consuming. In this work, an automatic approach for marking the pause in the training corpus is proposed. During the training phase, after every word explicit pause (PAU) tags are added to represent a pause. Then, models for all phones including PAU are trained and re-alignment is performed. During re-alignment, each PAU boundary is corrected using three speech specific features namely, modulation spectrum energy, spectral peaks energy, and strength of excitation. The proposed approach gives a better result as compared to manual pause marking with less time complexity. It also improves the overall segmentation accuracy. The tagged label files are used for developing text-to-speech synthesis system using Hidden Markov Model based speech synthesis framework. Subjective evaluation is performed for various approaches used in tagging the pause. The experimental evaluation shows that accurate pause marking plays an important factor for improving the quality of synthesized speech in terms of naturalness and intelligibility. © 2017 IEEE.
About the journal
JournalData powered by TypesetIEEE Region 10 Annual International Conference, Proceedings/TENCON
PublisherData powered by TypesetInstitute of Electrical and Electronics Engineers Inc.
ISSN21593442