Phoneme classification in reconstructed phase space with convolutional neural networks

R. John Wesley; Nayeemulla Khan; A. Shahina

doi:10.1016/j.patrec.2020.05.002

Profiles Research Units Publications

Articles

Phoneme classification in reconstructed phase space with convolutional neural networks

R. John Wesley, , A. Shahina

Published in Elsevier B.V.

2020

DOI: 10.1016/j.patrec.2020.05.002

Volume: 135

Pages: 299 - 306

Abstract

In this paper, we analyse segmented speech phonemes with Convolutional filters, after embedding them in Reconstructed Phase Space (RPS). These feature extracting Convolutional filters are trained on the embedded speech data from scratch and are also fine-tuned from networks trained with other data. Reconstruction of Phase Space portrays the dynamics of an observed system as a geometric representation. We present a study highlighting the discriminative capacity of the features extracted through Convolutional Neural Network (CNN) from the textural pattern and shape of this geometric representation. CNNs are heavily used in image-related tasks, but have not seen application on phase space portraits, possibly due to the higher dimensionality of the embedding. However, we find that the application of CNN on restricted bi-dimensional RPS, characterizes the space well than prior methods on high dimensional embeddings. We show experimental results supporting the use of RPS with CNN (RPS-CNN) for phoneme classification. The results affirm that essential signal characteristics are automatically quantified from the phase portraits of speech and can be used in place of conventional techniques involving frequency domain transformations. © 2020

About the journal

Journal	Data powered by TypesetPattern Recognition Letters
Publisher	Data powered by TypesetElsevier B.V.
ISSN	01678655

Authors (1)

Nayeemulla Khan

ABOUT US

ACADEMICS

INTERNATIONAL RELATIONS

RESEARCH

RANKINGS & PLACEMENT

ABOUT US

ACADEMICS

INTERNATIONAL RELATIONS

RESEARCH

RANKINGS & PLACEMENT