Header menu link for other important links
Wavenet based speech recognition system on an embedded board
Published in Blue Eyes Intelligence Engineering and Sciences Publication
Volume: 7
Issue: 6
Pages: 923 - 927
Speech Recognition is a vital and indispensable part of many modern-day systems and has subsequently become ubiquitous, finding diverse applications in fields such as automation, voice control, security, robotics etc. This paper aims to demonstrate implementation of an isolated spoken word recognition based on WAVENETs on an embedded board using an open-source numerical computing software called GNU Octave. WaveNet is an Artificial Neural Network (ANN) with wavelet function as an activation function. In this work, Gaussian wavelet is chosen as an activation function. The speech recognition involves the use of Mel-Frequency Cepstral Coefficients (MFCC) features which are calculated from the speech signal and fed as input to the NN. The Multi-Layer Perceptron (MLP) Feed Forward Neural Network is configured to process speech signal and is trained using back-propagation algorithm in MATLAB. The trained weights are then fed into and implemented using GNU Octave on Raspberry Pi. Texas Instruments’ TIDIGITS Corpus is used for training and testing the neural network. © BEIESP.
About the journal
JournalInternational Journal of Recent Technology and Engineering
PublisherBlue Eyes Intelligence Engineering and Sciences Publication