A Survey on Visual Speech Recognition Approaches

N. Radha; A. Shahina; Nayeemulla Khan

doi:10.1109/ICAIS50930.2021.9395878

Profiles Research Units Publications

Conferences

A Survey on Visual Speech Recognition Approaches

N. Radha, A. Shahina,

Published in Institute of Electrical and Electronics Engineers Inc.

2021

DOI: 10.1109/ICAIS50930.2021.9395878

Pages: 934 - 939

Abstract

The robustness of automatic speech recognition (ASR) systems degrade due to the factors such as environmental noises, speaker variability, and channel distortion, among others. The approaches such as speech signal processing, model adaptation, hybrid techniques and integration of multiple sources are used for ASR system development. This paper focuses on building a robust ASR system by combining the complementary evidence present is the multiple modalities through which speech is expressed. Speech sounds are produced with lip radiation accompanied lip movements called Visual Speech Recognition (VSR). VSR system converts lip movement into spoken words. This system consists of lip region detection, visual speech feature extraction method and modeling techniques. Robust feature extraction from visual lip movement is a challenging task in VSR system. Hence, this paper reviews the feature extraction methods and existing databases used for VSR system. The fusion of visual lip movements with ASR system at different levels is also presented. © 2021 IEEE.

Topics: Feature extraction (52)%

View more info for "A Survey on Visual Speech Recognition Approaches"

About the journal

Journal	Data powered by TypesetProceedings - International Conference on Artificial Intelligence and Smart Systems, ICAIS 2021
Publisher	Data powered by TypesetInstitute of Electrical and Electronics Engineers Inc.

Authors (1)

Nayeemulla Khan

ABOUT US

ACADEMICS

INTERNATIONAL RELATIONS

RESEARCH

RANKINGS & PLACEMENT

ABOUT US

ACADEMICS

INTERNATIONAL RELATIONS

RESEARCH

RANKINGS & PLACEMENT