An Extensive Analysis of the Vision-based Deep Learning Techniques for Action Recognition

R. Manasa; R. Shukla; Saranya  D

doi:10.14569/IJACSA.2021.0120276

Profiles Research Units Publications

Articles

An Extensive Analysis of the Vision-based Deep Learning Techniques for Action Recognition

R. Manasa, R. Shukla,

Published in Science and Information Organization

2021

DOI: 10.14569/IJACSA.2021.0120276

Volume: 12

Issue: 2

Pages: 604 - 611

Abstract

Action recognition involves the idea of localizing and classifying actions in a video over a sequence of frames. It can be thought of as an image classification task extended temporally. The information obtained over the multitude of frames is aggregated to comprehend the action classification output. Applications of action recognition systems range from assistance for healthcare systems to human-machine interaction. Action recognition has proven to be a challenging task as it poses many impediments including high computation cost, capturing extended context, designing complex architectures, and lack of benchmark datasets. Increasing the efficiency of algorithms in human action recognition can significantly improve the probability of implementing it in real-world scenarios. This paper has summarized the evolution of various action localization, classification, and detection algorithms applied to data from vision-based sensors. We have also reviewed the datasets that have been used for the action classification, localization, and detection process. We have further explored the areas of action classification, temporal and spatiotemporal action detection, which use convolution neural networks, recurrent neural networks, or a combination of both. © 2021. All Rights Reserved.