A Light Implementation of a 3D Convolutional Network for Online Gesture Recognition

Authors

  • Fabio Brandolt Baldissera PUCRS
  • Fabian Vargas Catholic University (PUCRS)

Keywords:

Gesture recognition, Online Classification, 3DCNN

Abstract

With the advancement of machine learning techniques and the increased accessibility to computing power, Artificial Neural Networks (ANNs) have achieved state-of-the-art results in image classification and, most recently, in video classification. The possibility of gesture recognition from a video source enables a more natural non-contact human-machine interaction, immersion when interacting in virtual reality environments and can even lead to sign language translation in the near future. However, the techniques utilized in video classification are usually computationally expensive, being prohibitive to conventional hardware. This work aims to study and analyze the applicability of continuous online gesture recognition techniques for embedded systems. This goal is achieved by proposing a new model based on 2D and 3D CNNs able to perform online gesture recognition, i.e. yielding a label while the video frames are still being processed, in a predictive manner, before having access to future frames of the video. This technique is of paramount interest to applications in which the video is being acquired concomitantly to the classification process and the issuing of the labels has a strict deadline. The proposed model was tested against three representative gesture datasets found in the literature. The obtained results suggest the proposed technique improves the state-of-the-art by yielding a quick gesture recognition process while presenting a high accuracy, which is fundamental for the applicability of embedded systems.

Downloads

Download data is not yet available.

Author Biography

Fabian Vargas, Catholic University (PUCRS)

Vargas obtained his Ph.D. Degree in Microelectronics from the Institut National Polytechnique de Grenoble (INPG), France, in 1995. At present, he is Full Professor at the Catholic University (PUCRS), Brazil.
His main research domains involve the HW-SW co-design of system-on-chip (SoC) and embedded systems having in mind test, fault-tolerance and reliability considerations for critical applications. Prof. Vargas is researcher of the Brazilian National Science Foundation (CNPq) since 1996. He received the Meritorious Service Award of the IEEE Computer Society for co-founding and chairing the IEEE Latin American Regional TTTC Group and the IEEE Latin American Test Symposium (LATS). Prof. Vargas is Senior Member of IEEE and Golden Core Member of the IEEE Computer Society.

Published

2020-03-21

How to Cite

Brandolt Baldissera, F., & Vargas, F. (2020). A Light Implementation of a 3D Convolutional Network for Online Gesture Recognition. IEEE Latin America Transactions, 18(2), 319–326. Retrieved from https://latamt.ieeer9.org/index.php/transactions/article/view/2920

Issue

Section

Special Isssue on Embedded Systems