PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY

 

e-ISSN 2231-8526
ISSN 0128-7680

Home / Regular Issue / JST Vol. 33 (S2) 2025 / JST(S)-0677-2024

 

Dynamic Hand Gesture Recognition by Hand Landmark Classification using Long Short-Term Memory

Khawaritzmi Abdallah Ahmad, Takahiro Higashi and Kaori Yoshida

Pertanika Journal of Science & Technology, Volume 33, Issue S2, December 2025

DOI: https://doi.org/10.47836/pjst.33.S2.05

Keywords: Classification, dynamic hand gesture, human-computer interaction, long short-term memory

Published on: 2025-02-25

Hand gestures are a valuable modality for human-computer interaction, conveying information that can be used as input. Dynamic hand gestures, prevalent in real-world scenarios, necessitate considering temporal factors such as gesture initiation, termination, and frame sequence. A Long Short-Term Memory (LSTM) based recognition model was proposed to address this challenge. Data availability for dynamic hand gesture research is a significant hurdle. The dataset introduced by Fronteddu et al. provides 27 classes of dynamic hand gestures, serving as a suitable training resource. MediaPipe Hands, a computer vision framework, was leveraged to extract keypoints from each frame, capturing spatial features fed into the LSTM model. Experiments were conducted to determine the optimal dropout rate for the LSTM model. Results indicated that a dropout rate of 70% yielded the highest accuracy, achieving up to 98.53% validation accuracy and 99.71% test accuracy. These findings demonstrate the effectiveness of the proposed LSTM-based recognition model for dynamic hand gestures. Future research could explore integrating other deep learning techniques, such as attention mechanisms, to enhance the accuracy and robustness of dynamic hand gesture recognition systems. Additionally, investigating the application of the proposed model in real-world scenarios, such as virtual and augmented reality, would be valuable in assessing its practical utility.

  • Ahmad, K. A., Silpani, D. C., & Yoshida, K. (2022). Hand gesture recognition by hand landmark classification. International Symposium on Affective Science and Engineering, 2022, Article 8. https://doi.org/10.5057/isase.2022-C000026

    Ahmad, K. A., Silpani, D. C., & Yoshida, K. (2023). The impact of large sample datasets on hand gesture recognition by hand landmark classification. International Journal of Affective Engineering, 22(3), 253–259. https://doi.org/10.5057/ijae.ijae-d-22-00022

    Camargo, V. P. D., Balancieri, R., Teixeira, H. M., & Guerino, G. C. (2021, October 18-22). Touchless modalities of human-computer interaction in Hospitals. [Paper presentation]. Proceedings of the XX Brazilian Symposium on Human Factors in Computing Systems, Sao Paulo, Brazil. https://doi.org/10.1145/3472301.3484328

    Fang, Y., Wang, K., Cheng, J., & Lu, H. (2007, July 2-5). A real-time hand gesture recognition method. [Paper presentation]. IEEE International Conference on Multimedia and Expo, Beijing, China. https://doi.org/10.1109/icme.2007.4284820

    Fronteddu, G., Porcu, S., Floris, A., & Atzori, L. (2022). A dynamic hand gesture recognition dataset for human-computer interfaces. Computer Networks, 205, Article 108781. https://doi.org/10.1016/j.comnet.2022.108781

    Hakim, N. L., Shih, T. K., Kasthuri Arachchi, S. P., Aditya, W., Chen, Y. C., & Lin, C. Y. (2019). Dynamic hand gesture recognition using 3DCNN and LSTM with FSM context-aware model. Sensors, 19(24), Article 5429. https://doi.org/10.3390/s19245429

    Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735

    Sharma, R. P., & Verma, G. K. (2015). Human computer interaction using hand gesture. Procedia Computer Science, 54, 721–727. https://doi.org/10.1016/j.procs.2015.06.085

    Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. (2014). Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research, 15(1), 1929-1958.

    Rehman, M. U., Ahmed, F., Attique Khan, M., Tariq, U., Alfouzan, F. A., Alzahrani, N. M., & Ahmad, J. (2021). Dynamic hand gesture recognition using 3D-CNN and LSTM networks. Computers, Materials & Continua, 70(3), Article 4676. https://doi.org/10.32604/cmc.2022.019586

    Zhang, F., Bazarevsky, V., Tkachenka, A., Sung, G., Chang, C. L., & Grundmann, M. (2020). Media pipe hands: On-device real-time hand tracking. arXiv. https://doi.org/10.48550/arXiv.2006.10214

ISSN 0128-7680

e-ISSN 2231-8526

Article ID

JST(S)-0677-2024

Download Full Article PDF

Share this article

Related Articles