Vision-Based Pattern Recognition Models for Intelligent Human Robot Interaction in Smart Spaces

Authors

DOI:

https://doi.org/10.33050/italic.v4i2.1101

Keywords:

Pattern Recognition, Human Robot Interaction, Smart Spaces, CNN Transformer, Gesture Recognition systems

Abstract

The rapid expansion of smart spaces has increased the need for robotic systems capable of interpreting visual cues, recognizing human behavior, and responding safely in real time. However, existing vision-based models often struggle with occlusion, lighting variation, latency constraints, and limited contextual understanding in dynamic human-centered environments. This study develops a hybrid vision-based pattern recognition framework that integrates Convolutional Neural Networks (CNNs), Transformer-based attention mechanisms, multi-scale feature fusion, supervised learning, and reinforcement learning. The model is trained and validated using publicly available human–robot interaction datasets and simulated smart space scenarios involving gesture recognition, object detection, activity recognition, and intention prediction. The objective is to enhance intelligent human–robot interaction by improving visual perception accuracy, contextual interpretation, adaptive decision-making, and real-time responsiveness in smart environments. The proposed framework achieves stronger performance than baseline CNN-only and Vision Transformer models, with improved accuracy in gesture recognition, object detection, activity recognition, and intention prediction while maintaining low-latency inference suitable for real-time robotic interaction. The model also demonstrates better adaptability under dynamic lighting, occlusion, and multi-person interaction scenarios. This study concludes that combining CNN-based local feature extraction, Transformer-based global attention, and reinforcement learning-based policy optimization provides a reliable, adaptive, and context-aware framework for intelligent robotic systems. The findings support safer and more efficient human–robot collaboration in healthcare, smart homes, collaborative workplaces, and smart city environments.

References

[1] B. Liu, L. Yu, C. Che, Q. Lin, H. Hu, and X. Zhao, “Integration and performance analysis of artificial intelligence and computer vision based on deep learning algorithms,” arXiv preprint arXiv:2312.12872, 2023.

[2] T. L. Anita, R. G. Freedy, N. Silawati, and M. Sunengsih, “Mathematical logic as a foundation for ai driven decision-making systems,” Sundara Advanced Research on Artificial Intelligence, vol. 2, no. 1, pp. 14–25, 2026.

[3] F. Liu, Z. Lu, and X. Lin, “Vision-based environmental perception for autonomous driving,” Proceedings of the Institution of Mechanical Engineers, Part D: Journal of Automobile Engineering, vol. 239, no. 1, pp. 39–69, 2025.

[4] K. Karnawati, D. N. Ramadhan, T. L. Anita, R. Nurmala, and L. Maria, “Designing inclusive companion robots to mitigate bias and enhance empathy in ai-driven care systems,” Journal of Orange Technology, vol. 2, no. 2, pp. 83–92, 2026.

[5] M. R. Sadik, R. I. Sony, N. N. I. Prova, Y. Mahanandi, A. Al Maruf, S. H. Fahim, and M. S. Islam, “Computer vision based bangla sign language recognition using transfer learning,” in 2024 Second International Conference on Data Science and Information System (ICDSIS). IEEE, 2024, pp. 1–7.

[6] Q. H. Hidayah, R. P. Laksana, A. Prabowo, and D. S. Simatupang, “Iot-based home security system: Esp32-cam integration and real-time notification,” International Transactions on Education Technology (ITEE), vol. 4, no. 2, pp. 149–161, 2026.

[7] A. Ismail, A. R. Ismail, N. A. Shaharuddin, M. A. Ara, A. A. Puzi, S. Awang, and R. Ramli, “Vision based vehicle classification for smart city,” APTISI Transactions on Technopreneurship, vol. 7, no. 2, pp. 441–453, 2025.

[8] N. Hussain and G. A. Pangilinan, “Robotics and automation with artificial intelligence: improving efficiency and quality,” Aptisi Transactions on Technopreneurship (ATT), vol. 5, no. 2, pp. 176–189, 2023.

[9] P. Wei, D. Ahmedt-Aristizabal, H. Gammulle, S. Denman, and M. A. Armin, “Vision-based activity recognition in children with autism-related behaviors,” Heliyon, vol. 9, no. 6, 2023.

[10] P. Costa, J. Ferdiansyah, and H. D. Ariessanti, “Integrating artificial intelligence for autonomous navigation in robotics,” International Transactions on Artificial Intelligence, vol. 3, no. 1, pp. 64–75, 2024.

[11] S. A. Singh, A. S. Kumar, and K. Desai, “Comparative assessment of common pre-trained cnns for vision based surface defect detection of machined components,” Expert Systems with Applications, vol. 218, p. 119623, 2023.

[12] E. Baydeniz, “The age of artificial intelligence: The effect of technology-organization-environment on intention to adopt robotic technologies in hotel businesses,” Indonesian Journal of Sustainability Accounting and Management, vol. 7, no. 2, pp. 380–396, 2023.

[13] J. Wang, P. Gao, J. Zhang, C. Lu, and B. Shen, “Knowledge augmented broad learning system for computer vision based mixed-type defect detection in semiconductor manufacturing,” Robotics and Computer-Integrated Manufacturing, vol. 81, p. 102513, 2023.

[14] D. Y. Kristiyanto, H. D. Purnomo, G. P. Cesna, and N. Ani, “The strategic role of orange technologyin cultivating innovation and well-being,” IAIC Transactions on Sustainable Digital Innovation (ITSDI), vol. 7, no. 1, pp. 27–37, 2025.

[15] S. McCall, S. S. Kolawole, A. Naz, L. Gong, S. W. Ahmed, P. S. Prasad, M. Yu, J. Wingate, and S. P. Ardakani, “Computer vision based transfer learning-aided transformer model for fall detection and prediction,” IEEE Access, vol. 12, pp. 28 798–28 809, 2024.

[16] D. Jonas, E. Maria, I. R. Widiasari, U. Rahardja, T. Wellem et al., “Design of a tam framework with emotional variables in the acceptance of health-based iot in indonesia,” ADI Journal on Recent Innovation, vol. 5, no. 2, pp. 146–154, 2024.

[17] W. Yu and M. Nishio, “Multilevel structural components detection and segmentation toward computer vision-based bridge inspection,” Sensors, vol. 22, no. 9, p. 3502, 2022.

[18] A. Rizky, N. Lutfiani, W. S. Mariyati, A. A. Sari, and K. R. Febrianto, “Decentralization of information using blockchain technology on mobile apps e-journal,” Blockchain Frontier Technology, vol. 1, no. 2, pp. 1–10, 2022.

[19] S. B. Alotaibi and S. Manimurugan, “Humanoid robotic system for social interaction using deep imitation learning in a smart city environment,” Frontiers in Sustainable Cities, vol. 4, p. 1076101, 2022.

[20] U. Rahardja, Q. Aini, A. S. Bist, S. Maulana, and S. Millah, “Examining the interplay of technology readiness and behavioural intentions in health detection safe entry station,” JDM (Jurnal Dinamika Manajemen), vol. 15, no. 1, pp. 125–143, 2024.

[21] T. Wang, P. Zheng, S. Li, and L. Wang, “Multimodal human–robot interaction for human-centric smart manufacturing: a survey,” Advanced Intelligent Systems, vol. 6, no. 3, p. 2300359, 2024.

[22] N. P. L. Santoso, B. Rawat, S. R. Ratri, D. Danang, D. F. C. Kumoro, R. Supriati, and E. A. Natalia, “Transformation of indonesian language in social media using ai expert systems and machine learning,” International Transactions on Artificial Intelligence, vol. 3, no. 2, pp. 130–139, 2025.

[23] H.-H. Wei, Y. Zhang, X. Sun, J. Chen, and S. Li, “Intelligent robots and human-robot collaboration in the construction industry: A review,” Journal of Intelligent Construction, vol. 1, no. 1, pp. 1–12, 2023.

[24] R. Raffik, R. R. Sathya, V. Vaishali, S. Balavedhaa et al., “Industry 5.0: Enhancing human-robot collaboration through collaborative robots–a review,” in 2023 2nd international conference on advancements in electrical, electronics, communication, computing and automation (ICAECA). IEEE, 2023, pp. 1–6.

[25] O. Jayanagara and A. Patricia, “Analyzing healthpreneur determinants for low-socioeconomic ethnic families,” Aptisi Transactions on Technopreneurship (ATT), vol. 7, no. 3, pp. 738–750, 2025.

[26] K. Madzharova-Atanasova and N. Shakev, “Intelligence in human-robot collaboration–overview, challenges and directions,” in 2023 International Conference Automatics and Informatics (ICAI). IEEE, 2023, pp. 190–194.

[27] D. Septyawati, S. Suroso, S. Bhupathiraju, C. T. Hua, and A. Fitriani, “Blockchain technology integration for enhancing security and reliability in modern information systems,” International Transactions on Artificial Intelligence, vol. 4, no. 1, pp. 95–104, 2025.

[28] Q. Zhu, S. Huang, G. Wang, S. K. Moghaddam, Y. Lu, and Y. Yan, “Dynamic reconfiguration optimization of intelligent manufacturing system with human-robot collaboration based on digital twin,” Journal of Manufacturing Systems, vol. 65, pp. 330–338, 2022.

[29] S. Baltasar and T. Marbun, “The role of artificial intelligence in human capital management: A review at pt. pos indonesia,” International Journal of Cyber and IT Service Management (IJCITSM), vol. 5, no. 1, pp. 31–44, 2025.

[30] Government of the Republic of Indonesia, “Presidential Regulation of the Republic of Indonesia Number 132 of 2022 concerning the National Architecture of Electronic-Based Government Systems,” Jakarta, Indonesia, 2022, national architecture for electronic-based government systems and digital service integration. [Online]. Available: https://peraturan.bpk.go.id/Details/233483/perpres-no-132-tahun-2022

[31] L. Larisang, S. Sanusi, M. A. Bora, and A. Hamid, “Practicality and effectiveness of new technopreneurship incubator model in the digitalization era,” Aptisi Transactions on Technopreneurship (ATT), vol. 7, no. 2, pp. 318–333, 2025.

[32] Government of the Republic of Indonesia, “Presidential Regulation of the Republic of Indonesia Number 82 of 2023 concerning the Acceleration of Digital Transformation and Integration of National Digital Services,” Jakarta, Indonesia, 2023, regulation supporting national digital transformation and integrated digital public services. [Online]. Available: https://peraturan.bpk.go.id/Details/273981/perpres-no-82-tahun-2023

Downloads

Published

2026-05-28

Issue

Section

Articles