Journal of Machine and Computing


Video Face Tracking for IoT Big Data using Improved Swin Transformer based CSA Model



Journal of Machine and Computing

Received On : 15 May 2023

Revised On : 26 September 2023

Accepted On : 07 January 2024

Published On : 05 April 2024

Volume 04, Issue 02

Pages :308-316


Abstract


Even though Convolutional Neural Networks (CNNs) have greatly improved face-related algorithms, it is still difficult to keep both accuracy and efficiency in real-world applications. The most cutting-edge approaches use deeper networks to improve performance, but the increased computing complexity and number of parameters make them impractical for usage in mobile applications. To tackle these issues, this article presents a model for object detection that combines Deeplabv3+ with Swin transformer, which incorporates GLTB and Swin-Conv-Dspp (SCD). To start with, in order to lessen the impact of the hole phenomena and the loss of fine-grained data, we employ the SCD component, which is capable of efficiently extracting feature information from objects at various sizes. Secondly, in order to properly address the issue of challenging object recognition due to occlusion, the study builds a GLTB with a spatial pyramid pooling shuffle module. This module allows for the extraction of important detail information from the few noticeable pixels of the blocked objects. Crocodile search algorithm (CSA) enhances classification accuracy by properly selecting the model's fine-tuning. On a benchmark dataset known as WFLW, the study experimentally validates the suggested model. Compared to other light models, the experimental findings show that it delivers higher performance with significantly fewer parameters and reduced computing complexity.


Keywords


Convolutional Neural Networks; Crocodile Search Algorithm; Global Local Transformer Block; Face Tracking; Spatial Pyramid Pooling Shuffle Module.


  1. X. Liu., “Collaborative Edge Computing With FPGA-Based CNN Accelerators for Energy-Efficient and Time-Aware Face Tracking System,” IEEE Transactions on Computational Social Systems, vol. 9, no. 1, pp. 252–266, Feb. 2022, doi: 10.1109/tcss.2021.3059318.
  2. M. Kumar, K. S. Raju, D. Kumar, N. Goyal, S. Verma, and A. Singh, “An efficient framework using visual recognition for IoT based smart city surveillance,” Multimedia Tools and Applications, vol. 80, no. 20, pp. 31277–31295, Jan. 2021, doi: 10.1007/s11042-020-10471-x.
  3. S. Jha, C. Seo, E. Yang, and G. P. Joshi, “Real time object detection and trackingsystem for video surveillance system,” Multimedia Tools and Applications, vol. 80, no. 3, pp. 3981–3996, Sep. 2020, doi: 10.1007/s11042-020-09749-x.
  4. A. K. Biswal, D. Singh, B. K. Pattanayak, D. Samanta, and M.-H. Yang, “IoT-Based Smart Alert System for Drowsy Driver Detection,” Wireless Communications and Mobile Computing, vol. 2021, pp. 1–13, Mar. 2021, doi: 10.1155/2021/6627217.
  5. S. Meivel et al., “Mask Detection and Social Distance Identification Using Internet of Things and Faster R-CNN Algorithm,” Computational Intelligence and Neuroscience, vol. 2022, pp. 1–13, Feb. 2022, doi: 10.1155/2022/2103975.
  6. M. F. Alotaibi, M. Omri, S. Abdel-Khalek, E. Khalil, and R. F. Mansour, “Computational Intelligence-Based Harmony Search Algorithm for Real-Time Object Detection and Tracking in Video Surveillance Systems,” Mathematics, vol. 10, no. 5, p. 733, Feb. 2022, doi: 10.3390/math10050733.
  7. T. A. Kumar, R. Rajmohan, M. Pavithra, S. A. Ajagbe, R. Hodhod, and T. Gaber, “Automatic Face Mask Detection System in Public Transportation in Smart Cities Using IoT and Deep Learning,” Electronics, vol. 11, no. 6, p. 904, Mar. 2022, doi: 10.3390/electronics11060904.
  8. S. Liu, X. Liu, S. Wang, and K. Muhammad, “Fuzzy-aided solution for out-of-view challenge in visual tracking under IoT-assisted complex environment,” Neural Computing and Applications, vol. 33, no. 4, pp. 1055–1065, May 2020, doi: 10.1007/s00521-020-05021-3.
  9. B. Varshini, H. Yogesh, S. D. Pasha, M. Suhail, V. Madhumitha, and A. Sasi, “IoT-Enabled smart doors for monitoring body temperature and face mask detection,” Global Transitions Proceedings, vol. 2, no. 2, pp. 246–254, Nov. 2021, doi: 10.1016/j.gltp.2021.08.071.
  10. M. Geetha, R. S. Latha, S. K. Nivetha, S. Hariprasath, S. Gowtham, and C. S. Deepak, “Design of face detection and recognition system to monitor students during online examinations using Machine Learning algorithms,” 2021 International Conference on Computer Communication and Informatics (ICCCI), Jan. 2021, doi: 10.1109/iccci50826.2021.9402553.
  11. X. Zhou, X. Xu, W. Liang, Z. Zeng, and Z. Yan, “Deep-Learning-Enhanced Multitarget Detection for End–Edge–Cloud Surveillance in Smart IoT,” IEEE Internet of Things Journal, vol. 8, no. 16, pp. 12588–12596, Aug. 2021, doi: 10.1109/jiot.2021.3077449.
  12. A. F. Klaib, N. O. Alsrehin, W. Y. Melhem, H. O. Bashtawi, and A. A. Magableh, “Eye tracking algorithms, techniques, tools, and applications with an emphasis on machine learning and Internet of Things technologies,” Expert Systems with Applications, vol. 166, p. 114037, Mar. 2021, doi: 10.1016/j.eswa.2020.114037.
  13. R. Ullah et al., “A Real-Time Framework for Human Face Detection and Recognition in CCTV Images,” Mathematical Problems in Engineering, vol. 2022, pp. 1–12, Mar. 2022, doi: 10.1155/2022/3276704.
  14. M. K. Hasan, Md. S. Ahsan, Abdullah-Al-Mamun, S. H. S. Newaz, and G. M. Lee, “Human Face Detection Techniques: A Comprehensive Review and Future Research Directions,” Electronics, vol. 10, no. 19, p. 2354, Sep. 2021, doi: 10.3390/electronics10192354.
  15. M. B. Satrio, A. G. Putrada, and M. Abdurohman, “Evaluation of Face Detection and Recognition Methods in Smart Mirror Implementation,” Lecture Notes in Networks and Systems, pp. 449–457, Sep. 2021, doi: 10.1007/978-981-16-2380-6_39.
  16. B. B. . Reddy, “Classification Approach for Face Spoof Detection in Artificial Neural Network Based on IoT Concepts”, Int J Intell Syst Appl Eng, vol. 12, no. 13s, pp. 79–91, Jan. 2024.
  17. A. Medjdoubi, M. Meddeber, and K. Yahyaoui, “Smart City Surveillance: Edge Technology Face Recognition Robot Deep Learning Based,” International Journal of Engineering, vol. 37, no. 1, pp. 25–36, 2024, doi: 10.5829/ije.2024.37.01a.03.
  18. M. Ali, A. Diwan, and D. Kumar, “Attendance System Optimization through Deep Learning Face Recognition,” International Journal of Computing and Digital Systems, vol. 15, no. 1, pp. 1527–1540, Apr. 2024, doi: 10.12785/ijcds/1501108.
  19. S. Biswas, T. Saha, P. Banerjee, and S. Datta, “A Novel Facial Emotion Recognition Technique using Convolution Neural Network,” Heterogenous Computational Intelligence in Internet of Things, pp. 175–195, Sep. 2023, doi: 10.1201/9781003363606-12.
  20. Jayabharathi Ponnurathinam and Sripriya Pradabadattan, “A Novel Approach for Human Face Extraction and Detection using SAE-AFB-RFCN Framework,” Journal of Advanced Research in Applied Sciences and Engineering Technology, vol. 34, no. 1, pp. 51–62, Nov. 2023, doi: 10.37934/araset.34.1.5162.
  21. M. D. R, A. Thirumalraj, and R. T, “An Improved ARO Model for Task Offloading in Vehicular Cloud Computing in VANET,” Aug. 2023, doi: 10.21203/rs.3.rs-3291507/v1.
  22. A. Thirumalraj, A. K, R. V, and P. K. Balasubramanian, “Designing a Modified Grey Wolf Optimizer Based Cyclegan Model for Eeg Mi Classification in Bci,” 2023, doi: 10.2139/ssrn.4642989.
  23. W. Wu, C. Qian, S. Yang, Q. Wang, Y. Cai, and Q. Zhou, “Look at Boundary: A Boundary-Aware Face Alignment Algorithm,” 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun. 2018, doi: 10.1109/cvpr.2018.00227.

Acknowledgements


Authors thank Reviewers for taking the time and effort necessary to review the manuscript.


Funding


No funding was received to assist with the preparation of this manuscript.


Ethics declarations


Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.


Availability of data and materials


Data sharing is not applicable to this article as no new data were created or analysed in this study.


Author information


Contributions

All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.


Corresponding author


Rights and permissions


Open Access This article is licensed under a Creative Commons Attribution NoDerivs is a more restrictive license. It allows you to redistribute the material commercially or non-commercially but the user cannot make any changes whatsoever to the original, i.e. no derivatives of the original work. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/


Cite this article


Anbumani K, Cuddapah Anitha, Achuta Rao S V, Praveen Kumar K, Meganathan Ramasamy and Mahaveerakannan R, “Video Face Tracking for IoT Big Data using Improved Swin Transformer based CSA Model”, Journal of Machine and Computing, pp. 308-316, April 2024. doi: 10.53759/7669/jmc202404029.


Copyright


© 2024 Anbumani K, Cuddapah Anitha, Achuta Rao S V, Praveen Kumar K, Meganathan Ramasamy and Mahaveerakannan R. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.