Journal of Machine and Computing


Combined Feature Set with Logistic Regression Model to Detect Credit Card Frauds in Real Time Applications



Journal of Machine and Computing

Received On : 25 December 2023

Revised On : 27 April 2024

Accepted On : 28 June 2024

Published On : 05 July 2024

Volume 04, Issue 03

Pages : 804-812


Abstract


Online payment methods are gaining popularity and are widely used, both in-store and online. Because to the Internet and smart mobile devices, conducting such transactions is quick, simple, and stress-free. However, online payment fraud is common due to the open nature of the internet, which allows criminals to use techniques such as eavesdropping, phishing, infiltration, denial-of-service, database theft, and man-in-the-middle assault. Online payment fraud is on the rise, and it is a big contributor to global economic losses. Financial services, healthcare, insurance, and other industries have long been plagued by fraud. Online fraud has developed in tandem with the use of digital payment systems such as credit/debit cards, PhonePe, Gpay, and Paytm. Furthermore, fraudsters and criminals are adept at evasion strategies, allowing them to steal more. Developing a secure system for client authentication and fraud protection is tough since there is always a workaround. This means that fraud detection systems play an important role in preventing financial crimes. Over time, victims of internet transaction fraud have incurred tremendous financial losses. The growth of cutting-edge technologies and global connection has led to a surge in online fraud. To reduce these expenses, it is critical to develop effective fraud detection systems. Machine learning and statistical tools make detecting dishonest money deals much easier. The scarcity of data, the sensitive nature of the data, and the uneven class distributions make it challenging to implement efficient fraud detection models. Given the delicate nature of the information, it is difficult to draw conclusions and construct more accurate models. This study offers a Linked Feature Set with Combined Feature Set with Logistic Regression (CFS-LoR) Model for accurate detection of online payment frauds. In comparison to extant models, the proposed model exhibits a highly accurate detection capability.


Keywords


Logistic Regression, Online Payment Fraud, Machine Learning, Feature Subset, Detection.


  1. S. K. Hashemi, S. L. Mirtaheri and S. Greco, "Fraud Detection in Banking Data by Machine Learning Techniques," in IEEE Access, vol. 11, pp. 3034-3043, 2023, doi: 10.1109/ACCESS.2022.3232287.
  2. M. Grossi et al., "Mixed Quantum–Classical Method for Fraud Detection With Quantum Feature Selection," in IEEE Transactions on Quantum Engineering, vol. 3, pp. 1-12, 2022, Art no. 3102812, doi: 10.1109/TQE.2022.3213474.
  3. H. Wang, W. Wang, Y. Liu and B. Alidaee, "Integrating Machine Learning Algorithms With Quantum Annealing Solvers for Online Fraud Detection," in IEEE Access, vol. 10, pp. 75908-75917, 2022, doi: 10.1109/ACCESS.2022.3190897.
  4. E. Ileberi, Y. Sun and Z. Wang, "Performance Evaluation of Machine Learning Methods for Credit Card Fraud Detection Using SMOTE and AdaBoost," in IEEE Access, vol. 9, pp. 165286-165294, 2021, doi: 10.1109/ACCESS.2021.3134330.
  5. S. A. Ebiaredoh-Mienye, E. Esenogho, and T. G. Swart, “Artificial neural network technique for improving prediction of credit card default: A stacked sparse autoencoder approach,” International Journal of Electrical and Computer Engineering (IJECE), vol. 11, no. 5, p. 4392, Oct. 2021, doi: 10.11591/ijece. v11i5.pp4392-4402.
  6. E. U. Savona and M. Riccardi, “Assessing the risk of money laundering: research challenges and implications for practitioners,” European Journal on Criminal Policy and Research, vol. 25, no. 1, pp. 1–4, Mar. 2019, doi: 10.1007/s10610-019-09409-3.
  7. H. Zhu, G. Liu, M. Zhou, Y. Xie, A. Abusorrah, and Q. Kang, “Optimizing Weighted Extreme Learning Machines for imbalanced classification and application to credit card fraud detection,” Neurocomputing, vol. 407, pp. 50–62, Sep. 2020, doi: 10.1016/j.neucom.2020.04.078.
  8. T. Zhang, K. Zhu, and D. Niyato, “A Generative Adversarial Learning-Based Approach for Cell Outage Detection in Self-Organizing Cellular Networks,” IEEE Wireless Communications Letters, vol. 9, no. 2, pp. 171–174, Feb. 2020, doi: 10.1109/lwc.2019.2947041.
  9. P. Zhang, S. Shu, and M. Zhou, “An online fault detection model and strategies based on SVM-grid in clouds,” IEEE/CAA Journal of Automatica Sinica, vol. 5, no. 2, pp. 445–456, Mar. 2018, doi: 10.1109/jas.2017.7510817.
  10. H. Liu, M. Zhou, and Q. Liu, “An embedded feature selection method for imbalanced data classification,” IEEE/CAA Journal of Automatica Sinica, vol. 6, no. 3, pp. 703–715, May 2019, doi: 10.1109/jas.2019.1911447.
  11. Q. Kang, L. Shi, M. Zhou, X. Wang, Q. Wu, and Z. Wei, “A Distance-Based Weighted Undersampling Scheme for Support Vector Machines and its Application to Imbalanced Classification,” IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 9, pp. 4152–4165, Sep. 2018, doi: 10.1109/tnnls.2017.2755595.
  12. Arora, R. S. Leekha, K. Lee, and A. Kataria, “Facilitating User Authorization from Imbalanced Data Logs of Credit Cards Using Artificial Intelligence,” Mobile Information Systems, vol. 2020, pp. 1–13, Oct. 2020, doi: 10.1155/2020/8885269.
  13. J. Błaszczyński, A. T. de Almeida Filho, A. Matuszyk, M. Szeląg, and R. Słowiński, “Auto loan fraud detection using dominance-based rough set approach versus machine learning methods,” Expert Systems with Applications, vol. 163, p. 113740, Jan. 2021, doi: 10.1016/j.eswa.2020.113740.
  14. B. Branco, P. Abreu, A. S. Gomes, M. S. C. Almeida, J. T. Ascensão, and P. Bizarro, “Interleaved Sequence RNNs for Fraud Detection,” Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Aug. 2020, doi: 10.1145/3394486.3403361.
  15. F. Cartella, O. Anunciacao, Y. Funabiki, D. Yamaguchi, T. Akishita and O. Elshocht, "Adversarial attacks for tabular data: Application to fraud detection and imbalanced data",2021, arXiv:2101.08030.
  16. Benchaji, S. Douzi, and B. E. Ouahidi, “Credit Card Fraud Detection Model Based on LSTM Recurrent Neural Networks,” Journal of Advances in Information Technology, vol. 12, no. 2, pp. 113–118, 2021, doi: 10.12720/jait.12.2.113-118.
  17. Y. Fang, Y. Zhang, and C. Huang, “Credit Card Fraud Detection Based on Machine Learning,” Computers, Materials & Continua, vol. 61, no. 1, pp. 185–195, 2019, doi: 10.32604/cmc.2019.06144.
  18. J. Forough and S. Momtazi, “Ensemble of deep sequential models for credit card fraud detection,” Applied Soft Computing, vol. 99, p. 106883, Feb. 2021, doi: 10.1016/j.asoc.2020.106883.
  19. B. Baesens, S. Höppner, and T. Verdonck, “Data engineering for fraud detection,” Decision Support Systems, vol. 150, p. 113492, Nov. 2021, doi: 10.1016/j.dss.2021.113492.
  20. X. Zhang, Y. Han, W. Xu, and Q. Wang, “HOBA: A novel feature engineering methodology for credit card fraud detection with a deep learning architecture,” Information Sciences, vol. 557, pp. 302–316, May 2021, doi: 10.1016/j.ins.2019.05.023.
  21. Y. Xie, G. Liu, R. Cao, Z. Li, C. Yan, and C. Jiang, “A Feature Extraction Method for Credit Card Fraud Detection,” 2019 2nd International Conference on Intelligent Autonomous Systems (ICoIAS), Feb. 2019, doi: 10.1109/icoias.2019.00019.
  22. Y. Y. Hsin, T. S. Dai, Y. W. Ti and M. C. Huang, "Interpretable electronic transfer fraud detection with expert feature constructions", Proc. CIKM Workshops, pp. 1-11, 2021.
  23. D. Cheng, S. Xiang, C. Shang, Y. Zhang, F. Yang, and L. Zhang, “Spatio-Temporal Attention-Based Neural Network for Credit Card Fraud Detection,” Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 01, pp. 362–369, Apr. 2020, doi: 10.1609/aaai. v34i01.5371.
  24. Y. Lucas et al., “Towards automated feature engineering for credit card fraud detection using multi-perspective HMMs,” Future Generation Computer Systems, vol. 102, pp. 393–402, Jan. 2020, doi: 10.1016/j.future.2019.08.029.
  25. V. N. Dornadula and S. Geetha, “Credit Card Fraud Detection using Machine Learning Algorithms,” Procedia Computer Science, vol. 165, pp. 631–641, 2019, doi: 10.1016/j.procs.2020.01.057.
  26. K. Ashok, M. Ashraf, J. Thimmia Raja, M. Z. Hussain, D. K. Singh, and A. Haldorai, “Collaborative analysis of audio-visual speech synthesis with sensor measurements for regulating human–robot interaction,” International Journal of System Assurance Engineering and Management, Aug. 2022, doi: 10.1007/s13198-022-01709-y.

Acknowledgements


Author(s) thanks to Dr.Nedunchelian R for this research completion and support.


Funding


No funding was received to assist with the preparation of this manuscript.


Ethics declarations


Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.


Availability of data and materials


Data sharing is not applicable to this article as no new data were created or analysed in this study.


Author information


Contributions

All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.


Corresponding author


Rights and permissions


Open Access This article is licensed under a Creative Commons Attribution NoDerivs is a more restrictive license. It allows you to redistribute the material commercially or non-commercially but the user cannot make any changes whatsoever to the original, i.e. no derivatives of the original work. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/


Cite this article


Prabhakaran N and Nedunchelian R, “Combined Feature Set with Logistic Regression Model to Detect Credit Card Frauds in Real Time Applications”, Journal of Machine and Computing, pp. 804-812, July 2024. doi: 10.53759/7669/jmc202404074.


Copyright


© 2024 Prabhakaran N and Nedunchelian R. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.