Journal of Machine and Computing


An Efficient Filter and Wrapper based Selection Methods along With Random Forest and Support Vector Machines Classification Technique in Health Care System



Journal of Machine and Computing

Received On : 02 May 2023

Revised On : 10 August 2023

Accepted On : 06 September 2023

Published On : 05 October 2023

Volume 03, Issue 04

Pages : 566-581


Abstract


Health care Management System (HMS) is a key to successful management of any health care industry. Health care management systems have so many research dimensions such as identifying disease and diagnostic, drug discovery manufacturing, Bioinformatics’ problem, personalized treatments, Patient image analysis and so on. Heart Disease Prediction (HDP) is a process of identifying heart disease in advance and recognizes patient health condition by applying techniques on patient heart related symptoms. Now a day’s the problem of identifying heart diseases is solved by machine learning techniques. In this paper we construct a heart disease prediction method using combined feature selection and classification machine learning techniques. According to the existing study the one of the main difficult in heart disease prediction system is that the available data in open sources are not properly recorded the necessary characteristics and there is some lagging in finding the useful features from the available features. The process of removing inappropriate features from an available feature set while preserving sufficient classification accuracy is known as feature selection. A methodology is proposed in this paper that consists of two phases: Phase one employs two broad categories of feature selection techniques to identify the efficient feature sets and it is given to the input of our second phase such as classification. In this work we will concentrate on filter-based method for feature selection such as Chi-square, Fast Correlation Based Filter (FCBF), Gini Index (GI), RelifeF, and wrapper-based method for feature selection such as Backward Feature Elimination (BFE), Exhaustive Feature Selection (EFS), Forward Feature Selection (FFS), and Recursive Feature Elimination (RFE). The UCI heart disease data set is used to evaluate the output in this study. Finally, the proposed system's performance is validated by various experiments setups.


Keywords


Health Care Management System, Heart Disease Prediction, Machine Learning Techniques, Feature Selection Techniques, Classification, Filter FS, Wrapper FS.


  1. S. Mohan, C. Thirumalai, and G. Srivastava, “Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques,” IEEE Access, vol. 7, pp. 81542–81554, 2019, doi: 10.1109/access.2019.2923707.
  2. J. Jacob et al., “Predicting outcomes in rheumatoid arthritis related interstitial lung disease,” European Respiratory Journal, vol. 53, no. 1, p. 1800869, Nov. 2018, doi: 10.1183/13993003.00869-2018.
  3. A. H. Chen, S. Y. Huang, P. S. Hong, C. H. Cheng and E. J. Lin, "HDPS: Heart disease prediction system," 2011 Computing in Cardiology, Hangzhou, China, 2011, pp. 557-560.
  4. Nidhi Bhatla and Kiran Jyoti, "An analysis of heart disease prediction using different data mining techniques," International Journal of Engineering Research & Technology (IJERT), Vol. 1, no. 8, 2012.
  5. S. Palaniappan and R. Awang, “Intelligent heart disease prediction system using data mining techniques,” 2008 IEEE/ACS International Conference on Computer Systems and Applications, Mar. 2008, doi: 10.1109/aiccsa.2008.4493524.
  6. M. Kavitha, G. Gnaneswar, R. Dinesh, Y. R. Sai, and R. S. Suraj, “Heart Disease Prediction using Hybrid machine Learning Model,” 2021 6th International Conference on Inventive Computation Technologies (ICICT), Jan. 2021, doi: 10.1109/icict50816.2021.9358597.
  7. H. Turabieh, “A Hybrid ANN-GWO Algorithm for Prediction of Heart Disease,” American Journal of Operations Research, vol. 06, no. 02, pp. 136–146, 2016, doi: 10.4236/ajor.2016.62016.
  8. S. Uddin, A. Khan, M. E. Hossain, and M. A. Moni, “Comparing different supervised machine learning algorithms for disease prediction,” BMC Medical Informatics and Decision Making, vol. 19, no. 1, Dec. 2019, doi: 10.1186/s12911-019-1004-8.
  9. G. Sliwoski, S. Kothiwale, J. Meiler, and E. W. Lowe, “Computational Methods in Drug Discovery,” Pharmacological Reviews, vol. 66, no. 1, pp. 334–395, Dec. 2013, doi: 10.1124/pr.112.007336.
  10. U. López de Heredia and J. L. Vázquez-Poletti, “RNA-seq analysis in forest tree species: bioinformatic problems and solutions,” Tree Genetics & Genomes, vol. 12, no. 2, Mar. 2016, doi: 10.1007/s11295-016-0995-x.
  11. J. Saez‐Rodriguez and N. Blüthgen, “Personalized signaling models for personalized treatments,” Molecular Systems Biology, vol. 16, no. 1, Jan. 2020, doi: 10.15252/msb.20199042.
  12. Ophir Gozes, et al, "Rapid ai development cycle for the coronavirus (covid-19) pandemic: Initial results for automated detection & patient monitoring using deep learning ct image analysis," 2020, doi: 10.48550/arXiv.2003.05037.
  13. J. Monsi et al., “XRAY AI: Lung Disease Prediction Using Machine Learning,” International Journal of Information Systems and Computer Sciences, vol. 8, no. 2, pp. 51–54, Apr. 2019, doi: 10.30534/ijiscs/2019/12822019.
  14. Jaymin Patel, Dr TejalUpadhyay, and Samir Patel, "heart disease prediction using machine learning and data mining technique," IJCSC, Vol. 7, no. 1pp. 129-137, 2015, doi: 10.090592/IJCSC.2016.018.
  15. Sathya priya, "Chronic Kidney Disease Prediction Using Machine Learning," International Journal of Computer Science and Information Security (IJCSIS), Vol. 16, no.4, 2018.
  16. R. Mathur, V. Pathak, and D. Bandil, “Parkinson Disease Prediction Using Machine Learning Algorithm,” Emerging Trends in Expert Applications and Security, pp. 357–363, Nov. 2018, doi: 10.1007/978-981-13-2285-3_42.
  17. G. T. Reddy and N. Khare, “An Efficient System for Heart Disease Prediction Using Hybrid OFBAT with Rule-Based Fuzzy Logic Model,” Journal of Circuits, Systems and Computers, vol. 26, no. 04, p. 1750061, Dec. 2016, doi: 10.1142/s021812661750061x.
  18. I. Yekkala and S. Dixit, “Prediction of Heart Disease Using Random Forest and Rough Set Based Feature Selection,” International Journal of Big Data and Analytics in Healthcare, vol. 3, no. 1, pp. 1–12, Jan. 2018, doi: 10.4018/ijbdah.2018010101.
  19. Janosi,Andras, Steinbrunn,William, Pfisterer,Matthias, and Detrano,Robert, "Heart Disease," UCI Machine Learning Repository, 1988. doi: 10.24432/C52P4X.
  20. S. Bashir, Z. S. Khan, F. Hassan Khan, A. Anjum, and K. Bashir, “Improving Heart Disease Prediction Using Feature Selection Approaches,” 2019 16th International Bhurban Conference on Applied Sciences and Technology (IBCAST), Jan. 2019, doi: 10.1109/ibcast.2019.8667106.
  21. Peter, T. John, and K. Somasundaram. "Study and development of novel feature selection framework for heart disease prediction." International Journal of Scientific and Research Publications, 2012.
  22. A. M. Usman, U. K. Yusof, and S. Naim, “Cuckoo inspired algorithms for feature selection in heart disease prediction,” International Journal of Advances in Intelligent Informatics, vol. 4, no. 2, p. 95, Jul. 2018, doi: 10.26555/ijain.v4i2.245.
  23. X. Jin, A. Xu, R. Bie, and P. Guo, “Machine Learning Techniques and Chi-Square Feature Selection for Cancer Classification Using SAGE Gene Expression Profiles,” Data Mining for Biomedical Applications, pp. 106–115, 2006, doi: 10.1007/11691730_11.
  24. Yu, Lei, and Huan Liu, "Feature selection for high-dimensional data: A fast correlation-based filter solution," Proceedings of the 20th international conference on machine learning (ICML-03), 2003.
  25. N. S. Chandra Reddy, S. Shue Nee, L. Zhi Min, and C. Xin Ying, “Classification and Feature Selection Approaches by Machine Learning Techniques: Heart Disease Prediction,” International Journal of Innovative Computing, vol. 9, no. 1, May 2019, doi: 10.11113/ijic.v9n1.210.
  26. N. Spolaor, E. A. Cherman, M. C. Monard, and H. D. Lee, “ReliefF for Multi-label Feature Selection,” 2013 Brazilian Conference on Intelligent Systems, Oct. 2013, doi: 10.1109/bracis.2013.10.
  27. D. Kostrzewa and R. Brzeski, “The Data Dimensionality Reduction in the Classification Process Through Greedy Backward Feature Elimination,” Man-Machine Interactions 5, pp. 397–407, Sep. 2017, doi: 10.1007/978-3-319-67792-7_39.
  28. J. Ren, Z. Qiu, W. Fan, H. Cheng, and P. S. Yu, “Forward Semi-supervised Feature Selection,” Lecture Notes in Computer Science, pp. 970–976, doi: 10.1007/978-3-540-68125-0_101.
  29. C.-Y. Lee and B.-S. Chen, “Mutually-exclusive-and-collectively-exhaustive feature selection scheme,” Applied Soft Computing, vol. 68, pp. 961–971, Jul. 2018, doi: 10.1016/j.asoc.2017.04.055.
  30. K. Yan and D. Zhang, “Feature selection and analysis on correlated gas sensor data with recursive feature elimination,” Sensors and Actuators B: Chemical, vol. 212, pp. 353–363, Jun. 2015, doi: 10.1016/j.snb.2015.02.025.
  31. D. Shah, S. Patel, and S. K. Bharti, “Heart Disease Prediction using Machine Learning Techniques,” SN Computer Science, vol. 1, no. 6, Oct. 2020, doi: 10.1007/s42979-020-00365-y.

Acknowledgements


Author(s) thanks to Dr.Nithyanandam S for this research completion and support.


Funding


No funding was received to assist with the preparation of this manuscript.


Ethics declarations


Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.


Availability of data and materials


No data available for above study.


Author information


Contributions

All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.


Corresponding author


Rights and permissions


Open Access This article is licensed under a Creative Commons Attribution NoDerivs is a more restrictive license. It allows you to redistribute the material commercially or non-commercially but the user cannot make any changes whatsoever to the original, i.e. no derivatives of the original work. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/


Cite this article


Keerthika N and Nithyanandam S, “An Efficient Filter and Wrapper based Selection Methods along With Random Forest and Support Vector Machines Classification Technique in Health Care System”, Journal of Machine and Computing, vol.3, no.4, pp. 566-581, October 2023. doi: 10.53759/7669/jmc202303048.


Copyright


© 2023 Keerthika N and Nithyanandam S. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.