An Efficient Filter and Wrapper based Selection Methods along With Random Forest and Support Vector Machines Classification Technique in Health Care System
Keerthika N
Keerthika N
Department of Computer Science and Engineering, Ponnaiyah Ramajayam Institute of Science and Technology (PRIST) Deemed to be University, Thanjavur, India.
Department of Computer Science and Engineering, Ponnaiyah Ramajayam Institute of Science and Technology (PRIST) Deemed to be University, Thanjavur, India.
Health care Management System (HMS) is a key to successful management of any health care industry. Health
care management systems have so many research dimensions such as identifying disease and diagnostic, drug discovery
manufacturing, Bioinformatics’ problem, personalized treatments, Patient image analysis and so on. Heart Disease Prediction
(HDP) is a process of identifying heart disease in advance and recognizes patient health condition by applying techniques on
patient heart related symptoms. Now a day’s the problem of identifying heart diseases is solved by machine learning techniques.
In this paper we construct a heart disease prediction method using combined feature selection and classification machine
learning techniques. According to the existing study the one of the main difficult in heart disease prediction system is that the
available data in open sources are not properly recorded the necessary characteristics and there is some lagging in finding the
useful features from the available features. The process of removing inappropriate features from an available feature set while
preserving sufficient classification accuracy is known as feature selection. A methodology is proposed in this paper that consists
of two phases: Phase one employs two broad categories of feature selection techniques to identify the efficient feature sets and
it is given to the input of our second phase such as classification. In this work we will concentrate on filter-based method for
feature selection such as Chi-square, Fast Correlation Based Filter (FCBF), Gini Index (GI), RelifeF, and wrapper-based
method for feature selection such as Backward Feature Elimination (BFE), Exhaustive Feature Selection (EFS), Forward
Feature Selection (FFS), and Recursive Feature Elimination (RFE). The UCI heart disease data set is used to evaluate the output
in this study. Finally, the proposed system's performance is validated by various experiments setups.
Keywords
Health Care Management System, Heart Disease Prediction, Machine Learning Techniques, Feature Selection
Techniques, Classification, Filter FS, Wrapper FS.
S. Mohan, C. Thirumalai, and G. Srivastava, “Effective Heart Disease Prediction Using Hybrid Machine Learning Techniques,” IEEE Access, vol. 7, pp. 81542–81554, 2019, doi: 10.1109/access.2019.2923707.
J. Jacob et al., “Predicting outcomes in rheumatoid arthritis related interstitial lung disease,” European Respiratory Journal, vol. 53, no. 1, p. 1800869, Nov. 2018, doi: 10.1183/13993003.00869-2018.
A. H. Chen, S. Y. Huang, P. S. Hong, C. H. Cheng and E. J. Lin, "HDPS: Heart disease prediction system," 2011 Computing in Cardiology, Hangzhou, China, 2011, pp. 557-560.
Nidhi Bhatla and Kiran Jyoti, "An analysis of heart disease prediction using different data mining techniques," International Journal of Engineering Research & Technology (IJERT), Vol. 1, no. 8, 2012.
S. Palaniappan and R. Awang, “Intelligent heart disease prediction system using data mining techniques,” 2008 IEEE/ACS International Conference on Computer Systems and Applications, Mar. 2008, doi: 10.1109/aiccsa.2008.4493524.
M. Kavitha, G. Gnaneswar, R. Dinesh, Y. R. Sai, and R. S. Suraj, “Heart Disease Prediction using Hybrid machine Learning Model,” 2021 6th International Conference on Inventive Computation Technologies (ICICT), Jan. 2021, doi: 10.1109/icict50816.2021.9358597.
H. Turabieh, “A Hybrid ANN-GWO Algorithm for Prediction of Heart Disease,” American Journal of Operations Research, vol. 06, no. 02, pp. 136–146, 2016, doi: 10.4236/ajor.2016.62016.
S. Uddin, A. Khan, M. E. Hossain, and M. A. Moni, “Comparing different supervised machine learning algorithms for disease prediction,” BMC Medical Informatics and Decision Making, vol. 19, no. 1, Dec. 2019, doi: 10.1186/s12911-019-1004-8.
G. Sliwoski, S. Kothiwale, J. Meiler, and E. W. Lowe, “Computational Methods in Drug Discovery,” Pharmacological Reviews, vol. 66, no. 1, pp. 334–395, Dec. 2013, doi: 10.1124/pr.112.007336.
U. López de Heredia and J. L. Vázquez-Poletti, “RNA-seq analysis in forest tree species: bioinformatic problems and solutions,” Tree Genetics & Genomes, vol. 12, no. 2, Mar. 2016, doi: 10.1007/s11295-016-0995-x.
J. Saez‐Rodriguez and N. Blüthgen, “Personalized signaling models for personalized treatments,” Molecular Systems Biology, vol. 16, no. 1, Jan. 2020, doi: 10.15252/msb.20199042.
Ophir Gozes, et al, "Rapid ai development cycle for the coronavirus (covid-19) pandemic: Initial results for automated detection & patient monitoring using deep learning ct image analysis," 2020, doi: 10.48550/arXiv.2003.05037.
J. Monsi et al., “XRAY AI: Lung Disease Prediction Using Machine Learning,” International Journal of Information Systems and Computer Sciences, vol. 8, no. 2, pp. 51–54, Apr. 2019, doi: 10.30534/ijiscs/2019/12822019.
Jaymin Patel, Dr TejalUpadhyay, and Samir Patel, "heart disease prediction using machine learning and data mining technique," IJCSC, Vol. 7, no. 1pp. 129-137, 2015, doi: 10.090592/IJCSC.2016.018.
Sathya priya, "Chronic Kidney Disease Prediction Using Machine Learning," International Journal of Computer Science and Information Security (IJCSIS), Vol. 16, no.4, 2018.
R. Mathur, V. Pathak, and D. Bandil, “Parkinson Disease Prediction Using Machine Learning Algorithm,” Emerging Trends in Expert Applications and Security, pp. 357–363, Nov. 2018, doi: 10.1007/978-981-13-2285-3_42.
G. T. Reddy and N. Khare, “An Efficient System for Heart Disease Prediction Using Hybrid OFBAT with Rule-Based Fuzzy Logic Model,” Journal of Circuits, Systems and Computers, vol. 26, no. 04, p. 1750061, Dec. 2016, doi: 10.1142/s021812661750061x.
I. Yekkala and S. Dixit, “Prediction of Heart Disease Using Random Forest and Rough Set Based Feature Selection,” International Journal of Big Data and Analytics in Healthcare, vol. 3, no. 1, pp. 1–12, Jan. 2018, doi: 10.4018/ijbdah.2018010101.
S. Bashir, Z. S. Khan, F. Hassan Khan, A. Anjum, and K. Bashir, “Improving Heart Disease Prediction Using Feature Selection Approaches,” 2019 16th International Bhurban Conference on Applied Sciences and Technology (IBCAST), Jan. 2019, doi: 10.1109/ibcast.2019.8667106.
Peter, T. John, and K. Somasundaram. "Study and development of novel feature selection framework for heart disease prediction." International Journal of Scientific and Research Publications, 2012.
A. M. Usman, U. K. Yusof, and S. Naim, “Cuckoo inspired algorithms for feature selection in heart disease prediction,” International Journal of Advances in Intelligent Informatics, vol. 4, no. 2, p. 95, Jul. 2018, doi: 10.26555/ijain.v4i2.245.
X. Jin, A. Xu, R. Bie, and P. Guo, “Machine Learning Techniques and Chi-Square Feature Selection for Cancer Classification Using SAGE Gene Expression Profiles,” Data Mining for Biomedical Applications, pp. 106–115, 2006, doi: 10.1007/11691730_11.
Yu, Lei, and Huan Liu, "Feature selection for high-dimensional data: A fast correlation-based filter solution," Proceedings of the 20th international conference on machine learning (ICML-03), 2003.
N. S. Chandra Reddy, S. Shue Nee, L. Zhi Min, and C. Xin Ying, “Classification and Feature Selection Approaches by Machine Learning Techniques: Heart Disease Prediction,” International Journal of Innovative Computing, vol. 9, no. 1, May 2019, doi: 10.11113/ijic.v9n1.210.
N. Spolaor, E. A. Cherman, M. C. Monard, and H. D. Lee, “ReliefF for Multi-label Feature Selection,” 2013 Brazilian Conference on Intelligent Systems, Oct. 2013, doi: 10.1109/bracis.2013.10.
D. Kostrzewa and R. Brzeski, “The Data Dimensionality Reduction in the Classification Process Through Greedy Backward Feature Elimination,” Man-Machine Interactions 5, pp. 397–407, Sep. 2017, doi: 10.1007/978-3-319-67792-7_39.
J. Ren, Z. Qiu, W. Fan, H. Cheng, and P. S. Yu, “Forward Semi-supervised Feature Selection,” Lecture Notes in Computer Science, pp. 970–976, doi: 10.1007/978-3-540-68125-0_101.
C.-Y. Lee and B.-S. Chen, “Mutually-exclusive-and-collectively-exhaustive feature selection scheme,” Applied Soft Computing, vol. 68, pp. 961–971, Jul. 2018, doi: 10.1016/j.asoc.2017.04.055.
K. Yan and D. Zhang, “Feature selection and analysis on correlated gas sensor data with recursive feature elimination,” Sensors and Actuators B: Chemical, vol. 212, pp. 353–363, Jun. 2015, doi: 10.1016/j.snb.2015.02.025.
D. Shah, S. Patel, and S. K. Bharti, “Heart Disease Prediction using Machine Learning Techniques,” SN Computer Science, vol. 1, no. 6, Oct. 2020, doi: 10.1007/s42979-020-00365-y.
Acknowledgements
Author(s) thanks to Dr.Nithyanandam S for this research completion and support.
Funding
No funding was received to assist with the preparation of this manuscript.
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to declare that are relevant to the content of this article.
Availability of data and materials
No data available for above study.
Author information
Contributions
All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.
Corresponding author
Keerthika N
Keerthika N
Department of Computer Science and Engineering, Ponnaiyah Ramajayam Institute of Science and Technology (PRIST) Deemed to be University, Thanjavur, India.
Open Access This article is licensed under a Creative Commons Attribution NoDerivs is a more restrictive license. It allows you to redistribute the material commercially or non-commercially but the user cannot make any changes whatsoever to the original, i.e. no derivatives of the original work. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/
Cite this article
Keerthika N and Nithyanandam S, “An Efficient Filter and Wrapper based Selection Methods along With Random Forest and Support Vector Machines Classification Technique in Health Care System”, Journal of Machine and Computing, vol.3, no.4, pp. 566-581, October 2023. doi: 10.53759/7669/jmc202303048.