Deprivation of Critical Care systems are a major cause of fatality worldwide, highlighting it’s need for saving human lives. This study proposes a novel hybrid ensemble model, which integrates Random Forests, Gradient Boosting Machines (GBM), and Neural Networks to enhance the predictive accuracy diagnostics. The methodology combines data pre-processing, feature selection, and ensemble learning, ensuring robust and reliable predictions. Comprehensive data pre-processing includes K-Nearest Neighbours (KNN) imputation for missing values, Z-Score normalization for scaling, and Polynomial Feature Generation for non-linear feature interactions. Feature selection performed using Recursive Feature Elimination (RFE) and Mutual Information relevant variable retention. The proposed model produces 98.55% accuracy, very surpassing nine baseline models, that includes XGBoost, Random Forests, and Neural Networks. Additional metrics such as precision (97.80%), recall (98.12%), F1-Score (98.00%), and ROC-AUC (99.12%) further validate the model's robustness. This framework not only demonstrates superior accuracy but also ensures computational efficiency, making it viable for deployment in real-world healthcare settings.
Keywords
Early Detection, AI-Powered Framework, Ensemble Learning, Random Forests, Gradient Boosting Machines, Neural Networks, Machine Learning, Predictive Model, Feature Selection.
J. Mishra and M. Tiwari, “IoT-enabled ECG-based heart disease prediction using three-layer deep learning and meta-heuristic approach,” Signal, Image and Video Processing, vol. 18, no. 1, pp. 361–367, Sep. 2023, doi: 10.1007/s11760-023-02743-4.
M. Mandava and S. Reddy vinta, “MDensNet201-IDRSRNet: Efficient cardiovascular disease prediction system using hybrid deep learning,” Biomedical Signal Processing and Control, vol. 93, p. 106147, Jul. 2024, doi: 10.1016/j.bspc.2024.106147.
A. Yashudas, D. Gupta, G. C. Prashant, A. Dua, D. AlQahtani, and A. S. K. Reddy, “DEEP-CARDIO: Recommendation System for Cardiovascular Disease Prediction Using IoT Network,” IEEE Sensors Journal, vol. 24, no. 9, pp. 14539–14547, May 2024, doi: 10.1109/jsen.2024.3373429.
N. Chandrasekhar and S. Peddakrishna, “Enhancing Heart Disease Prediction Accuracy through Machine Learning Techniques and Optimization,” Processes, vol. 11, no. 4, p. 1210, Apr. 2023, doi: 10.3390/pr11041210.
E. K. Oikonomou and R. Khera, “Machine learning in precision diabetes care and cardiovascular risk prediction,” Cardiovascular Diabetology, vol. 22, no. 1, Sep. 2023, doi: 10.1186/s12933-023-01985-3.
E. Dritsas and M. Trigka, “Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction,” Sensors, vol. 23, no. 3, p. 1161, Jan. 2023, doi: 10.3390/s23031161.
G. Ramkumar, J. Seetha, R. Priyadarshini, M. Gopila, and G. Saranya, “IoT-based patient monitoring system for predicting heart disease using deep learning,” Measurement, vol. 218, p. 113235, Aug. 2023, doi: 10.1016/j.measurement.2023.113235.
G. Abdulsalam, S. Meshoul, and H. Shaiba, “Explainable Heart Disease Prediction Using Ensemble-Quantum Machine Learning Approach,” Intelligent Automation & Soft Computing, vol. 36, no. 1, pp. 761–779, 2023, doi: 10.32604/iasc.2023.032262.
F. Li, P. Wu, H. H. Ong, J. F. Peterson, W.-Q. Wei, and J. Zhao, “Evaluating and mitigating bias in machine learning models for cardiovascular disease prediction,” Journal of Biomedical Informatics, vol. 138, p. 104294, Feb. 2023, doi: 10.1016/j.jbi.2023.104294.
N. A. Baghdadi, S. M. Farghaly Abdelaliem, A. Malki, I. Gad, A. Ewis, and E. Atlam, “Advanced machine learning techniques for cardiovascular disease early detection and diagnosis,” Journal of Big Data, vol. 10, no. 1, Sep. 2023, doi: 10.1186/s40537-023-00817-1.
S. Mohammad Ganie, P. Kanti Dutta Pramanik, M. Bashir Malik, A. Nayyar, and K. Sup Kwak, “An Improved Ensemble Learning Approach for Heart Disease Prediction Using Boosting Algorithms,” Computer Systems Science and Engineering, vol. 46, no. 3, pp. 3993–4006, 2023, doi: 10.32604/csse.2023.035244.
O. Taylan, A. Alkabaa, H. Alqabbaa, E. Pamukçu, and V. Leiva, “Early Prediction in Classification of Cardiovascular Diseases with Machine Learning, Neuro-Fuzzy and Statistical Methods,” Biology, vol. 12, no. 1, p. 117, Jan. 2023, doi: 10.3390/biology12010117.
J. Yu et al., “Incorporating longitudinal history of risk factors into atherosclerotic cardiovascular disease risk prediction using deep learning,” Scientific Reports, vol. 14, no. 1, Jan. 2024, doi: 10.1038/s41598-024-51685-5.
W. DeGroat, H. Abdelhalim, K. Patel, D. Mendhe, S. Zeeshan, and Z. Ahmed, “Discovering biomarkers associated and predicting cardiovascular disease with high accuracy using a novel nexus of machine learning techniques for precision medicine,” Scientific Reports, vol. 14, no. 1, Jan. 2024, doi: 10.1038/s41598-023-50600-8.
C.-Y. Ma et al., “Predicting coronary heart disease in Chinese diabetics using machine learning,” Computers in Biology and Medicine, vol. 169, p. 107952, Feb. 2024, doi: 10.1016/j.compbiomed.2024.107952.
M. T. García-Ordás, M. Bayón-Gutiérrez, C. Benavides, J. Aveleira-Mata, and J. A. Benítez-Andrades, “Heart disease risk prediction using deep learning techniques with feature augmentation,” Multimedia Tools and Applications, vol. 82, no. 20, pp. 31759–31773, Mar. 2023, doi: 10.1007/s11042-023-14817-z.
A. Ogunpola, F. Saeed, S. Basurra, A. M. Albarrak, and S. N. Qasem, “Machine Learning-Based Predictive Models for Detection of Cardiovascular Diseases,” Diagnostics, vol. 14, no. 2, p. 144, Jan. 2024, doi: 10.3390/diagnostics14020144.
M. Trigka and E. Dritsas, “Long-Term Coronary Artery Disease Risk Prediction with Machine Learning Models,” Sensors, vol. 23, no. 3, p. 1193, Jan. 2023, doi: 10.3390/s23031193.
C. M. Bhatt, P. Patel, T. Ghetia, and P. L. Mazzeo, “Effective Heart Disease Prediction Using Machine Learning Techniques,” Algorithms, vol. 16, no. 2, p. 88, Feb. 2023, doi: 10.3390/a16020088.
A. A. Almazroi, E. A. Aldhahri, S. Bashir, and S. Ashfaq, “A Clinical Decision Support System for Heart Disease Prediction Using Deep Learning,” IEEE Access, vol. 11, pp. 61646–61659, 2023, doi: 10.1109/access.2023.3285247.
https://www.kaggle.com/datasets/sulianova/cardiovascular-disease-dataset/data accessed on 15th June 2024.
A. G, B. Ganesh, A. Ganesh, C. Srinivas, Dhanraj, and K. Mensinkal, “Logistic regression technique for prediction of cardiovascular disease,” Global Transitions Proceedings, vol. 3, no. 1, pp. 127–130, Jun. 2022, doi: 10.1016/j.gltp.2022.04.008.
T. A. Assegie, “Heart disease prediction model with k-nearest neighbor algorithm,” International Journal of Informatics and Communication Technology (IJ-ICT), vol. 10, no. 3, p. 225, Dec. 2021, doi: 10.11591/ijict.v10i3.pp225-230.
F. Asadi, R. Homayounfar, Y. Mehrali, C. Masci, S. Talebi, and F. Zayeri, “Detection of cardiovascular disease cases using advanced tree-based machine learning algorithms,” Scientific Reports, vol. 14, no. 1, Sep. 2024, doi: 10.1038/s41598-024-72819-9.
Y. Wu, “Heart Disease Prediction Using Gradient Boosting Decision Trees,” Proceedings of the 1st International Conference on Engineering Management, Information Technology and Intelligence, pp. 527–535, 2024, doi: 10.5220/0012958300004508.
B. Xia, N. Innab, V. Kandasamy, A. Ahmadian, and M. Ferrara, “Intelligent cardiovascular disease diagnosis using deep learning enhanced neural network with ant colony optimization,” Scientific Reports, vol. 14, no. 1, Sep. 2024, doi: 10.1038/s41598-024-71932-z.
A. K. Yadav, et al., "Early Stage Prediction of Heart Disease Features using AdaBoost Ensemble Algorithm and Tree Algorithms", IJISAE, 12(3), 545–551, 2024, https://www.ijisae.org/index.php/IJISAE/article/view/5285.
S. Sharma and A. Singhal, “A Novel Heart Disease Prediction System Using XGBoost Classifier Coupled With ADASYN SMOTE,” 2023 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS), pp. 76–81, Nov. 2023, doi: 10.1109/icccis60361.2023.10425095.
CRediT Author Statement
The authors confirm contribution to the paper as follows:
Conceptualization: Vishnu Priyan S, Vijayalakshmi N, Suresh G and Rajesh K;
Methodology: Vishnu Priyan S and Vijayalakshmi N;
Software: Suresh G and Rajesh K;
Data Curation: Vishnu Priyan S and Vijayalakshmi N;
Writing- Original Draft Preparation: Vishnu Priyan S, Vijayalakshmi N, Suresh G and Rajesh K;
Visualization: Vishnu Priyan S and Vijayalakshmi N;
Investigation: Suresh G and Rajesh K;
Supervision: Vishnu Priyan S and Vijayalakshmi N;
Validation: Suresh G and Rajesh K;
Writing- Reviewing and Editing: Vishnu Priyan S, Vijayalakshmi N, Suresh G and Rajesh K;
All authors reviewed the results and approved the final version of the manuscript.
Acknowledgements
The authors would like to thank to the reviewers for nice comments on the manuscript.
Funding
No funding was received to assist with the preparation of this manuscript.
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to declare that are relevant to the content of this article.
Availability of data and materials
Data sharing is not applicable to this article as no new data were created or analysed in this study.
Author information
Contributions
All authors have equal contribution in the paper and all authors have read and agreed to the published version of the manuscript.
Corresponding author
Vishnu Priyan S
Department of Biomedical Engineering, Kings Engineering College, Chennai, Tamil Nadu, India.
Open Access This article is licensed under a Creative Commons Attribution NoDerivs is a more restrictive license. It allows you to redistribute the material commercially or non-commercially but the user cannot make any changes whatsoever to the original, i.e. no derivatives of the original work. To view a copy of this license, visit https://creativecommons.org/licenses/by-nc-nd/4.0/
Cite this article
Vishnu Priyan S, Vijayalakshmi N, Suresh G and Rajesh K, “Advancing Health Diagnostics: AI-Powered CVD-REF Framework for Precise and Early Risk Assessment”, Journal of Machine and Computing, pp. 1248-1264, April 2025, doi: 10.53759/7669/jmc202505098.