International Journal of Scientific & Technology Research

Home About Us Scope Editorial Board Blog/Latest News Contact Us
10th percentile
Powered by  Scopus
Scopus coverage:
Nov 2018 to May 2020


IJSTR >> Volume 9 - Issue 3, March 2020 Edition

International Journal of Scientific & Technology Research  
International Journal of Scientific & Technology Research

Website: http://www.ijstr.org

ISSN 2277-8616


[Full Text]



Soreen Ameen Fattah, Hussein Attya Lafta, Sura Z.Alrashid



Machine Learning, Ensemble Learning, Bagging, Medical Diagnosis, Diabetes, Heart Diseases, dataset splitting



Advancement in medical information systems has facilitated the development of automated diagnosis systems. Several Artificial Intelligence (AI) techniques have been implemented and studied in modern researches to come up with the most suitable and accurate medical diagnosis system. Bagging is one of these techniques, and it has been proven by several researches to be a powerful and convenient tool for such systems. In this research; bagging algorithm is used to produce a diagnosis system for two of the most common diseases: diabetes and heart diseases, where this algorithm used verified datasets of attributes that are combined with the same attributes values submitted by the patient through a dedicated interface. Testing the system and comparing it to other prediction systems proved its efficiency and accurate prediction rates.



[1] Witten, I. H., Frank, E., Hall, M. A., & Pal, C. J. (2016). Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann.
[2] Galdi, P., & Tagliaferri, R. (2018). Data Mining: Accuracy and Error Measures for Classification and Prediction. Reference Module in Life Sciences, no. January, Elsevier, 1-14.
[3] Tu, M. C., Shin, D., & Shin, D. (2009, October). Effective diagnosis of heart disease through bagging approach. In 2009 2nd International Conference on Biomedical Engineering and Informatics (pp. 1-4). IEEE.
[4] Yao, Z., Liu, P., Lei, L., & Yin, J. (2005, June). R-C4. 5 Decision tree model and its applications to health care dataset. In Proceedings of ICSSSM'05. 2005 International Conference on Services Systems and Services Management, 2005. (Vol. 2, pp. 1099-1103). IEEE.
[5] Soni, J., Ansari, U., Sharma, D., & Soni, S. (2011). Predictive data mining for medical diagnosis: An overview of heart disease prediction. International Journal of Computer Applications, 17(8), 43-48.
[6] Srimani, P. K., & Koti, M. S. (2013). Medical diagnosis using ensemble classifiers-a novel machine-learning approach. Journal of Advanced Computing, 1, 9-27.
[7] Fida, B., Nazir, M., Naveed, N., & Akram, S. (2011, December). Heart disease classification ensemble optimization using genetic algorithm. In 2011 IEEE 14th International Multitopic Conference (pp. 19-24). IEEE.
[8] Shouman, M., Turner, T., & Stocker, R. (2012, March). Using data mining techniques in heart disease diagnosis and treatment. In 2012 Japan-Egypt Conference on Electronics, Communications and Computers (pp. 173-177). IEEE.
[9] Sengur, A. (2012). Support vector machine ensembles for intelligent diagnosis of valvular heart disease. Journal of medical systems, 36(4), 2649-2655.
[10] Zheng, J., Jiang, Y., & Yan, H. (2006, June). Committee machines with ensembles of multilayer perceptron for the support of diagnosis of heart diseases. In 2006 International Conference on Communications, Circuits and Systems (Vol. 3, pp. 2046-2050). IEEE.
[11] Das, R., & Sengur, A. (2010). Evaluation of ensemble methods for diagnosing of valvular heart disease. Expert Systems with Applications, 37(7), 5110-5115.
[12] Peters, A., & Lausen, B. (2003). Direct and indirect classification in clinical research. Biometrical Journal: Journal of Mathematical Methods in Biosciences, 45(8), 1023-1041.
[13] Hand, D. J., Li, H. G., & Adams, N. M. (2001). Supervised classification with structured class definitions. Computational Statistics & Data Analysis, 36(2), 209-225.
[14] Peters, A., Hothorn, T., & Lausen, B. (2002). ipred: Improved predictors. R news, 2(2), 33-36.
[15] Budhani, S. K., Jha, C., & Ahmad, A. (2018). Comparative Study of Meta Classification Algorithm: Bagging, AdaboostM1 and Stacking with Concept Drift based Synthetic Dataset Hyperplane1 and Hyperplane2. International Journal of Engineering Science, 15927.
[16] Center for Machine Learning and Intelligent Systems, accessed online: http://archive.ics.uci.edu/ml/datasets/Heart+Disease , accessed on March. 2019.