International Journal of Scientific & Technology Research

Home About Us Scope Editorial Board Blog/Latest News Contact Us
10th percentile
Powered by  Scopus
Scopus coverage:
Nov 2018 to May 2020


IJSTR >> Volume 8 - Issue 8, August 2019 Edition

International Journal of Scientific & Technology Research  
International Journal of Scientific & Technology Research

Website: http://www.ijstr.org

ISSN 2277-8616

Comparison Of Datamining Techniques For Prediction Of Breast Cancer

[Full Text]



Deneshkumar V, Manoprabha M, Senthamarai Kannan K



Breast cancer, Data mining, Prediction, Feature Selection, Gini Index, Information Gain and ROC Curve.



Breast cancer is one of the most challenging deadly diseases. Correct and in-time prediction of such disease is very important. Wisconsin breast cancer dataset with 569 patients and 32 features were included in this study. The Information Gain and Gini Index were used to determine the effectiveness of features on breast cancer. The performance comparisons of the most commonly used statistical methods were also studied to find the best predictive model. The main objective of this manuscript is to make use of the advanced technologies to develop a best predictive model for breast cancer. All performance assessments were carried out using Rapid Miner Studio software.



[1] H. A. Abbass, “An Evolutionary Artificial Neural Networks Approach for Breast Cancer Diagnosis,” Artificial Intelligence in Medicine, vol. 25, pp. 223-232, July. 2002.
[2] R. Alizadehsani, J. Habibi, M. J. Hosseini, H. Mashayekhi, R. Boghrati, A. Ghandeharioun, B. Bahadorian, and Z. A. Sani, “A Data Mining Approach for Diagnosis of Coronary artery Disease,” Computer Methods and Programs in Biomedicine, COMM- 3519, pp. 1-10, July. 2013.
[3] A. N. Arbain, and Y. P. Balakrishnan, “A Comparison of Data Mining Algorithms for Liver Disease Prediction on Imbalanced Data,” International Journal of Data Science and Advanced Analytics, vol. 1, no. 1, pp. 1-11, Feb. 2019.
[4] H. Ayatollahi, L. Gholamhosseini, and M. Salehi, “Predicting Coronary artery Disease: a Comparison between Two Data Mining Algorithms,” BMC Public Health, vol. 19, no. 448, pp. 1-9, Apr. 2019.
[5] A. Bellaachia, and E. Guven, “Predicting Breast Cancer Survivability using Data Mining Techniques,” In: Scientific Data Mining Workshop (in conjunction with the 2006 SIAM conference on data mining), Bethesda, Maryland, pp. 20-22, 2006.
[6] V. Chaurasia, and S. Pal, “Data Mining Approach to Detect Heart Diseases,” International Journal of Advanced Computer Science and Information Technology, vol. 2, no. 4, pp. 56-66, Jan. 2014.
[7] V. Chaurasia, and S. Pal, “Performance Analysis of Data Mining Algorithms for Diagnosis and Prediction of Heart and Breast Cancer Disease,” International Journal of Innovative Computing, Information & Control, vol. 3, no. 8, pp. 1-13, May. 2014.
[8] V. Chaurasia, and S. Pal, and B.B. Tiwari, “Prediction of Benign and Malignant Breast Cancer using Data Mining Techniques,” Journal of Algorithms & Computational Technology, vol. 12, no. 2, pp. 119-126, Jan. 2018.
[9] T. L. Daniel, “Data Mining Methods and Models,” A John Wiley & Sons, INC Publication, Hoboken, New Jersey, 2006.
[10] S. K. Dehkordi, and H. Sajedi, “Prediction of disease based on Prescription using Data Mining Methods,” Health and Technology, pp. 1-8, July. 2018.
[11] J. Han, M. Kamber, and J. Pei, “Data Mining Concepts and Techniques,” 3rd edition, Morgan Kaufmann Publishers is an imprint of Elsevier, Waltham, MA, USA, 2012.
[12] D. S. Jacob, R. Viswan, V. Manju, L. PadmaSuresh, S. Raj, “A Survey on Breast Cancer Prediction Using Data Mining Techniques,” Proc. IEEE Conference on Emerging Devices and Smart Systems, pp. 256-258, Mar. 2018.
[13] L. Jena, and N.K. Kamila, “Distributed Data Mining Classification Algorithms for Prediction of Chronic-Kidney-disease,” International Journal of Emerging Research in Management & Technology, vol. 4, no. 11, pp. 110-118, Nov. 2015.
[14] V. Kunwar, K. Chandel, A. S. Sabitha, and A. Bansal, “Chronic Kidney Disease Analysis using Data Mining Classification Techniques,” 6th International conference – Cloud System and Big Data Engineering (Confluence), pp. 300-305, Jan. 2016.
[15] A. K. Mishra, and B. K. Ratha, ”Study of Random Tree and Random Forest Data Mining Algorithms for Microarray Data Analysis,” International Journal on Advanced Electrical and Computer Engineering, vol. 3,no. 4, pp. 5-7, 2016.
[16] J. Padmavathi, “Logistic regression in Feature Selection in Data Mining,” International Journal of Scientific & Engineering Research, vol. 3, no. 8, pp. 1-4, Aug. 2012.
[17] A. Sanjay, H. V. Nair, S. Murali, and K. S. Krishnaveni, “A Data Mining Model To Predict Breast Cancer Using Improved Feature Selection Method on Real Time Data,” International Conference on Advances in Computing, Communications and Informatics, pp. 2437-2440, Sep. 2018.
[18] M. Shouman, T. Turner, and R. Stocker, “Using Data Mining Techniques in Heart Disease Diagnosis and Treatment,” 2012 Japan-Egypt Conference on Electronics, Communications and Computers, pp. 173-177, Mar. 2012.
[19] K. Srinivas, B. K. Rani, and A. Govrdhan, “Application of Data Mining Techniques in Healthcare and Prediction of Heart Attacks,” International Journal on Computer Science and Engineering, vol. 2, no. 2, pp. 250-255, Mar. 2010.
[20] L. Ya-Qin, W. Cheng, and Z. Lu, “Decision Tree Based Predictive Models for Breast Cancer Survivability on Imbalanced Data,” 3rd International Conference on Bioinformatics and Biomedical Engineering, Beijing, China, pp. 11-13, July. 2009.