Breast Cancer Prediction via Grid Search Hyperparameter Optimization

Authors

  • Ashima Aggarwal
  • Anurag Sharma

Keywords:

Breast Cancer, early detection, machine learning, hyperparameter optimization, logistic regression, support vector machine, k-nearest neighbor, naive bayes, decision tree, random forest, OWBCD, WDBC, Coimbra, BRCA, Haberman, SEER.

Abstract

Breast Cancer is a serious health issue worldwide, and early detection is crucial in preventing deaths. Machine learning can help identify tumors efficiently, and this paper introduces the Grid Search Hyperparameter Optimization (GSHPO) method to optimize the parameters of six existing models, including Logistic Regression, Support Vector Machine, K-Nearest Neighbor, Naïve Bayes, Decision Tree, and Random Forest. The best parameters were applied to predict outcomes in six datasets, including OWBCD, WDBC, Coimbra, BRCA, Haberman, and SEER. The results show that tuning the hyperparameters of models has a significant positive impact on prediction accuracy. 

References

Amrane, S. Oukid, I. Gagaoua, and T. Ensari, “Breast cancer classification using machine learning,” 2018 Electr. Electron. Comput. Sci. Biomed. Eng. Meet. EBBT 2018, pp. 1–4, 2018, doi: 10.1109/EBBT.2018.8391453.

A. Bharat, N. Pooja, and R. A. Reddy, “Using Machine Learning algorithms for breast cancer risk prediction and diagnosis,” 2018 IEEE 3rd Int. Conf. Circuits, Control. Commun. Comput. I4C 2018, no. x, pp. 1–4, 2018, doi: 10.1109/CIMCA.2018.8739696

A. Bazila Banu and P. Thirumalaikolundusubramanian, “Comparison of bayes classifiers for breast cancer classification,” Asian Pacific J. Cancer Prev., vol. 19, no. 10, pp. 2917–2920, 2018, doi: 10.22034/APJCP.2018.19.10.2917.

A. Paulin, F, and Santhakumaran, "Classification of Breast cancer by comparing Backpropagation training algorithms,Int. J. Comput. Sci. Eng., vol. 3, no. 1, pp. 327–332, 2011.

C.P. Utomo, A.Kardiana, and R.Yuliwulandari, "Breast Cancer Diagnosis using Artificial Neural Networks with Extreme Learning Techniques," International Journal of Advanced Research in Artificial Intelligence, Vol.3, No.7, 2014.

C. Nguyen, Y. Wang, and H. N. Nguyen, “Random forest classifier combined with feature selection for breast cancer diagnosis and prognostic,” J. Biomed. Sci. Eng., vol. 06, no. 05, pp. 551–560, 2013, doi: 10.4236/jbise.2013.65070.

D. Bazazeh and R. Shubair, "Comparative study of machine learning algorithms for breast cancer detection and diagnosis," 2016 5th International Conference on Electronic Devices, Systems and Applications (ICEDSA), 2016, pp. 1-4, doi: 10.1109/ICEDSA.2016.7818560.

E. A. Bayrak, P. Kirci, and T. Ensari, “Comparison of machine learning methods for breast cancer diagnosis,” 2019 Sci. Meet. Electr. Biomed. Eng. Comput. Sci. EBBT 2019, pp. 4–6, 2019, doi: 10.1109/EBBT.2019.8741990.

F. M. Agarap, “On breast cancer detection: An application of machine learning algorithms on the Wisconsin diagnostic dataset,” ACM Int. Conf. Proceeding Ser., no. 1, pp. 5–9, 2018, doi: 10.1145/3184066.3184080

G. I. Salama, M. B. Abdelhalim, and M. A. Zeid, “Experimental comparison of classifiers for breast cancer diagnosis Experimental Comparison of Classifiers for Breast Cancer Diagnosis,” no. November, 2012, doi: 10.1109/ICCES.2012.6408508.

H. Asri, H. Mousannif, H. A. Moatassime, T. Noel, “Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis,” Procedia Computer Science, Volume 83, 2016, Pages 1064-1069, ISSN 1877-0509, https://doi.org/10.1016/j.procs.2016.04.224.

L. Rodrigues, “Analysis of the Wisconsin Breast Cancer Dataset and Machine Learning for Analysis of the Wisconsin Breast Cancer Dataset and Machine Learning for Breast Cancer Detection,” no. December, 2016.

M. M. Islam, H. Iqbal, M. R. Haque, and M. K. Hasan, “Prediction of breast cancer using support vector machine and K-Nearest neighbors,” 5th IEEE Reg. 10 Humanit. Technol. Conf. 2017, R10-HTC 2017, vol. 2018-January, pp. 226–229, 2018, doi: 10.1109/R10-HTC.2017.8288944.

O. I. Obaid, M. A. Mohammed, M. K. Abd Ghani, S. A. ostafa, and F. T. Al-Dhief, “Evaluating the performance of machine learning techniques in the classification of Wisconsin Breast Cancer,” Int. J. Eng. Technol., vol. 7, no. 4.36 Special Issue 36, pp. 160–166, 2018, doi: 10.14419/ijet.v7i4.36.23737.

P. Gupta and S. Garg, “Breast Cancer Prediction using varying Parameters of Machine Learning Models,” Procedia Comput. Sci., vol. 171, pp. 593–601, 2020, doi: 10.1016/j.procs.2020.04.064.

S. A. Mohammed, S. Darrab, S.A. Noaman and G.Saake, Data Mining and Big Data Book, In: Tan Y., Shi Y., Tuba M. (eds) Data Mining and Big Data. DMBD 2020. Communications in Computer and Information Science, vol 1234. Springer, Singapore. https://doi.org/10.1007/978-981-15-7205-0_10

S. Sharma, A. Aggarwal and T.Choudary, "Breast Cancer Detection Using Machine Learning Algorithms," 2018 International Conference on Computational Techniques, Electronics and Mechanical Systems (CTEMS), 2018, pp. 114-118, doi: 10.1109/CTEMS.2018.8769187.

S.Sharma and S.Deshpande, “Breast Cancer Classification Using Machine Learning Algorithms,” In: Joshi A., Khosravy M., Gupta N. (eds), Machine Learning for Predictive Analysis. Lecture Notes in Networks and Systems, vol 141, Springer, Singapore. https://doi.org/10.1007/978-981-15-71060_56.

Downloads

Published

2024-02-28

How to Cite

Ashima Aggarwal, & Anurag Sharma. (2024). Breast Cancer Prediction via Grid Search Hyperparameter Optimization . Journal Punjab Academy of Sciences, 23, 183–193. Retrieved from http://jpas.in/index.php/home/article/view/67