Machine learning models selection under uncertainty: application in cancer prediction

Ламрані Алауї, Ю.; Бенмір, М.; Абулайх, Р.; Lamrani Alaoui, Y.; Benmir, M.; Aboulaich, R.

Machine learning models selection under uncertainty: application in cancer prediction

dc.citation.epage	238
dc.citation.issue	11
dc.citation.journalTitle	Математичне моделювання та комп'ютинг
dc.citation.spage	230
dc.citation.volume	1
dc.contributor.affiliation	Університет Мухаммеда V у Рабаті
dc.contributor.affiliation	Mohammed V University in Rabat
dc.contributor.author	Ламрані Алауї, Ю.
dc.contributor.author	Бенмір, М.
dc.contributor.author	Абулайх, Р.
dc.contributor.author	Lamrani Alaoui, Y.
dc.contributor.author	Benmir, M.
dc.contributor.author	Aboulaich, R.
dc.coverage.placename	Львів
dc.coverage.placename	Lviv
dc.date.accessioned	2025-10-20T07:44:16Z
dc.date.created	2024-02-24
dc.date.issued	2024-02-24
dc.description.abstract	Рак є основною причиною смертності у світі, щороку діагностуються мільйони нових випадків. У багатьох дослідницьких роботах обговорюються потенційні переваги машинного навчання (МН) у прогнозуванні раку, включаючи покращене раннє виявлення та персоналізовані варіанти лікування. У літературі також висвітлюються проблеми, з якими стикається ця галузь, такі як потреба у великих та різноманітних наборах даних, а також у інтерпретованих моделях з високою продуктивністю. Метою цієї статті є пропонування нового підходу до вибору та оцінки ефективності узагальнення моделей МН у прогнозуванні раку, особливо для наборів даних обмеженого розміру. На оцінки ефективності узагальнення, як правило, впливають численні фактори протягом усього процесу навчання та тестування. Ці фактори включають вплив співвідношення навчання та тестування, а також випадковий вибір наборів даних для цілей навчання та тестування.
dc.description.abstract	Cancer stands as the foremost global cause of mortality, with millions of new cases diagnosed each year. Many research papers have discussed the potential benefits of Machine Learning (ML) in cancer prediction, including improved early detection and personalized treatment options. The literature also highlights the challenges facing the field, such as the need for large and diverse datasets as well as interpretable models with high performance. The aim of this paper is to suggest a new approach in order to select and assess the generalization performance of ML models in cancer prediction, particularly for datasets with limited size. The estimates of the generalization performance are generally influenced by numerous factors throughout the process of training and testing. These factors include the impact of the training–testing ratio as well as the random selection of datasets for training and testing purposes.
dc.format.extent	230-238
dc.format.pages	9
dc.identifier.citation	Lamrani Alaoui Y. Machine learning models selection under uncertainty: application in cancer prediction / Y. Lamrani Alaoui, M. Benmir, R. Aboulaich // Mathematical Modeling and Computing. — Lviv : Lviv Politechnic Publishing House, 2024. — Vol 1. — No 11. — P. 230–238.
dc.identifier.citationen	Lamrani Alaoui Y. Machine learning models selection under uncertainty: application in cancer prediction / Y. Lamrani Alaoui, M. Benmir, R. Aboulaich // Mathematical Modeling and Computing. — Lviv : Lviv Politechnic Publishing House, 2024. — Vol 1. — No 11. — P. 230–238.
dc.identifier.doi	10.23939/mmc2024.01.230
dc.identifier.uri	https://ena.lpnu.ua/handle/ntb/113783
dc.language.iso	en
dc.publisher	Видавництво Львівської політехніки
dc.publisher	Lviv Politechnic Publishing House
dc.relation.ispartof	Математичне моделювання та комп'ютинг, 11 (1), 2024
dc.relation.ispartof	Mathematical Modeling and Computing, 11 (1), 2024
dc.relation.references	[1] Zhang C., Hu J., Li H., Ma H., Othmane B., Ren W., Yi Z., Qiu D., Ou Z., Chen J., Zu X. Emerging biomarkers for predicting bladder cancer lymph node metastasis. Frontiers in Oncology. 11, 648968 (2021).
dc.relation.references	[2] Wang P., Li Y., Reddy C. K. Machine learning for survival analysis: A survey. ACM Computing Surveys. 51 (6), 1–36 (2019).
dc.relation.references	[3] Levine A. B., Schlosser C., Grewal J., Coope R., Jones S. J. M., Yip S. Rise of the machines: advances in deep learning for cancer diagnosis. Trends in Cancer. 5 (3), 157–169 (2019).
dc.relation.references	[4] Huang S., Yang J., Fong S., Zhao Q. Artificial intelligence in cancer diagnosis and prognosis: Opportunities and challenges. Cancer letters. 471, 61–71 (2020).
dc.relation.references	[5] Abreu P. H., Santos M. S., Abreu M. H., Andrade B., Silva D. C. Predicting breast cancer recurrence using machine learning techniques: a systematic review. ACM Computing Surveys. 49 (3), 1–40 (2016).
dc.relation.references	[6] Nguyen Q. H., Ly H.-B., Ho L. S., Al-Ansari N., Le H. V., Tran V. Q., Prakash I., Pham B. T. Influence of data splitting on performance of machine learning models in prediction of shear strength of soil. Mathematical Problems in Engineering. 2021, 4832864 (2021).
dc.relation.references	[7] Witten I. H., Frank E., Hall M. A. Credibility: evaluating what’s been learned. Data Mining: Practical Machine Learning Tools and Techniques. 147–187 (2011).
dc.relation.references	[8] Japkowicz N., Shah M. Performance evaluation in machine learning. Machine Learning in Radiation Oncology. 41–56 (2015).
dc.relation.references	[9] Kou G., Lu Y., Peng Y., Shi Y. Evaluation of classification algorithms using MCDM and rank correlation. International Journal of Information Technology & Decision Making. 11 (01), 197–225 (2012).
dc.relation.references	[10] Qu Z., Wan C., Yang Z., Lee P. T.-W. A discourse of multi-criteria decision making (MCDM) approaches. Multi-Criteria Decision Making in Maritime Studies and Logistics. 7–29 (2018).
dc.relation.references	[11] U¸car M. K., Nour M., Sindi H., Polat K. The effect of training and testing process on machine learning in biomedical datasets. Mathematical Problems in Engineering. 2020, 2836236 (2020).
dc.relation.references	[12] Raschka S. Model evaluation, model selection, and algorithm selection in machine learning. Preprint arXiv:1811.12808 (2018).
dc.relation.references	[13] Zheng A. Evaluating machine learning models: a beginner’s guide to key concepts and pitfalls. O’Reilly Media (2015).
dc.relation.references	[14] Torra V. Hesitant fuzzy sets. International Journal of Intelligent Systems. 25 (6), 529–539 (2010).
dc.relation.references	[15] Zhang N., Wei G. Extension of VIKOR method for decision making problem based on hesitant fuzzy set. Applied Mathematical Modelling. 37 (7), 4938–4947 (2013).
dc.relation.references	[16] Zadeh L. A. Fuzzy sets. Information and Control. 8 (3), 338–353 (1965).
dc.relation.references	[17] Hu J., Zhang X., Chen X., Liu Y. Hesitant fuzzy information measures and their applications in multicriteria decision making. International Journal of Systems Science. 47 (1), 62–76 (2016).
dc.relation.references	[18] Gal T., Stewart T., Hanne T. (Eds.). Multicriteria decision making: advances in MCDM models, algorithms, theory, and applications. Springer Science + Business Media, New York (2013).
dc.relation.references	[19] Hwang C. L., Yoon K. Methods for multiple attribute decision making. Multiple Attribute Decision Making. 58–191 (1981).
dc.relation.references	[20] Shih H.-S., Shyur H.-J., Lee E. S. An extension of TOPSIS for group decision making. Mathematical and Computer Modelling. 45 (7–8), 801–813 (2007).
dc.relation.references	[21] Xu Z., Zhang X. Hesitant fuzzy multi-attribute decision making based on TOPSIS with incomplete weight information. Knowledge-Based Systems. 52, 53–64 (2013).
dc.relation.references	[22] Sayadi M. K., Heydari M., Shahanaghi K. Extension of VIKOR method for decision making problem with interval numbers. Applied Mathematical Modelling. 33 (5), 2257–2262 (2009).
dc.relation.referencesen	[1] Zhang C., Hu J., Li H., Ma H., Othmane B., Ren W., Yi Z., Qiu D., Ou Z., Chen J., Zu X. Emerging biomarkers for predicting bladder cancer lymph node metastasis. Frontiers in Oncology. 11, 648968 (2021).
dc.relation.referencesen	[2] Wang P., Li Y., Reddy C. K. Machine learning for survival analysis: A survey. ACM Computing Surveys. 51 (6), 1–36 (2019).
dc.relation.referencesen	[3] Levine A. B., Schlosser C., Grewal J., Coope R., Jones S. J. M., Yip S. Rise of the machines: advances in deep learning for cancer diagnosis. Trends in Cancer. 5 (3), 157–169 (2019).
dc.relation.referencesen	[4] Huang S., Yang J., Fong S., Zhao Q. Artificial intelligence in cancer diagnosis and prognosis: Opportunities and challenges. Cancer letters. 471, 61–71 (2020).
dc.relation.referencesen	[5] Abreu P. H., Santos M. S., Abreu M. H., Andrade B., Silva D. C. Predicting breast cancer recurrence using machine learning techniques: a systematic review. ACM Computing Surveys. 49 (3), 1–40 (2016).
dc.relation.referencesen	[6] Nguyen Q. H., Ly H.-B., Ho L. S., Al-Ansari N., Le H. V., Tran V. Q., Prakash I., Pham B. T. Influence of data splitting on performance of machine learning models in prediction of shear strength of soil. Mathematical Problems in Engineering. 2021, 4832864 (2021).
dc.relation.referencesen	[7] Witten I. H., Frank E., Hall M. A. Credibility: evaluating what’s been learned. Data Mining: Practical Machine Learning Tools and Techniques. 147–187 (2011).
dc.relation.referencesen	[8] Japkowicz N., Shah M. Performance evaluation in machine learning. Machine Learning in Radiation Oncology. 41–56 (2015).
dc.relation.referencesen	[9] Kou G., Lu Y., Peng Y., Shi Y. Evaluation of classification algorithms using MCDM and rank correlation. International Journal of Information Technology & Decision Making. 11 (01), 197–225 (2012).
dc.relation.referencesen	[10] Qu Z., Wan C., Yang Z., Lee P. T.-W. A discourse of multi-criteria decision making (MCDM) approaches. Multi-Criteria Decision Making in Maritime Studies and Logistics. 7–29 (2018).
dc.relation.referencesen	[11] U¸car M. K., Nour M., Sindi H., Polat K. The effect of training and testing process on machine learning in biomedical datasets. Mathematical Problems in Engineering. 2020, 2836236 (2020).
dc.relation.referencesen	[12] Raschka S. Model evaluation, model selection, and algorithm selection in machine learning. Preprint arXiv:1811.12808 (2018).
dc.relation.referencesen	[13] Zheng A. Evaluating machine learning models: a beginner’s guide to key concepts and pitfalls. O’Reilly Media (2015).
dc.relation.referencesen	[14] Torra V. Hesitant fuzzy sets. International Journal of Intelligent Systems. 25 (6), 529–539 (2010).
dc.relation.referencesen	[15] Zhang N., Wei G. Extension of VIKOR method for decision making problem based on hesitant fuzzy set. Applied Mathematical Modelling. 37 (7), 4938–4947 (2013).
dc.relation.referencesen	[16] Zadeh L. A. Fuzzy sets. Information and Control. 8 (3), 338–353 (1965).
dc.relation.referencesen	[17] Hu J., Zhang X., Chen X., Liu Y. Hesitant fuzzy information measures and their applications in multicriteria decision making. International Journal of Systems Science. 47 (1), 62–76 (2016).
dc.relation.referencesen	[18] Gal T., Stewart T., Hanne T. (Eds.). Multicriteria decision making: advances in MCDM models, algorithms, theory, and applications. Springer Science + Business Media, New York (2013).
dc.relation.referencesen	[19] Hwang C. L., Yoon K. Methods for multiple attribute decision making. Multiple Attribute Decision Making. 58–191 (1981).
dc.relation.referencesen	[20] Shih H.-S., Shyur H.-J., Lee E. S. An extension of TOPSIS for group decision making. Mathematical and Computer Modelling. 45 (7–8), 801–813 (2007).
dc.relation.referencesen	[21] Xu Z., Zhang X. Hesitant fuzzy multi-attribute decision making based on TOPSIS with incomplete weight information. Knowledge-Based Systems. 52, 53–64 (2013).
dc.relation.referencesen	[22] Sayadi M. K., Heydari M., Shahanaghi K. Extension of VIKOR method for decision making problem with interval numbers. Applied Mathematical Modelling. 33 (5), 2257–2262 (2009).
dc.rights.holder	© Національний університет “Львівська політехніка”, 2024
dc.subject	прогноз раку
dc.subject	машинне навчання
dc.subject	нечітка логіка
dc.subject	MCDM
dc.subject	cancer prediction
dc.subject	machine learning
dc.subject	hesitant fuzzy logic
dc.subject	MCDM
dc.title	Machine learning models selection under uncertainty: application in cancer prediction
dc.title.alternative	Вибір моделей машинного навчання в умовах невизначеності: застосування в прогнозуванні раку
dc.type	Article

Files

Original bundle

Now showing 1 - 2 of 2

Name:: 2024v1n11_Lamrani_Alaoui_Y-Machine_learning_230-238.pdf
Size:: 6.04 MB
Format:: Adobe Portable Document Format

Download

Name:: 2024v1n11_Lamrani_Alaoui_Y-Machine_learning_230-238__COVER.png
Size:: 452.16 KB
Format:: Portable Network Graphics

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.84 KB
Format:: Plain Text
Description:

Download

Collections

Mathematical Modeling And Computing. – 2024. – Vol. 11, No. 1