Рекомендаційний алгоритм із використанням кластеризації даних

Simple item page

dc.citation.epage	24
dc.citation.issue	2
dc.citation.journalTitle	Український журнал інформаційних технологій
dc.citation.spage	18
dc.citation.volume	4
dc.contributor.affiliation	Національний університет “Львівська політехніка”
dc.contributor.affiliation	Lviv Polytechnic National University
dc.contributor.author	Левус, Є. В.
dc.contributor.author	Василюк, Р. Б.
dc.contributor.author	Levus, Ye. V.
dc.contributor.author	Vasyliuk, R. B.
dc.coverage.placename	Львів
dc.coverage.placename	Lviv
dc.date.accessioned	2024-03-27T08:56:55Z
dc.date.available	2024-03-27T08:56:55Z
dc.date.created	2022-02-28
dc.date.issued	2022-02-28
dc.description.abstract	Розроблено рекомендаційний алгоритм для підвищення якості надання рекомендацій та врахування проблем розрідженості даних і холодного старту, який враховує удосконалення відомих методів колаборативної фільтрації із використанням кластеризації. З'ясовано, що рекомендаційні системи створюють для швидкого знаходження товарів чи послуг в мережі Інтернет, надаючи пропозиції, які точно відповідають інтересам користувачів. Незважаючи на інтенсивний розвиток алгоритмів рекомендаційних систем та незалежно від доменів їх використання (електронна комерція, розваги, послуги, новини, соціальні мережі тощо), актуальними залишаються питання підвищення якості наданих рекомендацій, збільшення швидкодії їх вироблення, масштабованості, забезпечення стійкості в умовах розрідженості даних, холодного старту. Модифіковано алгоритм колаборативної фільтрації, який можна використати для вироблення рекомендацій користувачам системами закупівлі товарів чи надання послуг. Виявлено, що завдання пошуку схожих користувачів за вподобанням вирішують з використанням кластеризації. Поділ користувачів на кластери відбувається за допомогою алгоритму k-середніх із динамічним пошуком оптимальної кількості кластерів і початкових центроїдів. Запропонований рекомендаційний алгоритм надає релевантні рекомендації та працює ефективно за різної кількості вхідних даних. Кластеризація дає змогу алгоритму бути масштабованим і працювати із великою кількістю користувачів системи. Практична реалізація модифікованого рекомендаційного алгоритму здійснена для системи підбору кінофільмів. Наукова новизна отриманих результатів дослідження полягає у розвитку методу колаборативної фільтрації на підставі використання кластеризації із динамічним визначенням кількості кластерів і початкових центроїдів для ідентифікації груп подібних користувачів. Для верифікації результатів модифікований алгоритм було порівняно із іншими наявними імплементаціями – з алгоритмом, заснованим на пам'яті, та алгоритмом, заснованим на сусідстві. Запропонований алгоритм має кращі результати на 25-40 % для проведених тестів. Модифікований рекомендаційний алгоритм не є прив'язаним до певної предметної області, тому його можна інтегрувати в програмні системи різних доменів.
dc.description.abstract	Recommender systems play a vital role in the marketing of various goods and services. Despite the intensive growth of the theory of recommendation algorithms and a large number of their implementations, many issues remain unresolved; in particular, scalability, quality of recommendations in conditions of sparse data, and cold start. A modified collaborative filtering algorithm based on data clustering with the dynamic determination of the number of clusters and initial centroids has been developed. Data clustering is performed using the k-means method and is applied to group similar users aimed at increase of the quality of the recommendation results. The number of clusters is calculated dynamically using the silhouette method, the determination of the initial centroids is not random, but relies on the number of clusters. This approach increases the performance of the recommender system and increases the accuracy of recommendations since the search for recommendations will be carried out within one cluster where all elements are already similar. Recommendation algorithms are software-implemented for the movie recommendation system. The software implementation of various methods that allow the user to receive a recommendation for a movie meeting their preferences is carried out: a modified algorithm, memory and neighborhood-based collaborative filtering methods. The results obtained for input data of 100, 500 and 2500 users under typical conditions, data sparsity and cold start were analyzed. The modified algorithm shows the best results – from 35 to 80 percent of recommendations that meet the user's expectations. The drop in the quality of recommendations for the modified algorithm is less than 10 per cent when the number of users increases from 100 to 2500, which indicates a good level of scalability of the developed solution. In the case of sparse data (40 percent of information is missing), the quality of recommendations is 60 percent. A low quality (35 percent) of recommendations was obtained in the case of a cold start – this case needs further investigation. Constructed algorithms can be used in rating recommender systems with the ability to calculate averaged scores for certain attributes. The modified recommendation algorithm is not tied to this subject area and can be integrated into other software systems.
dc.format.extent	18-24
dc.format.pages	7
dc.identifier.citation	Левус Є. В. Рекомендаційний алгоритм із використанням кластеризації даних / Є. В. Левус, Р. Б. Василюк // Український журнал інформаційних технологій. — Львів : Видавництво Львівської політехніки, 2022. — Том 4. — № 2. — С. 18–24.
dc.identifier.citationen	Levus Ye. V. Recommendation algorithm using data clustering / Ye. V. Levus, R. B. Vasyliuk // Ukrainian Journal of Information Technology. — Lviv : Lviv Politechnic Publishing House, 2022. — Vol 4. — No 2. — P. 18–24.
dc.identifier.issn	2707-1898
dc.identifier.uri	https://ena.lpnu.ua/handle/ntb/61551
dc.language.iso	uk
dc.publisher	Видавництво Львівської політехніки
dc.publisher	Lviv Politechnic Publishing House
dc.relation.ispartof	Український журнал інформаційних технологій, 2 (4), 2022
dc.relation.ispartof	Ukrainian Journal of Information Technology, 2 (4), 2022
dc.relation.references	[1] Ahuja, R., Chug, A., Gupta, S., Ahuja, P., & Kohli, S. (2020). Classification and Clustering Algorithms of Machine Learning with their Applications. In: Yang, XS., He, XS. (eds) Nature-Inspired Computation in Data Mining and Machine Learning. Studies in Computational Intelligence, 855. Springer, Cham. https://doi.org/10.1007/978-3-030-28553-1_11
dc.relation.references	[2] Bansal, S., & Baliyan, N. (2019). A Study of Recent Recommender System Techniques. International Journal of Knowledge and Systems Science (IJKSS), 10(2), 13-41. http://doi.org/10.4018/IJKSS.2019040102
dc.relation.references	[3] Brinton, C., & Chiang, M. (2019). Netflix Recommendation System. Retrieved from: https://www.coursera.org/lecture/networks-illustrated/netflixrecommendation-system-TYOZV
dc.relation.references	[4] Capóa, M., Péreza, A., & Lozano, J. A. (2017). An efficient approximation to the K-means clustering for massive data. Knowledge-Based Systems, 117, 56-69. https://doi.org/10.1016/j.knosys.2016.06.031
dc.relation.references	[5] Das, J., Mukherjee, P., Majumder, S., & Gupta, Pr. (2014). Clustering-Based Recommender System Using Principles of Voting Theory. Proceedings of 2014 International Conference on Contemporary Computing and Informatics, IC3I 2014. http://doi.org/10.1109/IC3I.2014.7019655
dc.relation.references	[6] Dinh, D. T., Fujinami, T., & Huynh, V. N. (2019). Estimating the Optimal Number of Clusters in Categorical Data Clustering by Silhouette Coefficient. In: Chen, J., Huynh, V., Nguyen, GN., Tang, X. (Eds) Knowledge and Systems Sciences. KSS 2019. Communications in Computer and Information Science, 1103. Springer, Singapore. https://doi.org/10.1007/978-981-15-1209-4_1
dc.relation.references	[7] Gope, J., & Jain, S. K. (2017). A survey on solving cold start problem in recommender systems. 2017 International Conference on Computing. Communication and Automation (ICCCA). https://doi.org/10.1109/ ccaa.2017.8229786
dc.relation.references	[8] Hongzhi, Y., Qinyong, W., Kai, Zh., Zhixu, Li, & Xiaofang, Zh. (2019). Overcoming Data Sparsity in Group Recommendation. IEEE Transactions On Knowledge And Data Engineering, 87-120.
dc.relation.references	[9] How Netflixs Recommendations System Works. Retrieved from: https://help.netflix.com/uk/node/100639
dc.relation.references	[10] Huang, Z., Chung, W., & Chen, H. (2003). A Graph Model for E-Commerce Recommender Systems. Journal of the American Society for Information Science & Technology, 3–21. https://doi.org/10.1002/asi.10372
dc.relation.references	[11] Ko, H., Lee, S., Park, Y., & Choi, A. (2022) A Survey of Recommendation Systems: Recommendation Models, Techniques, and Application Fields. Electronics, 22(11), 141. https://doi.org/10.3390/electronics11010141
dc.relation.references	[12] Koutrika G. (2018). Modern Recommender Systems: from Computing Matrices to Thinking with Neurons. Proceedings of the Management of Data, 1651–1654. https://doi.org/10.1145/3183713.3197389
dc.relation.references	[13] Levus, Ye. V., & Polianska, A. O. (2020). Algorithm for developing a complex recommendation through the example of the tourism industry. Scientific Bulletin of UNFU, 30(5), 122–127. https://doi.org/10.36930/40300520
dc.relation.references	[14] Lika, B., Kolomvatsos, K., Hadjiefthymiades, St. (2014). Facing the cold start problem in recommender systems. Expert Systems with Applications, 2065–2073. https://doi.org/10.1016/j.eswa.2013.09.005
dc.relation.references	[15] Lobur, M., Shvarts, M., & Stekh, Y. (2018). Application of recommender systems in the design of complex microsystem devices. International Journal of Advanced Research in Computer Engineering & Technology, 7(9), 709–714.
dc.relation.references	[16] Lytvyn, V., Vysotska, V., Shatskykh, V., Kohut, I., Petruchenko, O., Dzyubyk, L., Bobrivetc, V., Panasyuk, V., Sachenko, S., & Komar, M. (2019). Design of a recommendation system based on collaborative filtering and machine learning considering personal needs of the user. Eastern-European Journal of Enterprise Technologies, 4(2)(100), 6–28. https://doi.org/10.15587/1729-4061.2019.175507
dc.relation.references	[17] Miranda, L., Viterbo, J., Bernardini, F. (2020). Towards the Use of Clustering Algorithms in Recommender Systems. AMCIS 2020 Proceedings. 21. Retrieved from: https://aisel.aisnet.org/amcis2020/ai_semantic_for_intelligent_info_systems/ai_semantic_for_intelligent_info_systems/21
dc.relation.references	[18] Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20, 53–65. https://doi.org/10.1016/0377-0427(87)90125-7
dc.relation.references	[19] Schwarz, M., Lobur, M., Stekh, Y. (2017). Analysis of the effectiveness of similarity measures for recommender systems. The experience of designing and application of CAD systems in microelectronics (CADSM): proceedings 14th International conference, 275-277.
dc.relation.references	[20] Shutaywi, M., & Kachouie, N. N. (2021). Silhouette Analysis for Performance Evaluation in Machine Learning with Applications to Clustering. Entropy, 23, 759. https://doi.org/10.3390/e23060759
dc.relation.references	[21] Syakur1, M. A., Khotimah1, B. K., Rochman1, E. M., Satoto, B. D. (2018). Integration K-Means Clustering Method and Elbow Method For Identification of the Best Customer Profile Cluster. IOP Conference Series Materials Science and Engineering, 2–5. https://doi.org/10.1088/1757-899X/336/1/012017
dc.relation.references	[22] Zhang, Q., Lu, J., & Jin, Y. (2021). Artificial intelligence in recommender systems. Complex Intell. Syst., 7, 439–457. https://doi.org/10.1007/s40747-020-00212-w
dc.relation.referencesen	[1] Ahuja, R., Chug, A., Gupta, S., Ahuja, P., & Kohli, S. (2020). Classification and Clustering Algorithms of Machine Learning with their Applications. In: Yang, XS., He, XS. (eds) Nature-Inspired Computation in Data Mining and Machine Learning. Studies in Computational Intelligence, 855. Springer, Cham. https://doi.org/10.1007/978-3-030-28553-1_11
dc.relation.referencesen	[2] Bansal, S., & Baliyan, N. (2019). A Study of Recent Recommender System Techniques. International Journal of Knowledge and Systems Science (IJKSS), 10(2), 13-41. http://doi.org/10.4018/IJKSS.2019040102
dc.relation.referencesen	[3] Brinton, C., & Chiang, M. (2019). Netflix Recommendation System. Retrieved from: https://www.coursera.org/lecture/networks-illustrated/netflixrecommendation-system-TYOZV
dc.relation.referencesen	[4] Capóa, M., Péreza, A., & Lozano, J. A. (2017). An efficient approximation to the K-means clustering for massive data. Knowledge-Based Systems, 117, 56-69. https://doi.org/10.1016/j.knosys.2016.06.031
dc.relation.referencesen	[5] Das, J., Mukherjee, P., Majumder, S., & Gupta, Pr. (2014). Clustering-Based Recommender System Using Principles of Voting Theory. Proceedings of 2014 International Conference on Contemporary Computing and Informatics, IC3I 2014. http://doi.org/10.1109/IC3I.2014.7019655
dc.relation.referencesen	[6] Dinh, D. T., Fujinami, T., & Huynh, V. N. (2019). Estimating the Optimal Number of Clusters in Categorical Data Clustering by Silhouette Coefficient. In: Chen, J., Huynh, V., Nguyen, GN., Tang, X. (Eds) Knowledge and Systems Sciences. KSS 2019. Communications in Computer and Information Science, 1103. Springer, Singapore. https://doi.org/10.1007/978-981-15-1209-4_1
dc.relation.referencesen	[7] Gope, J., & Jain, S. K. (2017). A survey on solving cold start problem in recommender systems. 2017 International Conference on Computing. Communication and Automation (ICCCA). https://doi.org/10.1109/ ccaa.2017.8229786
dc.relation.referencesen	[8] Hongzhi, Y., Qinyong, W., Kai, Zh., Zhixu, Li, & Xiaofang, Zh. (2019). Overcoming Data Sparsity in Group Recommendation. IEEE Transactions On Knowledge And Data Engineering, 87-120.
dc.relation.referencesen	[9] How Netflixs Recommendations System Works. Retrieved from: https://help.netflix.com/uk/node/100639
dc.relation.referencesen	[10] Huang, Z., Chung, W., & Chen, H. (2003). A Graph Model for E-Commerce Recommender Systems. Journal of the American Society for Information Science & Technology, 3–21. https://doi.org/10.1002/asi.10372
dc.relation.referencesen	[11] Ko, H., Lee, S., Park, Y., & Choi, A. (2022) A Survey of Recommendation Systems: Recommendation Models, Techniques, and Application Fields. Electronics, 22(11), 141. https://doi.org/10.3390/electronics11010141
dc.relation.referencesen	[12] Koutrika G. (2018). Modern Recommender Systems: from Computing Matrices to Thinking with Neurons. Proceedings of the Management of Data, 1651–1654. https://doi.org/10.1145/3183713.3197389
dc.relation.referencesen	[13] Levus, Ye. V., & Polianska, A. O. (2020). Algorithm for developing a complex recommendation through the example of the tourism industry. Scientific Bulletin of UNFU, 30(5), 122–127. https://doi.org/10.36930/40300520
dc.relation.referencesen	[14] Lika, B., Kolomvatsos, K., Hadjiefthymiades, St. (2014). Facing the cold start problem in recommender systems. Expert Systems with Applications, 2065–2073. https://doi.org/10.1016/j.eswa.2013.09.005
dc.relation.referencesen	[15] Lobur, M., Shvarts, M., & Stekh, Y. (2018). Application of recommender systems in the design of complex microsystem devices. International Journal of Advanced Research in Computer Engineering & Technology, 7(9), 709–714.
dc.relation.referencesen	[16] Lytvyn, V., Vysotska, V., Shatskykh, V., Kohut, I., Petruchenko, O., Dzyubyk, L., Bobrivetc, V., Panasyuk, V., Sachenko, S., & Komar, M. (2019). Design of a recommendation system based on collaborative filtering and machine learning considering personal needs of the user. Eastern-European Journal of Enterprise Technologies, 4(2)(100), 6–28. https://doi.org/10.15587/1729-4061.2019.175507
dc.relation.referencesen	[17] Miranda, L., Viterbo, J., Bernardini, F. (2020). Towards the Use of Clustering Algorithms in Recommender Systems. AMCIS 2020 Proceedings. 21. Retrieved from: https://aisel.aisnet.org/amcis2020/ai_semantic_for_intelligent_info_systems/ai_semantic_for_intelligent_info_systems/21
dc.relation.referencesen	[18] Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20, 53–65. https://doi.org/10.1016/0377-0427(87)90125-7
dc.relation.referencesen	[19] Schwarz, M., Lobur, M., Stekh, Y. (2017). Analysis of the effectiveness of similarity measures for recommender systems. The experience of designing and application of CAD systems in microelectronics (CADSM): proceedings 14th International conference, 275-277.
dc.relation.referencesen	[20] Shutaywi, M., & Kachouie, N. N. (2021). Silhouette Analysis for Performance Evaluation in Machine Learning with Applications to Clustering. Entropy, 23, 759. https://doi.org/10.3390/e23060759
dc.relation.referencesen	[21] Syakur1, M. A., Khotimah1, B. K., Rochman1, E. M., Satoto, B. D. (2018). Integration K-Means Clustering Method and Elbow Method For Identification of the Best Customer Profile Cluster. IOP Conference Series Materials Science and Engineering, 2–5. https://doi.org/10.1088/1757-899X/336/1/012017
dc.relation.referencesen	[22] Zhang, Q., Lu, J., & Jin, Y. (2021). Artificial intelligence in recommender systems. Complex Intell. Syst., 7, 439–457. https://doi.org/10.1007/s40747-020-00212-w
dc.relation.uri	https://doi.org/10.1007/978-3-030-28553-1_11
dc.relation.uri	http://doi.org/10.4018/IJKSS.2019040102
dc.relation.uri	https://www.coursera.org/lecture/networks-illustrated/netflixrecommendation-system-TYOZV
dc.relation.uri	https://doi.org/10.1016/j.knosys.2016.06.031
dc.relation.uri	http://doi.org/10.1109/IC3I.2014.7019655
dc.relation.uri	https://doi.org/10.1007/978-981-15-1209-4_1
dc.relation.uri	https://doi.org/10.1109/
dc.relation.uri	https://help.netflix.com/uk/node/100639
dc.relation.uri	https://doi.org/10.1002/asi.10372
dc.relation.uri	https://doi.org/10.3390/electronics11010141
dc.relation.uri	https://doi.org/10.1145/3183713.3197389
dc.relation.uri	https://doi.org/10.36930/40300520
dc.relation.uri	https://doi.org/10.1016/j.eswa.2013.09.005
dc.relation.uri	https://doi.org/10.15587/1729-4061.2019.175507
dc.relation.uri	https://aisel.aisnet.org/amcis2020/ai_semantic_for_intelligent_info_systems/ai_semantic_for_intelligent_info_systems/21
dc.relation.uri	https://doi.org/10.1016/0377-0427(87)90125-7
dc.relation.uri	https://doi.org/10.3390/e23060759
dc.relation.uri	https://doi.org/10.1088/1757-899X/336/1/012017
dc.relation.uri	https://doi.org/10.1007/s40747-020-00212-w
dc.rights.holder	© Національний університет “Львівська політехніка”, 2022
dc.subject	центроїди
dc.subject	кластери
dc.subject	коефіцієнт подібності
dc.subject	розрідженість даних
dc.subject	холодний старт
dc.subject	similarity coefficient
dc.subject	centroid
dc.subject	cluster
dc.subject	data sparsity
dc.subject	cold start
dc.title	Рекомендаційний алгоритм із використанням кластеризації даних
dc.title.alternative	Recommendation algorithm using data clustering
dc.type	Article

Files

Original bundle

Now showing 1 - 2 of 2

Name:: 2022v4n2_Levus_Ye_V-Recommendation_algorithm_18-24.pdf
Size:: 1.82 MB
Format:: Adobe Portable Document Format

Download

Name:: 2022v4n2_Levus_Ye_V-Recommendation_algorithm_18-24__COVER.png
Size:: 1.88 MB
Format:: Portable Network Graphics

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.8 KB
Format:: Plain Text
Description:

Download

Collections

Ukrainian Journal of Information Technology. – 2022. – Vol. 4, No. 2