Аналіз рішень та підходів кластеризації геопросторових даних для оптимізації продуктивності веб-карти та взаємодії користувача

Арзубов, М. В.; Батюк, А. Є.; Arzubov, M. V.; Batiuk, A. Y.

Аналіз рішень та підходів кластеризації геопросторових даних для оптимізації продуктивності веб-карти та взаємодії користувача

dc.citation.epage	96
dc.citation.issue	2
dc.citation.journalTitle	Український журнал інформаційних технологій
dc.citation.spage	88
dc.citation.volume	5
dc.contributor.affiliation	Національний університет “Львівська політехніка”
dc.contributor.affiliation	Lviv Polytechnic National University
dc.contributor.author	Арзубов, М. В.
dc.contributor.author	Батюк, А. Є.
dc.contributor.author	Arzubov, M. V.
dc.contributor.author	Batiuk, A. Y.
dc.coverage.placename	Львів
dc.coverage.placename	Lviv
dc.date.accessioned	2024-04-01T11:06:05Z
dc.date.available	2024-04-01T11:06:05Z
dc.date.created	2023-02-28
dc.date.issued	2023-02-28
dc.description.abstract	У сучасну епоху, управління та візуалізація геопросторової інформації у веб-браузерах набули більшого значення. Веб-карти є незамінними інструментами в різних областях, таких як туризм, доставка товарів чи екологія. Важливим є також широка підтримка веб-браузерів на різних пристроях, що робить використання геоданих у веб більш доступним для різних користувачів. Але постійне збільшення геопросторової інформації створює нові виклики у ефективному відображенні даних та навігації по цих даних на веб-картах. Тому при роботі з геоданими важливою є їх кластеризація. Різні методи кластеризації можуть по різному впливати на продуктивність чи візуальну зрозумілість веб-карт. В даній роботі проведено ґрунтовний огляд типів даних та методів кластеризації. Проаналізовано інструменти та бібліотеки, які спеціалізуються на кластеризації геоданих у веб-картах. Також досліджено різні типи геоданих і підходи при роботі з ними. Описано таке поняття як напівстатичні дані, і яке місце вони займають разом з статичними і динамічними типами даних. Під час аналізу виявлено у яких випадках краще використовувати певні методи кластеризації або коли варто застосовувати підхід з кластеризацією на серверній стороні. Також зроблено висновок про те який підхід варто обрати при роботі з великим обсягом статичних чи напівстатичних геоданих, а саме використання кластеризації на серверній стороні з кешуванням. В підсумку, проаналізовано різні підходи кластеризації у веб-картах як на клієнтській стороні, так і на серверній. Також описано переваги і недоліки обох підходів і коли краще той чи інший метод використовувати. Виявлено відсутність чітких підходів у кластеризації великих геоданих для зображення на веб-картах, що зумовлює актуальність досліджень у цьому напрямку.
dc.description.abstract	In the contemporary epoch, the management and visualization of geospatial information in web browsers have gained substantial importance. Web maps are indispensable tools across various tourism, goods delivery, and ecology sectors. Furthermore, the extensive support of web browsers on diverse devices enhances the accessibility of geospatial data on the web for various users. However, the incessant increment of geospatial information poses new challenges in efficiently displaying data and navigation through these data on web maps. Therefore, the clustering of geospatial data is crucial in dealing with them. Different clustering methods may affect the performance or visual clarity of web maps. To improve the user experience and optimize the use of computing resources, geodata clustering becomes a necessary tool for processing large volumes of markers on the map. Despite significant progress in the development of geodata clustering solutions in web maps, there are some challenges that developers and users may encounter. In this article, challenges with scaling, dynamic cluster data, and heterogeneity of data are described. Existing problems in geodata clustering in web maps require additional research and development. Understanding these issues will help developers and researchers improve existing solutions and create new methods and approaches for efficient clustering of geodata in web maps. The urgency of solving the problem lies in the search for effective clustering solutions that provide an opportunity to ensure convenient interactivity and fast processing of geodata in web maps. This study provides a comprehensive review of data types and clustering methods. Tools and libraries for geodata clustering in web maps are analyzed. Different types of geodata and approaches to working with them were also studied. Concepts such as semi-static data and their positions alongside static and dynamic data types are elucidated. Through the analysis, optimal scenarios for applying specific clustering methods or the utilization of server-side clustering approaches have been identified. Conclusions have also been drawn on the preferred approach when handling extensive volumes of static or semi-static geospatial data, particularly advocating for the application of server-side clustering with caching. In conclusion, various clustering approaches in web maps, both client-side and server-side, have been scrutinized. The advantages and disadvantages of both approaches, along with recommendations on when to apply each method, have been delineated. A noticeable absence of explicit approaches in clustering vast geospatial data for web map representation underpins the relevance and necessity of research in this direction.
dc.format.extent	88-96
dc.format.pages	9
dc.identifier.citation	Арзубов М. В. Аналіз рішень та підходів кластеризації геопросторових даних для оптимізації продуктивності веб-карти та взаємодії користувача / М. В. Арзубов, А. Є. Батюк // Український журнал інформаційних технологій. — Львів : Видавництво Львівської політехніки, 2023. — Том 5. — № 2. — С. 88–96.
dc.identifier.citationen	Arzubov M. V. Solutions and approaches analysis for geospatial data clustering to optimize performance and user experience of web maps / M. V. Arzubov, A. Y. Batiuk // Ukrainian Journal of Information Technology. — Lviv : Lviv Politechnic Publishing House, 2023. — Vol 5. — No 2. — P. 88–96.
dc.identifier.doi	doi.org/10.23939/ujit2023.02.088
dc.identifier.issn	2707-1898
dc.identifier.uri	https://ena.lpnu.ua/handle/ntb/61595
dc.language.iso	uk
dc.publisher	Видавництво Львівської політехніки
dc.publisher	Lviv Politechnic Publishing House
dc.relation.ispartof	Український журнал інформаційних технологій, 2 (5), 2023
dc.relation.ispartof	Ukrainian Journal of Information Technology, 2 (5), 2023
dc.relation.references	[1] Agarwal, S., & Rajan, K. S. (2016). Performance analysis of MongoDB versus PostGIS/PostGreSQL databases for line intersection and point containment spatial queries. Spat. Inf. Res., 24, 671 677. https://doi.org/10.1007/s41324-016-0059-1
dc.relation.references	[2] Amini, A., Wah, T. Y., Saybani, M. R., & Yazdi, S. R. A. S. (2011, July). A study of density-grid based clustering algorithms on data streams. In 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), vol. 3, 1652-1656. IEEE. https://doi.org/10.1109/FSKD.2011.6019867
dc.relation.references	[3] Ankerst, M., Breunig, M. M., Kriegel, H. P., & Sander, J. (1999). OPTICS: Ordering points to identify the clustering structure. ACM Sigmod record, 28(2), 49-60. https://doi.org/10.1145/304181.304187
dc.relation.references	[4] Cekule, M., Mitrofanovs, I., & Cabs, K. (2023). Information technology for real-time monitoring and visualization of load in urban public open spaces based on spatial and statistical data analyses of human behaviour. International Multidisciplinary Scientific GeoConference: SGEM, 23(2.1), 89-96. https://doi.org/10.5593/sgem2023/2.1/s07.12
dc.relation.references	[5] Choi, S., & Bae, B. (2015). The Real-Time Monitoring System of Social Big Data for Disaster Management. In: Park, J., Stojmenovic, I., Jeong, H., Yi, G. (eds) Computer Science and its Applications. Lecture Notes in Electrical Engineering, vol 330. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45402-2_115
dc.relation.references	[6] Doroshenko A. (2020). Analysis of the Distribution of COVID-19 in Italy Using Clustering Algorithms, 2020 IEEE Third International Conference on Data Stream Mining & Processing (DSMP), Lviv, Ukraine, pp. 325-328. https://doi.org/10.1109/DSMP47368.2020.9204202
dc.relation.references	[7] ElHaj, K., Alshamsi, D. & Aldahan, A. (2023). GeoZ: a Region-Based Visualization of Clustering Algorithms. J geovis spat anal, 7, 15. https://doi.org/10.1007/s41651-023-00146-0
dc.relation.references	[8] Guo, D., & Onstein, E. (2020). State-of-the-Art Geospatial Information Processing in NoSQL Databases. ISPRS Int. J. Geo-Inf., 9, 331. https://doi.org/10.3390/ijgi9050331
dc.relation.references	[9] Hartigan, J. A., & Wong, M. A. (1979). Algorithm AS 136: A K-Means Clustering Algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics), 28(1), 100 108. https://doi.org/10.2307/2346830
dc.relation.references	[10] Kramer, O. (2016). Scikit-Learn. In: Machine Learning for Evolution Strategies. Studies in Big Data, vol 20. Springer, Cham. https://doi.org/10.1007/978-3-319-33383-0_5
dc.relation.references	[11] Kulawiak, M., Dawidowicz, A., & Pacholczyk, M. E. (2019). Analysis of server-side and client-side Web-GIS data processing methods on the example of JTS and JSTS using open data from OSM and geoportal. Computers & Geosciences, 129, 26-37. 1 https://doi.org/10.1016/j.cageo.2019.04.011
dc.relation.references	[12] Laasasenaho, K., Lensu, A., Lauhanen, R., & Rintala, J. (2019). GIS-data related route optimization, hierarchical clustering, location optimization, and kernel density methods are useful for promoting distributed bioenergy plant planning in rural areas. Sustainable Energy Technologies and Assessments, 32, 47-57. https://doi.org/10.1016/j.seta.2019.01.006
dc.relation.references	[13] Levus, Ye. V., & Vasyliuk, R. B. (2022). Recommendation algorithm using data clustering. Ukrainian Journal of Information Technology, 4(2), 18-24. https://doi.org/10.23939/ujit2022.02.018
dc.relation.references	[14] Lytvyn, V., Uhryn, D., Ushenko, Y., Masikevych, A., & Bairachnyi, V. (2023). The Method of Clustering Geoinformation Data for Stationary Sectoral Geoinformation Systems Using Swarm Intelligence Methods. In: Cioboată, D.D. (eds) International Conference on Reliable Systems Engineering (ICoRSE) - 2023. ICoRSE 2023. Lecture Notes in Networks and Systems, vol 762. Springer, Cham. https://doi.org/10.1007/978-3-031-40628-7_44
dc.relation.references	[15] Muenchow, J., Schäfer, S., & Krüger, E. (2019). Reviewing qualitative GIS research-Toward a wider usage of open‐source GIS and reproducible research practices. Geography Compass, 13(6), e11. https://doi.org/10.1111/gec3.1
dc.relation.references	[16] Murtagh, F., & Contreras, P. (2012). Algorithms for hierarchical clustering: an overview. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 2(1), 86-97. https://doi.org/10.1002/widm.53
dc.relation.references	[17] Netek, R., Brus, J., & Tomecka, O. (2019). Performance Testing on Marker Clustering and Heatmap Visualization Techniques: A Comparative Study on JavaScript Mapping Libraries. ISPRS Int. J. Geo-Inf., 8, 348. https://doi.org/10.3390/ijgi8080348
dc.relation.references	[18] Praene, J. P., Malet-Damour, B., Radanielina, M. H., Fontaine, L., & Riviere, G. (2019). GIS-based approach to identify climatic zoning: A hierarchical clustering on principal component analysis. Building and Environment, 164, 106330. https://doi.org/10.1016/j.buildenv.2019.106330
dc.relation.references	[19] Rezaei, M., & Franti, P. (2018). Real-time clustering of large Geo-referenced data for visualizing on map. Adv. Electr. Comput. En., 18(4), 63-74, Nov. 2018. https://doi.org/10.4316/AECE.2018.04008
dc.relation.references	[20] Schubert, E., Sander, J., Ester, M., Kriegel, H. P., & Xu, X. (2017). DBSCAN revisited, revisited: why and how you should (still) use DBSCAN. ACM Transactions on Database Systems (TODS), 42(3), 1-21. https://doi.org/10.1145/3068335
dc.relation.references	[21] Yu, J., Wu, J., Sarwat, M. (2015). GeoSpark: a cluster computing framework for processing large-scale spatial data. In: Proceedings of the ACM SIGSPATIAL GIS, USA. https://doi.org/10.1145/2820783.2820860
dc.relation.referencesen	[1] Agarwal, S., & Rajan, K. S. (2016). Performance analysis of MongoDB versus PostGIS/PostGreSQL databases for line intersection and point containment spatial queries. Spat. Inf. Res., 24, 671 677. https://doi.org/10.1007/s41324-016-0059-1
dc.relation.referencesen	[2] Amini, A., Wah, T. Y., Saybani, M. R., & Yazdi, S. R. A. S. (2011, July). A study of density-grid based clustering algorithms on data streams. In 2011 Eighth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), vol. 3, 1652-1656. IEEE. https://doi.org/10.1109/FSKD.2011.6019867
dc.relation.referencesen	[3] Ankerst, M., Breunig, M. M., Kriegel, H. P., & Sander, J. (1999). OPTICS: Ordering points to identify the clustering structure. ACM Sigmod record, 28(2), 49-60. https://doi.org/10.1145/304181.304187
dc.relation.referencesen	[4] Cekule, M., Mitrofanovs, I., & Cabs, K. (2023). Information technology for real-time monitoring and visualization of load in urban public open spaces based on spatial and statistical data analyses of human behaviour. International Multidisciplinary Scientific GeoConference: SGEM, 23(2.1), 89-96. https://doi.org/10.5593/sgem2023/2.1/s07.12
dc.relation.referencesen	[5] Choi, S., & Bae, B. (2015). The Real-Time Monitoring System of Social Big Data for Disaster Management. In: Park, J., Stojmenovic, I., Jeong, H., Yi, G. (eds) Computer Science and its Applications. Lecture Notes in Electrical Engineering, vol 330. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45402-2_115
dc.relation.referencesen	[6] Doroshenko A. (2020). Analysis of the Distribution of COVID-19 in Italy Using Clustering Algorithms, 2020 IEEE Third International Conference on Data Stream Mining & Processing (DSMP), Lviv, Ukraine, pp. 325-328. https://doi.org/10.1109/DSMP47368.2020.9204202
dc.relation.referencesen	[7] ElHaj, K., Alshamsi, D. & Aldahan, A. (2023). GeoZ: a Region-Based Visualization of Clustering Algorithms. J geovis spat anal, 7, 15. https://doi.org/10.1007/s41651-023-00146-0
dc.relation.referencesen	[8] Guo, D., & Onstein, E. (2020). State-of-the-Art Geospatial Information Processing in NoSQL Databases. ISPRS Int. J. Geo-Inf., 9, 331. https://doi.org/10.3390/ijgi9050331
dc.relation.referencesen	[9] Hartigan, J. A., & Wong, M. A. (1979). Algorithm AS 136: A K-Means Clustering Algorithm. Journal of the Royal Statistical Society. Series C (Applied Statistics), 28(1), 100 108. https://doi.org/10.2307/2346830
dc.relation.referencesen	[10] Kramer, O. (2016). Scikit-Learn. In: Machine Learning for Evolution Strategies. Studies in Big Data, vol 20. Springer, Cham. https://doi.org/10.1007/978-3-319-33383-0_5
dc.relation.referencesen	[11] Kulawiak, M., Dawidowicz, A., & Pacholczyk, M. E. (2019). Analysis of server-side and client-side Web-GIS data processing methods on the example of JTS and JSTS using open data from OSM and geoportal. Computers & Geosciences, 129, 26-37. 1 https://doi.org/10.1016/j.cageo.2019.04.011
dc.relation.referencesen	[12] Laasasenaho, K., Lensu, A., Lauhanen, R., & Rintala, J. (2019). GIS-data related route optimization, hierarchical clustering, location optimization, and kernel density methods are useful for promoting distributed bioenergy plant planning in rural areas. Sustainable Energy Technologies and Assessments, 32, 47-57. https://doi.org/10.1016/j.seta.2019.01.006
dc.relation.referencesen	[13] Levus, Ye. V., & Vasyliuk, R. B. (2022). Recommendation algorithm using data clustering. Ukrainian Journal of Information Technology, 4(2), 18-24. https://doi.org/10.23939/ujit2022.02.018
dc.relation.referencesen	[14] Lytvyn, V., Uhryn, D., Ushenko, Y., Masikevych, A., & Bairachnyi, V. (2023). The Method of Clustering Geoinformation Data for Stationary Sectoral Geoinformation Systems Using Swarm Intelligence Methods. In: Cioboată, D.D. (eds) International Conference on Reliable Systems Engineering (ICoRSE) - 2023. ICoRSE 2023. Lecture Notes in Networks and Systems, vol 762. Springer, Cham. https://doi.org/10.1007/978-3-031-40628-7_44
dc.relation.referencesen	[15] Muenchow, J., Schäfer, S., & Krüger, E. (2019). Reviewing qualitative GIS research-Toward a wider usage of open‐source GIS and reproducible research practices. Geography Compass, 13(6), e11. https://doi.org/10.1111/gec3.1
dc.relation.referencesen	[16] Murtagh, F., & Contreras, P. (2012). Algorithms for hierarchical clustering: an overview. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 2(1), 86-97. https://doi.org/10.1002/widm.53
dc.relation.referencesen	[17] Netek, R., Brus, J., & Tomecka, O. (2019). Performance Testing on Marker Clustering and Heatmap Visualization Techniques: A Comparative Study on JavaScript Mapping Libraries. ISPRS Int. J. Geo-Inf., 8, 348. https://doi.org/10.3390/ijgi8080348
dc.relation.referencesen	[18] Praene, J. P., Malet-Damour, B., Radanielina, M. H., Fontaine, L., & Riviere, G. (2019). GIS-based approach to identify climatic zoning: A hierarchical clustering on principal component analysis. Building and Environment, 164, 106330. https://doi.org/10.1016/j.buildenv.2019.106330
dc.relation.referencesen	[19] Rezaei, M., & Franti, P. (2018). Real-time clustering of large Geo-referenced data for visualizing on map. Adv. Electr. Comput. En., 18(4), 63-74, Nov. 2018. https://doi.org/10.4316/AECE.2018.04008
dc.relation.referencesen	[20] Schubert, E., Sander, J., Ester, M., Kriegel, H. P., & Xu, X. (2017). DBSCAN revisited, revisited: why and how you should (still) use DBSCAN. ACM Transactions on Database Systems (TODS), 42(3), 1-21. https://doi.org/10.1145/3068335
dc.relation.referencesen	[21] Yu, J., Wu, J., Sarwat, M. (2015). GeoSpark: a cluster computing framework for processing large-scale spatial data. In: Proceedings of the ACM SIGSPATIAL GIS, USA. https://doi.org/10.1145/2820783.2820860
dc.relation.uri	https://doi.org/10.1007/s41324-016-0059-1
dc.relation.uri	https://doi.org/10.1109/FSKD.2011.6019867
dc.relation.uri	https://doi.org/10.1145/304181.304187
dc.relation.uri	https://doi.org/10.5593/sgem2023/2.1/s07.12
dc.relation.uri	https://doi.org/10.1007/978-3-662-45402-2_115
dc.relation.uri	https://doi.org/10.1109/DSMP47368.2020.9204202
dc.relation.uri	https://doi.org/10.1007/s41651-023-00146-0
dc.relation.uri	https://doi.org/10.3390/ijgi9050331
dc.relation.uri	https://doi.org/10.2307/2346830
dc.relation.uri	https://doi.org/10.1007/978-3-319-33383-0_5
dc.relation.uri	https://doi.org/10.1016/j.cageo.2019.04.011
dc.relation.uri	https://doi.org/10.1016/j.seta.2019.01.006
dc.relation.uri	https://doi.org/10.23939/ujit2022.02.018
dc.relation.uri	https://doi.org/10.1007/978-3-031-40628-7_44
dc.relation.uri	https://doi.org/10.1111/gec3.1
dc.relation.uri	https://doi.org/10.1002/widm.53
dc.relation.uri	https://doi.org/10.3390/ijgi8080348
dc.relation.uri	https://doi.org/10.1016/j.buildenv.2019.106330
dc.relation.uri	https://doi.org/10.4316/AECE.2018.04008
dc.relation.uri	https://doi.org/10.1145/3068335
dc.relation.uri	https://doi.org/10.1145/2820783.2820860
dc.rights.holder	© Національний університет “Львівська політехніка”, 2023
dc.subject	геодані
dc.subject	геопросторові
dc.subject	великі геодані
dc.subject	статичні геодані
dc.subject	напівстатичні геодані
dc.subject	динамічні геодані
dc.subject	кластеризація
dc.subject	вебкарти
dc.subject	geodata
dc.subject	geospatial
dc.subject	big geodata
dc.subject	static geodata
dc.subject	semi-static geodata
dc.subject	dynamic geodata
dc.subject	clustering
dc.subject	web map
dc.subject.udc	004.75/.62
dc.title	Аналіз рішень та підходів кластеризації геопросторових даних для оптимізації продуктивності веб-карти та взаємодії користувача
dc.title.alternative	Solutions and approaches analysis for geospatial data clustering to optimize performance and user experience of web maps
dc.type	Article

Files

Original bundle

Now showing 1 - 2 of 2

Name:: 2023v5n2_Arzubov_M_V-Solutions_and_approaches_88-96.pdf
Size:: 2.22 MB
Format:: Adobe Portable Document Format

Download

Name:: 2023v5n2_Arzubov_M_V-Solutions_and_approaches_88-96__COVER.png
Size:: 1.63 MB
Format:: Portable Network Graphics

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.8 KB
Format:: Plain Text
Description:

Download

Collections

Ukrainian Journal of Information Technology. – 2023. – Vol. 5, No. 2