Client-server system for parsing data from web pages

dc.citation.epage14
dc.citation.issue1
dc.citation.spage8
dc.contributor.affiliationLviv Polytechnic National University
dc.contributor.affiliationMajmaah University
dc.contributor.affiliationLviv State University of Life Safety
dc.contributor.authorBritvin, Artur
dc.contributor.authorAlrawashdeh, Jawad Hammad
dc.contributor.authorTkachuck, Rostyslav
dc.coverage.placenameЛьвів
dc.coverage.placenameLviv
dc.date.accessioned2023-04-21T08:27:15Z
dc.date.available2023-04-21T08:27:15Z
dc.date.created2022-06-06
dc.date.issued2022-06-06
dc.description.abstractAn overview of the basic principles and approaches for extracting information and processing information from web pages has been conducted. A methodology for developing a client-server system based on a tool for automation of work in Selenium web browsers based on the analyzed information about data parsing has been created. A third-party API as a user interface to simplify and speed up system development has been used. User access without downloading additional software has been enabled. Data from web pages have been received and processed. Development has been based on this methodology of its own client-server system, which is used to parse and collect the information presented on web pages. Analysis of cloud technology services for further deployment of data collection system from web pages has been carried out. Assessment and analysis of the viability of the system in an autonomous state have been deployed in the cloud service during long-term operation.
dc.format.extent8-14
dc.format.pages7
dc.identifier.citationBritvin A. Client-server system for parsing data from web pages / Artur Britvin, Jawad Hammad Alrawashdeh, Rostyslav Tkachuck // Advances in Cyber-Physical Systems. — Lviv : Lviv Politechnic Publishing House, 2022. — Vol 7. — No 1. — P. 8–14.
dc.identifier.citationenBritvin A., Alrawashdeh J. H., Tkachuck R. (2022) Client-server system for parsing data from web pages. Advances in Cyber-Physical Systems (Lviv), vol. 7, no 1, pp. 8-14.
dc.identifier.doihttps://doi.org/10.23939/acps2022.01.008
dc.identifier.urihttps://ena.lpnu.ua/handle/ntb/57967
dc.language.isoen
dc.publisherВидавництво Львівської політехніки
dc.publisherLviv Politechnic Publishing House
dc.relation.ispartofAdvances in Cyber-Physical Systems, 1 (7), 2022
dc.relation.references[1] Cukier, K. (2017). Big data : a revolution that will transform how we live, work and think. London: John Murray, 280 p.
dc.relation.references[2] O’neil, C. and Schutt, R. (2013). Doing data science. Beijing. Cambridge: O’reilly, 510 p.
dc.relation.references[3] Sweigart, A. (2020). Automate the boring stuff with Python : practical programming for total beginners. San Francisco, Calif. No Starch Press, 357 p.
dc.relation.references[4] Selenium. (n.d.). The Selenium Browser Automation Project. [online]. Available at: https://selenium.dev/documentation/.
dc.relation.references[5] Espinosa-Leal, L. (2018). Special issue of Big Data Research Journal on “Big Data and Neural Networks.”. Big Data Research, 11, pp. 120–130.
dc.relation.references[6] Williamson, E.P. (2017). Fetching and Parsing Data from the Web with OpenRefine. The Programming Historian, pp. 6–15.
dc.relation.references[7] Holden, G. (2016). Big Data and R&D Management. ResearchTechnology Management, 59(5), pp. 22–26. DOI:10.1080/08956308.2016.1208044
dc.relation.references[8] Kumar, S. and Singh, M. (2019). Big data analytics for healthcare industry: impact, applications, and tools. Big Data Mining and Analytics, 2(1), pp. 48–57.
dc.relation.references[9] Gardner, F.M. (1998). HTML Sourcebook: A Complete Guide To HTML 3.2 And HTML Extensions [Book Reviews]. IEEE Communications Magazine, 36(6), pp. 26–28. DOI: 10.1109/MCOM.1998.685344
dc.relation.references[10] Varshith, K. (2020). Software Virtualization using Containers in Google Cloud Platform. International Journal of Innovative Technology and Exploring Engineering, 9(4), pp. 802–804.
dc.relation.references[11] Itglobal.com. (n.d.). Amazon Web Services (AWS): platform responsibilities. [online] Available at: https://itglobal.com/ruru/company/glossary/amazon-web-services [Accessed 7 Nov. 2021].
dc.relation.references[12] Hamed, P.K. and Preece, A.S. (2020). Google Cloud Platform Adoption for Teaching in HEIs: A Qualitative Approach. OALib, 07(11), pp. 1–23. DOI: 10.4236/oalib.1106819
dc.relation.referencesen[1] Cukier, K. (2017). Big data : a revolution that will transform how we live, work and think. London: John Murray, 280 p.
dc.relation.referencesen[2] O’neil, C. and Schutt, R. (2013). Doing data science. Beijing. Cambridge: O’reilly, 510 p.
dc.relation.referencesen[3] Sweigart, A. (2020). Automate the boring stuff with Python : practical programming for total beginners. San Francisco, Calif. No Starch Press, 357 p.
dc.relation.referencesen[4] Selenium. (n.d.). The Selenium Browser Automation Project. [online]. Available at: https://selenium.dev/documentation/.
dc.relation.referencesen[5] Espinosa-Leal, L. (2018). Special issue of Big Data Research Journal on "Big Data and Neural Networks.". Big Data Research, 11, pp. 120–130.
dc.relation.referencesen[6] Williamson, E.P. (2017). Fetching and Parsing Data from the Web with OpenRefine. The Programming Historian, pp. 6–15.
dc.relation.referencesen[7] Holden, G. (2016). Big Data and R&D Management. ResearchTechnology Management, 59(5), pp. 22–26. DOI:10.1080/08956308.2016.1208044
dc.relation.referencesen[8] Kumar, S. and Singh, M. (2019). Big data analytics for healthcare industry: impact, applications, and tools. Big Data Mining and Analytics, 2(1), pp. 48–57.
dc.relation.referencesen[9] Gardner, F.M. (1998). HTML Sourcebook: A Complete Guide To HTML 3.2 And HTML Extensions [Book Reviews]. IEEE Communications Magazine, 36(6), pp. 26–28. DOI: 10.1109/MCOM.1998.685344
dc.relation.referencesen[10] Varshith, K. (2020). Software Virtualization using Containers in Google Cloud Platform. International Journal of Innovative Technology and Exploring Engineering, 9(4), pp. 802–804.
dc.relation.referencesen[11] Itglobal.com. (n.d.). Amazon Web Services (AWS): platform responsibilities. [online] Available at: https://itglobal.com/ruru/company/glossary/amazon-web-services [Accessed 7 Nov. 2021].
dc.relation.referencesen[12] Hamed, P.K. and Preece, A.S. (2020). Google Cloud Platform Adoption for Teaching in HEIs: A Qualitative Approach. OALib, 07(11), pp. 1–23. DOI: 10.4236/oalib.1106819
dc.relation.urihttps://selenium.dev/documentation/
dc.relation.urihttps://itglobal.com/ruru/company/glossary/amazon-web-services
dc.rights.holder© Національний університет „Львівська політехніка“, 2022
dc.rights.holder© Britvin A., Alrawashdeh J. H., Tkachuk R., 2022
dc.subjectweb page analysis
dc.subjectbig data
dc.subjectclient-server model
dc.subjectSelenium
dc.subjectPython
dc.titleClient-server system for parsing data from web pages
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
2022v7n1_Britvin_A-Client_server_system_for_parsing_8-14.pdf
Size:
269.44 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.77 KB
Format:
Plain Text
Description: