Client-server system for parsing data from web pages
Loading...
Date
2022-06-06
Journal Title
Journal ISSN
Volume Title
Publisher
Видавництво Львівської політехніки
Lviv Politechnic Publishing House
Lviv Politechnic Publishing House
Abstract
An overview of the basic principles and
approaches for extracting information and processing
information from web pages has been conducted. A
methodology for developing a client-server system based on
a tool for automation of work in Selenium web browsers
based on the analyzed information about data parsing has
been created. A third-party API as a user interface to
simplify and speed up system development has been used.
User access without downloading additional software has
been enabled. Data from web pages have been received and
processed. Development has been based on this methodology
of its own client-server system, which is used to parse and
collect the information presented on web pages. Analysis of
cloud technology services for further deployment of data
collection system from web pages has been carried out.
Assessment and analysis of the viability of the system in an
autonomous state have been deployed in the cloud service
during long-term operation.
Description
Keywords
web page analysis, big data, client-server model, Selenium, Python
Citation
Britvin A. Client-server system for parsing data from web pages / Artur Britvin, Jawad Hammad Alrawashdeh, Rostyslav Tkachuck // Advances in Cyber-Physical Systems. — Lviv : Lviv Politechnic Publishing House, 2022. — Vol 7. — No 1. — P. 8–14.