Client-server system for parsing data from web pages

Loading...
Thumbnail Image

Date

2022-06-06

Journal Title

Journal ISSN

Volume Title

Publisher

Видавництво Львівської політехніки
Lviv Politechnic Publishing House

Abstract

An overview of the basic principles and approaches for extracting information and processing information from web pages has been conducted. A methodology for developing a client-server system based on a tool for automation of work in Selenium web browsers based on the analyzed information about data parsing has been created. A third-party API as a user interface to simplify and speed up system development has been used. User access without downloading additional software has been enabled. Data from web pages have been received and processed. Development has been based on this methodology of its own client-server system, which is used to parse and collect the information presented on web pages. Analysis of cloud technology services for further deployment of data collection system from web pages has been carried out. Assessment and analysis of the viability of the system in an autonomous state have been deployed in the cloud service during long-term operation.

Description

Keywords

web page analysis, big data, client-server model, Selenium, Python

Citation

Britvin A. Client-server system for parsing data from web pages / Artur Britvin, Jawad Hammad Alrawashdeh, Rostyslav Tkachuck // Advances in Cyber-Physical Systems. — Lviv : Lviv Politechnic Publishing House, 2022. — Vol 7. — No 1. — P. 8–14.

Endorsement

Review

Supplemented By

Referenced By