Extracting and classification the semi-structured data of web-systems

Abstract

The extracting and classification of semi-structured data of websystems is described. The definition of semi-structured data is given and the main characteristics are defined. The variety of tasks text information processing is grouped into the eleven large classes related to the analysis of text data. The traditional models of knowledge representation are considered. An algorithm for the web-sources, from which data will to be obtained, ontological model integrating creating is proposed. The process of data extracting using the query language to the markup language elements is characterized.

Description

Keywords

semi-structured data, web-system, extracting, classification, ontology

Citation

Pelekh I. Extracting and classification the semi-structured data of web-systems / Irina Pelekh // Computational linguistics and intelligent systems, 25-27 June 2018. — Lviv : Lviv Polytechnic National University, 2018. — Vol 2 : Workshop. — P. 139–145. — (Section II. Intelligent Systems).