Extracting and classification the semi-structured data of web-systems

dc.citation.epage145
dc.citation.spage139
dc.contributor.affiliationLviv Polytechnic National University, Lviv, Ukraine
dc.contributor.authorPelekh, Irina
dc.coverage.placenameLviv
dc.coverage.temporal25-27 June 2018
dc.date.accessioned2018-09-03T11:41:10Z
dc.date.available2018-09-03T11:41:10Z
dc.date.created2018-06-25
dc.date.issued2018-06-25
dc.description.abstractThe extracting and classification of semi-structured data of websystems is described. The definition of semi-structured data is given and the main characteristics are defined. The variety of tasks text information processing is grouped into the eleven large classes related to the analysis of text data. The traditional models of knowledge representation are considered. An algorithm for the web-sources, from which data will to be obtained, ontological model integrating creating is proposed. The process of data extracting using the query language to the markup language elements is characterized.
dc.format.extent139-145
dc.format.pages7
dc.identifier.citationPelekh I. Extracting and classification the semi-structured data of web-systems / Irina Pelekh // Computational linguistics and intelligent systems, 25-27 June 2018. — Lviv : Lviv Polytechnic National University, 2018. — Vol 2 : Workshop. — P. 139–145. — (Section II. Intelligent Systems).
dc.identifier.citationenPelekh I. Extracting and classification the semi-structured data of web-systems / Irina Pelekh // Computational linguistics and intelligent systems, 25-27 June 2018. — Lviv : Lviv Polytechnic National University, 2018. — Vol 2 : Workshop. — P. 139–145. — (Section II. Intelligent Systems).
dc.identifier.issn2523-4013
dc.identifier.urihttps://ena.lpnu.ua/handle/ntb/42560
dc.language.isoen
dc.publisherLviv Polytechnic National University
dc.relation.ispartofComputational linguistics and intelligent systems (2), 2018
dc.relation.references1. Bondarenko M.F., Shabanov-Kushnarenko Yu.P.: Theory of intelligence. Textbook, X.: Izdvo SMIT, 576 p. (2007).
dc.relation.references2. Buileaar P., Eigner T.: Topic extraction from scientific literature for competency management. Іn The 7th International Semantic Web Conference PICKME 2008, Karlsruhe, Germany, 55-67. (2008)
dc.relation.references3. Kolada A.S., Gogunsky V.D.: Automation of information extraction from the sciencecomputer databases, Management of the development of complex systems, No. 16 (2013)
dc.relation.references4. Kushniretska I., Kushniretska О., Berko A.: Designing of Structural Ontological Data Systems Model for Mash-UP Integration Process, Applied Computer Science, 11(1) (2015)
dc.relation.references5. Kushniretska I., Kushniretska О., Berko A.: The ontological model of knowledge of scientific and technical information system, Computer Science and Information Technologies (CSIT'2014): proc. of the IX-th Intern. Scientific and Techn. Conf., Lviv, Ukraine / Min. of Education and Science of Ukraine (2014)
dc.relation.references6. Kushniretska I.: Semi-structured data dynamic integration Mashup system, Computer Science and Information Technologies (CSIT'2016): proc. of the XI-th Intern. Scientific and Techn. Conf., Lviv, Ukraine, Min. of Education and Science of Ukraine, 220-221 (2016)
dc.relation.references7. Manning C., Raghavan P., Schütze H.: Introduction to Information Retrieval, Cambridge University Press, ISBN 0-521-86571-9, (2008).
dc.relation.references8. Lytvyn, V., Pukach, P., Bobyk, І., Vysotska, V.: The method of formation of the status of personality understanding based on the content analysis. In: Eastern-European Journal of Enterprise Technologies, 5/2(83), 4-12 (2016)
dc.relation.references9. Kravets, P.: The game method for orthonormal systems construction. In: The Experience of Designing and Application of CAD Systems in Microelectronics (2007).
dc.relation.references10. Lytvyn, V., Vysotska, V, Veres, O., Rishnyak, I., Rishnyak, H.: Content linguistic analysis methods for textual documents classification. In: Computer Science and Information Technologies, Proc. of the XI-th Int. Conf. CSIT’2016, 190-192 (2016)
dc.relation.references11. Zhao Li, Wee Keong Ng, Aixin Sun: Web data extraction based on structural similarity, Journal Knowledge and Information Systems archive, Vol. 8, Issue 4, 438-461 (2005)
dc.relation.references12. Zhou L.: Ontology Learning: State of the Art, Information Technology and Management, 8 (3), 241-252 (2007)
dc.relation.references13. Chen, J., Dosyn, D., Lytvyn, V., Sachenko, A.: Smart Data Integration by Goal Driven Ontology Learning. In: Advances in Big Data. Advances in Intelligent Systems and Computing. – Springer International Publishing AG 2017. P. 283-292 (2017).
dc.relation.references14. Su, J., Vysotska, V., Sachenko, A., Lytvyn, V., Burov, Y.: Information resources processing using linguistic analysis of textual content. In: Intelligent Data Acquisition and Advanced Computing Systems Technology and Applications, Romania, 573-578, (2017)
dc.relation.references15. Vysotska, V., Chyrun, L., Chyrun, L.: Information Technology of Processing Information Resources in Electronic Content Commerce Systems, CSIT, 212–222 (2016)
dc.relation.references16. Vysotska, V., Hasko, R., Kuchkovskiy, V.: Process analysis in electronic content commerce system. In: Proceedings of the International Conference on Computer Sciences and Information Technologies, CSIT 2015, 120-123 (2015)
dc.relation.references17. Vysotska, V.: Linguistic Analysis of Textual Commercial Content for Information Resources Processing. In: Modern Problems of Radio Engineering, Telecommunications and Computer Science, TCSET’2016, 709–713 (2016)
dc.relation.references18. Basyuk, T.: The Popularization Problem of Websites and Analysis of Competitors. Advances in Intelligent Systems and Computing II. CSIT 2017. Advances in Intelligent Systems and Computing, vol 689. Springer, Cham pp. 54-65 (2017)
dc.relation.references19. Vysotska, V., Chyrun, L., Lytvyn, V.: Methods based on ontologies for information resources processing. Germany: LAP LAMBERT Academic Publishing (2016).
dc.relation.references20. Vysotska, V.: Tekhnolohiyi elektronnoyi komertsiyi ta Internet-marketynhu. Saarbrücken, Germany: LAP LAMBERT Academic Publishing (2018)
dc.relation.references21. Vysotska, V., Lytvyn, V.: Web resources processing based on ontologies. Saarbrücken, Germany: LAP LAMBERT Academic Publishing (2018)
dc.relation.references22. Vysotska, V., Shakhovska, N.: Information technologies of gamification for training and recruitment. Saarbrücken, Germany: LAP LAMBERT Academic Publishing (2018)
dc.relation.references23. Vysotska, V.: Internet systems design and development based on Web Mining and NLP. Saarbrücken, Germany: LAP LAMBERT Academic Publishing (2018)
dc.relation.references24. Vysotska, V.: Computer linguistics for online marketing in information technology : Monograph. Saarbrücken, Germany: LAP LAMBERT Academic Publishing (2018)
dc.relation.references25. Lytvyn, V., Vysotska, V., Chyrun, L., Smolarz, A., Naum O.: Intelligent System Structure for Web Resources Processing and Analysis. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 56-74 (2017)
dc.relation.references26. Lytvyn, V., Vysotska, V., Wojcik, W., Dosyn, D.: A Method of Construction of Automated Basic Ontology. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 75-83 (2017)
dc.relation.references27. Lytvynenko, V., Lurie, I., Radetska, S., Voronenko, M., Kornilovska, N., Partenjucha, D.: Content analysis of some social media of the occupied territories of Ukraine. In: 1st Inter. Conference Computational Linguistics and Intelligent Systems, COLINS, 84–94 (2017)
dc.relation.references28. Shepelev, G., Khairova, N.: Methods of comparing interval objects in intelligent computer systems. In: 1st Inter. Conf. Computational Linguistics and Intelligent Systems, (2017)
dc.relation.references29. Orobinska, O., Chauchat, J.-H., Sharonova, N.: Methods and models of automatic ontology construction for specialized domains (case of the Radiation Security). In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 95–99 (2017)
dc.relation.references30. Hamon, T., Grabar, N.: Unsupervised acquisition of morphological resources for Ukrainian. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 20–30 (2017)
dc.relation.references31. Grabar, N., Hamon, T.: Creation of a multilingual aligned corpus with Ukrainian as the target language and its exploitation. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 10–19 (2017)
dc.relation.references32. Hamon, T.: Biomedical text mining. In: Computational Linguistics and Intelligent Systems, colins.in.ua/wp-content/uploads/2017/04/2017COLINS-THAMON-keynote.pdf
dc.relation.references33. Lande, D., Andrushchenko, V., Balagura, I.: An index of authors’ popularity for Internet encyclopedia. In: Computational Linguistics and Intelligent Systems, COLINS, (2017)
dc.relation.references34. Lande, D.: Creation of subject domain models on the basis of monitoring of network information resources. In: 1st International Conference Computational Linguistics and Intelligent Systems, http://colins.in.ua/wp-content/uploads/2017/04/Lande.pdf (2017)
dc.relation.references35. Protsenko, Y.: Intuition on modern deep learning approaches in computer vision. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wp-content/uploads/2017/04/protsenko.pdf (2017)
dc.relation.references36. Kolbasin, V.: AI trends, or brief highlights of NIPS 2016. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wpcontent/ uploads/2017/04/CoLlnS_TuS.pdf (2017)
dc.relation.references37. Kersten, W.: The Digital Transformation of the Industry – the Logistics Example. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wp-content/uploads/2017/04/CoLlnS_TuS.pdf (2017)
dc.relation.references38. Shalimov, V.: Big Data – Revolution in Data Storage and Processing. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wp-content/uploads/2017/04/BigData_eng.pdf (2017)
dc.relation.references39. Hnot, T.: Qualitative content analysis: expertise and case study. In: 1st Inter. Conference Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wpcontent/ uploads/2017/04/Qualitative-content-analysis_expertise-and-case-study.pdf (2017)
dc.relation.references40. Romanyshyn, M.: Grammatical Error Correction: why commas matter. In: 1st Inter. Conf. Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wpcontent/ uploads/2017/04/Grammatical-Error-Correction-why-commas-matter.pdf. (2017)
dc.relation.references41. Yukhno, K., Chubar, E.: Gamification: today and tomorrow. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 139–140 (2017)
dc.relation.references42. Pidpruzhnikov, V., Ilchenko, M.: Search optimization and localization of the website of Department of Applied Linguistics. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 137–138 (2017)
dc.relation.references43. Olifenko, I., Borysova, N.: Analysis of existing German Corpora. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 135–136 (2017)
dc.relation.references44. Kolesnik, A., Khairova, N.: Use of linguistic criteria for estimating of wikipedia articles quality. In: 1st Inter. Conf. Computational Linguistics and Intelligent Systems, (2017)
dc.relation.references45. Kirkin, S., Melnyk, K.: Intelligent data processing in creating targeted advertising. In: 1st Inter. Conf. Computational Linguistics and Intelligent Systems, COLINS, 131–132 (2017)
dc.relation.references46. Hordienko, H., Ilchenko, M.: Development and computerization of an English term system in the fields of drilling and drilling rigs. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 129–130 (2017)
dc.relation.referencesen1. Bondarenko M.F., Shabanov-Kushnarenko Yu.P., Theory of intelligence. Textbook, X., Izdvo SMIT, 576 p. (2007).
dc.relation.referencesen2. Buileaar P., Eigner T., Topic extraction from scientific literature for competency management. In The 7th International Semantic Web Conference PICKME 2008, Karlsruhe, Germany, 55-67. (2008)
dc.relation.referencesen3. Kolada A.S., Gogunsky V.D., Automation of information extraction from the sciencecomputer databases, Management of the development of complex systems, No. 16 (2013)
dc.relation.referencesen4. Kushniretska I., Kushniretska O., Berko A., Designing of Structural Ontological Data Systems Model for Mash-UP Integration Process, Applied Computer Science, 11(1) (2015)
dc.relation.referencesen5. Kushniretska I., Kushniretska O., Berko A., The ontological model of knowledge of scientific and technical information system, Computer Science and Information Technologies (CSIT'2014): proc. of the IX-th Intern. Scientific and Techn. Conf., Lviv, Ukraine, Min. of Education and Science of Ukraine (2014)
dc.relation.referencesen6. Kushniretska I., Semi-structured data dynamic integration Mashup system, Computer Science and Information Technologies (CSIT'2016): proc. of the XI-th Intern. Scientific and Techn. Conf., Lviv, Ukraine, Min. of Education and Science of Ukraine, 220-221 (2016)
dc.relation.referencesen7. Manning C., Raghavan P., Schütze H., Introduction to Information Retrieval, Cambridge University Press, ISBN 0-521-86571-9, (2008).
dc.relation.referencesen8. Lytvyn, V., Pukach, P., Bobyk, I., Vysotska, V., The method of formation of the status of personality understanding based on the content analysis. In: Eastern-European Journal of Enterprise Technologies, 5/2(83), 4-12 (2016)
dc.relation.referencesen9. Kravets, P., The game method for orthonormal systems construction. In: The Experience of Designing and Application of CAD Systems in Microelectronics (2007).
dc.relation.referencesen10. Lytvyn, V., Vysotska, V, Veres, O., Rishnyak, I., Rishnyak, H., Content linguistic analysis methods for textual documents classification. In: Computer Science and Information Technologies, Proc. of the XI-th Int. Conf. CSIT’2016, 190-192 (2016)
dc.relation.referencesen11. Zhao Li, Wee Keong Ng, Aixin Sun: Web data extraction based on structural similarity, Journal Knowledge and Information Systems archive, Vol. 8, Issue 4, 438-461 (2005)
dc.relation.referencesen12. Zhou L., Ontology Learning: State of the Art, Information Technology and Management, 8 (3), 241-252 (2007)
dc.relation.referencesen13. Chen, J., Dosyn, D., Lytvyn, V., Sachenko, A., Smart Data Integration by Goal Driven Ontology Learning. In: Advances in Big Data. Advances in Intelligent Systems and Computing, Springer International Publishing AG 2017. P. 283-292 (2017).
dc.relation.referencesen14. Su, J., Vysotska, V., Sachenko, A., Lytvyn, V., Burov, Y., Information resources processing using linguistic analysis of textual content. In: Intelligent Data Acquisition and Advanced Computing Systems Technology and Applications, Romania, 573-578, (2017)
dc.relation.referencesen15. Vysotska, V., Chyrun, L., Chyrun, L., Information Technology of Processing Information Resources in Electronic Content Commerce Systems, CSIT, 212–222 (2016)
dc.relation.referencesen16. Vysotska, V., Hasko, R., Kuchkovskiy, V., Process analysis in electronic content commerce system. In: Proceedings of the International Conference on Computer Sciences and Information Technologies, CSIT 2015, 120-123 (2015)
dc.relation.referencesen17. Vysotska, V., Linguistic Analysis of Textual Commercial Content for Information Resources Processing. In: Modern Problems of Radio Engineering, Telecommunications and Computer Science, TCSET’2016, 709–713 (2016)
dc.relation.referencesen18. Basyuk, T., The Popularization Problem of Websites and Analysis of Competitors. Advances in Intelligent Systems and Computing II. CSIT 2017. Advances in Intelligent Systems and Computing, vol 689. Springer, Cham pp. 54-65 (2017)
dc.relation.referencesen19. Vysotska, V., Chyrun, L., Lytvyn, V., Methods based on ontologies for information resources processing. Germany: LAP LAMBERT Academic Publishing (2016).
dc.relation.referencesen20. Vysotska, V., Tekhnolohiyi elektronnoyi komertsiyi ta Internet-marketynhu. Saarbrücken, Germany: LAP LAMBERT Academic Publishing (2018)
dc.relation.referencesen21. Vysotska, V., Lytvyn, V., Web resources processing based on ontologies. Saarbrücken, Germany: LAP LAMBERT Academic Publishing (2018)
dc.relation.referencesen22. Vysotska, V., Shakhovska, N., Information technologies of gamification for training and recruitment. Saarbrücken, Germany: LAP LAMBERT Academic Publishing (2018)
dc.relation.referencesen23. Vysotska, V., Internet systems design and development based on Web Mining and NLP. Saarbrücken, Germany: LAP LAMBERT Academic Publishing (2018)
dc.relation.referencesen24. Vysotska, V., Computer linguistics for online marketing in information technology : Monograph. Saarbrücken, Germany: LAP LAMBERT Academic Publishing (2018)
dc.relation.referencesen25. Lytvyn, V., Vysotska, V., Chyrun, L., Smolarz, A., Naum O., Intelligent System Structure for Web Resources Processing and Analysis. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 56-74 (2017)
dc.relation.referencesen26. Lytvyn, V., Vysotska, V., Wojcik, W., Dosyn, D., A Method of Construction of Automated Basic Ontology. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 75-83 (2017)
dc.relation.referencesen27. Lytvynenko, V., Lurie, I., Radetska, S., Voronenko, M., Kornilovska, N., Partenjucha, D., Content analysis of some social media of the occupied territories of Ukraine. In: 1st Inter. Conference Computational Linguistics and Intelligent Systems, COLINS, 84–94 (2017)
dc.relation.referencesen28. Shepelev, G., Khairova, N., Methods of comparing interval objects in intelligent computer systems. In: 1st Inter. Conf. Computational Linguistics and Intelligent Systems, (2017)
dc.relation.referencesen29. Orobinska, O., Chauchat, J.-H., Sharonova, N., Methods and models of automatic ontology construction for specialized domains (case of the Radiation Security). In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 95–99 (2017)
dc.relation.referencesen30. Hamon, T., Grabar, N., Unsupervised acquisition of morphological resources for Ukrainian. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 20–30 (2017)
dc.relation.referencesen31. Grabar, N., Hamon, T., Creation of a multilingual aligned corpus with Ukrainian as the target language and its exploitation. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 10–19 (2017)
dc.relation.referencesen32. Hamon, T., Biomedical text mining. In: Computational Linguistics and Intelligent Systems, colins.in.ua/wp-content/uploads/2017/04/2017COLINS-THAMON-keynote.pdf
dc.relation.referencesen33. Lande, D., Andrushchenko, V., Balagura, I., An index of authors’ popularity for Internet encyclopedia. In: Computational Linguistics and Intelligent Systems, COLINS, (2017)
dc.relation.referencesen34. Lande, D., Creation of subject domain models on the basis of monitoring of network information resources. In: 1st International Conference Computational Linguistics and Intelligent Systems, http://colins.in.ua/wp-content/uploads/2017/04/Lande.pdf (2017)
dc.relation.referencesen35. Protsenko, Y., Intuition on modern deep learning approaches in computer vision. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wp-content/uploads/2017/04/protsenko.pdf (2017)
dc.relation.referencesen36. Kolbasin, V., AI trends, or brief highlights of NIPS 2016. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wpcontent/ uploads/2017/04/CoLlnS_TuS.pdf (2017)
dc.relation.referencesen37. Kersten, W., The Digital Transformation of the Industry – the Logistics Example. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wp-content/uploads/2017/04/CoLlnS_TuS.pdf (2017)
dc.relation.referencesen38. Shalimov, V., Big Data – Revolution in Data Storage and Processing. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wp-content/uploads/2017/04/BigData_eng.pdf (2017)
dc.relation.referencesen39. Hnot, T., Qualitative content analysis: expertise and case study. In: 1st Inter. Conference Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wpcontent/ uploads/2017/04/Qualitative-content-analysis_expertise-and-case-study.pdf (2017)
dc.relation.referencesen40. Romanyshyn, M., Grammatical Error Correction: why commas matter. In: 1st Inter. Conf. Computational Linguistics and Intelligent Systems, COLINS, http://colins.in.ua/wpcontent/ uploads/2017/04/Grammatical-Error-Correction-why-commas-matter.pdf. (2017)
dc.relation.referencesen41. Yukhno, K., Chubar, E., Gamification: today and tomorrow. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 139–140 (2017)
dc.relation.referencesen42. Pidpruzhnikov, V., Ilchenko, M., Search optimization and localization of the website of Department of Applied Linguistics. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 137–138 (2017)
dc.relation.referencesen43. Olifenko, I., Borysova, N., Analysis of existing German Corpora. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 135–136 (2017)
dc.relation.referencesen44. Kolesnik, A., Khairova, N., Use of linguistic criteria for estimating of wikipedia articles quality. In: 1st Inter. Conf. Computational Linguistics and Intelligent Systems, (2017)
dc.relation.referencesen45. Kirkin, S., Melnyk, K., Intelligent data processing in creating targeted advertising. In: 1st Inter. Conf. Computational Linguistics and Intelligent Systems, COLINS, 131–132 (2017)
dc.relation.referencesen46. Hordienko, H., Ilchenko, M., Development and computerization of an English term system in the fields of drilling and drilling rigs. In: 1st International Conference Computational Linguistics and Intelligent Systems, COLINS, 129–130 (2017)
dc.relation.urihttp://colins.in.ua/wp-content/uploads/2017/04/Lande.pdf
dc.relation.urihttp://colins.in.ua/wp-content/uploads/2017/04/protsenko.pdf
dc.relation.urihttp://colins.in.ua/wpcontent/
dc.relation.urihttp://colins.in.ua/wp-content/uploads/2017/04/CoLlnS_TuS.pdf
dc.relation.urihttp://colins.in.ua/wp-content/uploads/2017/04/BigData_eng.pdf
dc.rights.holder© 2018 for the individual papers by the papers’ authors. Copying permitted only for private and academic purposes. This volume is published and copyrighted by its editors.
dc.subjectsemi-structured data
dc.subjectweb-system
dc.subjectextracting
dc.subjectclassification
dc.subjectontology
dc.titleExtracting and classification the semi-structured data of web-systems
dc.typeConference Abstract

Files

Original bundle

Now showing 1 - 2 of 2
Thumbnail Image
Name:
COLINS_2018_2018v2_Pelekh_I-Extracting_and_classification_139-145.pdf
Size:
1.94 MB
Format:
Adobe Portable Document Format
Thumbnail Image
Name:
COLINS_2018_2018v2_Pelekh_I-Extracting_and_classification_139-145__COVER.png
Size:
250.75 KB
Format:
Portable Network Graphics

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.92 KB
Format:
Plain Text
Description: