Method for paraphrase extractionfrom the news text corpus

dc.citation.epage70
dc.citation.journalTitleComputational Linguistics and Intelligent Systems
dc.citation.spage69
dc.citation.volume2 : Proceedings of the 3nd International conference, COLINS 2019. Workshop, Kharkiv, Ukraine, April 18-19, 2019
dc.contributor.affiliationNational Technical University "Kharkiv Polytechnic Institute"
dc.contributor.authorManuilov, Illia
dc.contributor.authorPetrasova, Svitlana
dc.coverage.placenameLviv
dc.date.accessioned2019-10-31T13:21:02Z
dc.date.available2019-10-31T13:21:02Z
dc.date.created2019-04-18
dc.date.issued2019-04-18
dc.description.abstractThe paper discusses the process of automatic extraction of paraphrases used in rewriting. The researchers propose the method for extracting paraphrases from English news text corpora. The method is based on both the developed syntactic rules to define phrases and synsets to identify synonymous words in the designed text corpus of BBC news. In order to implement the method, Natural Language Toolkit, Universal Dependencies parser and WordNet are used.
dc.format.extent69-70
dc.format.pages2
dc.identifier.citationManuilov I. Method for paraphrase extractionfrom the news text corpus / Illia Manuilov, Svitlana Petrasova // Computational Linguistics and Intelligent Systems. — Lviv : Lviv Politechnic Publishing House, 2019. — Vol 2 : Proceedings of the 3nd International conference, COLINS 2019. Workshop, Kharkiv, Ukraine, April 18-19, 2019. — P. 69–70. — (Student section).
dc.identifier.citationenManuilov I. Method for paraphrase extractionfrom the news text corpus / Illia Manuilov, Svitlana Petrasova // Computational Linguistics and Intelligent Systems. — Lviv Politechnic Publishing House, 2019. — Vol 2 : Proceedings of the 3nd International conference, COLINS 2019. Workshop, Kharkiv, Ukraine, April 18-19, 2019. — P. 69–70. — (Student section).
dc.identifier.issn2523-4013
dc.identifier.urihttps://ena.lpnu.ua/handle/ntb/45486
dc.language.isoen
dc.publisherLviv Politechnic Publishing House
dc.relation.ispartofComputational Linguistics and Intelligent Systems (2), 2019
dc.relation.referencesen1. Koloiev, A.S.: Rewrite as a new phenomenon in modern journalism. In: SPU Bulletin. Philology, vol. 1, 221-226 (2012)
dc.relation.referencesen2. Bolshakov, I.A.: Two methods of synonymous paraphrasing in linguistic steganography. In:Proceedings of the International ConferenceDialogue-2004,http://www.dialog-21.ru/media/2496/bolshakov.pdf, last accessed 2019/02/10.
dc.relation.referencesen3. Petrasova, S., Khairova, N., Lewoniewski, W.: Building the semantic similarity model for social network data streams. In:Data Stream Mining & Processing, Proceedings of the 2018 IEEE Second International Conference (DSMP), 21-24 (2018)
dc.relation.referencesen4. WordNet: https://wordnet.princeton.edu, last accessed 2019/02/10.
dc.relation.referencesen5. BBC, https://www.bbc.com/news,last accessed 2019/02/10.
dc.relation.urihttp://www.dialog-21.ru/media/2496/bolshakov.pdf
dc.relation.urihttps://wordnet.princeton.edu
dc.relation.urihttps://www.bbc.com/news,last
dc.rights.holder© 2019 for the individual papers by the papers’ authors. Copying permitted only for private and academic purposes. This volume is published and copyrighted by its editors.
dc.subjectparaphrase extraction
dc.subjectnews text corpus
dc.subjectsyntactic rules
dc.subjectsynsets
dc.subjectUniversal Dependencies
dc.subjectWordNet
dc.titleMethod for paraphrase extractionfrom the news text corpus
dc.typeArticle

Files

Original bundle

Now showing 1 - 2 of 2
Thumbnail Image
Name:
2019v2___Proceedings_of_the_3nd_International_conference_COLINS_2019_Workshop_Kharkiv_Ukraine_April_18-19_2019_Manuilov_I-Method_for_paraphrase_extractionfrom_69-70.pdf
Size:
323.54 KB
Format:
Adobe Portable Document Format
Thumbnail Image
Name:
2019v2___Proceedings_of_the_3nd_International_conference_COLINS_2019_Workshop_Kharkiv_Ukraine_April_18-19_2019_Manuilov_I-Method_for_paraphrase_extractionfrom_69-70__COVER.png
Size:
733.81 KB
Format:
Portable Network Graphics

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.96 KB
Format:
Plain Text
Description: