WikiWars-UA: Ukrainian corpus annotated with temporal expressions

dc.citation.epage31
dc.citation.journalTitleComputational Linguistics and Intelligent Systems
dc.citation.spage22
dc.citation.volume2 : Proceedings of the 3nd International conference, COLINS 2019. Workshop, Kharkiv, Ukraine, April 18-19, 2019
dc.contributor.affiliationCNRS, Univ. Lille, UMR 81G3 - STL - Savoirs Textes Langage, F-59000 Lille, France
dc.contributor.affiliationLIMSI, CNRS, Université Paris-Saclay. F-91405 Orsay, France
dc.contributor.affiliationUniversité Paris 13. Sorbonne Paris Cité. F-93430 Villetaneuse. France
dc.contributor.authorGrabar, Natalia
dc.contributor.authorHamon, Thierry
dc.coverage.placenameLviv
dc.date.accessioned2019-10-31T13:21:05Z
dc.date.available2019-10-31T13:21:05Z
dc.date.created2019-04-18
dc.date.issued2019-04-18
dc.description.abstractReliability of tools and reproducibility of study results are important features of modern Natural Language Processing (NLP) tools and methods. The scientific research is indeed increasingly coming under criticism for the lack of reproducibility of results. First step towards the reproducibility is related to the availability of freely usable tools and corpora. In our work, we are interested in automatic processing of unstructured documents for the extraction of temporal information. Our main objective is to create reference annotated corpus with temporal information related to dates (absolute and relative), periods, time, etc. in Ukrainian, and to their normalization. The approach relies on the adaptation of existing application, automatic pre-annotation of WikiWars corpus in Ukrainian and its manual correction. The reference corpus permits to reliably evaluate the current version of the automatic temporal annotator and to prepare future work on these topics.
dc.format.extent22-31
dc.format.pages10
dc.identifier.citationGrabar N. WikiWars-UA: Ukrainian corpus annotated with temporal expressions / Natalia Grabar, Thierry Hamon // Computational Linguistics and Intelligent Systems. — Lviv : Lviv Politechnic Publishing House, 2019. — Vol 2 : Proceedings of the 3nd International conference, COLINS 2019. Workshop, Kharkiv, Ukraine, April 18-19, 2019. — P. 22–31. — (Paper presentations).
dc.identifier.citationenGrabar N. WikiWars-UA: Ukrainian corpus annotated with temporal expressions / Natalia Grabar, Thierry Hamon // Computational Linguistics and Intelligent Systems. — Lviv Politechnic Publishing House, 2019. — Vol 2 : Proceedings of the 3nd International conference, COLINS 2019. Workshop, Kharkiv, Ukraine, April 18-19, 2019. — P. 22–31. — (Paper presentations).
dc.identifier.issn2523-4013
dc.identifier.urihttps://ena.lpnu.ua/handle/ntb/45492
dc.language.isoen
dc.publisherLviv Politechnic Publishing House
dc.relation.ispartofComputational Linguistics and Intelligent Systems (2), 2019
dc.relation.referencesen1. ACE challenge: The ACE 2004 evaluation plan. evaluation of the recognition of ace entities, ace relations and ace events. Tech. rep., ACE challenge (2004). http: //www.itl.nist.gov/iad/mig/tests/ace/2004
dc.relation.referencesen2. Batal, I., Sacchi, L., Bellazzi, R., Hauskrecht, M.: A temporal abstraction framework for classifying clinical temporal data. In: Ann Symp Am Med Inform Assoc (AMIA). pp. 29-33(2009)
dc.relation.referencesen3. Bethard, S., Savova, G.,Palmer, M., Pustejovsky, J.: Semeval-2017 task 12: Clinical tempeval. In: Int Workshop on Semantic Evaluation (SemEval-2017). pp. 565-572. Association for Computational Linguistics, Vancouver, Canada (August 2017)
dc.relation.referencesen4. Chang, A.X., Manning, C.D.: SUTIME: A library for recognizing and normalizing time expressions. In: LREC. pp. 3735-3740 (2012)
dc.relation.referencesen5. Chapman, W.W., Nadkarni, P.M., Hirschman, L., D'Avolio, L.W., Savova, G.K., Uzuner, O.: Overcoming barriers to nlp for clinical text: the role of shared tasks and the need for additional creative solutions. J Am Med Inform Assoc 18(5), 540-543(2011)
dc.relation.referencesen6. Cohen, K.B., Xia, J., Roeder, C., Hunter, L.E.: Reproducibility in natural language processing: A case study of two R libraries for mining PubMed/MEDLINE. In: LREC Int Conf Lang Resour Eval. pp. 6-12 (2016)
dc.relation.referencesen7. Collins, F., Tabak, L.: Nih plans to enhance reproducibility. Nature 505, 612-613 (2014)
dc.relation.referencesen8. Grabar, N., Hamon, T.: Automatic detection of temporal information in ukrainian general-languagetexts. In: COLINS 2018. pp. 1-11 (2018)
dc.relation.referencesen9. Grouin, C., Grabar, N., Hamon, T., Rosset, S., Tannier, X., Zweigenbaum, P.: Hybrid approaches to represent the clinical patient's timeline. J Am Med Inform Assoc 20(5), 820-7(2013)
dc.relation.referencesen10.Jeong, Y.S., Joo, W.T., Do, H.W., Lim, C.G., Choi, K.S., Choi, H.J.: Korean timeml and korean timebank. In: Chair), N.C.C., Choukri, K., Declerck, T., Goggi, S., Grobelnik, M., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J., Piperidis, S. (eds.) Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). European Language Resources Association (ELRA), Paris, France (may 2016)
dc.relation.referencesen11.Kessler, R., Tannier, X., Hagege, C., Moriceau, V., Bittar, A.: Finding salient dates for building thematic timelines. In: Annual Meeting of the Association for Computational Linguistics. pp. 730-739 (2012)
dc.relation.referencesen12.Mazur, P., Dale, R.: WikiWars: A new corpus for research on temporal expressions. In: Int Conf on Empirical Methods in Natural Language Processing. pp. 913-922 (2010)
dc.relation.referencesen13.Moskovitch, R., Shahar, Y.: Medical temporal-knowledge discovery via temporal abstraction. In: Ann Symp Am Med Inform Assoc (AMIA). pp. 452-456 (2009)
dc.relation.referencesen14.Pustejovsky, J., Lee, K., Bunt, H., Romary, L.: ISO-TimeML: An international standard for semantic annotation. In: Chair), N.C.C., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) Int Conf Language Resources and Evaluation (LREC'10). European Language Resources Association (ELRA), Valletta, Malta (may 2010)
dc.relation.referencesen15.Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1-47 (2002)
dc.relation.referencesen16.Strotgen, J., Gertz, M.: Wikiwarsde: A german corpus of narratives annotatedwith temporal expressions. In: Conf of the German Society for Comp Linguistics and Language Technology (GSCL 2011). pp. 129-134. Hamburg, Germany (September 2011)
dc.relation.referencesen17.Strotgen, J., Gertz, M.: Temporal tagging on di erent domains: Challenges, strategies, and gold standards. In: Int Conf on Language Resources and Evaluation (LREC'12). pp. 3746-3753. ELRA (2012)
dc.relation.referencesen18.Strotgen, J., Gertz, M.: A baseline temporal tagger for all languages. In: Int Conf on Empirical Methods in Natural Language Processing. pp. 541-547. ACL (2015)
dc.relation.referencesen19.Strotgen, J., Armiti, A., Canh, T.V., Zell, J., Gertz, M.: Time for more languages: Temporal tagging of Arabic, Italian, Spanish, and Vietnamese. ACM Transactions on Asian Language Information Processing 13(1), 1-21 (2014)
dc.relation.referencesen20.Sun, W., Rumshisky, A., Uzuner, O.: Evaluating temporal relations in clinical text: 2012 i2b2 challenge. JAMIA 20(5), 806-813 (2013)
dc.relation.referencesen21.UzZaman, N., Llorens, H., Derczynski, L., Allen, J., Verhagen, M., Pustejovsky, J.: Semeval-2013 task 1: Tempeval-3: Evaluating time expressions, events, and temporal relations. In: Int Workshop on Semantic Evaluation (SemEval 2013). pp. 19. Atlanta, Georgia, USA (June 2013), http://www.aclweb.org/anthology/ S13-2001
dc.relation.referencesen22.Verhagen, M., Gaizauskas, R., Schilder, F., Hepple, M., Katz, G., Pustejovsky, J.: Semeval-2007 task 15: Tempeval temporal relation identication. In: Int Workshop on Semantic Evaluations (SemEval-2007). pp. 75-80. Prague, Czech Republic (June 2007), http://www.aclweb.org/anthology/S/S07/S07-1014
dc.relation.referencesen23.Verhagen, M., Sauri, R., Caselli, T., Pustejovsky, J.: Semeval-2010 task 13: Tempeval-2. In: Int Workshop on Semantic Evaluation. pp. 57-62. Uppsala, Sweden (July 2010), http://www.aclweb.org/anthology/S10-1010
dc.relation.urihttp://www.aclweb.org/anthology/
dc.relation.urihttp://www.aclweb.org/anthology/S/S07/S07-1014
dc.relation.urihttp://www.aclweb.org/anthology/S10-1010
dc.rights.holder© 2019 for the individual papers by the papers’ authors. Copying permitted only for private and academic purposes. This volume is published and copyrighted by its editors.
dc.subjectTemporality
dc.subjectInformation Extraction
dc.subjectUkrainian
dc.subjectWikiWars
dc.subjectHeidelTime
dc.subjectReference Corpus
dc.titleWikiWars-UA: Ukrainian corpus annotated with temporal expressions
dc.typeArticle

Files

Original bundle

Now showing 1 - 2 of 2
Thumbnail Image
Name:
2019v2___Proceedings_of_the_3nd_International_conference_COLINS_2019_Workshop_Kharkiv_Ukraine_April_18-19_2019_Grabar_N-WikiWars_UA_Ukrainian_corpus_22-31.pdf
Size:
1.52 MB
Format:
Adobe Portable Document Format
Thumbnail Image
Name:
2019v2___Proceedings_of_the_3nd_International_conference_COLINS_2019_Workshop_Kharkiv_Ukraine_April_18-19_2019_Grabar_N-WikiWars_UA_Ukrainian_corpus_22-31__COVER.png
Size:
258.34 KB
Format:
Portable Network Graphics

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.96 KB
Format:
Plain Text
Description: