WikiWars-UA: Ukrainian corpus annotated with temporal expressions
dc.citation.epage | 31 | |
dc.citation.journalTitle | Computational Linguistics and Intelligent Systems | |
dc.citation.spage | 22 | |
dc.citation.volume | 2 : Proceedings of the 3nd International conference, COLINS 2019. Workshop, Kharkiv, Ukraine, April 18-19, 2019 | |
dc.contributor.affiliation | CNRS, Univ. Lille, UMR 81G3 - STL - Savoirs Textes Langage, F-59000 Lille, France | |
dc.contributor.affiliation | LIMSI, CNRS, Université Paris-Saclay. F-91405 Orsay, France | |
dc.contributor.affiliation | Université Paris 13. Sorbonne Paris Cité. F-93430 Villetaneuse. France | |
dc.contributor.author | Grabar, Natalia | |
dc.contributor.author | Hamon, Thierry | |
dc.coverage.placename | Lviv | |
dc.date.accessioned | 2019-10-31T13:21:05Z | |
dc.date.available | 2019-10-31T13:21:05Z | |
dc.date.created | 2019-04-18 | |
dc.date.issued | 2019-04-18 | |
dc.description.abstract | Reliability of tools and reproducibility of study results are important features of modern Natural Language Processing (NLP) tools and methods. The scientific research is indeed increasingly coming under criticism for the lack of reproducibility of results. First step towards the reproducibility is related to the availability of freely usable tools and corpora. In our work, we are interested in automatic processing of unstructured documents for the extraction of temporal information. Our main objective is to create reference annotated corpus with temporal information related to dates (absolute and relative), periods, time, etc. in Ukrainian, and to their normalization. The approach relies on the adaptation of existing application, automatic pre-annotation of WikiWars corpus in Ukrainian and its manual correction. The reference corpus permits to reliably evaluate the current version of the automatic temporal annotator and to prepare future work on these topics. | |
dc.format.extent | 22-31 | |
dc.format.pages | 10 | |
dc.identifier.citation | Grabar N. WikiWars-UA: Ukrainian corpus annotated with temporal expressions / Natalia Grabar, Thierry Hamon // Computational Linguistics and Intelligent Systems. — Lviv : Lviv Politechnic Publishing House, 2019. — Vol 2 : Proceedings of the 3nd International conference, COLINS 2019. Workshop, Kharkiv, Ukraine, April 18-19, 2019. — P. 22–31. — (Paper presentations). | |
dc.identifier.citationen | Grabar N. WikiWars-UA: Ukrainian corpus annotated with temporal expressions / Natalia Grabar, Thierry Hamon // Computational Linguistics and Intelligent Systems. — Lviv Politechnic Publishing House, 2019. — Vol 2 : Proceedings of the 3nd International conference, COLINS 2019. Workshop, Kharkiv, Ukraine, April 18-19, 2019. — P. 22–31. — (Paper presentations). | |
dc.identifier.issn | 2523-4013 | |
dc.identifier.uri | https://ena.lpnu.ua/handle/ntb/45492 | |
dc.language.iso | en | |
dc.publisher | Lviv Politechnic Publishing House | |
dc.relation.ispartof | Computational Linguistics and Intelligent Systems (2), 2019 | |
dc.relation.referencesen | 1. ACE challenge: The ACE 2004 evaluation plan. evaluation of the recognition of ace entities, ace relations and ace events. Tech. rep., ACE challenge (2004). http: //www.itl.nist.gov/iad/mig/tests/ace/2004 | |
dc.relation.referencesen | 2. Batal, I., Sacchi, L., Bellazzi, R., Hauskrecht, M.: A temporal abstraction framework for classifying clinical temporal data. In: Ann Symp Am Med Inform Assoc (AMIA). pp. 29-33(2009) | |
dc.relation.referencesen | 3. Bethard, S., Savova, G.,Palmer, M., Pustejovsky, J.: Semeval-2017 task 12: Clinical tempeval. In: Int Workshop on Semantic Evaluation (SemEval-2017). pp. 565-572. Association for Computational Linguistics, Vancouver, Canada (August 2017) | |
dc.relation.referencesen | 4. Chang, A.X., Manning, C.D.: SUTIME: A library for recognizing and normalizing time expressions. In: LREC. pp. 3735-3740 (2012) | |
dc.relation.referencesen | 5. Chapman, W.W., Nadkarni, P.M., Hirschman, L., D'Avolio, L.W., Savova, G.K., Uzuner, O.: Overcoming barriers to nlp for clinical text: the role of shared tasks and the need for additional creative solutions. J Am Med Inform Assoc 18(5), 540-543(2011) | |
dc.relation.referencesen | 6. Cohen, K.B., Xia, J., Roeder, C., Hunter, L.E.: Reproducibility in natural language processing: A case study of two R libraries for mining PubMed/MEDLINE. In: LREC Int Conf Lang Resour Eval. pp. 6-12 (2016) | |
dc.relation.referencesen | 7. Collins, F., Tabak, L.: Nih plans to enhance reproducibility. Nature 505, 612-613 (2014) | |
dc.relation.referencesen | 8. Grabar, N., Hamon, T.: Automatic detection of temporal information in ukrainian general-languagetexts. In: COLINS 2018. pp. 1-11 (2018) | |
dc.relation.referencesen | 9. Grouin, C., Grabar, N., Hamon, T., Rosset, S., Tannier, X., Zweigenbaum, P.: Hybrid approaches to represent the clinical patient's timeline. J Am Med Inform Assoc 20(5), 820-7(2013) | |
dc.relation.referencesen | 10.Jeong, Y.S., Joo, W.T., Do, H.W., Lim, C.G., Choi, K.S., Choi, H.J.: Korean timeml and korean timebank. In: Chair), N.C.C., Choukri, K., Declerck, T., Goggi, S., Grobelnik, M., Maegaard, B., Mariani, J., Mazo, H., Moreno, A., Odijk, J., Piperidis, S. (eds.) Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016). European Language Resources Association (ELRA), Paris, France (may 2016) | |
dc.relation.referencesen | 11.Kessler, R., Tannier, X., Hagege, C., Moriceau, V., Bittar, A.: Finding salient dates for building thematic timelines. In: Annual Meeting of the Association for Computational Linguistics. pp. 730-739 (2012) | |
dc.relation.referencesen | 12.Mazur, P., Dale, R.: WikiWars: A new corpus for research on temporal expressions. In: Int Conf on Empirical Methods in Natural Language Processing. pp. 913-922 (2010) | |
dc.relation.referencesen | 13.Moskovitch, R., Shahar, Y.: Medical temporal-knowledge discovery via temporal abstraction. In: Ann Symp Am Med Inform Assoc (AMIA). pp. 452-456 (2009) | |
dc.relation.referencesen | 14.Pustejovsky, J., Lee, K., Bunt, H., Romary, L.: ISO-TimeML: An international standard for semantic annotation. In: Chair), N.C.C., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) Int Conf Language Resources and Evaluation (LREC'10). European Language Resources Association (ELRA), Valletta, Malta (may 2010) | |
dc.relation.referencesen | 15.Sebastiani, F.: Machine learning in automated text categorization. ACM Computing Surveys 34(1), 1-47 (2002) | |
dc.relation.referencesen | 16.Strotgen, J., Gertz, M.: Wikiwarsde: A german corpus of narratives annotatedwith temporal expressions. In: Conf of the German Society for Comp Linguistics and Language Technology (GSCL 2011). pp. 129-134. Hamburg, Germany (September 2011) | |
dc.relation.referencesen | 17.Strotgen, J., Gertz, M.: Temporal tagging on di erent domains: Challenges, strategies, and gold standards. In: Int Conf on Language Resources and Evaluation (LREC'12). pp. 3746-3753. ELRA (2012) | |
dc.relation.referencesen | 18.Strotgen, J., Gertz, M.: A baseline temporal tagger for all languages. In: Int Conf on Empirical Methods in Natural Language Processing. pp. 541-547. ACL (2015) | |
dc.relation.referencesen | 19.Strotgen, J., Armiti, A., Canh, T.V., Zell, J., Gertz, M.: Time for more languages: Temporal tagging of Arabic, Italian, Spanish, and Vietnamese. ACM Transactions on Asian Language Information Processing 13(1), 1-21 (2014) | |
dc.relation.referencesen | 20.Sun, W., Rumshisky, A., Uzuner, O.: Evaluating temporal relations in clinical text: 2012 i2b2 challenge. JAMIA 20(5), 806-813 (2013) | |
dc.relation.referencesen | 21.UzZaman, N., Llorens, H., Derczynski, L., Allen, J., Verhagen, M., Pustejovsky, J.: Semeval-2013 task 1: Tempeval-3: Evaluating time expressions, events, and temporal relations. In: Int Workshop on Semantic Evaluation (SemEval 2013). pp. 19. Atlanta, Georgia, USA (June 2013), http://www.aclweb.org/anthology/ S13-2001 | |
dc.relation.referencesen | 22.Verhagen, M., Gaizauskas, R., Schilder, F., Hepple, M., Katz, G., Pustejovsky, J.: Semeval-2007 task 15: Tempeval temporal relation identication. In: Int Workshop on Semantic Evaluations (SemEval-2007). pp. 75-80. Prague, Czech Republic (June 2007), http://www.aclweb.org/anthology/S/S07/S07-1014 | |
dc.relation.referencesen | 23.Verhagen, M., Sauri, R., Caselli, T., Pustejovsky, J.: Semeval-2010 task 13: Tempeval-2. In: Int Workshop on Semantic Evaluation. pp. 57-62. Uppsala, Sweden (July 2010), http://www.aclweb.org/anthology/S10-1010 | |
dc.relation.uri | http://www.aclweb.org/anthology/ | |
dc.relation.uri | http://www.aclweb.org/anthology/S/S07/S07-1014 | |
dc.relation.uri | http://www.aclweb.org/anthology/S10-1010 | |
dc.rights.holder | © 2019 for the individual papers by the papers’ authors. Copying permitted only for private and academic purposes. This volume is published and copyrighted by its editors. | |
dc.subject | Temporality | |
dc.subject | Information Extraction | |
dc.subject | Ukrainian | |
dc.subject | WikiWars | |
dc.subject | HeidelTime | |
dc.subject | Reference Corpus | |
dc.title | WikiWars-UA: Ukrainian corpus annotated with temporal expressions | |
dc.type | Article |
Files
Original bundle
1 - 2 of 2
- Name:
- 2019v2___Proceedings_of_the_3nd_International_conference_COLINS_2019_Workshop_Kharkiv_Ukraine_April_18-19_2019_Grabar_N-WikiWars_UA_Ukrainian_corpus_22-31.pdf
- Size:
- 1.52 MB
- Format:
- Adobe Portable Document Format
- Name:
- 2019v2___Proceedings_of_the_3nd_International_conference_COLINS_2019_Workshop_Kharkiv_Ukraine_April_18-19_2019_Grabar_N-WikiWars_UA_Ukrainian_corpus_22-31__COVER.png
- Size:
- 258.34 KB
- Format:
- Portable Network Graphics
License bundle
1 - 1 of 1