Publications

Abdollahi, S., Gottschalk, S. & Demidova, E. 2020. EventKG+Click: A Dataset of Language-specific Event-centric User Interaction Traces, CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC. 2611. https://arxiv.org/abs/2010.12370

Abdollahi, S., Gottschalk, S., & Demidova, E. 2023. LaSER: Language-specific Event Recommendation. Journal of Web Semantics. https://arxiv.org/abs/2303.04712

Abdohalli, S. 2022. User Access Models to Event-Centric Information. Companion Proceedings of the Web Conference 2022. https://zenodo.org/record/7870371#.ZEo_IHZBw2w

Alves, D., Kuculo, T., Amaral, G., Thakkar, G. & Tadić, M. 2020. UNER: Universal Named-Entity Recognition Framework, CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC. 2611. https://arxiv.org/abs/2010.12406

Alves, D., Thakkar, G. & Tadić, M. 2020. Evaluating Language Tools for Fifteen EU-official Under-resourced Languages, LREC2020. 1859-1866. https://arxiv.org/abs/2010.12428

Alves, D., Thakkar, G. & Tadić, M. 2020. Natural Language Processing Chains Inside a Cross-lingual Event-Centric Knowledge Pipeline for European Union Under-Resourced Languages,     LREC2020 SLTU-CCURL Workshop. 153-158. https://arxiv.org/abs/2010.12433

Alves, D., Salimbajevs, A., & Pinnis, M. 2020. Data augmentation for pipeline-based speech translation, Human Language Technologies – The Baltic Perspective – Proceedings of the Ninth International Conference Baltic HLT 2020, vol. 328, 73-79. https://hal.inria.fr/hal-02907053

Alves, D., Bekavac, B. & Tadić, M. 2020. Optimization of Portuguese Named Entity Recognition and Classification by Combining Local Grammars and Conditional Random Fields Trained with Parsed Corpora, NooJ2020 conference, vol. 1389, 196-205. https://zenodo.org/record/7947846

Alves, D., Bekavac, B., & Tadić, M. 2021. Typological Approach to Improve Dependency Parsing for Croatian Language, Proceedings of the 20th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2021), 1-11. https://zenodo.org/record/7947866#.ZGX6_3ZBxPY

Alves, D., Thakkar, G., & Tadić, M. 2021. Building and Evaluating Universal Named-Entity Recognition English corpus, CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, vol. 2829, 2-16. http://arxiv.org/abs/2212.07162

Alves, D., Thakkar, G., Amaral, G., Kuculo, T., & Tadić, M. 2021. Building Multilingual Corpora for a Complex Named Entity Recognition and Classification Hierarchy using Wikipedia and DBpedia, Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), vol. 2836.
https://arxiv.org/abs/2212.07429

Alves, D., Bekavac, B., & Tadić, M. 2022. Multilingual Comparative Analysis of Deep-Learning Dependency Parsing Results Using Parallel Corpora. Workshop on Building and Using Comparable Corpora (BUCC 2022) @LREC2022. https://zenodo.org/record/7947908#.ZGX9y3ZBxPY

Alves, D., Bekavac, B., Zeman, D., & Tadić, M. 2023. Analysis of Corpus-based Word-Order Typological Methods, Sixth Workshop on Universal Dependencies. https://zenodo.org/record/7947920

Alves, D., Bekavac, B., Zeman, D., & Tadić, M. 2023. Corpus-based Syntactic Typological Methods for Dependency Parsing Improvement. 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP 2023). https://zenodo.org/record/7947944#.ZGYAmnZBxPY

Amaral, G., Piscopo, A., Kaffee, L.-A., Rodrigues, O., & Simperl, E. 2021. Assessing the Quality of Sources in Wikidata Across Languages: A Hybrid Approach. Journal of Data and Information Quality. http://arxiv.org/abs/2109.09405

Armitage, J., Thakur, S., Tripathi, R., Maleshkova, M. & Lehmann, J. 2020. Training Multimodal Systems for Classification with Multiple Objectives, CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC. 2611. http://arxiv.org/abs/2008.11450

Armitage, J., Kacupaj, E., Tahmasebzadeh, G., Suman, S., Maleshkova, M., Ewerth R. & Lehmann, J. 2020. MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and Modalities https://arxiv.org/abs/2008.06376

Cheema, G. S., Hakimov, S. & Ewerth, R. 2020. Check_square at CheckThat! 2020: Claim Detection in Social Media via Fusion of Transformer and Syntactic Features, Proceedings of the 11th International Conference of the CLEF Association, CLEF 2020. https://arxiv.org/pdf/2007.10534.pdf

Cheema, G. S., Hakimov, S. & Ewerth, R. 2020. TIB’s Visual Analytics Group at MediaEval ’20: Detecting Fake News on Corona Virus and 5G Conspiracy, Proceedings of MediaEval 2020 Workshophttps://arxiv.org/pdf/2101.03529.pdf

Cheema, G. S., Hakimov, S., Müller-Budack, E., & Ewerth, R. 2021. On the Role of Images for Analyzing Claims in Social Media, CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, vol. 2829, 32-46. http://arxiv.org/abs/2103.09602

Cheema, G. S., Hakimov, S., Müller-Budack, E., & Ewerth, R. 2021. A Fair and Comprehensive Comparison of Multimodal Tweet Sentiment Analysis Methods, Workshop on Multi-Modal Pre-Training for Multimedia Understanding (MMPT 2021), co-located with ICMR 2021. http://arxiv.org/abs/2106.08829

Cheema, G. S., Hakimov, S., Sittar, A., Müller-Budack, E., Otto, C., & Ewerth, R. 2022. MM-Claims: A Dataset for Multimodal Claim Detection in Social Media, NAACL Findings. https://arxiv.org/abs/2205.01989

Gottschalk, S., Kacuaj, E., Abdollahi, S., Alves, D., Amaral, G., Koutsiana, E., Kuculo, T., Major, D., Mello, C., Cheema, G. S., Sittar, A., Swati, Tahmasebzadeh, G., & Thakkar, G. 2021. OEKG: The Open Event Knowledge Graph, CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, vol. 2829, 61-75. https://arxiv.org/abs/2302.14688

Hakimov, S., Cheema, G. S., & Ewerth, R. 2022. TIB-VA at SemEval-2022 Task 5: A Multimodal Architecture for the Detection and Classification of Misogynous Memes, SemEval Task 5 co-located with NAACL 2022. https://arxiv.org/abs/2204.06299

Kacupaj, E., Zafar, H., Lehmann, J. & Maleshkova, M. 2020. VQuAnDa: Verbalization QUestion ANswering Dataset, ESWC 2020, LNCS vol. 12123. https://figshare.com/projects/VQuAnDa/72488

Kacupaj, E., Banerjee, B., Singh, K., Lehmann, J. 2021. ParaQA: A Question Answering Dataset with Paraphrase Responses for Single-Turn Conversation. ESWC. http://arxiv.org/abs/2103.07771

Kuculo, T. 2022. Comprehensive Event Representations using Event Knowledge Graphs and Natural Language Processing. 2022. Companion Proceedings of the Web Conference 2022. https://arxiv.org/abs/2303.04794

Kuculo, T., Gottschalk, S., Demidova, E. 2022. QuoteKG: A Multilingual Knowledge Graph of Quotes. Proceedings of the 19th European Semantic Web Conference. https://arxiv.org/abs/2207.09562

Mello, C., Cheema, G. S., & Thakkar, G. 2022. Combining Sentiment Analysis Classifiers to Explore Multilingual News Articles Covering London 2012 and Rio 2016 Olympics. International Journal of Digital Humanities. https://github.com/caiocmello/sentiment-annotation-olympic-news

Plepi, J., Kacupaj, E., Singh, K., Thakka, H., & Lehmann, J. 2021. Context Transformer with Stacked Pointer Networks for Conversational Question Answering over Knowledge Graphs, ESWC. https://arxiv.org/pdf/2103.07766.pdf

Sarajlić, J., Thakkar, G., Alves, D., & Preradovic, N. M. 2021. Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study, Proceedings of the Conference on Digital Curation Technologies (Qurator 2021), vol. 2836.
https://arxiv.org/abs/2212.07172

Sittar, A., Mladenić, D., & Erjavec, T. 2020. A Dataset for Information Spreading over the News, SiKDD 2020, vol. c, 5-8.
https://zenodo.org/record/4679725#.Y5mX4X3MJZc

Sittar, A., Mladenić, D., & Grobelnik, M. 2021. Analysis of information cascading and propagation barriers across distinctive news events, Journal of Intelligent Information Systems.
https://arxiv.org/abs/2212.07742

Sittar, A. & Mladenić, D. 2021. Classification of Cross-Cultural News Events, SiKDD 2021. https://arxiv.org/abs/2301.05543

Sittar, A. & Mladenić, D. 2021. Using the Profile of Publishers to Predict Barriers Across News Articles, CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, vol. 2829. https://arxiv.org/abs/2301.05535

Sittar, A., Major, D., Mello, C., Mladenić, D., & Grobelnik, M. 2022. Political and Economic Patterns in COVID-19 News: From Lockdown to Vaccination. IEEE Access. https://arxiv.org/abs/2212.13875

Swati, Erjavec, E., & Mladenić, D. 2020. EveOut: Reproducible Event Dataset for Studying and Analyzing the Complex Event-Outlet Relationship, SiKDD 2020, vol. c, 17-20.
https://zenodo.org/record/7244613

Swati, & Mladenić, D. 2020. Are You Following the Right News-Outlet? A Machine Learning based approach to outlet prediction, SiKDD 2020, vol. c, 33-36.
https://zenodo.org/record/7244705

Swati, & Mladenić, D. 2021. Understanding the Impact of Geographical Bias on News Sentiment: A Case Study on London and Rio Olympics, SiKDD 2021.
https://doi.org/10.5281/zenodo.7244273

Swati, Mladenić, D., & Erjavec, T. 2021. EveOut: an Event-centric News Dataset to Analyze an Outlet’s Event Selection Patterns. Informatica – Slovenia. https://zenodo.org/record/7244874

Thakkar, G., & Pinnis, M. 2020. Pretraining and Fine-Tuning Strategies for Sentiment Analysis of Latvian Tweets, Human Language Technologies – The Baltic Perspective – Proceedings of the Ninth International Conference Baltic HLT 2020, vol. 328, 55-61. https://arxiv.org/abs/2010.12401

Thakkar, G., Preradović, N. M., & Tadić, M. 2021. Negation Detection Using NooJ, 44th International Convention on Information, Communication and Electronic Technology (MIPRO), 263.
https://zenodo.org/record/7940258

Thakkar, G., Preradović, N. M., Tadić, M. 2021. Multi-task Learning for Cross-Lingual Sentiment Analysis, CEUR Workshop Proceedings, CLEOPATRA Workshop 2021 co-located with TheWebConf, vol. 2829, 76-84. http://arxiv.org/abs/2212.07160

Thakkar, G., Preradović, N. M., Tadić, M. 2023. Croatian Film Review Dataset (Cro-FiReDa): A Sentiment Annotated Dataset of Film Reviews, Slavic NLP 2023 The 9th Workshop on  Slavic Natural Language Processing. https://arxiv.org/pdf/2305.08173.pdf

Thakkar, G., Preradović, N. M., Tadić, M. 2023. CroSentiNews 2.0: A Sentence-Level News Sentiment Corpus. 10th Language & Technology Conference. https://arxiv.org/pdf/2305.08187.pdf

Tahmasebi, S., Hakimov, S., Ewerth, R., & Müller-Budack, E. 2023. Improving Generalization for Multimodal Fake News Detection. ICMR 2023 – International Conference on Multimedia Retrieval. https://arxiv.org/abs/2305.18599

Tahmasebzadeh, G., Hakimov, S., Müller-Budack, E. & Ewerth, R. 2020. A Feature Analysis for Multimodal News Retrieval, CEUR Workshop Proceedings, CLEOPATRA Workshop 2020 co-located with ESWC. 2611. http://arxiv.org/abs/2007.06390

Tahmasebzadeh, G., Kacupaj, E., Müller-Budack, E., Hakimov, S., Lehmann, J., & Ewerth, R. 2021. GeoWINE: Geolocation based Wiki, Image, News and Event Retrieval, SIGIR 21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2565–2569. https://arxiv.org/pdf/2104.14994.pdf

Tahmasebzadeh, G., Müller-Budack, E., Hakimov, S., & Ewerth, R. 2023. MM-Locate-News: Multimodal Focus Location Estimation in News, International Conference on MultiMedia Modeling (MMM 2023). https://arxiv.org/abs/2211.08042

Tahmasebzadeh, G., Hakimov, S., Ewerth, R., & Müller-Budack, E. 2023. Multimodal Geolocation Estimation of News Photos. ECIR 2023 – European Conference on Information Retrieval. https://github.com/TIBHannover/mmg-newsphoto