-
Replication Data for: Svalbard through the prism of Russian media
The study applies Market Basket Analysis and Keymorph Analysis to analyze the articles related to Svalbard published in a sample of Russian mainstream federal and north-western... -
SciTweets - A Dataset and Annotation Framework for Detecting Scientific Onlin...
This repository contains an expert-annotated dataset of 1261 tweets and the corresponding annotation framework from the publication "SciTweets - A Dataset and Annotation... -
SciTweets - A Dataset and Annotation Framework for Detecting Scientific Onlin...
This repository contains an expert-annotated dataset of 1261 tweets and the corresponding annotation framework from the publication "SciTweets - A Dataset and Annotation... -
TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets (Part 10, J...
TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for nearly 3.0 billion tweets, spanning more... -
TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets (Part 11, J...
TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for nearly 3.0 billion tweets, spanning more... -
Invasion@Ukraine
We publish a dataset of raw tweets collected via the Twitter Streaming API in the context of the onset of the war, which Russia started in Ukraine on February 24, 2022. In... -
Legitimation Strategies of Regional Organizations (LegRO)
In an era of increasing political challenges to global and regional organizations, it is crucial to understand how they claim legitimacy and how successful they are in this... -
Legitimation Strategies of Regional Organizations (LegRO)
In an era of increasing political challenges to global and regional organizations, it is crucial to understand how they claim legitimacy and how successful they are in this... -
Meredith Giuliani - PhD project data for study 2
Study 2 - A Critical Review of Representation in the Development of Global Oncology Curricula and the Influence of Neocolonialism. This study was a systematic review using a... -
Czech RST Discourse Treebank 1.0
The Czech RST Discourse Treebank 1.0 (CzRST-DT 1.0) is a dataset of 54 Czech journalistic texts manually annotated using the Rhetorical Structure Theory (RST). Each text... -
Prague Discourse Treebank 3.0
The Prague Discourse Treebank 3.0 (PDiT 3.0) is a new version of annotation of discourse relations marked by primary and secondary discourse connectives in the data of the... -
DiscoMT 2017 Shared Task on Cross-lingual Pronoun Prediction
Data used in the 2017 shared task on cross-lingual pronoun prediction. -
EVALD 3.0 for Foreigners – Evaluator of Discourse
EVALD 3.0 for Foreigners is a software for automatic evaluation of surface coherence (cohesion) in Czech texts written by non-native speakers of Czech. -
CzeDLex 1.0
CzeDLex 1.0 is the first production version (the fourth development version) of the Lexicon of Czech discourse connectives. The lexicon contains connectives partially... -
Lexicon of Czech and German Anaphoric Connectives
GeCzLex 1.0 is an online electronic resource for translation equivalents of Czech and German discourse connectives. It contains anaphoric connectives for both languages and... -
Self-paced reading experiments on explicit and implicit contrastive and tempo...
Supplementary materials for the paper “Processing of explicit and implicit contrastive and temporal discourse relations in Czech” (submitted to Discourse Processes) -
Prague Dependency Treebank 3.5
The Prague Dependency Treebank 3.5 is the 2018 edition of the core Prague Dependency Treebank (PDT). It contains all PDT annotation made at the Institute of Formal and Applied... -
DiscoMT 2016 Shared Task on Cross-lingual Pronoun Prediction
Files for the DiscoMT 2016 shared task on cross-lingual pronoun prediction -
EVALD 3.0 – Evaluator of Discourse
EVALD 3.0 serves for automatic evaluation of surface coherence (cohesion) in Czech texts written by native speakers of Czech. -
Prague Dependency Treebank - Consolidated 1.0 (PDT-C 1.0)
A richly annotated and genre-diversified language resource, The Prague Dependency Treebank – Consolidated 1.0 (PDT-C 1.0, or PDT-C in short in the sequel) is a consolidated...