Czech RST Discourse Treebank 1.0

PID

The Czech RST Discourse Treebank 1.0 (CzRST-DT 1.0) is a dataset of 54 Czech journalistic texts manually annotated using the Rhetorical Structure Theory (RST). Each text document in the treebank is represented as a single tree-like structure, the nodes (discourse units) are interconnected through hierarchical rhetorical relations.

The dataset also contains concurrent annotations of five double-annotated documents.

The original texts are a part of the data annotated in the Prague Dependency Treebank, although the two projects are independent.

Identifier
PID http://hdl.handle.net/11234/1-5174
Related Identifier https://ufal.mff.cuni.cz/czrst-dt1.0
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-5174
Provenance
Creator Poláková, Lucie; Zikánová, Šárka; Mírovský, Jiří; Hajičová, Eva
Publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Publication Year 2023
Rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0); http://creativecommons.org/licenses/by-nc-sa/4.0/; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language Czech
Resource Type corpus
Format text/plain; charset=utf-8; application/zip; text/plain; downloadable_files_count: 2
Discipline Linguistics