ParCzech PS7 2.0

PID

The ParCzech PS7 2.0 corpus is the second version of ParCzech PS7 consisting of stenographic protocols that record the Chamber of Deputies' meetings held in the 7th term between 2013-2017. The protocols are provided in their original HTML format, TEI format and TEI-derived format to make them searchable in the TEITOK corpus manager. Their audio recordings are available as well. The corpus is automatically enriched with the morphological, syntactic, and named-entity annotations using the procedures UDPipe 2 and NameTag 2.

Identifier
PID http://hdl.handle.net/11234/1-3436
Related Identifier http://hdl.handle.net/11234/1-3174
Related Identifier http://hdl.handle.net/11234/1-3631
Related Identifier https://ufal.mff.cuni.cz/parczech
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11234/1-3436
Provenance
Creator Hladká, Barbora; Kopp, Matyáš; Straňák, Pavel
Publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Publication Year 2020
Rights Public Domain Dedication (CC Zero); http://creativecommons.org/publicdomain/zero/1.0/; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Language Czech
Resource Type corpus
Format text/plain; charset=utf-8; application/x-gzip; application/x-tar; downloadable_files_count: 9
Discipline Linguistics