Error-annotated developmental corpus Šolar 2.0 Error

PID

The corpus contains 2094 texts from the corpus Šolar 2.0 (http://hdl.handle.net/11356/1214), i.e. only those in which error annotations can be found. For each text, the information on school (elementary or secondary), subject, level (grade or year), type of text, region and date of production is provided. The original error annotations from Šolar 1.0 have been re-categorized according to a new system (the specifications in Slovene are attached). There are 36,671 error annotations in total, which also include corrections made by teachers. The corpus consists of 756,130 words from student texts (this word count does not include teacher corrections).

Identifier
PID http://hdl.handle.net/11356/1231
Related Identifier http://hdl.handle.net/11356/1589
Related Identifier http://hdl.handle.net/11356/1036
Related Identifier https://solar.trojina.si/
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1231
Provenance
Creator Arhar Holdt, Špela; Goli, Teja; Lavrič, Polona; Laskowski, Cyprian; Klemenc, Bojan; Rozman, Tadeja; Stritar Kučuk, Mojca; Krek, Simon; Krapš Vodopivec, Irena; Stabej, Marko; Kosem, Iztok
Publisher Trojina, Institute for Applied Slovene Studies; Centre for Language Resources and Technologies, University of Ljubljana
Publication Year 2019
Rights Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0); https://creativecommons.org/licenses/by-nc-sa/4.0/; PUB
OpenAccess true
Contact info(at)clarin.si
Representation
Language Slovenian; Slovene
Resource Type corpus
Format application/zip; application/pdf; text/plain; charset=utf-8; downloadable_files_count: 2
Discipline Linguistics