Tweet code-switching corpus Janes-Preklop 1.0

PID

Janes-Preklop is a corpus of Slovene tweets that is manually annotated for code-switching (the use of words from two or more languages within one sentence or utterance), according to the supplied typology. Words in the corpus are also automatically tagged with MSDs and lemmas.

Identifier
PID http://hdl.handle.net/11356/1154
Related Identifier http://nl.ijs.si/janes/wp-content/uploads/2017/09/Magistrsko-delo_%C5%A0pela-Reher_final.pdf
Related Identifier http://nl.ijs.si/janes/viri/rocno-oznaceni-korpusi/#Janes-Preklop
Related Identifier https://doi.org/10.1007/s10579-018-9425-z
Related Identifier http://nl.ijs.si/janes/
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1154
Provenance
Creator Reher, Špela; Erjavec, Tomaž; Fišer, Darja
Publisher Jožef Stefan Institute
Publication Year 2017
Rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0); https://creativecommons.org/licenses/by-sa/4.0/; PUB
OpenAccess true
Contact info(at)clarin.si
Representation
Language Slovenian; Slovene
Resource Type corpus
Format text/plain; charset=utf-8; application/pdf; application/zip; downloadable_files_count: 4
Discipline Linguistics