Core vocabulary for Slovenian as L2 1.0

PID

The Core vocabulary for Slovenian as L2 is based on an analysis of the vocabulary appearing in the KUUS corpus (http://hdl.handle.net/11356/1696), which includes textbooks for Slovenian as a second and foreign language. By exporting lemmas, comparing them with the Reference list of Slovene frequent common words (Pollak et al. 2020, http://hdl.handle.net/11356/1346) and manual review, a list of 5273 words was compiled. The lemmas were classified into the first three CEFR levels. The list includes 350 words with the assigned label A1-core, 864 words with the label A1-larger, 1451 words with the label A2 and 2608 words at level B1. The file is in a tab separated format, containing lemma, part-of-speech (following the MULTEXT-East tagset for Slovenian), the information if the lemma appears in the Reference List of Slovene Frequent Common Words or not, and the relative average frequency.

The word lists are presented in more detail in: KLEMEN, Matej, ARHAR HOLDT, Špela, POLLAK, Senja, KOSEM, Iztok, HUBER, Damjan, LUTAR, Mateja, 2022: Korpus učbenikov za učenje slovenščine kot drugega in tujega jezika. Nataša Pirih Svetina, Ina Ferbežar (eds.): Na stičišču svetov: slovenščina kot drugi in tuji jezik. Obdobja 41. Ljubljana: Založba Univerze v Ljubljani. 165–174. DOI: https://doi.org/10.4312/Obdobja.41.2784-7152.

Identifier
PID http://hdl.handle.net/11356/1697
Related Identifier https://doi.org/10.4312/Obdobja.41.2784-7152
Related Identifier https://centerslo.si/KUUS
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1697
Provenance
Creator Klemen, Matej; Arhar Holdt, Špela; Pollak, Senja
Publisher Centre for Slovene as a Second and Foreign Language, University of Ljubljana; Centre for Language Resources and Technologies, University of Ljubljana
Publication Year 2022
Rights CLARIN.SI Licence ACA ID-BY-NC-INF-NORED 1.0; https://clarin.si/repository/xmlui/page/licence-aca-id-by-nc-inf-nored-1.0; ACA
OpenAccess true
Contact info(at)clarin.si
Representation
Language Slovenian; Slovene
Resource Type lexicalConceptualResource
Format text/plain; charset=utf-8; text/plain; downloadable_files_count: 1
Discipline Linguistics