List of formulaic sequences in spoken Slovenian

PID

This document contains 2,374 formulaic sequences in spoken Slovenian, i.e. frequently recurring strings of two to five words, manually annotated for syntactic structure, pragmatic function, and dictionary relevance. The list of sequences with a minimum frequency threshold of 20/million is based on the Frequency lists of word-level n-grams from normalized word forms in GOS 1.0 (http://hdl.handle.net/11356/1271) and contains the union of top-1,000 formulaic sequences ranked by frequency and five association measures (Dice, t-test, MI, MI3, simple-LL).

Note that there exists a related entry, "List of formulaic sequences in standard written Slovenian", http://hdl.handle.net/11356/1280.

Identifier
PID http://hdl.handle.net/11356/1279
Related Identifier http://slovnica.ijs.si/wp-content/uploads/2019/12/NSSS_DS5-nizi_navodila_v6.pdf
Related Identifier http://slovnica.ijs.si/
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1279
Provenance
Creator Dobrovoljc, Kaja; Roblek, Rebeka; Vianello, Chiara; Diaci, Ajda; Vuga, Zala
Publisher Jožef Stefan Institute; Centre for Language Resources and Technologies, University of Ljubljana
Publication Year 2020
Rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0); https://creativecommons.org/licenses/by-sa/4.0/; PUB
OpenAccess true
Contact info(at)clarin.si
Representation
Language Slovenian; Slovene
Resource Type lexicalConceptualResource
Format application/octet-stream; text/plain; charset=utf-8; downloadable_files_count: 1
Discipline Linguistics