Slovene ontology of semantic types for nouns SLONEST-noun 1.0

PID

SLONEST stands for Slovene Ontologies of Semantic Types. The first subset – SLONEST-noun 1.0 – represents an ontology developed for nouns. SLONEST-noun contains an XML file with a total of 271 categories of semantic types: 21 top-level categories, which are further divided into up to three levels of hierarchical subcategories. The ontology was developed and evaluated using the data from the Collocations Dictionary of Modern Slovene (Kosem et al. 2018; https://viri.cjvt.si/kolokacije; http://hdl.handle.net/11356/1250) and the Comprehensive Slovene-Hungarian Dictionary (https://www.cjvt.si/en/research/cjvt-projects/slovene-hungarian-dictionary), which are being compiled at the Centre for Language Resources and Technologies, University of Ljubljana.

The semantic types in the SLONEST-noun ontology are accompanied with numerical ids (listed in the attribute SEMCODE; e.g. "1.1.1") and full ontology path (attribute SEMFULLNAME; e.g. "HUMAN-ACTIVITY-OTHER"). Every semantic type is provided with a definition (e.g. "Other denominations for humans related to activities."). Where relevant, especially at top-level semantic types, the corresponding semantic type (i.e. lexicographer file) from Wordnet (https://wordnet.princeton.edu/) is listed, along with the level of matching ("full" or "partial"). For most semantic types, examples of Slovene lemmas or multiword units are also provided.

As the ontology was also developed for, and tested on, collocation data, a selection of collocations is also provided for most categories. For every collocation, noun headwords and collocates are clearly labelled, and the information on grammatical structure (id and name) is provided, based on the most recent database of Slovene collocations (http://hdl.handle.net/11356/1415).

The ontology was developed as part of the KOLOS project. The authors acknowledge that the project titled Collocation as a basis for language description: semantic and temporal perspectives (J6-8255) was financially supported by the Slovenian Research Agency.

Identifier
PID http://hdl.handle.net/11356/1428
Related Identifier https://www.cjvt.si/kolos/
Metadata Access http://www.clarin.si/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:www.clarin.si:11356/1428
Provenance
Creator Kosem, Iztok; Pori, Eva; Gantar, Polona; Logar, Nataša; Krek, Simon; Laskowski, Cyprian; Arhar Holdt, Špela; Čibej, Jaka; Dobrovoljc, Kaja; Gorjanc, Vojko; Klemenc, Bojan; Ljubešić, Nikola
Publisher Centre for Language Resources and Technologies, University of Ljubljana
Publication Year 2020
Rights Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0); https://creativecommons.org/licenses/by-sa/4.0/; PUB
OpenAccess true
Contact info(at)clarin.si
Representation
Language Slovenian; Slovene
Resource Type lexicalConceptualResource
Format application/zip; text/plain; charset=utf-8; downloadable_files_count: 1
Discipline Linguistics