The Database of Lithuanian multiword expressions

PID

The Database of Lithuanian multiword expressions (MWEs) is freely accessible for online search at: https://resursai.pastovu.vdu.lt/paieska/paprastoji from 2019. It contains two-word and three-word MWEs extracted from the DELFI.lt corpus representing news texts on the various topics (https://klc.vdu.lt/pastovuSearch.html). First, 12,000 MWEs (mostly collocations, a few idioms) were included in the database. In 2022, the database was updated adding new collocations from the same corpus and filtering arbitrary collocations: out of appr. 19,000 collocations appr. 9000 are marked as arbitrary collocations, i.e., having lexical collocability restrictions. The database provides rich information about the usage of collocations: lemma, word forms, frequencies (in the DELFI.lt corpus), morphological information, syntactic relations, grammatical variants, text genres, and usage examples. Usage variation cases are also illustrated, for example, word order changes or insertions between collocation constituents.

Identifier
PID http://hdl.handle.net/20.500.11821/49
Related Identifier https://arka.pastovu.vdu.lt/
Metadata Access https://clarin.vdu.lt/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:clarin.vdu.lt:20.500.11821/49
Provenance
Creator Bielinskienė, Agnė; Boizou, Loïc; Bumbulienė, Ieva; Kovalevskaitė, Jolanta; Krilavičius, Tomas; Mandravickaitė, Justina; Rimkutė, Erika; Vaičenonienė, Jurgita; Vilkaitė-Lozdienė, Laura
Publisher Vytautas Magnus University
Publication Year 2022
Rights ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT; https://clarin.vdu.lt/licenses/eula/ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm; ACA
OpenAccess true
Contact info(at)clarin.vdu.lt
Representation
Language Lithuanian
Resource Type toolService
Format text/plain; application/zip; downloadable_files_count: 2
Discipline Linguistics