Corpus of Discourse on Crime

PID

Specialised "Corpus of Discourse on Crime" is synchronic, monolingual, unannotated, consists of two subcorpora. Subcorpus 1: all texts on crime, published in criminal columns on the most popular Lithuanian web portals (15min.lt, delfi.lt, lrytas.lt) and other sources (police websites, specialized newspaper Akistata). Period: from 6 September 2015 to 5 October 2015. Size: 329,227 tokens. Subcorpus 2: texts on two crimes that have caused widespread public resonance: (a) Suspect AB (15min.lt, delfi.lt, lrytas.lt, tv3.lt, individual authors, police websites, Akistata). Period: from 2 January 2016 to 28 January 2016. Size: 32,915 tokens. (b) Suspect GK (15min.lt, delfi.lt, lrytas.lt, tv3.lt, police websites). Period: from 26 January 2017 to 14 March 2017. Size: 48,849 tokens. The selection of the texts meets the criteria of readability and accessibility. The principle of random selection was less relevant, as the corpus included all the texts on crimes published in the scheduled sources. The 2nd subcorpus consists of texts on crimes against children and provides the opportunity to obtain more emotional evaluation data. The analysis of this corpus is presented in the dissertation by S. Jakimovienė “Evaluation in the Discourse on Crime: Identification and Interpretation”.

Identifier
PID http://hdl.handle.net/20.500.11821/37
Metadata Access https://clarin.vdu.lt/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:clarin.vdu.lt:20.500.11821/37
Provenance
Creator Jakimovienė, Sigita
Publisher Vytautas Magnus University
Publication Year 2020
Rights PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT; https://clarin.vdu.lt/licenses/eula/PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm; PUB
OpenAccess true
Contact info(at)clarin.vdu.lt
Representation
Language Lithuanian
Resource Type corpus
Format application/zip; text/plain; charset=utf-8; downloadable_files_count: 1
Discipline Linguistics