Corpus of the Contemporary Lithuanian Language

PID

Corpus of the Contemporary Lithuanian Language, which comprises 208 million words, is a collection of texts designed to represent the current Lithuanian. The corpus has been compiled since 1990. The corpus is designed to represent as wide a range of contemporary written Lithuanian as possible. The largest part of the corpus is comprised of General Press (texts from regional and national newspapers), Popular Press, and Special Press (specialized newspapers and magazines). The rest of the corpus consists of Fiction, Nonfiction, Administrative documents, and Spoken language. The corpus is morphologically annotated and freely accessible for online search at http://corpus.vdu.lt.

Identifier
PID http://hdl.handle.net/20.500.11821/16
Related Identifier http://corpus.vdu.lt
Metadata Access https://clarin.vdu.lt/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:clarin.vdu.lt:20.500.11821/16
Provenance
Creator Utka, Andrius; Rimkutė, Erika; Kovalevskaitė, Jolanta; Bielinskienė, Agnė; Petkevičius, Mažvydas; Petrauskaitė, Rūta; Mikelionienė, Jurgita
Publisher Vytautas Magnus University
Publication Year 2017
OpenAccess true
Contact info(at)clarin.vdu.lt
Representation
Language Lithuanian
Resource Type corpus
Format downloadable_files_count: 0
Discipline Linguistics