-
SYN v4: large corpus of written Czech
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mostly the period of 1990–2014 and it is a traditional corpus (as opposed to the... -
SYN2013PUB: corpus of written Czech newspapers
Corpus of contemporary Czech newspapers and magazines sized 935 MW. It contains various titles published between 2005–2009. The corpus is lemmatized and morphologically tagged... -
SYN2010: balanced corpus of written Czech
Balanced corpus of contemporary written Czech sized 100 MW. It was created as a representation of written language from 2005–2009 and thus it contains a wide range of text types... -
SYN v9: large corpus of written Czech
Corpus of contemporary written (printed) Czech sized 4.7 GW (i.e. 5.7 billion tokens). It covers mostly the 1990-2019 period and features rich metadata including detailed... -
SYN2005: balanced corpus of written Czech
Balanced corpus of contemporary written Czech sized 100 MW. It was created as a representation of written language from 2000–2004 and thus it contains a wide range of text types... -
Thesaurus linguae Latinae
The Thesaurus linguae Latinae is the first comprehensive dictionary of ancient Latin; • it is compiled on the basis of all Latin texts surviving from antiquity (until AD... -
AKCES 1
Corpus AKCES 1 includes texts written in czech by youth (native speakers); it is the same data as the corpus SKRIPT 2012 -
SYN2009PUB: corpus of Czech newspapers
Corpus of contemporary Czech newspapers and magazines sized 700 MW. It contains various titles published between 1995–2007. The corpus is lemmatized and morphologically tagged... -
SYN2015: representative corpus of written Czech
Representative corpus of contemporary written Czech sized 100 MW. It was created as a representation of printed language from 2010–2014 containing a wide range of text types... -
SYN2006PUB: corpus of Czech newspapers
Corpus of contemporary Czech newspapers and magazines sized 300 MW. It contains various titles published between the end of 1989 and 2004. The corpus is lemmatized and... -
Das Kiezdeutschkorpus "KiDKo": Zusatzkorpora
Aditional corpus I "Frog Story" oral presentation of the picture story (Mayer 1969), written reproduction of the "Frog Story" from memory. Additional corpus... -
Das Kiezdeutschkorpus "KiDKo": Einstellungen (KiDKo/E)
A corpus including E-Mails and reader comments of the public debate about "Kiezdeutsch" ("Einstellungen" (Settings)- Additional to the KiezDeutsch- corpus,...