41 datasets found

Keywords: Czech

Filter Results
  • Metonymy in Word-Formation: Russian, Czech, and Norwegian

    Publication abstract: A foundational goal of cognitive linguistics is to explain linguistic phenomena in terms of general cognitive strategies rather than postulating an...
  • Parent-child conversations about motion events (Russian, Russian-German, Czech)

    The dataset contains transcripts of parent-child communication over picture stimuli depicting motion events. The transcripts are partly-coded and transcribed in purpose of...
  • CoNLL-based Extended Czech Named Entity Corpus 2.0

    This is a Czech Named Entity Corpus 2.0 transformed into the CoNLL format. The original corpus can be downloaded from: http://hdl.handle.net/11858/00-097C-0000-0023-1B22-8. The...
  • VALLEX 3.0

    VALLEX 3.0 provides information on the valency structure (combinatorial potential) of verbs in their particular senses, which are characterized by glosses and examples. VALLEX...
  • Czech Models (MorfFlex CZ 160310 + PDT 3.0) for MorphoDiTa 160310

    Czech models for MorphoDiTa, providing morphological analysis, morphological generation and part-of-speech tagging. The morphological dictionary is created from MorfFlex CZ...
  • Czech Legal Text Treebank

    The Czech Legal Text Treebank (CLTT) is a collection of 1133 manually annotated dependency trees. CLTT consists of two legal documents: The Accounting Act (563/1991 Coll., as...
  • Czech Models for Korektor 2

    The Czech models for Korektor 2 created by Michal Richter, 02 Feb 2013. The models can either perform spellchecking and grammarchecking, or only generate diacritical marks.
  • Czech HS Contracts Dataset (CHSC) 1.0

    Czech Contracts dataset was created as a part of the thesis Low-resource Text Classification (2021), A. Szabó, MFF UK. Contracts are obtained from the Hlídač Státu web portal....
  • Khresmoi Query Translation Test Data 2.0

    This package contains data sets for development and testing of machine translation of medical queries between Czech, English, French, German, Hungarian, Polish, Spanish ans...
  • sqad 2.1

    Simple question answering database version 2.1 (SQAD_v2.1) created from Czech Wikipedia. Each record of SQAD consist of four files (in vertical form provided with lemmatization...
  • VALLEX 4.5

    VALLEX 4.5 provides information on the valency structure (combinatorial potential) of Czech verbs in their particular senses (almost 4 700 verbs in more than 11 080 lexical...
  • ORTOFON v1: balanced corpus of informal spoken Czech with multi-tier transcri...

    ORTOFON v1 is designed as a representation of authentic spoken Czech used in informal situations (private environment, spontaneity, unpreparedness etc.) in the area of the whole...
  • sqad 3.0

    Simple question answering database version 3 (SQAD v3) created from Czech Wikipedia. New version consits of 13477 records. Each record of SQAD consist of multiple files -...
  • MorfFlex CZ 161115

    Czech morphological dictionary developed originally by Jan Hajič as a spelling checker and lemmatization dictionary. Currently it contains full morphological information for...
  • Semantic annotation of noun/verb conversion in Czech

    The item contains a list of 2,058 noun/verb conversion pairs along with related formations (word-formation paradigms) provided with linguistic features, including semantic...
  • NomVallex I.

    The NomVallex I. lexicon describes valency of Czech deverbal nouns belonging to three semantic classes, i.e. Communication (dotaz 'question'), Mental Action (plán 'plan') and...
  • Czech Models (MorfFlex CZ + PDT) for MorphoDiTa

    Czech models for MorphoDiTa, providing morphological analysis, morphological generation and part-of-speech tagging. The morphological dictionary is created from MorfFlex CZ and...
  • Khresmoi Summary Translation Test Data 2.0

    This package contains data sets for development (Section dev) and testing (Section test) of machine translation of sentences from summaries of medical articles between Czech,...
  • VALLEX 4.0 (2021-02-12)

    VALLEX 4.0 provides information on the valency structure (combinatorial potential) of verbs in their particular senses; each sense is by a gloss and examples. VALLEX 4.0...
  • Khresmoi Summary Translation Test Data 1.1

    This package contains data sets for development and testing of machine translation of sentences from summaries of medical articles between Czech, English, French, and German.
You can also access this registry using the API (see API Docs).