27 datasets found

Keywords: inflection

Filter Results
  • Replication Data for: Predicting Stress in Russian using Modern Machine-Learn...

    This dataset consists of a TSV file with five columns of data originating in Zaliznyak's Grammar and Dictionary (1977). The data was programmatically scraped from Giella...
  • Toposław 2 (2016-05-31)

    Toposław 2 is an editor of multi-world unit inflection lexicons.
  • Polimorf

    PoliMorf is a morphological dictionary for Polish resulting from the standardization and merger of Morfeusz SGJP and Morfologik. The present version includes extended...
  • MWELexicon

    Lexicon of 55k multi-word lexical units linked to plWordNet, together with description of their syntactic bahaviour obtained in constraint language (WCCL).
  • MWELexicon 1.1

    Lexicon of 56,5k multi-word lexical units linked to plWordNet, together with description of their syntactic bahaviour obtained in constraint language (WCCL).
  • Morfeusz 2

    Morfeusz 2 is a dictionary based morphological analyser and generator for Polish. This version of the program is decoupled from the dictionary. Two dictionaries of Polish...
  • Toposław 2

    Toposław 2 is an editor of multi-word unit inflection lexicons.
  • Toposław

    Toposław is an editor of multi-word unit inflection lexicons.
  • Inflectional lexicon srLex 1.1

    srLex is a large inflectional lexicon of Serbian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,...
  • Inflectional lexicon hrLex 1.3

    hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, MSD features, UPOS, morphological features, frequency,...
  • Irish National Morphology Database (ELEXIS)

    Bunachar Gramadaí is a large collection of Irish words which records their inflected forms and linguistic properties. The database contains some 43,000 entries and covers nouns,...
  • Inflectional lexicon hrLex 1.2

    hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,...
  • Inflectional lexicon hrLex 1.1

    hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,...
  • Morphological lexicon Sloleks 3.0

    Sloleks is a reference morphological lexicon of Slovene that was developed to be used in various NLP applications and language manuals. It contains Slovene lemmas, their...
  • Morphological lexicon Sloleks 2.0

    Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP applications and language manuals. Encoded in LMF XML, the lexicon contains...
  • Morphological lexicon Sloleks 1.0

    Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP applications and language manuals. Encoded in LMF XML, the lexicon contains...
  • Beseda Corpus Lemmatisation Lexicon

    Beseda Corpus Lemmatisation Lexicon for Slovenian language was generated at the Fran Ramovš Institute of Slovenian Language, primarily through inflection of open class words...
  • MULTEXT-East non-commercial lexicons 4.0

    The MULTEXT-East morphosyntactic lexicons have a simple structure, where each line is a lexical entry with three tab-separated fields: (1) the word-form, the inflected form of...
  • Inflectional lexicon srLex 1.0

    hrLex is an large inflectional lexicon of Serbian language where each entry consists of a (wordform, lemma, MSD) triple. The MSD tagset follows the revised MULTEXT-East V4...
  • Inflectional lexicon srLex 1.2

    srLex is a large inflectional lexicon of Serbian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,...
You can also access this registry using the API (see API Docs).