99 datasets found

Keywords: morphology

Filter Results
  • Corpus extraction tool LIST 1.2

    The LIST corpus extraction tool is a Java program for extracting lists from text corpora on the levels of characters, word parts, words, and word sets. It supports VERT and TEI...
  • Inflectional lexicon srLex 1.1

    srLex is a large inflectional lexicon of Serbian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,...
  • ILSP Conceptual Dictionary of Modern Greek (ELEXIS)

    ConceptNet-el (Εννοιολογικό Λεξικό της Νέας Ελληνικής ΙΕΛ). ConceptNet-el is a conceptual dictionary of Modern Greek that assumes the form of a linguistic ontology. It...
  • Frequency lists of word parts from the Gigafida 2.0 corpus

    Frequency lists of words split into word parts were extracted from the Gigafida 2.0 Corpus of Written Standard Slovene (https://viri.cjvt.si/gigafida/) using the LIST corpus...
  • Corpus extraction tool LIST 1.0

    The LIST corpus extraction tool is a Java program for extracting lists from text corpora on the levels of characters, word parts, words, and word sets. It supports VERT and TEI...
  • Frequency lists of word parts from the GOS 1.0 corpus

    Frequency lists of words split into word parts were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle.net/11356/1040) using the LIST corpus extraction tool...
  • Inflectional lexicon hrLex 1.3

    hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, MSD features, UPOS, morphological features, frequency,...
  • Irish National Morphology Database (ELEXIS)

    Bunachar Gramadaí is a large collection of Irish words which records their inflected forms and linguistic properties. The database contains some 43,000 entries and covers nouns,...
  • Inflectional lexicon hrLex 1.2

    hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,...
  • Inflectional lexicon hrLex 1.1

    hrLex is a large inflectional lexicon of Croatian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,...
  • Morphological patterns from the Sloleks 2.0 lexicon 1.0

    This entry consists of XML files with 96,290 lexical units (nouns, verbs, adjectives, and adverbs) from the Sloleks Morphological Lexicon of Slovene 2.0...
  • Morphological lexicon Sloleks 2.0

    Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP applications and language manuals. Encoded in LMF XML, the lexicon contains...
  • Morphological lexicon Sloleks 1.0

    Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP applications and language manuals. Encoded in LMF XML, the lexicon contains...
  • Beseda Corpus Lemmatisation Lexicon

    Beseda Corpus Lemmatisation Lexicon for Slovenian language was generated at the Fran Ramovš Institute of Slovenian Language, primarily through inflection of open class words...
  • Inflectional lexicon srLex 1.0

    hrLex is an large inflectional lexicon of Serbian language where each entry consists of a (wordform, lemma, MSD) triple. The MSD tagset follows the revised MULTEXT-East V4...
  • Inflectional lexicon srLex 1.2

    srLex is a large inflectional lexicon of Serbian language where each entry consists of a (wordform, lemma, MSD, frequency, per-million frequency) 5-tuple. The (wordform, lemma,...
  • Inflectional lexicon srLex 1.3

    srLex is a large inflectional lexicon of Serbian language where each entry consists of a (wordform, lemma, MSD, MSD features, UPOS, morphological features, frequency,...
  • List of word relations from the Sloleks 2.0 lexicon 1.0

    This entry consists of a TSV file containing a list of 66,347 Slovene word pairs from the Sloleks Morphological Lexicon of Slovene (v2.0; http://hdl.handle.net/11356/1230) that...
  • Morphological lexicon Franček

    Morphological Lexicon Franček for Slovenian language contains non-stressed inflected word forms for 96,402 entries (out of 100,006 total) of the Franček Portal Headword List....
  • Morphological lexicon Sloleks 1.2

    Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP applications and language manuals. Encoded in LMF XML, the lexicon contains...
You can also access this registry using the API (see API Docs).