33 datasets found

Keywords: learner corpus

Filter Results
  • HABE-IXA euskarazko idazmen proben corpusa HABE-IXA Basque written test corpus

    This corpus contains essays written in official HABE exams for assessing student's knowledge of the Basque language. We have collected 120 essays in each of the B1, B2, C1 and...
  • Slovene learner corpus KOST 2.0

    The corpus of Slovene as a foreign language KOST (Korpus slovenščine kot tujega jezika) contains 8,347 texts (almost 1.3 million words) written by adult speakers for whom...
  • Business English learner speech corpus SAPS

    SAPS is a specialized speech corpus which contains business meeting simulations in English between undergraduate students of Languages for Business and Economics at the School...
  • Corpus of Romanian Academic Genres ROGER

    The corpus contains academic papers from eight disciplines, written by the Romanian students in native Romanian and English L2. The corpus was collected over a three-year period...
  • Slovene learner corpus KOST 1.0

    The corpus of Slovene as a foreign language KOST (Korpus slovenščine kot tujega jezika) contains 6,311 texts (just over 1 million words) written by adult speakers for whom...
  • AKCES 5 (CzeSL-SGT)

    Essays written by non-native learners of Czech, a part of AKCES/CLAC – Czech Language Acquisition Corpora. CzeSL-SGT stands for Czech as a Second Language with Spelling, Grammar...
  • Czesl - Universal Dependencies Release 0.5

    Syntactic annotation of 1600 sentences from the Czesl-MAN corpus using the framework of Universal Dependencies 2.3
  • KAMOKO: KAsseler MOrgenstern KOrpus (2021-02-09)

    KAMOKO is a structured and commented french learner-corpus. It addresses the central structures of the French language from a linguistic perspective (18 different courses). The...
  • KAMOKO: KAsseler MOrgenstern KOrpus

    KAMOKO is a structured and commented french learner-corpus. It addresses the central structures of the French language from a linguistic perspective (18 different courses). The...
  • AKCES 5 (CzeSL-SGT) Release 2

    Essays written by non-native learners of Czech, a part of AKCES/CLAC – Czech Language Acquisition Corpora. CzeSL-SGT stands for Czech as a Second Language with Spelling, Grammar...
  • KAMOKO-Digitalizer

    This editor was developed especially for the needs of the KAMOKO project (https://lindat.mff.cuni.cz/repository/xmlui/handle/11372/LRT-3261). The editor allows the quick entry...
  • KoKo German L1 Learner Corpus v1

    The KoKo Corpus is an error-annotated learner corpus of L1 German speakers. It has been created with the aim to investigate and describe the writing skills of German-speaking...
  • Kolipsi-2 Corpus v1.1

    The Kolipsi-2 Corpus is a written learner corpus of German and Italian L2 speakers originating from South Tyrol (Italy). It has been developed as a by-product of the KOLIPSI II...
  • MERLIN Written Learner Corpus for Czech, German, Italian 1.0

    The MERLIN corpus is a written learner corpus for Czech, German, and Italian that has been designed to illustrate the Common European Framework of Reference for Languages (CEFR)...
  • Kolipsi-2 Corpus v1.0

    The Kolipsi-2 Corpus is a written learner corpus of German and Italian L2 speakers originating from South Tyrol (Italy). It has been developed as a by-product of the KOLIPSI II...
  • MERLIN Written Learner Corpus for Czech, German, Italian 1.1

    The MERLIN corpus is a written learner corpus for Czech, German, and Italian that has been designed to illustrate the Common European Framework of Reference for Languages (CEFR)...
  • KoKo German L1 Learner Corpus v3

    The KoKo Corpus is an error-annotated learner corpus of L1 German speakers. It has been created with the aim to investigate and describe the writing skills of German-speaking...
  • LEONIDE - Longitudinal Learner Corpus in Italiano, Deutsch and English 1.1

    LEONIDE is a longitudinal corpus of student essays documenting the language competences and writing development of lower secondary school students in three different languages....
  • Beldeko Summary Corpus v1.1.0

    Beldeko Summary Corpus v1.1.0 The Beldeko (Belgisches Deutschkorpus) Summary Corpus is a learner corpus that consists of summaries written by advanced L2 German learners (CEF...
  • KoKo German L1 Learner Corpus v2

    The KoKo Corpus is an error-annotated learner corpus of L1 German speakers. It has been created with the aim to investigate and describe the writing skills of German-speaking...
You can also access this registry using the API (see API Docs).