-
A harmonised testsuite for social media POS tagging (DE)
A harmonised POS testsuite of web data, CMC and Twitter microtext, with word forms and STTS pos tags (+ some additional CMC-specific tags). UD pos tags have been automatically... -
GER_SET: Situation Entity Type labelled corpus for German
Semantic clause types, also called Situation Entity (SE) types (Smith, 2003) are linguistic characterizations of aspectual properties shown to be useful for tasks like... -
GermEval-2018 Corpus (DE)
This dataset comprises the training and test data (German tweets) from the GermEval 2018 Shared on Offensive Language Detection. -
XRF analysis on metal castings from Lübeck, Germany
XRF analysis was performed in August 2022 on 7 metal artefacts at different places in Lübeck, Germany: Epitaph of Bartholomäus Heisegger (St. Anne's... -
Replication Data for: Zur Determiniererlosigkeit bei prädikativ verwendeten z...
This data set contains the replication data for the article "Zur Determiniererlosigkeit bei prädikativ verwendeten zählbaren Nomen im Deutschen: Korpusdaten und ihre... -
Subset of KoLaS (Commented Learner Corpus Academic Writing), Plain Text Version
For this upload, all Word files (.doc and .docx) in the original KoLaS corpus were converted to plain text. For more information... -
Germeval 2017 Embeddings
Word Embeddings to our paper and conll converted data of the shared task -
Evaluation of neural coreference annotation of simplified German
This poster presents our evaluation of a neural coreference resolver (Schröder et al. 2021) on simplified German texts as well as the results of an annotation study that we... -
Türkisch-Englisch-Deutsch bei Herkunftssprechern (TEDH)
The TEDH has been created as part of the project "Foreign Language Acquisition in German-Turkish bilinguals". The TEDH Corpus contains interviews in three languages:... -
Gold and Silver fire gilding of copper/bronze objects
By means of the NRCA signal generated by objects on the INES beam line we intend to quantify the amount of gold and silver applied on the surface of art objects. Replica samples... -
Time of flight neutron diffraction at pulsed neutron source using a side-on G...
This dataset has no description
-
Diffusion Kinetics in Multilayer Organic Films for OLEDs
Conjugated organic semiconductors form an exciting class of materials that can be used in a variety of cutting edge technologies including organic light-emitting diodes (OLED),... -
An archeometallurgical study of a German Harquebusier Breastplate dated to th...
In a preliminary study we investigated the metal characteristics of Harquebusier breastplates. The main body is ferrite (source bloomery iron). The object shows corrosion in the... -
Posture-verb constructions in Dutch and German
The dataset is part of the research published in Okabe (forthcoming). The data comprise two .csv files: database_nl.csv and database_de.csv. -
Covert translation: Business Communication (new)
Translation corpora of original texts with translations and comparable texts from the genre external business communication. Übersetzungs- und Vergleichskorpus mit authentischen... -
Consecutive and Simultaneous Interpreting (CoSi)Konsekutives und Simultanes D...
Audio and video recordings of three lectures in Portuguese, one simultaneously and two consecutively professionally interpreted into German. For the simultaneouly interpreted... -
BIPODE Media
Additional data from the BIPODE project. -
TU_DE_L1-Korpus
The TÜ_DE-L1-Korpus is a corpus of spoken child language that has been collected in the project Specific Language Impairment and Early Successive Language Acquisition... -
PhonBLA Longitudinalstudie Hamburg Media
Additional data from the PhonBLA Longitudinalstudie Hamburg project. -
ZISA_BR_ZI
Sub-corpus of the ZISA project with one Italian and one Portuguese learner. The ZISA project contains audio recordings of five adult learners of German as an L2 with L1s...