-
German causal language annotations and lexicon (verbs, nouns, prepositions) (DE)
Annotations of causal verbs, nouns and prepositions in context and lexicon file for causal verbs, nouns and prepositions. -
Early Chinese Periodicals Online (ECPO) [Metadata]
ECPO joins several important digital collections of the early Chinese press and puts them into a single overarching framework. In the first phase, several databases on early... -
Negative Sampling for Learning Knowledge Graph Embeddings
Reimplementation of four KG factorization methods and six negative sampling methods. Abstract Knowledge graphs are large, useful, but incomplete knowledge repositories. They... -
Topological Field Labeler for German
This resource contains the code of the topological labeler used in the paper: Do and Rehbein (2020). "Parsers Know Best: German PP Attachment Revisited". For this tool, labeling... -
Ergänzungsmaterial zu: Siedlungsarchäologie im Alpenvorland XV. Die Pfahlbaus...
Im Band "Siedlungsarchäologie im Alpenvorland XV" werden die Ergebnisse der Grabungen und der dendrochronologischen Untersuchungen in der Pfahlbaustation Sipplingen-Osthafen am... -
Genre-sensitive Neural Situation Entity classifier (DE, EN)
This is a Classifier for situation entity types as described in Becker et al., 2017. These clause types depend on a combination of syntactic-semantic and contextual features. We... -
Paris und Versailles in Reisebeschreibungen deutscher Architekten um 1700. Pi...
This is the research data related to the publication "Paris und Versailles in Reisebeschreibungen deutscher Architekten um 1700. Pitzler, Corfey und Sturm" (Paris and Versailles... -
Pre-trained POS tagging models for German social media
Pre-trained POS tagging models for the HunPos tagger (Halácsy et al. 2007) the biLSTM-char-CRF tagger (Reimers & Gurevych 2017) Online-Flors (Yin et al. 2015).... -
ACL word segmentation correction
The data in this collection consists of two parallel directories, one ("raw") containing the raw text of 18850 articles from the ACL 2013/02 collection, the other... -
Ergänzende Materialien zu: Die eisenzeitliche und römische Siedlung von Tönis...
Grabungsgesamtplan, großformatig, als PDF; Phasenplan der Grabung, großformatig, als PDF; Tabelle aller Befunde, als PDF. -
Encoder-Decoder Model for Semantic Role Labeling
Abstract (Daza & Frank 2019): We propose a Cross-lingual Encoder-Decoder model that simultaneously translates and generates sentences with Semantic Role Labeling annotations... -
AMR parse quality prediction [Source Code]
Accuracy prediction for AMR parsing predicts 33 accuracy metrics for a given sentence and its (automatic) AMR parse Abstract (Opitz and Frank, 2019): Semantic proto-role... -
tweeDe
A German UD Twitter treebank, with >12,000 tokens from 519 tweets, annotated in the Universal Dependencies framework -
Tool for Extracting PP Attachment Disambiguation Dataset
This resource contains code to extract a PP attachment disambiguation dataset as described in the paper: Do and Rehbein (2020). "Parsers Know Best: German PP Attachment... -
Begleitdaten zu: "PIA 1. Bericht des Pilotprojekts Inwertsetzung Ausgrabungen...
1) Neolithische Siedlung Cleebronn "Langwiesen IV": Fotodokumentation von Silexfunden 2) Frühmittelalterliches Gräberfeld Cleebronn "Langwiesen IV": Befundzeichnungen,... -
Affixoid Dataset (DE)
The dataset contains the manual annotations for the COLING 2018 submission "Distinguishing affixoid formations from compounds" by Josef Ruppenhofer, Michael Wiegand, Rebecca... -
Kaiserchronik - digital
The digital edition presents the entire manuscript transmission of the Kaiserchronik, both known and extant, in dual format: digital facsimiles of the manuscripts (where these... -
Neural Rerankers for Dependency Parsing
This resource contains code for different types of neural rerankers (RCNN, RCNN-shared and GCN) from the paper: Do and Rehbein (2020). "Neural Reranking for Dependency Parsing:... -
Real-World PP Attachment Disambiguation Dataset
This resource contains a German dataset for real-world PP attachment disambiguation. The creation, analysis and experiment results of the dataset are described in the paper: Do... -
Lexicon of Abusive Words (EN)
This goldstandard contains a bootstrapped lexicon of abusive words. The lexicon comprises a large set of English negative polar expressions annotated as either abusive or not.