-
FEDI Dataset
FEDI is the first task-oriented document-grounded dialogue dataset for learning from demographic information, user emotions and implicit user feedback. In its current version,... -
Document Structure in Long Document Transformers
This repository contains the data for the paper "Document Structure in Long Document Transformers", accepted at EACL 2024. Please see README.md for more information. -
Exploring Jiu-Jitsu Argumentation for Writing Peer Review Rebuttals
This is the resource for the dataset and models released as a part of our EMNLP 2023 paper "Exploring Jiu-Jitsu Argumentation for Writing Peer Review Rebuttals" -
Pose Prediction for Mobile Ground Robots Evaluation Dataset
This dataset provides ground truth robot trajectories in rough terrain for the evaluation of pose prediction approaches for mobile ground robots. It is composed of six datasets... -
Hector Enrich 2023 Radiation Mapping Dataset
Data set for the evaluation of radiation mapping methods for mobile robots accompanying our SSRR 2023 paper "Online 2D-3D Radiation Mapping and Source Localization using... -
DRZ Living Lab Tracked Robot SLAM Dataset
Data set for the evaluation of SLAM systems in challenging terrains. The data set covers four sequences with challenging terrain, each tracked with a high-performance Qualisys... -
Annotation Error Detection: Analyzing the Past and Present for a More Coheren...
This is the accompanying data for our paper "Annotation Error Detection: Analyzing the Past and Present for a More Coherent Future". Annotated data is an essential ingredient in... -
Lessons Learned from a Citizen Science Project for Natural Language Processing
This is the accompanying data for our paper "Lessons Learned from a Citizen Science Project for Natural Language Processing". Many Natural Language Processing (NLP) systems use... -
Analyzing Dataset Annotation Quality Management in the Wild
This is the accompanying data for the paper "Analyzing Dataset Annotation Quality Management in the Wild". Data quality is crucial for training accurate, unbiased, and... -
The TYC Dataset for Understanding Instance-Level Semantics and Motions of Cel...
TYC dataset proposed in the paper "The TYC Dataset for Understanding Instance-Level Semantics and Motions of Cells in Microstructures" [ICCVW 2023]. Project page:... -
On emergence
Output files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning. -
NLPEER: A Unified Resource for the Computational Study of Peer Review
Dataset of peer review reports and paper drafts from diverse domains and venues. We provide multiple versions of the dataset; when in doubt, download the newest version. You can... -
Self-supervised Augmentation Consistency for Adapting Semantic Segmentation
We propose an approach to domain adaptation for semantic segmentation that is both practical and highly accurate. In contrast to previous work, we abandon the use of... -
Single-stage Semantic Segmentation from Image Labels
Recent years have seen a rapid growth in new approaches improving the accuracy of semantic segmentation in a weakly supervised setting, i.e. with only image-level labels... -
Fast Axiomatic Attribution for Neural Networks
Mitigating the dependence on spurious correlations present in the training dataset is a quickly emerging and important topic of deep learning. Recent approaches include priors... -
Dense Unsupervised Learning for Video Segmentation
We present a novel approach to unsupervised learning for video object segmentation (VOS). Unlike previous work, our formulation allows to learn dense feature representations... -
Verb Sense Labelling
Vocabulary used for the creation of sense patterns: -
Whittle Networks datasets
Datasets for paper "Whittle Networks: A Deep Likelihood Model for Time Series" Paper at http://proceedings.mlr.press/v139/yu21c.html Code at... -
Visual Feature Track Dataset
This dataset contains 282 visual feature tracks. A visual feature track is a sequence of feature observations of the same real 3D-landmark in consecutive image frames. These... -
WWW 2019 X-Ling Question Retrieval Data v1
This repository contains the data and code to reproduce the results of our paper "Improved Cross-Lingual Question Retrieval for Community Question Answering"...