Wikipedia Edit-Turn-Pairs

Corresponding and Non-Corresponding Edit-Turn-Pairs from the English Wikipedia. The ETP-gold corpus is based on article edits and discussion page turns from the English Wikipedia. The ETP-gold-labels MTurk dataset contains the labels and metadata from the crowdsource annotation task. For the edit-turn-pair detection task, please refer to/cite: Johannes Daxenberger and Iryna Gurevych (2014): "Automatically Detecting Corresponding Edit-Turn-Pairs in Wikipedia." In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. Short Papers. For the crowdsource annotation, please refer to/cite: Emily K. Jamison and Iryna Gurevych (2014). "Needle in a Haystack: Reducing the Costs of Annotating Rare-Class Instances in Imbalanced Datasets." In: Proceedings of the 28th Pacific Asia Conference on Language, Information and Computing.

Identifier
Source https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/2355
Related Identifier https://www.aclweb.org/anthology/P14-2031/
Related Identifier https://www.aclweb.org/anthology/Y14-1030/
Metadata Access https://tudatalib.ulb.tu-darmstadt.de/oai/openairedata?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:tudatalib.ulb.tu-darmstadt.de:tudatalib/2355
Provenance
Creator Daxenberger, Johannes; Gurevych, Iryna; Jamison, Emily K.
Publisher TU Darmstadt
Publication Year 2014
Rights Creative Commons Attribution Share-Alike 4.0; info:eu-repo/semantics/openAccess
OpenAccess true
Contact https://tudatalib.ulb.tu-darmstadt.de/page/contact
Representation
Language English
Resource Type Text
Format application/octet-stream; application/zip
Version 1.0
Discipline Other