-
Post-edited and error annotated machine translation corpus PErr 1.0
The PE²rr corpus contains source language texts from different domains along with their automatically generated translations into several morphologically rich languages, their... -
DigiLing e-Learning Hub: e-Courses for Digital Linguistics
The files represent exported e-learning resources created within the DigiLing project, www.digiling.eu. We have identified seven core subjects in Digital Linguistics and built... -
APE Shared Task WMT17: Human Post-edits Test Data DE-EN
Human post-edited test sentences for the WMT 2017 Automatic post-editing task. This consists in 2,000 English sentences belonging to the IT domain and already tokenized. Source... -
Test Data DE-EN APE Shared Task WMT17
Test data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in German-English triplets (source and... -
Test Data EN-DE MT_NMT APE Shared Task WMT18
Test data for the WMT 2018 Automatic post-editing task. They consist in English-German pairs (source and target) belonging to the information technology domain and already... -
APE Shared Task WMT18: Human Post-edits and References Test Data EN-DE PBSMT
Human post-edited and reference test sentences for the En-De PBSMT WMT 2018 Automatic post-editing task. This consists of 2,000 German sentences for each file belonging to the... -
APE Shared Task WMT17: Human Post-edits Test Data EN-DE
Human post-edited test sentences for the WMT 2017 Automatic post-editing task. This consists in 2,000 German sentences belonging to the IT domain and already tokenized. Source... -
WMT18 APE Shared Task: En-DE NMT Train and Dev Data
Training and development data for the WMT 2018 Automatic post-editing task. They consist in English-German triplets (source, target and post-edit) belonging to the information... -
Test Data EN-DE APE Shared Task WMT17
Test data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in 2,000 English-German pairs (source and... -
WMT17 En-De APE Shared Task Data
Training data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in 11,000 English-German triplets... -
QT21 Data
Post-editing and MQM annotations produced by the QT21 project. As described in @InProceedings{specia-etal_MTSummit:2017, author = {Specia, Lucia and Kim Harris and... -
WMT17 De-En APE Shared Task Data
Training and development data for the WMT 2017 Automatic post-editing task (the same used for the Sentence-level Quality Estimation task). They consist in German-English... -
Depfix: Automatic Post-editing of SMT
Depfix, a tool for Automatic Post-editing of SMT. See the project website for more information. -
Test Data EN-DE MT_PBSMT APE Shared Task WMT18
Test data for the WMT 2018 Automatic post-editing task. They consist in English-German pairs (source and target) belonging to the information technology domain and already...