Model of English OCR Post-Correction

PID

This is an OpenNMT-py model for OCR post-correction in English

Usage, see: https://github.com/mikahama/natas

This is a part of the following publication:

Mika Hämäläinen, and Simon Hengchen. 2019. From the Paft to the Fiiture: a Fully Automatic NMT and Word Embeddings Method for OCR Post-Correction. In the Proceedings of Recent Advances in Natural Language Processing.

Identifier
PID http://hdl.handle.net/11304/e5a3013a-0854-4d09-b40f-4c7dbf19cb40
Metadata Access https://b2share.eudat.eu/api/oai2d?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:b2share.eudat.eu:b2rec/3abc1bd0dd0c44e7a3d65c5cdf2607fa
Provenance
Creator Hämäläinen, Mika; Hengchen, Simon
Publisher CLARIN
Publication Year 2020
Rights info:eu-repo/semantics/openAccess; CC BY 4.0
OpenAccess true
Representation
Discipline Linguistics