NER-Modell 22 des Projekts Dehmel Digital

Dataset

DOI

This dataset contains two types of resources: Firstly, one Named Entity Recognition model developed in the context of the project "Dehmel digital" for the automatic annotation of persons, places, artworks and organisations in german-speaking letters from the period around 1900. The training corpus for model 20 consists of circa 270,000 manually annotated tokens. Second, a table in which the results of the performance test are broken down in detail. The performance was calculated on the basis of eight different test texts, each consisting of 10,000 manually annotated tokens.

Identifier
DOI	https://doi.org/10.25592/uhhfdm.10830
Related Identifier	https://doi.org/10.25592/uhhfdm.9790
Related Identifier	https://doi.org/10.25592/uhhfdm.10829
Metadata Access	https://www.fdr.uni-hamburg.de/oai2d?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:fdr.uni-hamburg.de:10830

Provenance
Creator	Flüh, Marie (ORCID: 0000-0002-1707-284X)
Publisher	Universität Hamburg
Publication Year	2022
Rights	Creative Commons Attribution 4.0 International; Open Access; https://creativecommons.org/licenses/by/4.0/legalcode; info:eu-repo/semantics/openAccess
OpenAccess	true

Representation
Language	German
Resource Type	Dataset
Version	Version 1
Discipline	Humanities