GLips - German Lipreading Dataset

DOI

The German Lipreading dataset consists of 250,000 publicly available videos of the faces of speakers of the Hessian Parliament, which was processed for word-level lip reading using an automatic pipeline. The format is similar to that of the English language Lip Reading in the Wild (LRW) dataset, with each H264-compressed MPEG-4 video encoding one word of interest in a context of 1.16 seconds duration, which yields compatibility for studying transfer learning between both datasets. Choosing video material based on naturally spoken language in a natural environment ensures more robust results for real-world applications than artificially generated datasets with as little noise as possible. The 500 different spoken words ranging between 4-18 characters in length each have 500 instances and separate MPEG-4 audio- and text metadata-files, originating from 1018 parliamentary sessions. Additionally, the complete TextGrid files containing the segmentation information of those sessions are also included. The size of the uncompressed dataset is 16GB.

Copyright of original data: Hessian Parliament (https://hessischer-landtag.de). If you use this dataset, you agree to use it for research purpose only and to cite the following reference in any works that make any use of the dataset.

Reference: Gerald Schwiebert, Cornelius Weber, Leyuan Qu, Henrique Siqueira, Stefan Wermter (2022). A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning. arXiv:2202.13403

{"references": ["Gerald Schwiebert, Cornelius Weber, Leyuan Qu, Henrique Siqueira, Stefan Wermter (2022). A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning", "arXiv:2202.13403"]}

Identifier
DOI https://doi.org/10.25592/uhhfdm.10048
Related Identifier https://doi.org/10.25592/uhhfdm.10047
Metadata Access https://www.fdr.uni-hamburg.de/oai2d?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:fdr.uni-hamburg.de:10048
Provenance
Creator Schwiebert, Gerald; Weber, Cornelius; Qu, Leyuan; Siqueira, Henrique; Wermter, Stefan
Publisher Universität Hamburg
Publication Year 2022
Rights Creative Commons Attribution Non Commercial No Derivatives 4.0 International; Open Access; https://creativecommons.org/licenses/by-nc-nd/4.0/legalcode; info:eu-repo/semantics/openAccess
OpenAccess true
Representation
Language German
Resource Type Dataset
Version 1.0
Discipline Other