TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets (Part 10, Jan 2021 - Dec 2021) - Dataset

Dataset

TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets (Part 10, Jan 2021 - Dec 2021)

DOI

TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for nearly 3.0 billion tweets, spanning more than 9 years (February 2013 - August 2022). Metadata information about the tweets as well as extracted entities, sentiments, hashtags, user mentions and URLs are exposed in RDF using established RDF/S vocabularies. For the sake of privacy, we anonymize user IDs and we do not provide the text of the tweets. For a list of the previous dataset parts, example queries and more information see the TweetsKB's home page: https://data.gesis.org/tweetskb/.

Web Scraping

Identifier
DOI	https://doi.org/10.7802/2472
Source	https://search.gesis.org/research_data/SDN-10.7802-2472?lang=de
Metadata Access	https://datacatalogue.cessda.eu/oai-pmh/v0/oai?verb=GetRecord&metadataPrefix=oai_ddi25&identifier=820aba27a694c5659d2cf1013aafc2699da554633ef77d96766e3d4dcf62dadc

Provenance
Creator	Baran, Erdal; Bensmann, Felix; Dietze, Stefan
Publisher	GESIS Data Archive for the Social Sciences; GESIS Datenarchiv für Sozialwissenschaften
Publication Year	2022
Rights	Free access (without registration) - The research data can be downloaded directly by anyone without further limitations. Data can only be used for non-commercial research; Freier Zugang (ohne Registrierung) - Die Forschungsdaten können von jedem direkt heruntergeladen werden. Data can only be used for non-commercial research
OpenAccess	true
Contact	http://www.gesis.org/

Representation
Discipline	Social Sciences