TweetsKB is a public RDF corpus of anonymized data for a large collection of annotated tweets. The dataset currently contains data for nearly 3.0 billion tweets, spanning more than 9 years (February 2013 - August 2022). Metadata information about the tweets as well as extracted entities, sentiments, hashtags, user mentions and URLs are exposed in RDF using established RDF/S vocabularies. For the sake of privacy, we anonymize user IDs and we do not provide the text of the tweets. For a list of the previous dataset parts, example queries and more information see the TweetsKB's home page: https://data.gesis.org/tweetskb/.
Web Scraping