-
The news dataset for discriminating between Bosnian, Croatian and Serbian SET...
The SETimes.HBS dataset consists of parallel documents written in Bosnian, Croatian and Serbian, harvested from the already inactive setimes.com website publishing news in the... -
News sentiment analysis datasets for Serbian, Bosnian, Macedonian, Albanian a...
We provide annotated datasets on a three-point sentiment scale (positive, neutral and negative) for Serbian, Bosnian, Macedonian, Albanian, and Estonian. For all languages... -
The Twitter user dataset for discriminating between Bosnian, Croatian, Monten...
The Twitter-HBS dataset consists of Twitter users, their tweets, and the label of their predominantly used language - Bosnian, Croatian, Montenegrin, or Serbian. Among the...