-
User preference-interest dataset
The User preference-interest dataset is a comprehensive collection of preferences generated by a sequence of 6 regimes following the rules below: - initially, we have... -
Dataset of "Mediterranean" users interactions in Twitter, to evaluate polarit...
A dataset collecting Twitter interactions between generic users self-declaring in one of the following countries: Spain, France, Italy, Greece, to evaluate the polarity of...-
TSV
The resource: 'Mediterranean' is not accessible as guest user. You must login to access it!
-
TSV
-
Synthetic Datasets for Fine-Grained Fairness Analysis of Abusive Language Det...
Three synthetic datasets covering different types of bias grouped by target, namely sexism, racism and ableism. The reason for distinguishing the records by abuse targets is...-
CSV
The resource: 'Synthetic Datasets for ...' is not accessible as guest user. You must login to access it!
-
CSV
-
FAIR-Edu: Gender Bias in Academic Promotion Dataset
Pseudo anonymized dataset on the publications made by researchers and professors in the Italian Informatics Community (Computer Science + Computer Engineering). Each sample is... -
Reducing radicalizism in social networks by feeds prioritization - Rebalancin...
Code and description of the methodology of the paper "Rebalancing Social Feed to Minimize Polarization and Disagreement" funded by SoBigData ++ -
Twitter dataset on coordinated behavior in 2019 UK General Election
This dataset contains ~11M tweets related to the 2019 United Kingdom General Election, published and collected between November 12, 2019, and December 12, 2019. In addition,... -
Twitter dataset on coordinated behavior in 2020 USA Presidential Election
This dataset contains ~140M tweets related to the 2020 United States Presidential Election, published and collected between October 2, 2020, and December 2, 2020. In addition,... -
Twitter Conspiracy Dataset
This repository contains the Twitter dataset used to investigate the traits of 7,394 conspiracy users and 7,394 random users collected in 2022. Both the profile's info and the... -
UK election abuse data
The GATE team (gate.ac.uk) at the University of Sheffield have collected 1.4 million tweets sent to and by UK members of parliament in the months leading up to the 2015 and...-
XLS
The resource: 'uk-election-abuse.tar.gz' is not accessible as guest user. You must login to access it!
-
XLS
-
Articles and comments of major Estonian newspapers
The dataset contains articles and comments of four major Estonian news portals since early 2000s to 2016. -
ClueWeb12
The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information... -
Brexit Tweets Linked Domains
In this spreadsheet we share domains linked in the UK EU membership referendum tweet collection. Counts for links by leave voters and remain voters are given, enabling sites...-
ODS
The resource: 'Brexit Tweets Linked ...' is not accessible as guest user. You must login to access it!
-
ODS
-
Brexit Twitter User Vote Intent
A list of users for which vote intent in the UK EU membership referendum has been established. -
Twitter Newcomers Dataset
Twitter accounts detected right after registration and monitored for 21 days-
ZIP
The resource: 'New Accounts Dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
Sheffield NERD Tweet Corpus
The dataset contais 794 tweets annotated with named entities disambiguated against DBpedia, and split into equally sized training and test portions. 400 tweets from 2013 comes...-
FINF
The resource: 'Sheffield NERD Tweet Corpus' is not accessible as guest user. You must login to access it!
-
FINF
-
DE webarchive
The dataset consists of all the content from the .de top level domain as crawled by the Internet Archive.-
HTML
The resource: 'Internet Archive Wayback ...' is not accessible as guest user. You must login to access it!
-
HTML
-
UK General Election Vote Intent
A list of Twitter users for whom party political allegiance/vote intent has been established. -
Facebook Wallpost
Online interactions between users via the wall feature in the New Orleans regional network.-
HTML
The resource: 'Original data' is not accessible as guest user. You must login to access it!
-
HTML
-
Twitter Dataset 2013-2014
The dataset was collected by the Archive team through the Twitter Streaming API which provides free access to 1% of public tweets. The covered time period is from January 1st... -
Introduction to Data Curation
This course is an introduction to data collection, data preparation & transformation and data analysis. It contains the essential concepts for a researcher in order to...-
PDF
The resource: 'Introduction to Data Curation' is not accessible as guest user. You must login to access it!
-
PDF
