-
Telegram data conspiracyIT chats
This dataset contains Italian-language Telegram chats focused on conspiracy discussions. It was collected using a snowball sampling technique based on message forwarding,... -
Private Reddit Remote Work Dataset
The dataset was collected exclusively from Reddit using the Python library praw [Boe and Payne, 2023]. Posts were extracted from the subreddits remotework, workfromhome, and... -
Telegram data qanonEN chats
This dataset consists of English-language chats involved in conspiracy discussions on Telegram. The data was collected using a snowball crawling technique that leverages... -
Twitter dataset during COVID-19 pandemic
IDs of tweets collected from Twitter spanning across 7 years, from 2016-03-01 to 2022-03-01. The tweets are used to derive interactions between users and, therefore, their... -
Private Neural2PC
Neural2PC is a novel machine-learning approach that finds polarized communities in signed networks. The method explores suboptimal solutions to the relaxed 2PC problem and... -
Private Practice output
-
Annotazione semantica di delibere comunali
Progetto POC per l'uso delle tecniche di text mining su documenti della pubblica amministrazione per migliorare la trasparenza e l’accesso alle informazioni da parte dei... -
y/Politics 1k
Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...-
ZIP
The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
-
ZIP
-
Private A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings...
"A Decade of Reddit Politics: Comprehensive Dataset on User Political Leanings and Interaction Networks (2011-2021)" is a comprehensive dataset containing a 10 years long... -
Last.Fm UK User Graph Dataset: A Social Network and Music Listening Behavior ...
The Last.Fm UK User Graph Dataset is a comprehensive collection of social network and music listening behavior data obtained from the Last.Fm platform. The dataset includes user...-
Folder
The resource: 'Link to the folder ...' is not accessible as guest user. You must login to access it!
-
Folder
-
Online polarization: enriching models with data, understanding data through m...
Development of online polarization dynamics models and application to social media discussion data -
Brexit dataset
This dataset comprises a set of online footprints extracted from Twitter using the available APIs. It is centered around the Brexit debate on Twitter from the 2nd until the...-
RAR
The resource: 'BrexitDataset' is not accessible as guest user. You must login to access it!
-
RAR
-
User preference-interest dataset
The User preference-interest dataset is a comprehensive collection of preferences generated by a sequence of 6 regimes following the rules below: - initially, we have... -
Synthetic Datasets for Fine-Grained Fairness Analysis of Abusive Language Det...
Three synthetic datasets covering different types of bias grouped by target, namely sexism, racism and ableism. The reason for distinguishing the records by abuse targets is...-
CSV
The resource: 'Synthetic Datasets for ...' is not accessible as guest user. You must login to access it!
-
CSV
-
Reducing radicalizism in social networks by feeds prioritization - Rebalancin...
Code and description of the methodology of the paper "Rebalancing Social Feed to Minimize Polarization and Disagreement" funded by SoBigData ++ -
ClueWeb12
The ClueWeb12 dataset consists of 733,019,372 English web pages, collected between February 10, 2012 and May 10, 2012. It was created to support research on information... -
Facebook Wallpost
Online interactions between users via the wall feature in the New Orleans regional network.-
HTML
The resource: 'Original data' is not accessible as guest user. You must login to access it!
-
HTML
-
ClueWeb09
The ClueWeb09 dataset consists of about 1 billion web pages in ten languages that were collected in January and February 2009. It was created to support research on... -
Twitter social bots
Spambots are automated accounts (i.e., accounts driven by a bot) that repeatedly advertise unsolicited and often harmful content (e.g., malware, URLs to phishing Web sites,... -
Twitter fake followers
Fake followers are fake accounts massively created to follow a target account and that can be bought from online markets. In other words, their goal is that of increasing the...
