-
Archive Crawling
Web archives are typically very broad in scope and extremely large in scale. This makes data analysis appear daunting, especially for non-computer scientists. These... -
Annotazione semantica di delibere comunali
Progetto POC per l'uso delle tecniche di text mining su documenti della pubblica amministrazione per migliorare la trasparenza e l’accesso alle informazioni da parte dei... -
Wi-Fi Dataset of wireless channel samplings
The dataset was acquired by periodically sampling a wireless channel with Wi-Fi frames. The main goal is to track the evolution of the channel quality by acquiring key...-
ZIP
The resource: 'SoBigData_Wi-Fi_Dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
High Performance and Scalable Analytics Module
Mining with big data or big data mining has become an active research area. Running current analytical methodologies and software tools on a single personal computer cannot...-
PDF
The resource: 'Introduction to Parallel ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Introduction to Hadoop' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Hadoop Patterns' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Remote Connection and HDFS' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Exercises for Remote ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Introduction to Spark' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Exercises for Introduction ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Introduction to Spark SQL' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Exercises for Introduction ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Hadoop Ecosystem and ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Data Mining with Spark (MLLIB)' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Exercises for Data Mining ...' is not accessible as guest user. You must login to access it!
-
PDF
-
Deep Learning Course
This course developed by Universitat Politècnica de Catalunya and Barcelona Supercomputing Center provides an applied approach to Deep Learning. It chooses to present an...-
DOCX
The resource: 'Instructions' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Deep_Learning_Course' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'High_Performance_Computing_ ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Lesson_a_FeedForward_Neural ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Lesson_b_Recurrent_Neural_N ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Lesson_c_Embedding_Spaces' is not accessible as guest user. You must login to access it!
-
DOCX
-
Interactive Learning Environments
King’s College London developed a variety of data science materials based on R and Python. R is a de facto standard in statistical computing and visualisation, while our... -
Archive Spark
An Apache Spark framework for easy data processing, extraction as well as derivation for archival collections. Originally developed for the use with Web archives, it has now... -
Efficiency - Effectiveness Trade-offs in Learning to Rank
This tutorial provides an 'Introduction to Learning to Rank' and focuses on 'Dealing with the Efficiency/Effectiveness trade-off in Web Search'. Moreover, it provides two...-
PDF
The resource: 'Introduction to Learning ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Dealing with the ...' is not accessible as guest user. You must login to access it!
-
python
The resource: 'Hands-on Session 1 ' is not accessible as guest user. You must login to access it!
-
python
The resource: 'Hands-on Session 2 ' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Publicly available ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Istella Learning to Rank ...' is not accessible as guest user. You must login to access it!
-
PDF
-
Compressed and Learned Data Structures Seminar
In this seminar cycle, students are guided in the direct usage of a powerful C++ library implementing many state-of-the-art compressed data structures for big data. Other than...-
PDF
The resource: 'A gentle introduction to ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Learned indexes, the ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'GitHub Repository' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'GitHub Repository Instructions' is not accessible as guest user. You must login to access it!
-
PDF
-
-
ZIP
The resource: 'Dataset' is not accessible as guest user. You must login to access it!
-
ZIP
-
GATE Course
The material is the 2017 version of a week-long training course delivered annually by the GATE team. Over almost ten years, this course has been developed to provide basic and...-
PDF
The resource: 'Module 1 - Introduction to ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 1 - Hands-on materials' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 1 - Introduction to ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 1 - Introduction to ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 1 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 1 - Advanced JAPE' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 1 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 2 - Crowdsourcing ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 2 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 2 - GATE Mímir and ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 2 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 2 - Introduction to ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 2 - Classification ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 2 - Classification ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 2 - GATE ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 2 - Chunking - ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 2 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Hands-on materials for ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 4 - Advanced GATE ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 4 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 4 - Opinion Mining' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 4 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 5 - The GATE ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 5 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 5 - Creating new ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 5 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 5 - Advanced GATE ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 5 - Advanced GATE ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 5 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 6 - Applications - ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 6 - Applications - ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 6 - Applications - ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 6 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 6 - Entity Linking' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 6 - JAPE Practical ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Module 6 - Hands-on ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Module 6 - Summarisation ...' is not accessible as guest user. You must login to access it!
-
PDF
-
Jupyter Notebooks
King’s College London has developed complete stories around Jupyter Notebooks that form easy recipes for reproducible methods in social data science. Jupyter...-
ZIP
The resource: 'Historical Cultures Repository' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Prediction Modelling ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Social and Cultural ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Social Sensing Repository' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Visual Arts Repository' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Ananke Guide' is not accessible as guest user. You must login to access it!
-
mp4
The resource: 'Ananke Guide Video' is not accessible as guest user. You must login to access it!
-
ZIP
-
DataSeT Progetto IBIS ECO - IoT- based Building Information System for Energy...
The dataset was collected as part of the IBIS ECO project ("IoT-based Building Information System for Energy Efficiency & Comfort"), an initiative aimed at implementing an...-
ZIP
The resource: 'DataSeT - IBIS ECO Project' is not accessible as guest user. You must login to access it!
-
ZIP
-
Air Quality Datasets over L'Aquila Region
These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData.-
CSV
The resource: 'CeTEMPS Dataset up to 2023' is not accessible as guest user. You must login to access it!
-
CSV
The resource: 'ARTA AirQuality up to 2023' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'ESA Sentinel 5P NO2 daily ...' is not accessible as guest user. You must login to access it!
-
HTML
The resource: 'Map of the area pollutants ...' is not accessible as guest user. You must login to access it!
-
CSV
-
y/Politics 1k
Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...-
ZIP
The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
-
ZIP
-
Visual Analytics for Data Scientists
Participants to this module shall - Learn the principles and rules underlying the design of visual data representations and human-computer interactions - Understand,...-
PDF
The resource: 'Introduction to Visual ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Using Partition-based ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Visual Analytics of ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Visual Analytics of ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Use of Density-based ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Use of Density-based ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Analysis of Mobility ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Analysis of Mobility ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Further Abilities and ...' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Data Import and Export in ...' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Exercises for the Visual ...' is not accessible as guest user. You must login to access it!
-
PDF
-
Data Visualisation and Visual Analytics Module
This module provides an introduction to the concepts of vision and perception in order to design an effective data visualisation. Moreover, it provides insight into visual...-
PDF
The resource: 'Introduction' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Visual Variables ' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Web Visualisation' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Visual Analytics and D3.js' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Geolocalisation' is not accessible as guest user. You must login to access it!
-
PDF
The resource: 'Layouts' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'Exercises and Code Repository' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'IPhython Notebooks' is not accessible as guest user. You must login to access it!
-
PDF
-
Cybersecurity NER SecureBERT model
This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...-
JSON
The resource: 'config' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'merges' is not accessible as guest user. You must login to access it!
-
BIN
The resource: 'model' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'model_args' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'optimizer' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'scheduler' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'training_args' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'vocab' is not accessible as guest user. You must login to access it!
-
text/x-python
The resource: 'inference' is not accessible as guest user. You must login to access it!
-
JSON
-
Cybersecurity NER RoBERTa-base model
This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...-
JSON
The resource: 'config' is not accessible as guest user. You must login to access it!
-
TXT
The resource: 'merges' is not accessible as guest user. You must login to access it!
-
BIN
The resource: 'model' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'model_args' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'scheduler' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'training_args' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
-
JSON
The resource: 'vocab' is not accessible as guest user. You must login to access it!
-
ZIP
The resource: 'optimizer' is not accessible as guest user. You must login to access it!
-
py
The resource: 'inference' is not accessible as guest user. You must login to access it!
-
JSON
-
Italian Common Procurement Vocabulary (CPV)
This dataset contains 5M pairs of Italian tender descriptions and the corresponding Common Procurement Vocabulary (CPV) code. The data are downloaded from the ANAC website...-
ZIP
The resource: '10007545' is not accessible as guest user. You must login to access it!
-
ZIP