283 items found

Formats: ZIP

Filter Results
  • TrainingMaterial

    Archive Crawling

    Web archives are typically very broad in scope and extremely large in scale. This makes data analysis appear daunting, especially for non-computer scientists. These...
    • PDF
      The resource: 'Archive Crawling Tutorial' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Extracting Event-Centric ...' is not accessible as guest user. You must login to access it!
  • Experiment

    Annotazione semantica di delibere comunali

    Progetto POC per l'uso delle tecniche di text mining su documenti della pubblica amministrazione per migliorare la trasparenza e l’accesso alle informazioni da parte dei...
    • PDF
      The resource: 'Annotazione Delibere' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Codice sorgente' is not accessible as guest user. You must login to access it!
  • Dataset

    Wi-Fi Dataset of wireless channel samplings

    The dataset was acquired by periodically sampling a wireless channel with Wi-Fi frames. The main goal is to track the evolution of the channel quality by acquiring key...
    • ZIP
      The resource: 'SoBigData_Wi-Fi_Dataset' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    High Performance and Scalable Analytics Module

    Mining with big data or big data mining has become an active research area. Running current analytical methodologies and software tools on a single personal computer cannot...
    • PDF
      The resource: 'Introduction to Parallel ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Introduction to Hadoop' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Hadoop Patterns' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Remote Connection and HDFS' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Exercises for Remote ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Introduction to Spark' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Exercises for Introduction ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Introduction to Spark SQL' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Exercises for Introduction ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Hadoop Ecosystem and ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Data Mining with Spark (MLLIB)' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Exercises for Data Mining ...' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Deep Learning Course

    This course developed by Universitat Politècnica de Catalunya and Barcelona Supercomputing Center provides an applied approach to Deep Learning. It chooses to present an...
    • DOCX
      The resource: 'Instructions' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Deep_Learning_Course' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'High_Performance_Computing_ ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Lesson_a_FeedForward_Neural ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Lesson_b_Recurrent_Neural_N ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Lesson_c_Embedding_Spaces' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Interactive Learning Environments

    King’s College London developed a variety of data science materials based on R and Python. R is a de facto standard in statistical computing and visualisation, while our...
    • ZIP
      The resource: 'Rstudio docker image' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'VirtualBox' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Swirl courses' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Archive Spark

    An Apache Spark framework for easy data processing, extraction as well as derivation for archival collections. Originally developed for the use with Web archives, it has now...
    • PDF
      The resource: 'Archive Spark Slides' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Archive Spark Jupyter ...' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Efficiency - Effectiveness Trade-offs in Learning to Rank

    This tutorial provides an 'Introduction to Learning to Rank' and focuses on 'Dealing with the Efficiency/Effectiveness trade-off in Web Search'. Moreover, it provides two...
    • PDF
      The resource: 'Introduction to Learning ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Dealing with the ...' is not accessible as guest user. You must login to access it!
    • python
      The resource: 'Hands-on Session 1 ' is not accessible as guest user. You must login to access it!
    • python
      The resource: 'Hands-on Session 2 ' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Publicly available ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Istella Learning to Rank ...' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Compressed and Learned Data Structures Seminar

    In this seminar cycle, students are guided in the direct usage of a powerful C++ library implementing many state-of-the-art compressed data structures for big data. Other than...
    • PDF
      The resource: 'A gentle introduction to ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Learned indexes, the ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'GitHub Repository' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'GitHub Repository Instructions' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    GATE Course

    The material is the 2017 version of a week-long training course delivered annually by the GATE team. Over almost ten years, this course has been developed to provide basic and...
    • PDF
      The resource: 'Module 1 - Introduction to ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 1 - Hands-on materials' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 1 - Introduction to ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 1 - Introduction to ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 1 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 1 - Advanced JAPE' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 1 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 2 - Crowdsourcing ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 2 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 2 - GATE Mímir and ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 2 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 2 - Introduction to ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 2 - Classification ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 2 - Classification ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 2 - GATE ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 2 - Chunking - ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 2 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 3 - GATE and Social ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Hands-on materials for ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 4 - Advanced GATE ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 4 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 4 - Opinion Mining' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 4 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 5 - The GATE ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 5 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 5 - Creating new ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 5 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 5 - Advanced GATE ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 5 - Advanced GATE ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 5 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - Applications - ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - Applications - ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - Applications - ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 6 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - Entity Linking' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - JAPE Practical ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Module 6 - Hands-on ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Module 6 - Summarisation ...' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Jupyter Notebooks

    King’s College London has developed complete stories around Jupyter Notebooks that form easy recipes for reproducible methods in social data science. Jupyter...
    • ZIP
      The resource: 'Historical Cultures Repository' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Prediction Modelling ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Social and Cultural ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Social Sensing Repository' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Visual Arts Repository' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Ananke Guide' is not accessible as guest user. You must login to access it!
    • mp4
      The resource: 'Ananke Guide Video' is not accessible as guest user. You must login to access it!
  • Dataset

    DataSeT Progetto IBIS ECO - IoT- based Building Information System for Energy...

    The dataset was collected as part of the IBIS ECO project ("IoT-based Building Information System for Energy Efficiency & Comfort"), an initiative aimed at implementing an...
    • ZIP
      The resource: 'DataSeT - IBIS ECO Project' is not accessible as guest user. You must login to access it!
  • Dataset

    Air Quality Datasets over L'Aquila Region

    These datasets have been collected through ESA, CeTEMPS and ARTA. They are a work-in-progress deliverable of a virtual laboratory (VL-Disaster) in the context of the SoBigData.
    • CSV
      The resource: 'CeTEMPS Dataset up to 2023' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'ARTA AirQuality up to 2023' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'ESA Sentinel 5P NO2 daily ...' is not accessible as guest user. You must login to access it!
    • HTML
      The resource: 'Map of the area pollutants ...' is not accessible as guest user. You must login to access it!
  • Dataset

    y/Politics 1k

    Social simulation data generated using Y Social focused on political-related topics. Y Social is a Digital Twin of an online social media platform that allows researchers to...
    • ZIP
      The resource: 'y_politics_1k.db' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Visual Analytics for Data Scientists

    Participants to this module shall -    Learn the principles and rules underlying the design of visual data representations and human-computer interactions -    Understand,...
    • PDF
      The resource: 'Introduction to Visual ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Using Partition-based ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Visual Analytics of ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Visual Analytics of ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Use of Density-based ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Use of Density-based ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Analysis of Mobility ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Analysis of Mobility ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Further Abilities and ...' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Data Import and Export in ...' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Exercises for the Visual ...' is not accessible as guest user. You must login to access it!
  • TrainingMaterial

    Data Visualisation and Visual Analytics Module

    This module provides an introduction to the concepts of vision and perception in order to design an effective data visualisation. Moreover, it provides insight into visual...
    • PDF
      The resource: 'Introduction' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Visual Variables ' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Web Visualisation' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Visual Analytics and D3.js' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Geolocalisation' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Layouts' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'Exercises and Code Repository' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'IPhython Notebooks' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER SecureBERT model

    This method includes a Python script and files of a SecureBERT model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will be...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • text/x-python
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Method

    Cybersecurity NER RoBERTa-base model

    This method includes a Python script and files of a RoBERTa-base model fine-tuned on our Cybersecurity NER dataset. The method requires as input a list of sentences that will...
    • JSON
      The resource: 'config' is not accessible as guest user. You must login to access it!
    • TXT
      The resource: 'merges' is not accessible as guest user. You must login to access it!
    • BIN
      The resource: 'model' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'model_args' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'scheduler' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'special_tokens_map' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer_config' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'training_args' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'tokenizer' is not accessible as guest user. You must login to access it!
    • JSON
      The resource: 'vocab' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'optimizer' is not accessible as guest user. You must login to access it!
    • py
      The resource: 'inference' is not accessible as guest user. You must login to access it!
  • Dataset

    Italian Common Procurement Vocabulary (CPV)

    This dataset contains 5M pairs of Italian tender descriptions and the corresponding Common Procurement Vocabulary (CPV) code. The data are downloaded from the ANAC website...
    • ZIP
      The resource: '10007545' is not accessible as guest user. You must login to access it!