432 items found

Organisations: SoBigData Catalogue

Filter Results
  • Dataset

    Retail Market Data

    This dataset contains Retail Market Data about food products, from 2007, for about 130 shops of an Italian Distribution chain. Data are of about 1 M of Active Clients, and...
  • Dataset

    Compas

    The compas dataset contains the features used by the COMPAS algorithm for scoring defendants and their risk (Low, Medium and High), for over $4,000$ individuals. We considered...
    • CSV
      The resource: 'https://www' is not accessible as guest user. You must login to access it!
  • Method

    Twitter preprocessor

    Tokeniser, lemmatiser, extraction of negation. Under development.
    • xslx
      The resource: 'Wyroles' is not accessible as guest user. You must login to access it!
  • Experiment

    Micro Project Experiments: Academic Migration and Academic Networks

    The experiments and results material for the micro project titled Academic Migration and Academic Networks: Evidence from Scholarly Big Data and the Iron Curtain
    • HTML
      The resource: 'Micro Project Experiments ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Micro Project Datasets: Academic Migration and Academic Networks

    Datasets used and produced for and from the micro project titled: Academic Migration and Academic Networks: Evidence from Scholarly Big Data and the Iron Curtain
    • HTML
      The resource: 'Micro Project Datasets' is not accessible as guest user. You must login to access it!
  • Method

    Micro Project Methods: Academic Migration and Academic Networks

    Methods used for the micro-project titled: Academic Migration and Academic Networks: Evidence from Scholarly Big Data and the Iron Curtain
    • HTML
      The resource: 'Methods ' is not accessible as guest user. You must login to access it!
    • PDF
      The resource: 'Machine Learning ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Activity data from the Covid19 period

    Activity data from Telia telecommunications company, Finland reports the number of people dwelling in area for a certain amount of time. More precisely, activity count...
  • Experiment

    Minimizing Hitting Time between Disparate Groups with Shortcut Edges

    Experiments on real-world datasets to evaluate the effectiveness of the algorithms proposed in paper...
    • Github
      The resource: 'experiment data and code' is not accessible as guest user. You must login to access it!
  • Experiment

    Self-Rated Health Among Italian Immigrants Living in Norway: A Cross-Sectiona...

    Most of the respondents (69%) rated their Health as “good” or “very good”. This figure was not significantly different with the Norwegian population, nor to the Italians...
  • Access required...

    ×

    Experiment

    Private Epidemics Simulation Recovery

    The computation of Graph Epidemics Simulation Recovery performs a simulation of SEIR (Susceptible, Exposed, Infectious, Recovered) dynamics on a graph using the shortest path...
  • Access required...

    ×

    Dataset

    Private 64-tiles tessellation of Chicago

    Squared tessellation of the city center of Chicago, Illinois, into 64 tiles. Tessellation only of the central part of Chicago, namely the neighborhoods 'LOOP', 'NEAR SOUTH...
  • Access required...

    ×

    Experiment

    Private Workshopping on social big data and aversarial publics

    This is the report on the micro-project entitled “Workshopping on social big data and adversarial publics” carried out by CNRS. In this microproject, we propose to re-evaluate...
  • Method

    Measurement Expression Annotator

    Annotates numbers and measurement expressions in text. This method recognises many types of measurements including length, temperature, time and speed, and calculates their...
    • method-engine
      The resource: 'Run method' is not accessible as guest user. You must login to access it!
  • Method

    Digital DNA fingerprinting

    The "Digital DNA fingerprinting" is a spambot detection technique based on the "Digital DNA" online behavioral modeling technique. Given a set of Twitter user timelines, it is...
  • Method

    Python library for direct and indirect discrimination prevention in data mining

    This python library implements the discrimination discovery and prevention method proposed in the paper: “A methodology for direct and indirect discrimination prevention in...
    • GitHub
      The resource: 'Link to library' is not accessible as guest user. You must login to access it!
  • Method

    MyWay - Trajectory Prediction

    MyWay is a prediction system which exploits the individual systematic behaviors modeled by mobility profiles to predict human movements. MyWay provides three strategies: the...
    • PDF
      The resource: 'MyWay' is not accessible as guest user. You must login to access it!
    • ZIP
      The resource: 'MyWay - Source Code' is not accessible as guest user. You must login to access it!
  • Application

    SWAT

    SWAT is a entity-salience system which identifies on-the-fly the semantic focus of a document, expressed by its Salient Wikipedia Entities. The core of this technology is...