20 items found

Organisations: SoBigData Catalogue Formats: CSV Types: Dataset

Filter Results
  • Dataset

    Common Crawl Financial News Dataset

    This dataset contains financial articles related to companies in the S&P500 index for the period from September 2016 to February 2020. The articles were extracted from the...
    • CSV
      The resource: 'Common_Crawl_Financial_News' is not accessible as guest user. You must login to access it!
  • Dataset

    World Trade Web_2000

    Weighted, directed adjacency matrix of the World Trade Web in the year 2000
    • CSV
      The resource: 'World Trade Web_2000' is not accessible as guest user. You must login to access it!
  • Dataset

    Papers on Gender Bias in Academic Promotions

    This dataset contains the result of a systematic mapping study conducted to analyse how the issue of gender bias in academic promotions has been addressed by the literature....
    • CSV
      The resource: 'Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Kinematic Features of Porto Taxi Trips

    This dataset comprises tabular information related to the movement of taxis in the city of Porto, Portugal. For every taxi journey, we segmented the trajectory into 20...
    • CSV
      The resource: 'TIF - taxi trajectory data' is not accessible as guest user. You must login to access it!
  • Dataset

    Carbon Trade Network_2000

    Weighted, directed adjacency matrix of the Carbon Trade Network in the year 2000
    • CSV
      The resource: 'CTN_adj_2000' is not accessible as guest user. You must login to access it!
  • Dataset

    Carbon Trade Network_2020

    Weighted, directed adjacency matrix of the Carbon Trade Network in the year 2020
    • CSV
      The resource: 'CTN_adj_2020' is not accessible as guest user. You must login to access it!
  • Dataset

    Synthetic Datasets for Fine-Grained Fairness Analysis of Abusive Language Det...

    Three synthetic datasets covering different types of bias grouped by target, namely sexism, racism and ableism. The reason for distinguishing the records by abuse targets is...
    • CSV
      The resource: 'Synthetic Datasets for ...' is not accessible as guest user. You must login to access it!
  • Dataset

    World Trade Web_2020

    Weighted, directed adjacency matrix of the World Trade Web in the year 2020
    • CSV
      The resource: 'WTN_adj_2020' is not accessible as guest user. You must login to access it!
  • Dataset

    Synthetic Dataset for Causal Analysis

    The dataset is a synthetic version of the well-known German Credit dataset (https://archive.ics.uci.edu/dataset/144/statlog+german+credit+data). It includes variables such as...
    • CSV
      The resource: 'synthetic german data' is not accessible as guest user. You must login to access it!
  • Dataset

    Post-earthquake Reconstruction Progress Datasets over L'Aquila Region

    Reconstruction data sets, provided by the National Public Entities of USRA and USRC} These data sets are stored in CSV files and provide comprehensive information related to...
    • CSV
      The resource: 'Dataset Fascicoli ...' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'Dataset Pratiche Ricostruzione' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'Churn Dataset' is not accessible as guest user. You must login to access it!
  • Dataset

    Testing NBA dataset

    Just for platform reviewing
    • CSV
      The resource: 'nbaStats.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2011 Christchurch earthquake

    This dataset contains tweets related to the devastating earthquake occurred on 22 February 2011, at around 12 p.m. local time in Christchurch, New Zealand...
    • CSV
      The resource: 'EAQ-CHR_tweets.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Emergency Tweets 2013 Milan blackout

    This dataset is related to a power outage (i.e., a blackout) that occurred in the city of Milan, in northern Italy, in the night between 14 and 15 May 2013. Despite not...
    • CSV
      The resource: 'PWO-MIL_tweets.csv' is not accessible as guest user. You must login to access it!
  • Dataset

    Dataset Adult

    The adult dataset includes $48,842$ instances with demographic information like age, workclass, marital-status, race, capital-loss, capital-gain etc. The income attribute...
    • CSV
      The resource: 'Adult' is not accessible as guest user. You must login to access it!
  • Dataset

    GPS Origin Destination Matrix in Tuscany

    This dataset is the origin and destination matrix among the municipalities of Tuscany extracted starting from GPS tracks of private vehicles collected from 2014-02-10 to...
    • CSV
      The resource: ' GPS Origin Destination Matrix' is not accessible as guest user. You must login to access it!
  • Dataset

    German Credit

    In the german credit dataset each one of the 1,000 persons is classified as a good or bad creditor according to attributes like age, sex, checking_account, credit_amount,...
    • CSV
      The resource: 'German Credit' is not accessible as guest user. You must login to access it!
  • Dataset

    Mobility index for local quarantines in Chile

    Fighting the COVID-19 pandemic, most countries have implemented non-pharmaceutical interventions like wearing masks, physical distancing, lockdown, and travel restrictions....
    • CSV
      The resource: 'Mobility Index for Local ...' is not accessible as guest user. You must login to access it!
  • Dataset

    Compas

    The compas dataset contains the features used by the COMPAS algorithm for scoring defendants and their risk (Low, Medium and High), for over $4,000$ individuals. We considered...
    • CSV
      The resource: 'https://www' is not accessible as guest user. You must login to access it!
  • Dataset

    WIRE dataset

    This dataset consists of 503 pairs of Wikipedia entities drawn from the New York Times dataset with a human assigned relatedness score. The domain experts based their...
    • HTML
      The resource: 'WikipediaRelatedness' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'WIRE dataset' is not accessible as guest user. You must login to access it!