5 items found

Types: Dataset Tags: Web mining

Filter Results
  • Dataset

    Common Crawl Financial News Dataset

    This dataset contains financial articles related to companies in the S&P500 index for the period from September 2016 to February 2020. The articles were extracted from the...
    • CSV
      The resource: 'Common_Crawl_Financial_News' is not accessible as guest user. You must login to access it!
  • Dataset

    SWH Filenames

    A 69 GB dataset with ~2.3 billion strings representing deduplicated names of source code files collected by Software Heritage, the great library of source code...
    • ZIP
      The resource: 'SWH Filenames' is not accessible as guest user. You must login to access it!
  • Dataset

    FAIR-SWENG: dataset on gender fairness in software engineering academic lands...

    The dataset contains academic performance metrics of Software Engineers worldwide.
  • Dataset

    Post-earthquake Reconstruction Progress Datasets over L'Aquila Region

    Reconstruction data sets, provided by the National Public Entities of USRA and USRC} These data sets are stored in CSV files and provide comprehensive information related to...
    • CSV
      The resource: 'Dataset Fascicoli ...' is not accessible as guest user. You must login to access it!
    • CSV
      The resource: 'Dataset Pratiche Ricostruzione' is not accessible as guest user. You must login to access it!
  • Dataset

    Wikipedia Word Embeddings

    Embeddings were created through applying word2vec skipgram to a corpus of wikipedia non-stub articles from a December 2015 English dump with the following parameters: -cbow 0...
    • The resource: 'Embeddings' is not accessible as guest user. You must login to access it!