WIRE dataset

This dataset consists of 503 pairs of Wikipedia entities drawn from the New York Times dataset with a human assigned relatedness score. The domain experts based their evaluation on the textual description of the Wikipedia entities and further investigation of their relationships by possibly taking advantage of other sources.
WiRe has been devised for benchmarking entity relatedness solutions with a larger set of entities.

Data and Resources
To access the resources you must log in
  • WIRE datasetCSV

    The resource: 'WIRE dataset' is not accessible as guest user. You must login to access it!
Additional Info
Field Value
Accessibility Both
AccessibilityMode Download
Attribution requirements
Availability On-Line
Basic rights Download
ChildrenData No
Consent obtained also covers the envisaged transfer of the personal data outside the EU No
Consent of the data subject No
CreationDate 2017-10-24
Creator Ponza, Marco, marco.ponza@di.unipi.it
DataProtectionDirective none
Display requirements
Distribution requirements
External Identifier
Field/Scope of use Any use
Language eng, English
License term /Not specified
ManifestationType Virtual
Personal data was manifestly made public by the data subject No
PersonalData No
PersonalSensitiveData Select PersonalSensitiveData
ProcessingDegree Primary
RelatedPaper Marco Ponza, Paolo Ferragina, and Soumen Chakrabarti. 2017. A Two-Stage Framework for Computing Entity Relatedness in Wikipedia. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM '17). ACM, New York, NY, USA, 1867-1876. https://dl.acm.org/citation.cfm?doid=3132847.3132890
Requirement of non-disclosure (confidentiality mark)
Restrictions on use
Semantic Coverage
Sublicense rights No
Territory of use World Wide
ThematicCluster Text and Social Media Mining
TimeCoverage 2007-01-01 /2007-12-31
system:type SoBigData.eu: Dataset
Management Info
Field Value
Author Ferragina Paolo
Maintainer Marco Ponza
Version 1
Last Updated 26 September 2019, 12:28 (CEST)
Created 26 September 2019, 12:28 (CEST)