Wikinews dataset

This dataset consists of a sample of 365 news published by Wikinews from November 2004 to June 2014 and annotated with about 5000 entities, each associated with a saliency score, by the Wikinews community.

Data and Resources
To access the resources you must log in
  • entity-saliencyJSON

    The resource: 'entity-saliency' is not accessible as guest user. You must login to access it!
Additional Info
Field Value
Accessibility Both
AccessibilityMode Download
Attribution requirements
Availability On-Line
Basic rights Download
ChildrenData No
Consent obtained also covers the envisaged transfer of the personal data outside the EU No
Consent of the data subject No
CreationDate 2016-11-04
Creator Trani, Salvatore,
DataProtectionDirective none
Display requirements
Distribution requirements
External Identifier
Field/Scope of use Any use
Language eng, English
License term /Not specified
ManifestationType Virtual
Personal data was manifestly made public by the data subject No
PersonalData No
PersonalSensitiveData Select PersonalSensitiveData
ProcessingDegree Primary
Requirement of non-disclosure (confidentiality mark)
Restrictions on use
Semantic Coverage
Sublicense rights No
Territory of use World Wide
ThematicCluster Text and Social Media Mining
TimeCoverage 2004-01-01 /2014-12-31
system:type Dataset
Management Info
Field Value
Author Ferragina Paolo
Maintainer Ferragina Paolo
Version 1
Last Updated 26 September 2019, 12:26 (CEST)
Created 26 September 2019, 12:26 (CEST)