File:Biohackathon report on reviewing Wikidata subsetting methods - Wikidata Reuse Days.pdf

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search
Go to page
next page →
next page →
next page →

Original file(1,500 × 843 pixels, file size: 3.82 MB, MIME type: application/pdf, 54 pages)



Slides used during the session: Biohackathon: report on reviewing Wikidata subsetting methods


English: Often Wikidata is too big to handle. Through a sequence of biohackathons we have been reviewing methods to extract subsets from Wikidata to facilitate downstream reuse. We have identified a set of tools and would like to report back on our intermediate results. We will address the different applicable file formats. Natively, Wikidata data is stored in json, but it is also available as RDF through for example the Wikidata Query Service. Different subsetting methods use either or both of those formats as input and output. We will also address the way how to define the subset. This can be a JSON file or a Shape Expression.
Source Own work
Author Andra Waagmeester, Jose Emilio Labra Goya, Hosseini Beghaeiraveri, Sabah Ul-Hasan


I, the copyright holder of this work, hereby publish it under the following license:
w:en:Creative Commons
This file is licensed under the Creative Commons Attribution 4.0 International license.
You are free:
  • to share – to copy, distribute and transmit the work
  • to remix – to adapt the work
Under the following conditions:
  • attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.

File history

Click on a date/time to view the file as it appeared at that time.

current11:51, 22 March 2022Thumbnail for version as of 11:51, 22 March 20221,500 × 843, 54 pages (3.82 MB)Andrawaag (talk | contribs)Uploaded own work with UploadWizard

There are no pages that use this file.