Commons:SPARQL query service

Go to the Wikimedia Commons Query Service

How to use

This page needs to be updated with more information on how to use the Commons Query Service.

Examples of Commons queries
Note: To learn how to add data see Commons:Structured data/GLAM/Tools or Commons:Structured data/Get involved/Finding data.

Commons Query Service uses Wikibase, Wikidata Query Help provides the documentation needed to use it except M IDs which are described below.

M IDs

The only Commons specific part of the Query Service are M IDs, which are a unique identifier for each file on Commons, equivalent to Q IDs in Wikidata.

Find M IDs

Individual files

To find M IDs for individual images look at a file, in the left hand menu there is 'Concept URI', right click and copy link, this link contains the M ID. Alternatively on the left hand menu there is 'Page information'. The M ID is the 'Page ID' preceded by an 'M'.

Multiple files

PetScan can be used to find M IDs for all the files in a Wikimedia Commons category. Find the name of the Commons category and chose the following options:

# Language = commons

# Project = wikimedia

# Categories = Name of the category (replacing spaces in the name with _)

# Combination = Intersection

# Go to the 'Page properties' tab and under 'Namespaces' click the 'file' box

In the results the 'Page ID' is the M ID (the M must be added by the user). The results can be either copied manually or under the 'Output' tab there are a range of options for export.

Minefield can be used to convert a list of Commons file page titles to media M IDs

API endpoint

Because WCQS is an authenticated service, it is currently not as easy or straightforward to use it as an API endpoint in a way you might expect if you are familiar with Wikidata's unauthenticated endpoint.

See Commons:SPARQL query service/API endpoint for details on how to programmatically access the query service.

Release notes

This is a beta SPARQL endpoint exposing the Structured Data on Commons (SDoC) dataset. This endpoint can federate with WDQS. More work is needed as we iterate on the service, but feel free to begin using the endpoint. Known limitations are listed below:

The service is a beta endpoint that is updated via weekly dumps. Some caveats include limited performance, expected downtimes, and no interface, naming, or backward compatibility stability guarantees.

The service is hosted on Wikimedia Cloud Services, with limited resources and limited monitoring. This means there may be random unplanned downtime.

The data will be reloaded weekly on Mondays from dumps taken on Sunday. The dumps can be seen at https://dumps.wikimedia.org/other/wikibase/commonswiki/. The service will be down during data reload. With the current amount of SDoC data, downtime will last approximately 4 hours, but this may increase as SDoC data grows.

~~Due to an issue with the dump format, the data currently only dates back to July 5th. We’re working on getting more up-to-date data and hope to have a solution soon. (T258507 and T258474)~~
The MediaInfo concept URIs (e.g. http://commons.wikimedia.org/entity/M37200540) are currently HTTP; we may change these to HTTPS in the near future. Please comment on T258590 if you have concerns about this change.

~~The service is restricted behind OAuth authentication, backed by Commons. You will need an account on Commons to access the service.~~ It seems that OAuth is currently not working. See https://stackoverflow.com/questions/65303450/how-to-authenticate-to-wikimedia-commons-query-service-using-oauth-in-python#comment115828158_65424900. This is so that we can contact abusive bots and/or users and block them selectively as a last resort if needed.

Please note that to correctly logout of the service, you need to use the logout link in WCQS - logging out of just Wikimedia Commons will not work for WCQS. This limitation will be lifted once we move to production.

No documentation on the service is available yet except examples available at Commons:SPARQL query service/queries/examples following the format at Wikidata:SPARQL query service/queries/examples. Please add your own examples to this list.

Please use the SPARQL template. ~~Note that while there is currently a bug that doesn’t allow us to change the “Try it!” link endpoint, the examples will be displayed correctly on the WCQS GUI.~~

WCQS is a work in progress and some bugs are to be expected, especially related to generalizing WDQS to fit SDoC data. For example, current bugs include:

URI prefixes specific for SDoC data don’t yet work - you need to use full URIs if you want to query using them. Relations and Q items are defined by :* Autocomplete for SDoC items doesn’t work - without prefixes they’d be unusable anyway, but additional work will be required after we inject SDoC URI prefixes into WCQS GUI.
If you find any additional bugs or issues, please report them via Phabricator with the tag wikidata-query-service.

Future plans

We do plan to move the service to production, but we don’t have a timeline on that yet. We want to emphasize that while we do expect a SPARQL endpoint to be part of a medium to long-term solution, it will only be part of that solution. Even once the service is production-ready, it will still have limitations in terms of timeouts, expensive queries, and federation. Some use cases will need to be migrated, over time, to better solutions - once those solutions exist.

v t e Structured Data on Wikimedia Commons
Features	File captions Depicts Statements Glossary
Tools	Computer-aided tagging ISA Depictor AC/DC SDC tool OpenRefine QuickStatements SPARQL query service Image Annotator
About	About Structured Commons Why we work on this Structured data on other projects Press and presentations Project glossary Stable Interface Policy Contact
Get involved	How to get involved How to find structured data on Commons Help determine how Structured Data metadata should be modelled on Commons
GLAM	Structured Data on Commons for Galleries, Libraries, Archives, and Museums Value for GLAMs Commons or Wikidata How to upload Tools for GLAMs Workflow for GLAMs Examples Events and workshops
Modeling	Data modeling: guidelines on how to describe Wikimedia Commons files with structured data Properties table
Discussion	General talk page Talk page about data modeling
In other projects	Phabricator (tasks, bug reports) mediawiki.org (project reports) meta.wikimedia.org (general background)
Related	Media search Structured data across Wikimedia (SDAW) International Image Interoperability Framework (IIIF)
Archived info about Structured Commons Information and discussion about the project grant from the Alfred P. Sloan Foundation

Commons:SPARQL query service

Contents

How to use

M IDs

Find M IDs

Individual files

Multiple files

API endpoint

Release notes

Navigation menu

Commons:SPARQL query service

How to use

M IDs

Find M IDs

Individual files

Multiple files

API endpoint

Release notes

Navigation menu

Search