Commons:Structured data/About/Why

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search


Gnome-preferences-system.svg
This page is a work in progress page, not an article or policy, and may be incomplete and/or unreliable.
Please offer suggestions on the talk page.

Deutsch | English | Español | Suomi | Français | Magyar | Italiano | 日本語 | Македонски | Nederlands | Português | Português do Brasil | Română | Русский | Sicilianu | Українська | العربية | +/−

Structured Data on Commons aims to benefit the Wikimedia community in its broadest sense, and many re-users outside the Wikimedia ecosystem, including:

  • the communities of Commons and Wikimedia contributors;
  • cultural institutions (GLAMs) who publish their collections via Commons;
  • scientific and research organisations interested in working with large metadata datasets as a basis for research;
  • smaller and larger re-users of free media files online (from bloggers to large publishers);
  • smaller and larger developer communities (from app and web developers to the builders of search engines and operating systems).

Impact and benefits for the Wikimedia movement[edit]

The project will affect the Wikimedia movement in the following ways:

  1. Categories and metadata can be created in multilingual ways, so that volunteers with different language skills can work together more easily, and files can be found via other languages than English. Multilingual categories on Commons have been a long-term request from the Commons community.
  2. Wikimedia Commons becomes a lot friendlier and more usable to developers. Structured Commons provides a new infrastructure of fine-grained APIs and other machine-readable endpoints, so that developers both within and outside the Wikimedia community can create consistent, reusable and reliable software that helps with editing, reusing and analyzing Commons media and its associated data. Without structured data, such tools rely on short-term solutions that break or produce bad data when MediaWiki core changes or when the volunteer community updates wikitext or categories.
  3. When it becomes easier to search Wikimedia Commons - in multiple languages! - Wikimedia contributors can more effectively illustrate Wikimedia projects such as Wikipedia. Without structured data, Wikipedians need to know English, need to know the category system on Commons well, and/or need to know the specific terms with which the files are described by uploaders, in order to be able to find suitable illustrations on Commons.
  4. Structured data allows for easier and simpler partnerships with content providers, especially knowledge institutions and organizations with media collections (such as cultural institutions or GLAMs). Without structured data, mass uploads of larger sets of well-described media files to Commons are technically complicated, even with relatively user-friendly tools like Pattypan. With structured data, the precise and complex metadata of files in institutional databases can more easily be integrated into Commons, also on a large scale.

Impact and benefits for other organizations[edit]

  1. With structured data, Wikimedia Commons gains a large, and highly valued, new advantage for partner organizations who donate media: it will finally become possible to follow, and review, changes that have happened to 'their' media on Commons, such as improvements and translations of the metadata. When Wikimedia Commons has refined, structured APIs, it is also possible to import these changes to institutions' own catalogues again. In this way, the Wikimedia community does not only receive materials from GLAMs around the world, but it is also able to give back, in the form of improved and updated metadata, in a clean and consistent format.
  2. Structured data also makes Wikimedia Commons more attractive for knowledge institutions around the world, because a structured environment aligns much better with the advanced metadata in the specialized repositories that such institutions have built during the last decades. Better search and findability of media on Commons also provides a greater incentive to share collections there. Without structured data, the main incentive for institutions to upload to Commons is the volume of Wikipedia page views from pages that contain their media files. By improving Commons itself, expanding the way people can search for images and reuse them, we greatly expand the usefulness of Commons, also of those files that are not used as an illustration on Wikipedia.
  3. Many knowledge organizations, especially in regions like South and Southeast Asia, Latin America and Africa, don't have support from online cultural aggregators like Europeana, Trove and DPLA, and sometimes don't even have the technical capacity for hosting their own digitized collections. Especially with structured data, Wikimedia Commons can fill this gap, becoming a de facto hosting platform and aggregator for cultural media across the world - a reliable venue for sharing cultural heritage content under free formats and free licenses.

Impact and benefits for re-use of Commons media across the web[edit]

  1. Structured data on Commons makes it easier to dynamically re-use and to embed Wikimedia Commons content with proper attribution: because the data behind media is provided in a structured form, via detailed APIs, many content management systems and platforms (such as Drupal and Wordpress) can develop embed tools and plugins that help their end users to use media from Commons, while correctly complying with our licensing.
  2. The vocabulary for describing media files (such as creators, institutions, depicted people, places, animals, plants, buildings, historical events…) is drawn from Wikidata. There, these concepts are linked with the wider internet via identifiers. This allows for cross-internet discovery of relationships between media files - a foundational principle of the semantic web and Linked Open Data.
  3. With structured data, the content on Wikimedia Commons can more faithfully and more consistently be archived by Internet Archive and other digital archiving services, assuring longevity of that content, even if Wikimedia projects disappear. Digital archiving media files becomes easier and more precise when their associated metadata is properly structured.