Commons:Structured data/GLAM/Projects

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Official pilot projects (with mentorship and documentation)[edit]

In the course of 2019, Sandra Fauconnier (Program Officer for GLAM at the Wikimedia Foundation) and the Structured Commons team mentor 5 to 8 representative GLAM-Wiki projects that apply structured data on Wikimedia Commons for the first time.

The projects are typical examples of the main types of GLAM-Wiki projects on Commons:

  1. community-driven projects and campaigns;
  2. partnerships with cultural institutions, involving uploads of digital files representing cultural heritage in the broad sense.

Some of the projects also explore the potential of the new features of Wikimedia Commons:

  • adding new (structured) metadata, annotations and transcriptions to existing, already uploaded files;
  • development of new applications on top of Wikimedia Commons' structured data;
  • projects that encourage and emphasize re-use of the uploaded files.

These projects receive mentorship and inspire new documentation for Commons partnerships. Their documentation can then be used by Wikimedians and GLAM partners to apply structured data in their own projects.

Partnerships with cultural institutions[edit]

Art museum collections - Museum voor Schone Kunsten Gent[edit]

Images from the collections of Museum voor Schone Kunsten Gent, Belgium

  • Pilot project by Packed/VIAA, Belgium
  • Example structured data upload of a typical art museum collection
    • How to model such files in structured data?
    • How to link them with Wikidata?
  • Documentation how to do batch uploads to Wikimedia Commons in structured data format for a typical art museum collection

Archival highlights - FelixArchief[edit]

Images from the collections of FelixArchief, Belgium (Antwerp's municipal archive)

  • Pilot project by Packed/VIAA, Belgium
  • Example structured data upload of a typical collection of archival highlights: maps, prints and photographs (from a municipal archive)
    • How to model such files in structured data?
    • How to link them with Wikidata?
  • Documentation how to do batch uploads to Wikimedia Commons in structured data format for a typical archival highlights collection

Decorative arts - Project Wolfers[edit]

Images of decorative works (jewellery, decorative objects) by the Wolfers dynasty: Louis Wolfers, Philippe Wolfers, Marcel Wolfers and their companies and collaborators

  • Pilot project by Packed/VIAA, Belgium
  • Example structured data upload of a typical decorative arts collection, including designs and sketches (from various collections)
    • How to model such files in structured data?
    • How to link them with Wikidata?
    • How to correctly model copyright
  • Documentation how to do batch uploads to Wikimedia Commons and Wikidata in structured data format for a typical decorative arts collection

Encyclopedic museum collections - Metropolitan Museum of Art[edit]

Images of works described in the Heilbrunn Timeline of Art History, and of all works on public display at The Met

  • Pilot project by the Metropolitan Museum of Art, New York
  • Example structured data upload of a very culturally diverse museum collection
    • How to model such files in structured data?
    • How to link them with Wikidata?
  • Documentation how to do batch uploads to Wikimedia Commons and Wikidata in structured data format for a very diverse museum collection

Special materials - Maps and aerial photographs from Instituto Geográfico Nacional[edit]

Digitized topographical maps, aerial photographs and historical maps from Instituto Geográfico Nacional in Spain.

  • Pilot project by Wikimedia España
  • Example structured data uploads of topographical maps, aerial photographs and historical maps
    • How to model such files in structured data?
    • How to link them with Wikidata?
  • Documentation on how to do batch uploads to Wikimedia Commons in structured data format for topographical maps, historical maps and aerial photographs

Community-driven projects and campaigns[edit]

Digitized books and Wikisource - Punjabi Qisse[edit]

  • Pilot project by Punjabi Wikimedians, mentored by Satdeep Gill
  • In the course of 2019, a small pilot project takes place to practice the full Wikisource workflow, including uploading files to Wikimedia Commons in structured data format. It consists of
    • Digitization of a small set of Punjabi books (in the Qisse genre)
    • Upload of the digitized files to Wikimedia Commons, in structured data format
    • Upload of the books' metadata (and author data) to Wikidata
    • Indexing and transcribing the books on Wikisource
    • Inclusion of the metadata of the books on Wikisource
  • Example structured data uploads of digitized publications that will be transcribed on Wikisource
    • How to model such files in structured data?
    • How to link them in Wikidata?
  • Outline of a good and efficient workflow, from digitization to upload to Commons, data entry in Wikidata, and integration and transcription on Wikisource
  • Documentation on how to upload files for Wikisource in structured data format on Wikimedia Commons

Software and tools with Structured Data on Commons[edit]

ISA, a 'microcontributions' tool for Wikimedia Commons[edit]

ISA Structured Data logo.png

ISA is a fun, multilingual, mobile-first 'microcontributions' tool, that makes it easy for (groups of inexperienced) people to add structured data to images on Wikimedia Commons.

  • With ISA, you can choose a pre-defined set of images on Commons and then ask contributors to 'tag' these with multilingual structured metadata. Points are counted for each contribution, and therefore it is possible to organize 'tagging' or microcontributions competitions or challenges with ISA.
  • ISA is originally built to provide better multilingual and structured descriptions of Wiki Loves Africa images. But it is also developed to be useful to all of the Wiki Loves competitions, and eventually for all media files on Wikimedia Commons.
  • ISA is developed as a collaboration between Wiki In Africa, Histropedia and the Structured Data on Commons project.

Wikimedia Commons Data Roundtripping[edit]

  • The Swedish National Heritage Board researches and develops a prototype tool to provide improved metadata (translations, data additions...) from Wikimedia Commons back to the source institution.
  • Research report on the desirability and requirements of GLAM-collections managers in regards to retrieving metadata added to their files post-upload on Wikimedia Commons
  • A tested prototype tool that supports GLAM-collections managers in identifying, reviewing and retrieving added or changed metadata to media files.
  • Report on lessons learned and recommend future actions (or inaction).

Timeline[edit]

  • April 2018: Workshop at Wikimedia Conference, attended by more than 50 people, creating a longlist of potential pilot projects.
  • September 2018: A pre-selection of pilot projects has been made! The Structured Commons team is talking with the projects' representatives to refine the planning. The final list of projects will be published on this page.
  • First half of 2019: Pilot projects are executed. / Documentation and reporting.

Other GLAM projects with Structured Data on Commons[edit]

Do you also plan to do an early project with Structured Data on Commons in 2019? Feel free to list your project below:

GLAM projects with Structured Data on Commons in 2019[edit]

  • The FindingGLAMs project by Wikimedia Sweden will perform Wikimedia Commons uploads with structured data where possible.
  • National Library of Wales, oil paintings - I have 600 oil paintings ready to upload to Commons and Wikidata. It would be great to trial this upload with structured data in early 2019. Let me know if you are interested. Jason.nlw (talk) 09:45, 8 December 2018 (UTC)
Hello @Jason.nlw: this is great to hear :-) This should be possible after full SDC deployment, February-March according to current estimates. Do you intend to upload new works (i.e. ones that have no Wikidata items and/or Commons images yet)? The Sum of all Paintings folks should then be quite interested. SandraF (WMF) (talk) 09:00, 12 December 2018 (UTC)
Hi SandraF (WMF), yes these will be new to Wikidata and Commons. I have already had a chat with Maarten about this upload. I will see how things go. If we decide to upload these earlier, I think we might be able to get another batch ready by March any way. I will keep in touch. Jason.nlw (talk) 09:40, 13 December 2018 (UTC)
  • NOA project, see related blogpost: "This means we keep information in three(!) classifications for each image. One of the reasons for this is that Wikimedia Commons is currently restructuring: In the future, the categories on commons could be replaced by multilingual structured descriptions, which is currently being tested in the Structured Data project. In order to fit into the new data format, our indexing would also have to adapt. Since Structured Commons is mainly about describing what is depicted on an image with the help of Wikidata, our automatic indexing might reach its limits. Scientific images often depict information instead of clearly defined visual entities, making it difficult for people to formulate which concept is depicted, so teaching this to machine is even harder. We have already given feedback on the discussion pages so that images like ours can be effectively described in the future. A more intensive exchange of Structured Commons and NOA would be a good idea. We are ready for this :)"
Thank you for listing this project! It will be very valuable indeed if you actively engage with the Wikimedia Commons and Wikidata communities on the topic of data modelling and correct description of media files from scientific publications. I think WikiCite is one relevant project here, in which you are already active. Other suggestions from community members are welcome. SandraF (WMF) (talk) 07:28, 2 April 2019 (UTC)
  • Your project here:
  • ...

Questions and support[edit]

If you have questions and/or need support, feel free to post a message on the general Structured Data on Commons discussion page, and/or e-mail Sandra Fauconnier, Program Officer for Structured Data at the Wikimedia Foundation: sfauconnier@wikimedia.org.