Commons:Batch uploading/Kurt Rasmussen
Jump to navigation
Jump to search
- Source to upload from: Kurt Rasmussen, bahnbilder.de
- Did you observe an URL pattern
- bahnbilder.de/xxxx/some-name.jpg where xxxx is a four-digit number
- Do you know whether the site as an API
- I think not.
- What else can ease uploading (is the site valid XHTML, WCM they use…)?
- Essentially the algorithm that needs to be done is:
- for each page in the linked search results
- for each div class="bildvorschau" in the search results
- download the url given in the first a href=, use this URL as source in the final information template
- in the now downloaded file, find div class="bildcontainer"
- in it, from the p class="beschreibung", extract the description to be used in the final information template
- from the img tag immediately following it, download the url in the src attribute
- upload it to Commons
- for each div class="bildvorschau" in the search results
- for each page in the linked search results
- Essentially the algorithm that needs to be done is:
- Did you observe an URL pattern
- Describe the works to be uploaded in detail (audio files, images by …):
- All images by Kurt Rasmussen.
- Which license tag(s) should be applied?
- Is there a template that could be used on the file description pages? Do you think a special template should be created?
- Just standard {{Information}}, like File:Wien-wvb-sl-25-e1-554062.jpg. My suggestion would be to add at least a "needing categories" category - I'm prepared to do the categorizations at least for photos of Vienna.
I am also, parallelly, trying to coordinate a manual upload of this huge collection of extremely valuable photos. For that, see User:Darkweasel94/Rasmussen. darkweasel94 13:42, 13 December 2013 (UTC)
Opinions
[edit]Assigned to | Progress | Bot name | Category |
---|---|---|---|
darkweasel94 | finished coding, will upload in the next days | will probably do this from my own user account | Category:Files uploaded by darkweasel94 (cleanup) (also contains other stuff) |