Commons:Batch uploading/Minerals from Rob Lavinsky on mindat

From Wikimedia Commons, the free media repository
Jump to: navigation, search

Minerals from Rob Lavinsky on mindat[edit]

Old issues[edit]

I have moved the old issues from this page to Commons:Batch uploading/Minerals from Rob Lavinsky on mindat/Old issues and will try a new jump start here. --Reinhard Kraasch (talk) 09:05, 12 April 2010 (UTC)

Quantity structure[edit]

  • 274905 images have been downloaded from mindat.org
  • 274078 of these have a description
  • 256480 of these are images of minerals
  • 34951 images are copyrighted by Rob Lavinsky
  • 34918 of these have sufficient size and will be uploaded
  • 34917 have been uploaded (1 image was defective)
  • 4005 different localities
  • 4370 different mineral descriptions in mindat.org
  • 1274 of these are used in conjunction with Rob Lavinsky's images

Further proceeding[edit]

I will soon:

  • generate a new locality category list (here it is: User:Reinhard Kraasch/Localities) --Reinhard Kraasch (talk) 13:47, 12 April 2010 (UTC)
  • nuke the pages already uploaded / generated by the bot completely (this is much easier than modify some and delete others)
  • and then start a new upload (of 100 files) considering the modifications below. --Reinhard Kraasch (talk) 09:26, 12 April 2010 (UTC)
Some questions:
The locality list looks good, but why are blanks between : and locality name?
Fixed. Furthermore I had to reassign a few categories which were not unique (like "Apex Mine", there are several "Apex Mines"). --Reinhard Kraasch (talk) 09:15, 13 April 2010 (UTC)
Before you delete the pictures, could you make a list, which pictures have added description (respectively have edits after the bot-edit) like File:Silver and Acanthite - Imiter Mine, Boumalne-Dades, Ouarzazate, Souss-Massa-Draa, Morocco.jpg (translation into fr: and it:). It would be a pity to lost that descriptions. greetings -- Ra'ike T C 21:36, 12 April 2010 (UTC)
You find the list here: User:Reinhard Kraasch/Modified pages. Most modifications seem to be fixes to the categories (like "Diamond" --> "Diamonds" which should of course better be done by the bot (by fixing the category assignment in the category list).
The manually uploaded files like File:Silver and Acanthite - Imiter Mine, Boumalne-Dades, Ouarzazate, Souss-Massa-Draa, Morocco.jpg are an other issue, however. They will be duplicates after the bot uploads which will have to be identified and merged manually.
And we will have to tell some users that changes like [1] are superfluous, since the creator category is set by the template. --Reinhard Kraasch (talk) 09:15, 13 April 2010 (UTC)
It shouldn't be a problem to merge the double pictures. There where only a few of them.
I messaged Johnny Controletti‎, that user categorisation is needless and that he should have a look on this page for information. -- Ra'ike T C 20:06, 13 April 2010 (UTC)

Can anybody tell me why the category does not show up with the images, even though they are obviously in the category:

??? --Reinhard Kraasch (talk) 12:13, 14 April 2010 (UTC)

Category:Images by Rob Lavinsky is a hidden category, by default not visible. Change your "My Preferences"/appearance to enable hidden category display. --Foroa (talk) 12:43, 14 April 2010 (UTC)
OK, thanks (I thought I had enabled the display of hidden categories, but that was in de: and not in Commons). --Reinhard Kraasch (talk) 15:38, 14 April 2010 (UTC)
I would appreciate that you delete all categories to be deleted soon. I categorized already hundreds of uncategorized mine categories in Category:Mines by country categories; I try to keep Special:UncategorizedCategories almost empty. When you produce new bot created categories, it would be a major time saver if you could already categorize them in the respective Category:Mines by country and/or Category:Minerals by country categories. --Foroa (talk) 06:36, 15 April 2010 (UTC)
I have meanwhile deleted all uploads. The categorization of categories is already on my agenda, I have modified the list such that it shows the super-categories. --Reinhard Kraasch (talk) 17:58, 15 April 2010 (UTC)
I created all needed mines/minerals by country categories. There are only some problems with (Democratic) Congo, the republic of Macedonia and Korea. Not sure what to do with minerals of Antarctica and oceans. --Foroa (talk) 10:09, 16 April 2010 (UTC)
It's not too hard, I think. We have Category:Antarctica, Category:Republic of Macedonia, Category:Korea, Category:Republic of the Congo, and for the oceans we also have adequate categories (like Category:Atlantic Ocean) so the mineral categories can ordered in there ;-) greetings -- Ra'ike T C 11:28, 16 April 2010 (UTC)
Oeps, I forgot that there is a Category:Republic of the Congo and a different Category:Democratic Republic of the Congo. Korea in no country, so it should be possible to refine that to South or North Korea. For Macedonia, there is a "the" missing. --Foroa (talk) 12:14, 16 April 2010 (UTC)

Some pictures shows not the mineral, but a variety of it (like File:Smithsonite-155878.jpg - Cuprian smithonite). Whitch mineral category should have these pictures (variety-cat or category of the main mineral)? At the moment, the pictures have a red variety-cat, but this is a lot of work, to create cats for every variety (there are no systematic or lists of varieties) or can the bot create these categories and categorize the cats correct as sub-categories? --Orci Disk. 11:36, 16 April 2010 (UTC)

Please do not spend efforts in doing this manually, this can (and will) easily be done by the bot. The same applies to other routine work (e.g. moving categories). --Reinhard Kraasch (talk) 12:26, 16 April 2010 (UTC)
If the bot can do it, it's OK. --Orci Disk. 13:04, 16 April 2010 (UTC)
Why it's "Republic of the Congo" on one hand and "Germany" on the other, is hard to understand... I fixed "Macedonia" meanwhile. With "Korea" we cannot do much, since "Korea" is the only information available for this location on Mindat. --Reinhard Kraasch (talk) 12:26, 16 April 2010 (UTC)
And I don't understand your strange combination "Republic of the Congo" on one hand and "Germany" on the other ;-), but I found a mistake in the category of (for example) File:Quartz-20131.jpg. It shouldn't be named "Minerals of Herkimer Co", but "Minerals of Herkimer County" (no shortcut of the county). Could you fix it, please and if existent, other categories of that, too? greetings -- Ra'ike T C 16:35, 16 April 2010 (UTC)
Well, when it comes to the official names, then "Germany" is "Federal Republic of Germany", isn't it? The abbreviation of "County" to "Co." (as well as "Municipio" to "Mun.", "Mountain" to "Mt." - or "Mt" - etc.) is specific to mindat.org. To really fix this, one would have to look up all the county (municipio, mountain ...) names as used in Wikipedia and build a matching table, and - regarding the counties - perhaps sort in the categories as sub-categories of the specific county. --Reinhard Kraasch (talk) 17:19, 16 April 2010 (UTC)
That's a language issue: when looking in category:Alps, there are many "xxx in the Alps" categories too. I did not dare to change other category names (mt to mount, co. to county) because that might brake links in the uploading process. --Foroa (talk) 17:53, 16 April 2010 (UTC)
Well, the categorization system (both that of Commons as that of mindat.org) is not canonical to the very end and has several ambiguities and imperfections by design (e.g. the rule in Commons "to use English names where appropiate" which opens a wide field of ambiguities ...). And mindat uses of course another scheme than Commons does. If you look e.g. at [2] - it's "Germany and Czech Republic" in mindat, in Commons this would have to be two categories (at least): "Mine in Germany" and "Mine in the Czech Republic". Another problem is the lack of precision of some locations (as the "Korea" example). I fear we just cannot cope with all these specifics and will have to leave a lot of things just in the way they are. --Reinhard Kraasch (talk) 18:34, 16 April 2010 (UTC)
Another fix is needed. In the description of File:Spodumene-18945.jpg is the german variety linked to de:Spodumen, but the article de:Kunzit exists as against to the english wikipedia, where Kunzite is a redirect to Spodumene. -- Ra'ike T C 16:46, 16 April 2010 (UTC)
The translations are a bit heuristic (I have used mineralienatlas.de for the translation and did not track all possible redirect combinations in Wikipedia), but it should be (Var.: [[:de:Spodumen|Kunzit]]) and not (Var.: [[:de:Spodumen]]), I will check this. --Reinhard Kraasch (talk) 17:19, 16 April 2010 (UTC)

I spot checked some uploads and it seems OK. I made two minor corrections in the localitiy file. I think you can start the bot category creation (first, so you can see red fields when loading images with non existing categories) and upload. Other category changes can be done later or by bot move. We will see what's in the Minerals of Korea cat. --Foroa (talk) 16:52, 16 April 2010 (UTC)

OK, if there is no contradiction I will generate the locality cats then (in the "Korea" cat are only two images, according to the list) -- Reinhard Kraasch (talk) 17:11, 16 April 2010 (UTC)

Last summary of picture descriptions and requests for picture description[edit]

File name

<Mineral1>[-<Mineral2>[-<Mineral3>]]-<MindatID>.jpg

  • consensus of Reinhard Kraasch, Orci and me: No need to change the name to RL-<Mineral1>[-<Mineral2>[-<Mineral3>]]-<MindatID>.jpg
Description
English: Mineral1 (optional Mineral2 and so on)
Locality: complete locality description from mindat.org with links to existing wikipedia articles and link to the locality description of mindat.org
complete description of the showing mineral from mindat.org
Deutsch: Mineral2 (optional Mineral2 usw.)
Fundort: Komplette Beschreibung des Fundortes mit Links zu existierenden WP-Artikeln und Link zur Fundortbeschreibung auf mindat.org
There were several files named of the sort Quartz-Quartz-xyz.jpg, (with duplicate mineral names), I will modify the naming procedure to eliminate such duplicates.
Source
no hidden link to the picture on mindat.org
Author
{{Creator:Rob Lavinsky}}
Date
{{other date|before|2010-03}}
Permission
(see below)

== {{int:license-header}} ==

{{Images by Rob Lavinsky}}

Categorisation
  • consensus: Change only that categories, which don't have the suffix "Mine" (or "Quarry" ...) in the locality description to "Category:Minerals of <first location name of the mindat locality description>". -- Ra'ike T C 00:21, 4 April 2010 (UTC)

Progress of the request (failed, uploading, coding, done)[edit]

Assigned to Progress Bot name Category
Reinhard Kraasch finished RKBot Category:Images by Rob Lavinsky

Details[edit]

Assigned to Job Status Comments
Reinhard Kraasch Image (and description) download from mindat.org Status:    Done 12:24, 15 March 2010 (UTC) All mindat.org images have been downloaded
Reinhard Kraasch Generate image descriptions, autotranslate locality info Status:    Done 19:45, 21 March 2010 (UTC)
Reinhard Kraasch Generate and autotranslate category info Status:    Done 19:45, 21 March 2010 (UTC)
Various Discussion of image description layout Status:    Done 21:38, 29 March 2010 (UTC) Sample - discussion above
Various Discussion of locality categories Status:    Done 21:38, 29 March 2010 (UTC) Sample - discussion above
Various Discussion of category layout Status:    Done 21:38, 29 March 2010 (UTC) Sample - discussion above
Reinhard Kraasch Test upload Status:    Done 19:45, 21 March 2010 (UTC) (9 images)
Reinhard Kraasch Test upload Status:    Done 21:38, 29 March 2010 (UTC) (100 images)
Various Discussion of test upload Status:    Done 20:55, 15 April 2010 (UTC)
Reinhard Kraasch Fixes to upload procedure Status:    Done 20:55, 15 April 2010 (UTC)
Reinhard Kraasch Nuke all pages created so far Status:    Done 18:00, 15 April 2010 (UTC)
Reinhard Kraasch Test upload Status:    Done 15:24, 16 April 2010 (UTC) (200 images, 56 locality categories)
Various Discussion of test upload Status:    Done 21:34, 24 April 2010 (UTC)
Reinhard Kraasch Generation of locality categories Status:    Done 21:34, 24 April 2010 (UTC)
Reinhard Kraasch Actual image upload Status:    Done 18:23, 27 April 2010 (UTC)
Reinhard Kraasch Generation of mineral categories Status:    Done 18:23, 27 April 2010 (UTC)
Reinhard Kraasch Identify and delete duplicates Status:    Done 19:28, 27 April 2010 (UTC)
Reinhard Kraasch Reduce and upload oversize images Status:    Done 19:28, 27 April 2010 (UTC)

Catched mistakes[edit]

  • In this picture, I had to add the command "1=", because there wasn't shown the complete description. -- Ra'ike T C 22:11, 24 April 2010 (UTC)
I fixed the description meanwhile (with the further uploaded files: [3]) --Reinhard Kraasch (talk) 07:15, 25 April 2010 (UTC)
Ok, thank you. In the moment, I check the modified pages. When I saw a mistake there, I mark the pictures with "checked and corrected". So you can control it, if there can something be fixed with the bot. greetings -- Ra'ike T C 08:44, 25 April 2010 (UTC)
I propose to fix locality descriptions only in the locality category, but not in all of the image descriptions (which can be thousands). I can then (after a while) update the image descriptions from the localities with a bot run. --Reinhard Kraasch (talk) 11:00, 25 April 2010 (UTC)

To be done[edit]

  • update the locality info after a while (see above)
  • augment the other mineral categories by interlanguage links etc. generated in this project
  • ...
  • and, of course, upload the images from irocks.com - but that's a different job... --Reinhard Kraasch (talk) 21:42, 27 April 2010 (UTC)