Commons:Bots/Requests/Rybecbot (2)

From Wikimedia Commons, the free media repository
Jump to: navigation, search

Rybecbot (2) (talk · contribs)

Operator: Rybec (talk · contributions · Number of edits · recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Sometimes when files are transferred from Flickr by a bot, the tags on Flickr lead the bot to add inappropriate categories. For example, this photo of some rocks in shallow water was given the unhelpful categories Commons, Facebook, Flickr, Google, News, Pic, Wallpaper, Wiki, Wikipedia and Photographs. Mass importation of files causes a need for mass removal of categories, which is what this request is for.

Automatic or manually assisted:automatic with some supervision

Edit type (e.g. Continuous, daily, one time run):occasional

Maximum edit rate (e.g. edits per minute):4

Bot flag requested: (Y/N):N (already has the flag)

Programming language(s): Python (standard pywikipedia category.py or replace.py script)

Rybec (talk) 02:14, 31 March 2013 (UTC)

Discussion

  • So do you remove every image in this kind of category, or only those from Flickr, or??? --99of9 (talk) 13:13, 31 March 2013 (UTC)
I think you could make better edit summaries, like Removing [[Category:News]]. Also will be good idea to remove from several categories at once edit when applicable. --EugeneZelenko (talk) 14:22, 31 March 2013 (UTC)
  • The bots that import images from Flickr don't create new categories, but only add images to categories that already exist, so my intention is not to get rid of the categories entirely. When a category should contain no images but only subcategories, then I would want to empty it of images, but typically there would be some images which belong and others which don't. My intention is to manually identify the ones which don't belong, make a list of them, then have the script make the changes.
I've done another three test edits to show removal of multiple categories in a single edit with a non-default edit summary: [1] [2] [3]. Rybec (talk) 21:09, 31 March 2013 (UTC)
Looks OK for me, but I think will be good idea to use link to categories in edit summary. --EugeneZelenko (talk) 14:36, 3 April 2013 (UTC)
I don't object to doing that. Rybec (talk) 00:48, 4 April 2013 (UTC)
I’ve have had the same kind of trouble with stuff bot-uploaded from Flickr being piled up in Category:Lisbon (a much more specific category than those above, of course), which is a pain to clean up manually. There’s a few things that the importing tool could do automaticly, like avoiding over-categorization (e.g. if a photo is under Category:Streets in Lisbon is should not be also under Category:Lisbon), but more or less human input and manual work is going to be needed sooner or later. A big problem is how to tell apart what media really belong in a given category from those that should be further moved to more detailed subcategories within the same tree. I’d really like to have separate "(cleanup)" subcategories for all categories afflicted by this kind of flooding, really. -- Tuválkin 20:18, 5 April 2013 (UTC)
  • If you have to manually identify some files, I'm not sure how this is much different to cat-a-lot where any user can simply select all then unselect anything that should stay. --99of9 (talk) 10:50, 11 July 2013 (UTC)
  • I had tried Cat-a-lot before making this request, but I don't know how to work with more than 200 files at a time using it. At least one of the sets of uploads I was hoping to clean up after comprised roughly 4000 files. I seem to remember some messages about rate-limiting, but several months have passed and I didn't take notes. Another consideration was the spamming of people's watch lists. Rybec (talk) 00:39, 24 July 2013 (UTC)

Ok, I suggest we approve this. Sorry for the delay. Any final comments? --99of9 (talk) 12:00, 23 September 2013 (UTC)

I'm still interested in doing this. Rybec (talk) 01:03, 24 September 2013 (UTC)
task approved thanks --99of9 (talk) 14:26, 28 September 2013 (UTC)