Commons:Bots/Work requests

From Wikimedia Commons, the free media repository
Jump to: navigation, search

Shortcut: COM:BR · COM:BWR

Bot policy and list · Requests to operate a bot · Requests for work to be done by a bot  · Requests for batch uploads

Filing cabinet icon.svg
SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days.


Could someone compile a list of SVGs missing an XML declaration (<?xml version="1.0" encoding="UTF-8"?> and similar <?xml ?> tags)? Without it, SVGs do not render but there are still some cached renders of files without XML declarations from before 2010. Furthermore, could the files be reuploaded with the declaration prepended? Thanks, Jc86035 (talk) 10:10, 29 November 2017 (UTC)

Does this only affect files before 2010? Currently, I could only provide a list by downloading and searching the SVG information. This might be easy doable for SVG files before 2010 or 2012, when these files where still uncommon. --Schlurcher (talk) 07:14, 13 December 2017 (UTC)
Commons SVG Uploads
Year SVGs GiB
2005 1,221 0.1
2006 38,406 7.3
2007 57,618 5.7
2008 61,617 8.0
2009 75,615 15.0
2010 86,851 22.7
2011 123,016 39.9
2012 77,589 21.8
2013 135,798 309.8
2014 104,601 76.5
2015 93,687 30.6
2016 111,873 31.8
2017 223,506 39.6
2018– 37,767 2.2
Total 1,229,165 610.8
SELECT SUBSTRING(img_timestamp,1,4) AS Year, COUNT(*), SUM(img_size) FROM image WHERE img_media_type="DRAWING" GROUP BY 1 ORDER BY 1;
Its a tradeoff between development time and bandwidth. While backend support HTTP range requests (thus <2 GiB bandwidth for complete scan), the requests python library does not. You'll have to add custom headers to urllib2. —Dispenser (talk) 22:15, 13 December 2017 (UTC)
We have a 2017 upload with an <?xml tag, but is missing xmlns= so it does not render.
Oddities: The thumbnailer scales up the smallest cached render if SVG rendering fails.
A full run would take 40+ hours. —Dispenser (talk) 12:10, 14 December 2017 (UTC)
Reporting back, it took 32 hours to scan 1,138,986 SVGs. At least 4 files were lost. 1,779 SVGs are missing xmlns= and I will need to check if they fail to render. I may attempt a second run to determine severity of other issues. —Dispenser (talk) 04:41, 18 December 2017 (UTC)
@Dispenser: Thank you, but isn't that about 58,811 files short of the "Total" in the table above?   — Jeff G. ツ 13:41, 18 December 2017 (UTC)
The database connection died at the very end and it was close enough I didn't notice it. This weekend I have some time to work on it again. Dispenser (talk) 21:57, 27 January 2018 (UTC)
@Jeff G.: The second scan is done and nearly everything was logged or packed into a 616 MB archive. It took 29.3 hours to scan 1,229,573 SVGs of which 1,810 are missing xmlns= critical for in-browser rendering. —Dispenser (talk) 05:45, 30 January 2018 (UTC)
@Dispenser: Thanks, I hope it didn't take too much babysitting.   — Jeff G. ツ please ping or talk to me 10:19, 30 January 2018 (UTC)

SVGs missing xmlns
Year Files
2005 25
2006 793
2007 571
2008 161
2009 163
2010 48
2011 35
2015 1
2016 7
2017 6

The SVG xmlns rendering problem was first noticed end of 2012. A bug (T43174) was filed and declined. And apparently (from the above table) whatever mechanism was to deny uploading SVG without an xmlns has been removed. And that's all the history I could find. —Dispenser (talk) 23:21, 30 January 2018 (UTC)

Full-text search in SVG files[edit]

Can all SVGs containing (in the latest revision) «-Bold», «-Italic» and so on be identified and, preferably, categorized? Can the same be done for other perversions of Adobe Illustrator?

Attention: I am speaking about searching in files – SVG code is a kind of text. I do not request searching in «File:» wiki pages.
Incnis Mrsi (talk) 14:25, 13 January 2018 (UTC)

@Incnis Mrsi: It's not impossible, but it would currently require all 1.2 million SVGs to be downloaded on someone's computer or on the WMF cloud servers. At that point it would be better to have the bot periodically reupload all such files (for fixes which are very unlikely to cause rendering issues, anyway). This would also help with fixing invalid XML declarations/<svg> tags (librsvg stopped rendering files without an xmlns about 7 years ago but there are still some files which don't have an xmlns yet), and possibly other librsvg/Inkscape/etc. formatting and rendering artefacts if someone is aware of how to fix those things. Jc86035 (talk) 14:39, 13 January 2018 (UTC)
@Incnis Mrsi, Jc86035: I downloaded the first 4 KB (see #SVGs above) and this is a rabbit hole:
  1. SVG with embedded proprietary(?) font
  2. The thumbnailer does font substitution (see meta:SVG fonts), every font-family should end with a generic-family (e.g. serif or sans-serif), but some don't and the thumbnail uses a sans-serif, but when downloaded it displays a serif typeface.
Some interesting font names I spotted:
  • Arial-BoldMT
  • DejaVuSerif-BoldItalic
  • LiberationSerif-Italic
  • MyriadPro-Light
  • MyriadPro-Regular
  • MyriadPro-Semibold
  • MyriadPro-Bold
  • MyriadPro-BoldIt
  • SourceHanSerifTC-SemiBold-B5pc-H
  • Ubuntu Bold
  • Verdana-Bold
In my dump there are 343 SVGs using typefaces ending with -Bold (specifically regex: font-family\s*:[^{};<>]*-(Bold)["\'\s]*[,;}]). Dispenser (talk) 04:05, 5 February 2018 (UTC)
@Dispenser: not a surprise to me that Commons SVGs are full of Arial. Incnis Mrsi (talk) 06:57, 5 February 2018 (UTC)

Create redirects for dates in Arabic[edit]

Hi! Is it possible to create automatic redirects for dates in Arabic. I mean for example 1 يناير to January 1. So that we can avoid redlinks in descriptions and summaries in Arabic. Especially for moved files. The list of dates can be found here [1]. --Helmoony (talk) 04:45, 26 January 2018 (UTC)

Location at the top of world => "90° 00′ 00″ N"[edit]

Hello, could you removed the template location when the location is "90° 00′ 00″ N".
ie {{Location|90| or {{Location dec|90.
It's never appropriate, see : search "90° 00′ 00″ N". - Drongou (talk) 13:33, 5 February 2018 (UTC)

@Drongou: What were the uploaders of those 611 files thinking? OTOH, do we really want to discourage intrepid explorers from posting their photos from the poles?   — Jeff G. ツ please ping or talk to me 03:06, 6 February 2018 (UTC)
@Drongou: It's correct for File:Object_rotate_right.png! After turning 270° it has turned 90° from where it was. ;-)
@Jeff G.: These were uploaded with the upload wizard. When you enter the location in the wizard, it instantly warns you that "The longitude must be a number between -180 and 180." or "The latitude must be a number between -90 and 90." even before you've had a chance to enter the other value. So I suspect some users simply enter whatever the warning told them is the maximum. Of course, by that logic, some users would have entered the minimum the warning informed them about. And they have! - Alexis Jazz 05:01, 6 February 2018 (UTC)

Category:Set of Maps[edit]

I was wondering if someone could add {{Mechanical Curator image}} to the images in the descriptions. Some already have it so, can the url be added to the template as well? . Thanks. Artix Kreiger 2 (talk) 22:34, 8 February 2018 (UTC)

Rename/Move files[edit]

The images in c:Category:Melodifestivalen 2018, Göteborg by Tarnic66 got the automatic numbering wrong, and should all be renamed Melodifestivalen 2018, Deltävling 2, Scandinavium, Göteborg, Name number.jpg. Is that possible to do by bot? Thanks in advance. /Haxpett (talk) 08:37, 10 February 2018 (UTC)

Done by AndreCostaWMSE-bot, T187500. /Haxpett (talk) 08:56, 17 February 2018 (UTC)
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment. Haxpett (talk) 08:56, 17 February 2018 (UTC)

Category:Frederik Willem Zürcher[edit]

Hi, anybody around being able to add the files "Author F.W. Zürcher" to the Category in one go? I did a couple of them, but 92 to is time consuming. Thank you for your time. :) Lotje (talk) 15:04, 13 February 2018 (UTC)

Please have a look at: Help:Gadget-Cat-a-lot. --Schlurcher (talk) 18:02, 13 February 2018 (UTC)
@Lotje: ✓ Done with Cat-a-lot.   — Jeff G. ツ please ping or talk to me 14:11, 17 February 2018 (UTC)
Checkmark This section is resolved and can be archived. If you disagree, replace this template with your comment.   — Jeff G. ツ please ping or talk to me 14:08, 17 February 2018 (UTC)