User talk:Zhuyifei1999

From Wikimedia Commons, the free media repository
Jump to: navigation, search


This is a Wikimedia Commons user talk page.

This is not an article, file or the talk page of an article or file. If you find this page on any site other than the Wikimedia Commons you are viewing a mirror site. Be aware that the page may be outdated and that the user to whom this talk page belongs may have no personal affiliation with any site other than the Wikimedia Commons itself. The original page is located at

This is the user talk page of Zhuyifei1999, where you can send messages and comments to Zhuyifei1999.
  • Please sign and date your entries by clicking on the appropriate button or by typing four tildes (~~~~) at the end.
  • Put new text under old text.
  • New to Wikimedia Commons? Welcome! Ask questions, get answers as soon as possible.
  • Click here to start a new topic.

čeština | Deutsch | Deutsch (Sie-Form)‎ | English | español | français | italiano | 한국어 | മലയാളം | português | русский | +/−

  • Be polite.
  • Be friendly.
  • Assume good faith.
  • No personal attacks.

Tech News: 2015-51[edit]

17:42, 14 December 2015 (UTC)

YiFeiBot adding Category:Pages using Information template with parsing errors[edit]

Hey there User:Zhuyifei1999,

I was looking around and found that User:YiFeiBot seems to be adding Category:Pages using Information template with parsing errors to files even if the InfoBox was being correctly formatted. Examples: File:Hgr-106.jpg, File:WikiDSC_9311CHoeltschl.jpg and File:Turnierhelme_im_Burgmuseum_Meersburg.jpg.

I wanted to check if this was a bug in the bot or is there some reason for it doing so that I'm unaware of. I've currently removed the category, as he category description says "File description pages with parsing errors in Template:Information resulting in non-rendered template" which didn't seem to be correct for these files. Let me know, and I'll revert the changes appropriately.AbdealiJK (talk) 01:47, 4 May 2016 (UTC)

Thanks. They are false positives. While my bot does two null edits and 3 (IIRC) checks before actually adding the category, false positive rate is difficult to reduce as MediaWiki page information updates are done in job queue and not real time. --Zhuyifei1999 (talk) 02:28, 4 May 2016 (UTC)
Maybe we could also check Category:Pages using Information template with parsing errors for files transcluding {{Information}} or {{Infobox template tag}} and automatically remove them from the library on a daily basis.
@Jarekt: Hmm. How much time do you suggest to have between the time of the category addition run and the category removal run? Or maybe it should run several times a day? --Zhuyifei1999 (talk) 10:44, 4 May 2016 (UTC)
I was thinking about processing the previous day just before the new run, so I guess 24 hours, but if we can do it more often that would be better. We could also add similar category removal runs for "no license" categories. We seem to have occasionally people helping with those categories by tagging files with "no license" templates, but often without removing the files from the categories. --Jarekt (talk) 12:02, 4 May 2016 (UTC)
@Jarekt: While 24 hours would give enough time for any page informations in the job queue to finish, I'm afraid some false positives would get tagged with "no source" or "no license" within the 24 hours, failing the purpose of reducing false positives. (Oh I can't run it more than like 6 times a day, as the query is quite expensive and running too often might "annoy" jynus) --Zhuyifei1999 (talk) 12:29, 4 May 2016 (UTC)
Two thoughts. I do hope that false positives would not get tagged with "no source" or "no license". Right now I am the one mostly tagging files with "no license" and fixing files in Category:Pages using Information template with parsing errors (I empty those categories on most days), and I make sure to check them all individually. The purpose of removal runs would be to keep number of false positives and fixed-but-not-removed files small, on the days nobody empties the categories. Removal run queries should not be expensive as they only look at files in their category. --Jarekt (talk) 15:55, 4 May 2016 (UTC)
Good point. I'll code it next week, and run 6 times a day --Zhuyifei1999 (talk) 08:46, 5 May 2016 (UTC)
great --Jarekt (talk) 20:55, 5 May 2016 (UTC)
@Jarekt: Does this query look good for undoing the licensing task? I'll finish the two scripts this weekend --Zhuyifei1999 (talk) 12:11, 13 May 2016 (UTC)
I will have to test the query with some files in the categories. The query should closely match your addition query, or otherwise we will end up with a lot of categorize/uncategorize cycles. You should use {[tl|Deletion template tag}} which should simplify the query a lot. Also in untested quarry:query/9706 I was trying to narrow down the subquery. In your version it would returned ids of all the files on Commons and that would be expensive. in my version it would only return files in the 2 categories that need to be removed. --Jarekt (talk) 15:35, 13 May 2016 (UTC)
Opps, quarry:query/9706 did not saved my changes. I will try to recreate it from memory. --Jarekt (talk) 01:54, 15 May 2016 (UTC)
Now quarry:query/9706 returns files that can be removed from Category:New_uploads_without_a_license --Jarekt (talk) 04:14, 15 May 2016 (UTC)
✓ Done added to crontab and now running 6 times a day. One hit on first run. --Zhuyifei1999 (talk) 11:27, 16 May 2016 (UTC)

Tech News: 2016-20[edit]

16:01, 16 May 2016 (UTC)

Wrong files in Category:Pages using Information template with parsing errors[edit]

Hallo Zhuyifei1999, there are a lot of files (such as File:Aerial photographs of Florida MM00034964x (9409340769).jpg or File:De-Abmachung.ogg), that are wronly contained in the mentioned category. Is there a way to clean that up? --Arnd (talk) 08:05, 18 May 2016 (UTC)

I got my bot on reverting those edits for licensing ones last weekend. I'll try to get the same functions for this category and the missing information category this weekend --Zhuyifei1999 (talk) 09:00, 18 May 2016 (UTC)

Tech News: 2016-21[edit]

18:40, 23 May 2016 (UTC)