Commons:Village pump
This page is used for discussions of the operations, technical issues, and policies of Wikimedia Commons. Recent sections with no replies for 7 days and sections tagged with {{Section resolved|1=--~~~~}} may be archived; for old discussions, see the archives; the latest archive is Commons:Village pump/Archive/2024/10. Please note:
Purposes which do not meet the scope of this page:
Search archives: |
Legend |
---|
|
|
|
|
|
Manual settings |
When exceptions occur, please check the setting first. |
SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 1 day and sections whose most recent comment is older than 7 days. | |
September 11
Proposal: AI generated images must be clear they're AI in the file name
Now these are being used for a good purpose (reporting on the misuse of AI at en:Wikipedia:Signpost) so please don't just nominate them for deletion, but look at these filenames:
File:Amoeba moving.jpg File:Leukocytes.jpg
Now I've moved them to File:AI genetated image of... - but that's just actively setting people up to use semi-believable illustrations that have no scientific accuracy, and then making it relatively hard to catch what happened.
Should we be somewhat stricter about filenames for AI? There's cases where I think it matters less than these, but the capacity to mislead is high. Adam Cuerden (talk) 10:01, 11 September 2024 (UTC)
- Oh, there's also File:Cancer cell.jpg, now File:AI generated image of a cancer cell.jpg which is not used anywhere, and looks ridiculously misleading. That's just the AI giving the cell its own tumor. Actually, maybe that should be in that article about misleading AI Adam Cuerden (talk) 10:05, 11 September 2024 (UTC)
- File names for AI generated images not indicating that's what they are is definitely an issue. There's no reason there shouldn't be some indication in the file name that an image is AI generated. I think it would be in alignment with the changes to guideline on how to name files that was passed recently to. Regardless, file names should be as descriptive as possible and I can't see why that information shouldn't be included in the file name. --Adamant1 (talk) 11:02, 11 September 2024 (UTC)
- An alternative would be to add a tag to the media without requiring files to be moved or named with that in the title from the start. Just like any NSFW image has an indicator for that on sites like reddit. It would be shown on all files in Category:AI-generated images either at the end of the file-title or e.g. within a corner of the thumbnail. I think adding that automatically would be better. However, when uploading the file using the Upload Wizard and checking made with AI one could also automatically append (AI-generated) or (made using AI) to the file-title. Prototyperspective (talk) 11:29, 11 September 2024 (UTC)
- Is it really misleading if people cant be arsed to even read the template? Trade (talk) 11:30, 11 September 2024 (UTC)
- The template only shows on the file page. And even there it doesn't look very different from other common license templates which people only interested in the content usually probably don't look at either and many files like the linked examples don't even have these templates. Prototyperspective (talk) 12:47, 11 September 2024 (UTC)
- Support The file name and caption are the first things you see when you select an image that's used on Wikipedia for better viewing, right before you click through to the file's own page. For most people it'll probably be the only information they'll see. This information is absolutely important enough that it should be mentioned in the file name. ReneeWrites (talk) 13:03, 11 September 2024 (UTC)
- Support per ReneeWrites. I agree that having AI generated marked in the file name will give Wikipedia users much more transparency on the provenance of files. William Graham (talk) 15:36, 11 September 2024 (UTC)
- Oppose. Nice idea, but I prefer templates that can be translated and add properties. Latin letters in a filename are not a good clue in other scripts. We may have endless rename requests. File naming hacks are also not systematic; we do not routinely encode other properties in filenames. Glrx (talk) 15:47, 11 September 2024 (UTC)
- At the same time, an actively misleading filename is a problem. They are not leucocytes (for example. They don't even look much like cells. AI is very good at creating images that look like they're plausible depictions but really aren't, they just ape the - for lack of a better word - art style of real scientific illustrations, coloured electron microscope depictions, and so on. Adam Cuerden (talk) 16:05, 11 September 2024 (UTC)
- we do not routinely encode other properties in filenames - We routinely use naming systems like "Flag of [Country]" for other types of files. Using filenames to make important disclosures about the origin of files isn't a huge leap. Omphalographer (talk) 19:11, 11 September 2024 (UTC)
- @Omphalographer, Well said. -- Ooligan (talk) 17:11, 12 September 2024 (UTC)
- Support Update with caveat: my support is for this idea in principle, with the understanding that we would need an additional discussion about implementation to cover things like wording. — Rhododendrites talk | 12:32, 12 September 2024 (UTC) This is a good idea, and in line with the spirit of many off-wiki policies proposed for AI content. It also doesn't preclude a template. The question, though, is what label/language should be used. It would need to be something someone wouldn't choose accidentally for a non-AI image. Also, documentation for this rule would need to be clear that we're talking about media that is produced through generative AI models (as opposed to, say, a scientific visualization in which machine learning was used somewhere in the process). — Rhododendrites talk | 16:10, 11 September 2024 (UTC)
- I wouldn't be too concerned about language necessarily. If the filename is not in a language people speak, they're much more likely to check the decription. We don't need a perfect solution, just an improvement. Adam Cuerden (talk) 16:14, 11 September 2024 (UTC)
- Comment - This is a good idea, but it needs refinement. Besides Rhododendrites's caveat's above, I think it should only apply to images which depict something in a realistic manner. There's not much point requiring this for something like File:Portrait of a Unicorn.png. Otherwise, I would only support it as a recommendation, not a requirement. Nosferattus (talk) 18:21, 11 September 2024 (UTC)
- What harm would it be to apply it to that image, to make clear (a) this is not a human-made illustration, (b) this is not a Lisa Frank image, etc.? — Rhododendrites talk | 18:33, 11 September 2024 (UTC)
- We shouldn't be mass changing file titles without a compelling reason. It just generally causes chaos. I think files being factually misleading is a compelling reason, but otherwise, I'm not convinced. Nosferattus (talk) 16:57, 15 September 2024 (UTC)
- I'd say there's a class of images where it matters less. But a human-made illustration probably has some effort to get key aspects, whatever those might be. AI just tries to get something that looks like other images with similar key words, and might miss out important bits that a human wouldn't. Honestly, as a general rule, the higher the likelihood it'd be used on Wikipedia, the more that's an issue. Adam Cuerden (talk) 19:34, 11 September 2024 (UTC)
- Agreed on Nosferattus' conditional support: images that can be mistaken for something else, should be marked, and the filename is the most obvious place to do so. By the way, this also applies to photoshop fabrications of "real life flower elfs" etc. And from a filemover perspective: We are supposed to only rename files that are realistically going to be kept. Is there even a rationale to keep misleading non-scientific AI illustrations? I mean, beyond illustrating how you can't trust AI illustrations? --Enyavar (talk) 00:01, 13 September 2024 (UTC)
- Aye. These couple are useful to illustrate the problem, but we certainly don't need more. Adam Cuerden (talk) 15:22, 14 September 2024 (UTC)
- Oppose for now, unless proposal is substantially modified to address concerns above. Nosferattus (talk) 16:53, 15 September 2024 (UTC)
- What harm would it be to apply it to that image, to make clear (a) this is not a human-made illustration, (b) this is not a Lisa Frank image, etc.? — Rhododendrites talk | 18:33, 11 September 2024 (UTC)
- Support, as we should use any (and all) means to achieve maximum transparency for re-users about the non-authenticity of AI-generated images. --Túrelio (talk) 18:27, 11 September 2024 (UTC)
- Support: I don't see any downside. One remark, though: like everything else on Commons, this should not be restricted to English, and I don't imagine I would recognize something if it were marked in Chinese as AI. How do we intend to deal with the multilingual aspect of this? - Jmabel ! talk 19:32, 11 September 2024 (UTC)
- As I said above, I think that perfection isn't needed. If it's labelled in Chinese, as long as the whole filename is in Chinese, Anglosphere people will presumably go to the description. They might not for one that has a plausible English filename. Adam Cuerden (talk) 19:37, 11 September 2024 (UTC)
- Yep. In a Czech file name, the warning should be in Czech, and in a Japanese file name, in Japanese - tailored to the native languages these images are likely to get used for. And if I'm that determined to use a cool image with Tamil filename in the German WP, I the user must make sure to understand the filename and description. (GTranslate exists.) --Enyavar (talk) 00:10, 13 September 2024 (UTC)
- As I said above, I think that perfection isn't needed. If it's labelled in Chinese, as long as the whole filename is in Chinese, Anglosphere people will presumably go to the description. They might not for one that has a plausible English filename. Adam Cuerden (talk) 19:37, 11 September 2024 (UTC)
- Support: Let´s do it. Transparency first. Alexpl (talk) 20:11, 11 September 2024 (UTC)
- Support as a general idea. Would this extend to AI-upscaled images, which can get very strange at the deep end? Belbury (talk) 20:39, 11 September 2024 (UTC)
- How do I unsee that image?! Omphalographer (talk) 22:58, 11 September 2024 (UTC)
- Extremely disturbing heh Bedivere (talk) 04:49, 12 September 2024 (UTC)
- How do I unsee that image?! Omphalographer (talk) 22:58, 11 September 2024 (UTC)
- Comment from a filemover. If you want to apply this requirement to files after upload, you should amend Commons:File renaming to make it clear that lacking a statement of AI-generation in the filename is good cause for renaming. Either by adding a new numbered criterion or by finding a way to shoehorn it into an existing one (2 or 3, I'd guess). --bjh21 (talk) 21:01, 11 September 2024 (UTC)
- @Bjh21: You could argue, in clear cases like the ones I mentioned earlier, it's already covered by 2, since they aren't actually pictures of (say) leukocyctes, but I agree that adding an example would help. Adam Cuerden (talk) 20:40, 12 September 2024 (UTC)
- @Adam Cuerden: I think you mean 3 (obvious error), and I agree that would cover clear cases like those. But there are other cases that I don't think would be covered, like File:White generic hatchback.png or File:Wikimedia LGBT+ graphic illustration 1.png. --bjh21 (talk) 21:48, 12 September 2024 (UTC)
- It'd be nice to expand criterion 2 to allow adding information about the non-factual nature of an image in general (e.g. AI generated images, simulations, reenactments, historical reconstructions, artistic representations, etc). Omphalographer (talk) 22:40, 13 September 2024 (UTC)
- Aye. Certainly in the spirit of, but explicitly permitted never hurt. Adam Cuerden (talk) 15:23, 14 September 2024 (UTC)
- @Bjh21: You could argue, in clear cases like the ones I mentioned earlier, it's already covered by 2, since they aren't actually pictures of (say) leukocyctes, but I agree that adding an example would help. Adam Cuerden (talk) 20:40, 12 September 2024 (UTC)
- I believe that ALL substantially manipulated or fabricated images, that might be misunderstood as "real", should be very clearly marked. It shouldn't depend only on whether someone decides or rules that the fabrication method was "AI". ITookSomePhotos (talk) 21:37, 11 September 2024 (UTC)
- Fair, but this is an easy "bright line" case. We can easily spot AI by tags and upload tools, Adam Cuerden (talk) 15:23, 14 September 2024 (UTC)
- Support, probably difficult to enforce though given the backlogs of other bad file names needing renaming (screenshot, whatsapp, etc). Gnomingstuff (talk) 22:28, 11 September 2024 (UTC)
- Support AI generated images must have "AI" in the file name as a principle, perhaps even better would be "AI generated", which is more clear. And always at least in Latin letters. Yes, the backlog might be a problem, but we can start now for new uploads. --JopkeB (talk) 05:12, 12 September 2024 (UTC)
- What do people think about when uploading the file using the Upload Wizard and checking made with AI one could also automatically append (AI-generated) or (made using AI) to the file-title? (No replies on that above or on the idea of a tag displayed dynamically next to the file-title and in the thumbnail.) I think doing something automatically and in a standardized way would be better than just requiring this which many uploaders will not follow up on. Prototyperspective (talk) 10:49, 12 September 2024 (UTC)
- Given that the Upload Wizard already asks about AI tools, I think it would be appropriate for it to ensure that uploads using them follow whatever policy arises from this discussion. --bjh21 (talk) 17:26, 12 September 2024 (UTC)
- What do people think about when uploading the file using the Upload Wizard and checking made with AI one could also automatically append (AI-generated) or (made using AI) to the file-title? (No replies on that above or on the idea of a tag displayed dynamically next to the file-title and in the thumbnail.) I think doing something automatically and in a standardized way would be better than just requiring this which many uploaders will not follow up on. Prototyperspective (talk) 10:49, 12 September 2024 (UTC)
- Support All AI images must use this required prefix phrase: "AI generated image of..." -- Ooligan (talk) 05:58, 12 September 2024 (UTC)
- Support. on top of this, if we need to rename these files, i suggest requiring the new name to begin with " «AI generated» " or " ~AI generated ". this will make them appear behind all ascii letters when sorted alphabetically. RoyZuo (talk) 13:58, 12 September 2024 (UTC)
- If you want to change where something sorts, I think it's better to do it using
{{DEFAULTSORT}}
rather than by requiring a particular pattern in the filename. --bjh21 (talk) 15:46, 12 September 2024 (UTC)- @Bjh21, Can we do both? -- Ooligan (talk) 16:47, 12 September 2024 (UTC)
- @Ooligan: I can't see why you would want to, but you certainly can. --bjh21 (talk) 17:23, 12 September 2024 (UTC)
- this is just an idea that can be done with no extra cost, when the file will be renamed anyway. RoyZuo (talk) 20:50, 12 September 2024 (UTC)
- Probably doable through the AI templates. Something like {{DEFAULTSORT:«{{BASEPAGENAME}}}} Adam Cuerden (talk) 21:05, 12 September 2024 (UTC)
- @Ooligan: I can't see why you would want to, but you certainly can. --bjh21 (talk) 17:23, 12 September 2024 (UTC)
- @Bjh21, Can we do both? -- Ooligan (talk) 16:47, 12 September 2024 (UTC)
- Please no. This makes file names unnecessarily difficult to type - most keyboards don't have
«»
keys, and~
is difficult to find on many mobile devices. The goal is to label these files, not to make them difficult to use. Omphalographer (talk) 22:53, 12 September 2024 (UTC)- Agreed. DEFAULTSORT is the better solution for de-prioritizing AI images. ReneeWrites (talk) 07:23, 13 September 2024 (UTC)
- +3. Adding special characters to file names should be banned. --Adamant1 (talk) 07:42, 13 September 2024 (UTC)
- It really depends on the meaning of "Special", lest we ban, say, Korean file names, or accents. We have French filenames with French-style quotes in them, and we shouldn't change those. At the same time, we have default sort; let's not make it a policy to name AI images File:💩AI generated💩 Foobar.jpg Adam Cuerden (talk) 10:03, 13 September 2024 (UTC)
- That's not really what I'm talking about. I don't think arbitrary putting brackets in file names is useful though. Maybe circle brackets, but
«»
or~
. If for no other reason then most keyboards don't have them to begin with. I'm also super annoyed by file names with emojis them though. They should 100% be banned. I'd be totally fine with requiring people put (AI generated image) at the end of a file name though. --Adamant1 (talk) 10:50, 13 September 2024 (UTC)- Aye, merely trying to avoid bad policy coming out of this. Should we append characters at the start of filenames to deprioritise them? No. That's a job for {{DEFAULTSORT}}. But «» are the standard quotes used in French, so we shouldn't ban their use, lest we require bad French. I'm a little bit of a stickler for trying to avoid policy for one situation that screws up other situations. Adam Cuerden (talk) 15:38, 14 September 2024 (UTC)
- That's not really what I'm talking about. I don't think arbitrary putting brackets in file names is useful though. Maybe circle brackets, but
- It really depends on the meaning of "Special", lest we ban, say, Korean file names, or accents. We have French filenames with French-style quotes in them, and we shouldn't change those. At the same time, we have default sort; let's not make it a policy to name AI images File:💩AI generated💩 Foobar.jpg Adam Cuerden (talk) 10:03, 13 September 2024 (UTC)
- Keep it simple. Just put "AI" infront of the filename. Those who want to know more can check the summary / category of a file for details. Alexpl (talk) 11:06, 13 September 2024 (UTC)
- Please rather put it at the end of the filename. Moreover, "AI" is ambiguous and also included in many other files, so again I'd suggest (AI-generated) or (made using AI) and this could be appended to the initial file-titles automatically in the Upload Wizard. Prototyperspective (talk) 11:24, 13 September 2024 (UTC)
- the thing is, if you prepend filenames with A, it's counterproductive to your aim (discouraging use of ai files as illustrations) because then all the ai files will occupy the front rows in categories (unless you add defaultsort of a super "late" unicode to the ai template). RoyZuo (talk) 18:34, 16 September 2024 (UTC)
- If you want to change where something sorts, I think it's better to do it using
- Oppose Use a template, similar to {{Retouched}}. Note that we already have non-AI files with names beginning "Ai ". Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:17, 13 September 2024 (UTC)
- The examples people have brought up in support or as examples in this discussion include "AI generated", not just AI. Edit: Except for Alexpl's comment. ReneeWrites (talk) 14:22, 13 September 2024 (UTC)
- I'm not clear how this is a response to my comment. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 17:47, 23 September 2024 (UTC)
- The examples people have brought up in support or as examples in this discussion include "AI generated", not just AI. Edit: Except for Alexpl's comment. ReneeWrites (talk) 14:22, 13 September 2024 (UTC)
- Support When images are used internally editors know to check the filepage, when outside people reuse the images they are usually not aware of the info on the files page. We have had people reusing images without complying to the terms of the license by naming the creator. I would like to see (AI generated) the way we use "(cropped)" in file names. --RAN (talk) 18:06, 13 September 2024 (UTC)
- Ooh, that's a good point - if a user saves an image from Commons (or another wiki) to their computer, for instance, there's a decent chance that its file name will remain intact as a warning. A template on the file's talk page certainly won't. Omphalographer (talk) 18:13, 13 September 2024 (UTC)
- Yes, I agree. @Richard Arthur Norton (1958- ) & @Omphalographer -- Ooligan (talk) 02:57, 14 September 2024 (UTC)
- Strong oppose no no no, file names shouldn't be used for that. This opens up Pandora's box for controversial subjects. This should be in the file description, not in the name. Multichill (talk) 12:53, 15 September 2024 (UTC)
- Oppose i agree with Multichill, there is no need to make people suffer for not adding "transparency". description is enough. modern_primat ඞඞඞ ----TALK 18:27, 11 October 2024 (UTC)
- Comment Why do we even accept AI generated images to begin with? Most of them are misleading, useless for Wikipedia articles, fake-y look as standalone content, and can be barely trusted. AI images should only be limited to very specific scenarios, otherwise we end up with a bunch of superheros holding the Commons logo, which we can all agree is largely a set of very interesting trademark violations and not consistent with community practices. Moreover, there have been recently a number of court cases around copyright infringement for several of these AI companies, so I'm concerned that we can't distinguish the provenance from different models that may or may not be trained on infringing datasets. We have no idea how this is going to be regulated.
- Scann (talk) 00:12, 17 September 2024 (UTC)
- No thing such as "infringing datasets" exist Trade (talk) 20:42, 20 September 2024 (UTC)
- Support Helps a lot with the identification of the file as AI and not the real thing, it's immediate.-- Darwin Ahoy! 10:45, 18 September 2024 (UTC)
- Comment I generally think templates and/or structured data are a better way to do this, and more multilingual-friendly. But I'm not opposed to also requiring it in file names if the community agrees on that (and I will anyway include in the filename of any AI files I upload myself). Noting that I have made a related proposal below: #Proposal: de-prioritise AI images in search the wub "?!" 15:09, 19 September 2024 (UTC)
- Support this would prevent problems with AI images being mistaken for real ones, and I really don’t see how this could potentially be harmful. Dronebogus (talk) 11:13, 24 September 2024 (UTC)
- Support Helpful in identifying AI images. --A1Cafel (talk) 14:00, 25 September 2024 (UTC)
- Support it is misleading when the filename implies a photo or similar Immanuelle ❤️💚💙 (please tag me) 14:05, 25 September 2024 (UTC)
- Oppose File names should not be used like this; the proper way is to use multilingual templates. Thuresson (talk) 20:52, 4 October 2024 (UTC)
- templates are good too. modern_primat ඞඞඞ ----TALK 18:28, 11 October 2024 (UTC)
Working out changes
I'm sensing pretty widespread support, so let's plan out what would need changed:
- Commons:File renaming: #2 gains "To identify AI generated works" with a possible more general version of "to point out major manipulations" (colourization, etc). This is explicitly allowed to be in any language.
- 1b. Also Commons:File naming, under "Clear".
- Commons:AI-generated media notes that the AI-generation must be mentioned in the filename, ideally in the same language as the rest of the filename.
- File upload wizard appends "AI generated" if the AI creation option is ticked, with the option to change this after, but with a note saying that identifying AI art in filenames is important. Alternatively, this can just be a soft prompt, that suggests a new filename, but doesn't require. (Similar to others where you can click "ignore and upload file anyway)
- Possibly, {{PD-algorithm}} and similar can be edited to add a {{DEFAULTSORT}} to move AI works lower in categories.
Have I missed anything, and anyone have suggestions? Adam Cuerden (talk) 19:55, 14 September 2024 (UTC)
- I think #4 should either not be implemented or be for files in Category:AI misgeneration. Images shouldn't be sorted by how they were produced but by by where the user is expecting to find them / looking for them or generally the relevance and quality of the image as it relates to the category concept, not the method/techniques used to produce it. You may have missed an addition to Commons:File naming. Prototyperspective (talk) 20:40, 14 September 2024 (UTC)
- Added File naming, and you're probably right about #4. Wanted to pull all the suggestions made, but that may be too much (if nothing else, AI image categories wouldn't do the headers for first letter of filename). Adam Cuerden (talk) 11:44, 15 September 2024 (UTC)
- #3 and #4 are terrible ideas. #3 will cause uploader confusion, filename conflicts, language issues, etc. This needs to be done by humans, not machines. #4 will also be confusing as no one will expect {{PD-algorithm}} to mess with the sorting. Plus it's just unneeded and potentially unhelpful, as there may be other reasons an AI-generated file needs to be sorted in a particular way. Nosferattus (talk) 17:05, 15 September 2024 (UTC)
- I don't think they're terrible ideas, but I don't think these two are needed. We're not inundated with such a flood of AI-generated images being uploaded to Commons that these couldn't be done by hand, and a lot more people have filemover rights than admin rights, so this wouldn't add to the backlog of issues needing admin attention. ReneeWrites (talk) 09:46, 16 September 2024 (UTC)
- Okay. Then let's focus on points 1, 1b, and 2. For 1b, I'm thinking (under "Clear")
- "Where an image, either through method of creation or modifications, might mislead, this should be noted in the filename. This includes AI generation, colourization of a photograph, turning a sepia image black and white, upscaling an image, and other things that might not be immediately obvious. Simple, minor fixes do not need to be noted."
- Too much? Adam Cuerden (talk) 10:56, 16 September 2024 (UTC)
- This discussion is about AI, and I think we should stick with that. I've seen way too many discussions get killed the moment they gain any traction because people keep attaching stuff to it that is tangentially related that no consensus was reached on. ReneeWrites (talk) 15:22, 16 September 2024 (UTC)
- Okay. Let's get this implemented, and any further additions can be discussed on the talk pages after? Adam Cuerden (talk) 17:41, 16 September 2024 (UTC)
- This discussion is about AI, and I think we should stick with that. I've seen way too many discussions get killed the moment they gain any traction because people keep attaching stuff to it that is tangentially related that no consensus was reached on. ReneeWrites (talk) 15:22, 16 September 2024 (UTC)
- Okay. Then let's focus on points 1, 1b, and 2. For 1b, I'm thinking (under "Clear")
- I don't think they're terrible ideas, but I don't think these two are needed. We're not inundated with such a flood of AI-generated images being uploaded to Commons that these couldn't be done by hand, and a lot more people have filemover rights than admin rights, so this wouldn't add to the backlog of issues needing admin attention. ReneeWrites (talk) 09:46, 16 September 2024 (UTC)
- 1a: Support "To identify AI generated works" sounds good to me.
- 1b: Comment I think it would be more at home under "Descriptive", specifically the subheader "Correct". There's nothing particularly unclear about the filename "Cancer cell.jpg", but it leaves out a lot of pretty crucial context that makes it pretty misleading.
- I'd like to propose this change: Correct – The name should describe the file's content and convey what the subject is actually called. Inaccurate names for the file subject, although they may be common, should be avoided. The title given to a work of art by the artist that created it is considered appropriate, even if the name has nothing to do with what is depicted (for example, many works of Dadaism). The name should also be free of obvious errors, such as misspelled proper nouns, incorrect dates, and misidentified objects or organisms. Users are allowed to upload "unidentified" or "unknown" organisms but such files may be renamed upon identification. AI-generated images must disclose this fact in the file name.
- It's tempting to include a bit on the rationale as to why, but none of the other examples have that either, they simply state what is policy. So I think addressing this with just one line that's clear and unambiguous is both pragmatic and in line with how the rest is written. ReneeWrites (talk) 15:50, 16 September 2024 (UTC)
- Sounds good. Adam Cuerden (talk) 17:42, 16 September 2024 (UTC)
- Made a slight adjustment to the wording. A lot of the file names on Commons made with Dall-E or Midjourney have that in their file name, which should also cover this base. ReneeWrites (talk) 08:42, 17 September 2024 (UTC)
- Could maybe move it a sentence later to keep the talk about organisms together. Adam Cuerden (talk) 09:43, 17 September 2024 (UTC)
- Good idea, I moved the sentence. ReneeWrites (talk) 11:17, 17 September 2024 (UTC)
- Okay. If no-one else has suggestions after a couple days, let's bring 1, 1b (with your text), and 2 together, ping everyone involved in the original discussion, and implement. Secondary ideas can be considered after that. Adam Cuerden (talk) 17:21, 17 September 2024 (UTC)
- Good idea, I moved the sentence. ReneeWrites (talk) 11:17, 17 September 2024 (UTC)
- Could maybe move it a sentence later to keep the talk about organisms together. Adam Cuerden (talk) 09:43, 17 September 2024 (UTC)
- Made a slight adjustment to the wording. A lot of the file names on Commons made with Dall-E or Midjourney have that in their file name, which should also cover this base. ReneeWrites (talk) 08:42, 17 September 2024 (UTC)
- Sounds good. Adam Cuerden (talk) 17:42, 16 September 2024 (UTC)
- Support all, but this should have been at COM:VPP. — 🇺🇦Jeff G. ツ please ping or talk to me🇺🇦 14:19, 21 September 2024 (UTC)
- May be worth a clear call on whether AI upscaled photos should fall under "AI generated" for all this, given their similar potential for being misleading when the viewer doesn't realise that an AI was involved (eg. File:2Pac Passport (cropped).jpg, where one Wikipedia editor was pleased to find what they described as a "free-use authentic high quality photograph" of the subject on Commons, but no, it's just an upscale of an old and extremely low quality passport photo). Belbury (talk) 12:58, 25 September 2024 (UTC)
- It's not AI-generated, it's AI-upscaled which is very different. It needs separate templates and categories which also warn the user about issues like potential inaccuracies. Prototyperspective (talk) 13:35, 25 September 2024 (UTC)
- It's different, but if there's going to be a policy change on naming and negative-boosting AI content, we should be clear whether that also applies to AI upscaling or whether it doesn't apply to it at all. Some users already (very understandably) tick "I generated this work using an artificial intelligence tool" when uploading an AI-upscaled image, causing it to be incorrectly filed as {{PD-algorithm}} with no human authorship. Belbury (talk) 14:17, 25 September 2024 (UTC)
- I think when this is checked the Upload Wizard should show another checkbox about whether img2img (an input image) was used or whether upscaling was used. If the former is checked, the user should enter some url to the input image(s). If the latter, it would add the template for AI upscaled image. Prototyperspective (talk) 14:21, 25 September 2024 (UTC)
- It's different, but if there's going to be a policy change on naming and negative-boosting AI content, we should be clear whether that also applies to AI upscaling or whether it doesn't apply to it at all. Some users already (very understandably) tick "I generated this work using an artificial intelligence tool" when uploading an AI-upscaled image, causing it to be incorrectly filed as {{PD-algorithm}} with no human authorship. Belbury (talk) 14:17, 25 September 2024 (UTC)
- It's not AI-generated, it's AI-upscaled which is very different. It needs separate templates and categories which also warn the user about issues like potential inaccuracies. Prototyperspective (talk) 13:35, 25 September 2024 (UTC)
Upload wizards capacity
I note in this AI discussion that the upload wizard asks the question of is it ai generated, so its possible that appending to file names or adding a template to identify AI generated media could be relatively easy to do automatically at upload. with a high degreee of consistance Gnangarra 01:06, 17 September 2024 (UTC)
- This was mentioned a couple of times, but there is currently not enough support (or opposition) to reach a consensus on this. ReneeWrites (talk) 09:42, 17 September 2024 (UTC)
- I'd say that's probably something to bring up after the policy changes go through. Though I am surprised a template isn't already auto-added. Adam Cuerden (talk) 17:20, 17 September 2024 (UTC)
- {{PD-algorithm}} is already added to any upload where this box is ticked, in addition to the licence template specified by the uploader.
- It's worth remembering that some users tick this box in error, fairly regularly. Any additional effects of ticking it will require additional steps of cleanup in that minority of cases. Belbury (talk) 13:46, 19 September 2024 (UTC)
- I'd say that's probably something to bring up after the policy changes go through. Though I am surprised a template isn't already auto-added. Adam Cuerden (talk) 17:20, 17 September 2024 (UTC)
- Support both suggestions, specially the automated AI template placement which should already be there. Darwin Ahoy! 10:47, 18 September 2024 (UTC)
- Pinging @Sannita (WMF) Darwin Ahoy! 13:30, 19 September 2024 (UTC)
- @DarwIn I already relayed the idea of adding automatically a template, if I remember correctly something is already added. Anyway, if there is community consensus to add a(nother) specific template, I can relay this too and discuss it with the team. It's going to take some time anyway. Sannita (WMF) (talk) 13:47, 19 September 2024 (UTC)
- Yes, {{PD-algorithm}} is already automatically added when the checkbox in UploadWizard is selected. I tested today by uploading this image. the wub "?!" 15:12, 19 September 2024 (UTC)
- @DarwIn I already relayed the idea of adding automatically a template, if I remember correctly something is already added. Anyway, if there is community consensus to add a(nother) specific template, I can relay this too and discuss it with the team. It's going to take some time anyway. Sannita (WMF) (talk) 13:47, 19 September 2024 (UTC)
- Pinging @Sannita (WMF) Darwin Ahoy! 13:30, 19 September 2024 (UTC)
- Is there also a way to insert "AI generated" into the metadata? --RAN (talk) 17:08, 18 September 2024 (UTC)
- The template exists for a reason Trade (talk) 20:41, 20 September 2024 (UTC)
- @Trade: The problem with {{PD-algorithm}} is that it's also used on images that have been modified by AI, not generated that way. Really there should be two separate templates depending on if the image is AI generated or just modified by AI. {{PD-algorithm}} doesn't really make sense in the later instance anyway. --Adamant1 (talk) 18:25, 2 October 2024 (UTC)
- The template exists for a reason Trade (talk) 20:41, 20 September 2024 (UTC)
AI enhancement/improvement/upscaling
I've started a discussion that is marginally related to this topic at Commons_talk:AI-generated_media#AI_enhancements/improvements/upscaling, which may or may not depend on this outcome. Bastique ☎ appelez-moi! 21:17, 4 October 2024 (UTC)
September 23
Hosting HDR images as JPEG with gain map
The tools for creating and displaying High Dynamic Range (HDR) images are starting to mature. HDR displays can render much brighter highlights than before, which leads to a big qualitative improvement in an image. Software for HDR production, and web-browser support, are becoming wide-spread. (Note that this is distinct from the tone-mapped HDR images you may have seen for the past decade or so.)
This post is partly a response to User:Hym3242 and User:PantheraLeo1359531 in Commons:Village pump/Archive/2024/08#Can I upload bt2020nc/bt2020/smpte2084(PQ) HDR AVIF images to commons and use them in wikipedia articles?. I was wondering the same thing, so I uploaded a couple files to see how well Commons would support them. They are formatted as JPEG with a gain map. The promise of this format is that it is backward-compatible with systems that process and serve standard JPEG. The base image is a JPEG, usable on any device. HDR information is inserted in the file as metadata. In the worst case HDR metadata is lost, resulting in a standard image. In the best case HDR metadata is preserved, the end-user has an HDR-capable display and web browser, and the image looks great.
My test results are at Category:HDR gain-mapped images. Both images survived the process of uploading and rendering previews. HDR metadata was stripped from preview images, but preserved in the original uploads. If you have a newish HDR screen and a compliant web browser, the originals of this house and this church will appear brighter than usual. The effect on the house is subtle, limited to where sunlight hits white paint. The effect on the church is more dramatic: the windows should appear much brighter than the rest of the interior.
Most users of Commons images will see one of the smaller standard files, so for now the benefits of publishing this sort of content are limited. Are there any downsides to publishing it on Commons?
This post isn't marked as a proposal, because hosting these images on Commons works already. At a later date, when the standards are settled and the hardware is widely available, it would be nice to preserve HDR metadata in the generated preview images. — Preceding unsigned comment added by Semiautonomous (talk • contribs) 23:51, 23 September 2024 (UTC)
September 26
How to change the text in a speedy deletion (GA1)?
I have changed the meaning of GA1 to "Gallery page without at least two images or other media files" in Commons:Criteria for speedy deletion, but when I use this code in a gallery, the old text is used in the deletion, see for example Giusto Le Court. How can I change that text as well? I think it is hidden somewhere, but I cannot find the right place. This question was posed at Commons talk:Criteria for speedy deletion#How to change the text in a deletion?, but there was no answer, so I hope I have more luck here. JopkeB (talk) 14:43, 26 September 2024 (UTC)
- It might be because the text hasn't been translated into other languages yet. It looks like the last update to other pages besides English was in May. That's the only thing I can think of though. --Adamant1 (talk) 16:04, 26 September 2024 (UTC)
- @Adamant1: Where can I find those texts to be translated? JopkeB (talk) 05:21, 2 October 2024 (UTC)
- @JopkeB: Template:Speedydelete/en, the Dutch subpage is here: Template:Speedydelete/nl. ReneeWrites (talk) 22:53, 6 October 2024 (UTC)
- Thanks @ReneeWrites: but I do not think this is what I mean. This text is before the gallery page has been deleted. I am looking for the text after a gallery page has been deleted with reason GA1. See Giusto Le Court JopkeB (talk) 15:05, 9 October 2024 (UTC)
- @JopkeB: that text is added by the admin who removed it. Sometimes they'll copy-paste the text you provided as the deletion reason and sometimes they put something else there. ReneeWrites (talk) 16:30, 9 October 2024 (UTC)
- So I cannot do anything about it? JopkeB (talk) 15:06, 10 October 2024 (UTC)
- Unfortunately not. ReneeWrites (talk) 21:22, 10 October 2024 (UTC)
- So I cannot do anything about it? JopkeB (talk) 15:06, 10 October 2024 (UTC)
- @JopkeB: that text is added by the admin who removed it. Sometimes they'll copy-paste the text you provided as the deletion reason and sometimes they put something else there. ReneeWrites (talk) 16:30, 9 October 2024 (UTC)
- Thanks @ReneeWrites: but I do not think this is what I mean. This text is before the gallery page has been deleted. I am looking for the text after a gallery page has been deleted with reason GA1. See Giusto Le Court JopkeB (talk) 15:05, 9 October 2024 (UTC)
- @JopkeB: Template:Speedydelete/en, the Dutch subpage is here: Template:Speedydelete/nl. ReneeWrites (talk) 22:53, 6 October 2024 (UTC)
- @Adamant1: Where can I find those texts to be translated? JopkeB (talk) 05:21, 2 October 2024 (UTC)
October 02
Intersection category of gender, occupation, nationality and decade of birth
There're these categories https://commons.wikimedia.org/w/index.php?search=intitle:%22born+in+the%22&sort=create_timestamp_asc e.g. Category:Actresses from the United States born in the 1990s. is it necessary?--RoyZuo (talk) 18:02, 2 October 2024 (UTC)
- Probably not. If I had my way there would just be flat lists or metacats of actors and actresses by name. Minus the whole "by birth" thing since it's totally pointless trivia. More so with "by decade of birth." I'd almost argue the same for "by nationality" to BTW. --Adamant1 (talk) 18:17, 2 October 2024 (UTC)
- It doesn't make it easier to find someone, it makes it more difficult, unless accompanied by a flat list that contains all the entries. If we made searching in Wikidata easier, we would not need these intersection lists. --RAN (talk) 23:22, 2 October 2024 (UTC)
- A commercial databank, wouldn't go to this level. We actually only need the Actors name here, Equity demands that names are unique, as it is.
- Other websites do it better. Wikipedia is where people are going to look for this kind of info, not here. I suppose computers search progs will use Wikidata. Wikidata gives us a detailed infox already.
- Main cats are flat cats, by definition. In this case the name of the actor. As an aside, I would be in favour of a rule that states we shouldn’t have more than 4 levels to any main cat. In this case the main cat being People. Broichmore (talk) 13:49, 4 October 2024 (UTC)
- 4 levels seems a little extreme but it's a good suggestion in general. There's some pretty cavernous category structures out there and most of this stuff is better off in infoboxes or otherwise stored on Wikidata's end anyway. With people specifically, most of the time we already know the person's gender from their first name. So it seems kind of pointless to have categories for it. Plus there's a risk of it getting really pointless and obscure depending on the situation. --Adamant1 (talk) 18:48, 4 October 2024 (UTC)
- Separate issue, but how do you feel about categories for other biographical metadata like Category:Births by location (e.g. Category:Births in Fairbanks, Alaska) or Category:Deaths by year? These, too, feel like situations where Commons categories are being misused as a sort of Wikidata-lite to describe people, rather than classifying media. For instance, the aforementioned "Births in Fairbanks, Alaska" contains categories of images of people who were born in Fairbanks, not images of people being born in Fairbanks. Omphalographer (talk) 22:59, 4 October 2024 (UTC)
- I've always thought it was weird that categories for "births" contain images of and categories for people who have already been born. I assume a lot of them are added through templates though. So I'm not really sure there's anything that can be done to fix the issue. That's assuming there would even be a consensus to deal with it to begin with. But if it were me I'd confine categories for "births" to actual images of the birthing process and/or babies being born. But then who knows where the sub-categories would go in that case. "People by birth location" maybe? Or just completely ditch the whole scheme as meaningless trivia outright. --Adamant1 (talk) 23:13, 4 October 2024 (UTC)
- I consider those unnecessary cats too.
- also same thing for the deaths by causes, buildings by height Category:123-meter-tall structures, bridges by length Category:3.1-kilometer bridges... RoyZuo (talk) 14:55, 6 October 2024 (UTC)
- Separate issue, but how do you feel about categories for other biographical metadata like Category:Births by location (e.g. Category:Births in Fairbanks, Alaska) or Category:Deaths by year? These, too, feel like situations where Commons categories are being misused as a sort of Wikidata-lite to describe people, rather than classifying media. For instance, the aforementioned "Births in Fairbanks, Alaska" contains categories of images of people who were born in Fairbanks, not images of people being born in Fairbanks. Omphalographer (talk) 22:59, 4 October 2024 (UTC)
- 4 levels seems a little extreme but it's a good suggestion in general. There's some pretty cavernous category structures out there and most of this stuff is better off in infoboxes or otherwise stored on Wikidata's end anyway. With people specifically, most of the time we already know the person's gender from their first name. So it seems kind of pointless to have categories for it. Plus there's a risk of it getting really pointless and obscure depending on the situation. --Adamant1 (talk) 18:48, 4 October 2024 (UTC)
Category:Miss Elizabeth is categorized under Category:Miss (surname) due to {{Wikidata Infobox}}, which is far from the most boneheaded thing I've seen emanating from WI. And some of you still think it's a good idea to let Wikidata hijack our category structure simply because you could never be bothered to do any of the hard work yourself? Excuse me while I go somewhere and laugh my balls off. RadioKAOS / Talk to me, Billy / Transmissions 23:23, 6 October 2024 (UTC)
- So fix it! (Which I just did.) Commons doesn't exist in a vacuum; we shouldn't duplicate work being done by other Wikimedia projects. Omphalographer (talk) 23:39, 6 October 2024 (UTC)
- "So fix it" = existing in a vacuum. I wouldn't even mention it if it weren't representative of a much bigger problem. As I hinted in the other discussion, I really hate mentioning examples because it ALWAYS provides an excuse for the other person to dwell on the example and ignore the big picture. RadioKAOS / Talk to me, Billy / Transmissions 02:57, 7 October 2024 (UTC)
- Strongly agree with Omphalographer. We should minimize time and effort required as well as deduplicate work. For example, I think there could be a bot that suggests categories for Commons categories if the cats differ between WMC and WP so both are in sync (they don't have to be the same but often cats here miss some valid useful cats). Just saying "So fix it" would indeed be a case of existing in a vacuum, it would be a good point but that's not all the user said and elaboration of what the "much bigger problem" would be is missing. I could make an actual argument there for your case but I don't know if that's what you mean(?): Wikidata items may have inaccurate data and its items are less well maintained in the sense of watched for vandalism or flawed edits. An argument against that is that people here can see this flawed cat in the category page, they don't see the change in their Watchlist, but they can edit the Wikidata item if they notice it's flawed and also inaccurate data in the few (imo too few) fields that the infobox auto-adds to categories aren't that common. The main argument against this point is a potential solution: scripts that check for difference that likely need manual checking such as when a (surname) category set on an item differs from the category's first word, or items that have "Miss (surname)" where the first word differs and so on [a comprehensive ruleset could be developed]. Prototyperspective (talk) 12:06, 7 October 2024 (UTC)
October 06
Wikidata infobox forcing Wikidata based DEFAULTSORT
The {{Wikidata Infobox}} template seems to impose an (anglosaxonic?) DEFAULTSORT ordering on categories of people based on Wikidata, without broad discussion with the community and completely messing up a number of categories. Example: Category:Male politicians of São Paulo, try to find Category:Professor Fernando there, even if you know his real full name. It's impossible. Not only he is not know by his full name, and we are expecting an A-Z order of category names, but even if you happen to know his full name, someone added a surname on Wikidata that doesn't even exist in Portuguese. Shouldn't this Wikidata DEFAULTSORT imposition be removed or at least "opt-in" for that template? Pinging @Mike Peel: , who develops the tool. Darwin Ahoy! 14:16, 6 October 2024 (UTC)
Pinging @Animalparty: who also noticed this problem some years ago. This situation seems to be lingering here for long, I wonder why it hasn't been fixed yet with a simple opt-in policy for that automatic Wikidata based DEFAULTSORT ordering, instead of forcing it everywhere and causing a mess.-- Darwin Ahoy! 14:36, 6 October 2024 (UTC)
- You can disable its DEFAULTSORT.
- wdib is used millions of times. obviously its practical solution is going by "opt out" instead of "opt in". RoyZuo (talk) 14:50, 6 October 2024 (UTC)
- @RoyZuo The problem is not WDIB, just the Wikidata based defaultsort "feature". Darwin Ahoy! 15:07, 6 October 2024 (UTC)
- You can just add defaultsort and the bot then de-activates the infobox based sorting. That the default is English comes from category names being in English.
∞∞ Enhancing999 (talk) 14:51, 6 October 2024 (UTC)- @Enhancing999 people names in Portuguese "being in English"? 🤔 Darwin Ahoy! 15:05, 6 October 2024 (UTC)
- Is there are particular category where the outlined approach isn't working?
∞∞ Enhancing999 (talk) 15:06, 6 October 2024 (UTC)- @Enhancing999 Basically all the Lusphone people categories, like Category:Male politicians of São Paulo Darwin Ahoy! 15:07, 6 October 2024 (UTC)
- I checked Category:Eliseu Gabriel: seems ok.
∞∞ Enhancing999 (talk) 15:11, 6 October 2024 (UTC)- @Enhancing999 It isn't, Gabriel is one of his given names, not a surname. Good example of why this shouldn't be enabled by default. Darwin Ahoy! 15:28, 6 October 2024 (UTC)
- It just means it was incorrectly set up at Wikidata (which you seem to have fixed). How about Category:José Pires do Rio? Or should all be under first names?
∞∞ Enhancing999 (talk) 15:48, 6 October 2024 (UTC)- Yes, it was incorrectly set up in Wikidata, that's why it was barely findable in that system. Now that I fixed it adding his surname Pieri he is impossible to find there, unless you know that rather obscure information. And please stop picking singular cases among the almost 100 which are there, obviously some of them would be correct, while many others will not. That one you brought now has the sorting key "Pires", so it's not even an appropiate example as it's not using that automated DEFAULTSORT. DEFAULTSORT key is the name the person is generally known for, in that case "Pires do Rio". There's no general rule related to using given names, surnames or pseudonyms for teh defaultsort. Darwin Ahoy! 15:53, 6 October 2024 (UTC)
- Well, there doesn't seem to be a clearcut rule, so you will have to set DEFAULTSORT if needed.
∞∞ Enhancing999 (talk) 16:39, 6 October 2024 (UTC)- If there is no rule, it never should have been implemented by default, instead of bringing this additional burden to the community here. 🙄 Darwin Ahoy! 17:12, 6 October 2024 (UTC)
- The default MediaWiki rule for categories doesn't seem to be any better.
∞∞ Enhancing999 (talk) 17:30, 6 October 2024 (UTC)- @Enhancing999 And we have been adapting it for more than 20 years. What was the point of changing that to something else equally bad? Darwin Ahoy! 17:56, 6 October 2024 (UTC)
- I doubt it was available before Commons nor at the beginning of Commons: nostalgia, eh?
- In any case, you can still use it. The infobox is getting somewhat old too, maybe time for a new system?
∞∞ Enhancing999 (talk) 19:22, 6 October 2024 (UTC)- Fun fact: the infobox is designed in such a way that it can be replaced by a new system when that becomes necessary - e.g., if infoboxes could be built into MediaWiki, or we migrate away from the category structure, since it's just a way of displaying the relevant information from Wikidata. With this issue, it's been great to see the discussion here, it seems no change is needed for now. Thanks. Mike Peel (talk) 22:15, 12 October 2024 (UTC)
- @Enhancing999 And we have been adapting it for more than 20 years. What was the point of changing that to something else equally bad? Darwin Ahoy! 17:56, 6 October 2024 (UTC)
- The default MediaWiki rule for categories doesn't seem to be any better.
- If there is no rule, it never should have been implemented by default, instead of bringing this additional burden to the community here. 🙄 Darwin Ahoy! 17:12, 6 October 2024 (UTC)
- Well, there doesn't seem to be a clearcut rule, so you will have to set DEFAULTSORT if needed.
- Yes, it was incorrectly set up in Wikidata, that's why it was barely findable in that system. Now that I fixed it adding his surname Pieri he is impossible to find there, unless you know that rather obscure information. And please stop picking singular cases among the almost 100 which are there, obviously some of them would be correct, while many others will not. That one you brought now has the sorting key "Pires", so it's not even an appropiate example as it's not using that automated DEFAULTSORT. DEFAULTSORT key is the name the person is generally known for, in that case "Pires do Rio". There's no general rule related to using given names, surnames or pseudonyms for teh defaultsort. Darwin Ahoy! 15:53, 6 October 2024 (UTC)
- It just means it was incorrectly set up at Wikidata (which you seem to have fixed). How about Category:José Pires do Rio? Or should all be under first names?
- @Enhancing999 It isn't, Gabriel is one of his given names, not a surname. Good example of why this shouldn't be enabled by default. Darwin Ahoy! 15:28, 6 October 2024 (UTC)
- I checked Category:Eliseu Gabriel: seems ok.
- @Enhancing999 Basically all the Lusphone people categories, like Category:Male politicians of São Paulo Darwin Ahoy! 15:07, 6 October 2024 (UTC)
- Is there are particular category where the outlined approach isn't working?
- @Enhancing999 people names in Portuguese "being in English"? 🤔 Darwin Ahoy! 15:05, 6 October 2024 (UTC)
I've had a lot of issues with DEFAULTSORT myself. It really shouldn't be the default, but then that would kind of defeat the purpose of the whole thing. So maybe it should just be axed? There's no reason people can't, or shouldn't, sort categories whatever way they want to without it being imposed on them through a template though. --Adamant1 (talk) 20:47, 6 October 2024 (UTC)
If adding the DEFAULTSORT template manually overrides the Infobox's DEFAULTSORT, I don't really see what the issue is. Most casual or novice editors won't think to add the template themselves (and even experienced editors sometimes forget to, i.e. me). In the vast majority of cases it does its job correctly and the few instances where it didn't it can be fixed manually. The alternative is having to add it manually to all categories. ReneeWrites (talk) 09:28, 7 October 2024 (UTC)
It seems to me entirely natural that the Wikidata rule that, given the right data, is correct well upwards of 95% of the time (98% would not surprise me), is used by default. Just so long as we can override, which we can, that's fine. - Jmabel ! talk 09:42, 7 October 2024 (UTC)
Per ReneeWrites and Jmabel, the status quo is fine. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:51, 7 October 2024 (UTC)
October 07
Dashes in category names
Sinigh and I disagree about dashes in category names, especially for dates. I favor the ISO/IEC 646 character known as the hyphen-minus. He favors "–" (which I copy-pasted here, I'm not certain of the code point and I'm headed out the door in about 2 minutes). Our discussion so far can be found at https://commons.wikimedia.org/w/index.php?title=User_talk:Sinigh&oldid=934116270. Probably further discussion should continue here, not there. Other opinions sought, because we each have a decent rationale, but are unlikely to convince the other. - Jmabel ! talk 06:00, 7 October 2024 (UTC)
- En dash indicates a range, so 1800–1885 would be correct. This is as far as I can tell the norm on Commons for both category and file names. The universal hyphen/minus sign is used in category names that don't indicate a date range, see for example every entry here: Category:Days by day. ReneeWrites (talk) 06:31, 7 October 2024 (UTC)
- Category names should use hyphen/minus sign, not en/em dashes. Per Commons:Categories#Category names policy "Basic English characters (ISO/IEC 646) are preferred over national variants or extension character sets (for instance, 'straight' apostrophes over 'curly'), where reasonable." Category names should be something that can normally be typed with a keyboard and should not use extension characters where there is no need. Hyphen is perfectly understandable range sign. MKFI (talk) 06:37, 7 October 2024 (UTC)
- The policy says "where reasonable", which affords flexibility, and its use for ranges is not only completely reasonable but also grammatically correct. The en dash serves a specific purpose that is universally recognized and does not introduce ambiguity, which is why most English-language style guides will call for the en dash to be used in this way. That it can't be found on most keyboards seems to be the only real argument against it, and I find it completely unconvincing. ReneeWrites (talk) 09:25, 7 October 2024 (UTC)
- I agree, especially since it's very common to use Commons:HotCat for categorizing, so the optimal solution is "Title with en dash" and "Category redirect with hyphen". I appreciate that it is somewhat cumbersome to input en dashes, but they are typographically and semantically correct and while HotCat or similar tools are certainly not mandatory, they can easily resolve the issue for those who really care. —Justin (koavf)❤T☮C☺M☯ 09:34, 7 October 2024 (UTC)
- I just wish to point out that I believe typeability is generally a valid concern, so I don't take issue with the basic premise of the discussion. But I only agree to the extent that I'm certain this could have been a real problem. My main objection, however, is that it seems very unlikely that the range dash (or hyphen) will ever be typed. To clarify, the original examples were Category:Hendrik Koolwijk (1800–1885) and Category:Iraqi Army in the Gulf War (1990–1991), which made me wonder: When is anyone ever going to type full category titles like those? And if they were to, when would they actually have to?
- If typeability is indeed relevant here, I don't see why it is only considered important when it comes to the dash, while diacritics and language-specific characters derived from the Latin alphabet are preferred. Don't they, more often than not, cause an arguably worse version of same perceived problem?
- "Title with en dash" and "Category redirect with hyphen" sounds like good strategy, and hopefully it's also an acceptable compromise. From that point of view, it's actually quite convenient that categories with range hyphens often already exist.
- Sinigh (talk) 09:59, 7 October 2024 (UTC)
- I hope this will be accepted as a compromise too. I know Commons isn't Wikipedia and we do our own thing here, but I do find it useful to look at how things are done on Wikipedia when an issue like this crops up (one that's not Commons-specific, I mean) on how they've dealt with it, and "Title with en dash, redirect with hyphen" is the compromise they settled on as well. ReneeWrites (talk) 10:12, 7 October 2024 (UTC)
- I agree, especially since it's very common to use Commons:HotCat for categorizing, so the optimal solution is "Title with en dash" and "Category redirect with hyphen". I appreciate that it is somewhat cumbersome to input en dashes, but they are typographically and semantically correct and while HotCat or similar tools are certainly not mandatory, they can easily resolve the issue for those who really care. —Justin (koavf)❤T☮C☺M☯ 09:34, 7 October 2024 (UTC)
- The policy says "where reasonable", which affords flexibility, and its use for ranges is not only completely reasonable but also grammatically correct. The en dash serves a specific purpose that is universally recognized and does not introduce ambiguity, which is why most English-language style guides will call for the en dash to be used in this way. That it can't be found on most keyboards seems to be the only real argument against it, and I find it completely unconvincing. ReneeWrites (talk) 09:25, 7 October 2024 (UTC)
- Category names should use hyphen/minus sign, not en/em dashes. Per Commons:Categories#Category names policy "Basic English characters (ISO/IEC 646) are preferred over national variants or extension character sets (for instance, 'straight' apostrophes over 'curly'), where reasonable." Category names should be something that can normally be typed with a keyboard and should not use extension characters where there is no need. Hyphen is perfectly understandable range sign. MKFI (talk) 06:37, 7 October 2024 (UTC)
- If I understand it correctly the short summary of what this is about is whether the - or – character should be used. If there is some decision either way I think this should be in some guidance page where it's also clarified where each character is to be used as well as a bot/script that automatically moves categories accordingly. Additionally, it would be best if not needed that a technical change is implemented that makes HotCat autocompletes with – show up when using the more common minus - character. This is a broad subject and also affects for example Wikipedia lists which sometimes intermingle both. If there is a code issue or proposal about the autocompletes showing up, please link it here. Prototyperspective (talk) 11:57, 7 October 2024 (UTC)
- Agreed on documentation and to be clear, HotCat will correct/autocomplete any category redirect, either those that use standard MediaWiki syntax (i.e. "#redirect[[:Category:Foo]]") or by using {{Catredirect}}. —Justin (koavf)❤T☮C☺M☯ 12:07, 7 October 2024 (UTC)
- To clarify: I meant that it should show the autocomplete without requiring there to be a redirect with the same name but a different hyphen character. E.g. always whenever there is a – between numbers show this cat in the autocompletes even if the user entered a minus - despite of there not being a redirect. Maybe it's not really needed because that character usually comes only near the end of a cat title, this would be needed if sometimes this is near the start of a cat title. Prototyperspective (talk) 12:13, 7 October 2024 (UTC)
- That would still be a brilliant addition, though. Would it even be possible to automatically redirect searches whenever someone (understandably) types a hyphen where the dash goes? I mean, sort of like the way searches aren't case sensitive, could they also not be hyphen/dash sensitive? Only when needed, of course, i.e. if there actually is a category with a dash where the person types a hyphen. Sinigh (talk) 13:34, 7 October 2024 (UTC)
- It is possible, the question is how difficult it would be and what the best way to implement it would be. In any case, it probably needs a code issue for HotCat. Prototyperspective (talk) 14:11, 7 October 2024 (UTC)
- That would still be a brilliant addition, though. Would it even be possible to automatically redirect searches whenever someone (understandably) types a hyphen where the dash goes? I mean, sort of like the way searches aren't case sensitive, could they also not be hyphen/dash sensitive? Only when needed, of course, i.e. if there actually is a category with a dash where the person types a hyphen. Sinigh (talk) 13:34, 7 October 2024 (UTC)
- To clarify: I meant that it should show the autocomplete without requiring there to be a redirect with the same name but a different hyphen character. E.g. always whenever there is a – between numbers show this cat in the autocompletes even if the user entered a minus - despite of there not being a redirect. Maybe it's not really needed because that character usually comes only near the end of a cat title, this would be needed if sometimes this is near the start of a cat title. Prototyperspective (talk) 12:13, 7 October 2024 (UTC)
- Agreed on documentation and to be clear, HotCat will correct/autocomplete any category redirect, either those that use standard MediaWiki syntax (i.e. "#redirect[[:Category:Foo]]") or by using {{Catredirect}}. —Justin (koavf)❤T☮C☺M☯ 12:07, 7 October 2024 (UTC)
- Can we use a character that is found on keyboards without requiring to use the alt key?
∞∞ Enhancing999 (talk) 12:51, 7 October 2024 (UTC)- Thats the best solution, the hyphen. Juts because Wikipedia has a preference for en dash) does not need to apply to us. I keep on saying we're a databank, names of cats should be as simple as possible and contain easily available on the keyboard. No need for affectations. Broichmore (talk) 12:40, 12 October 2024 (UTC)
- Wikipedia's preference is not and has never been the main argument here; ReneeWrites only told us how they solved it over there and pointed out that "we do our own thing here." As for the keyboard layout, everyone already knows that the hyphen is easier to type. That argument has been addressed in several ways, yet noone seems to have anything to say in response to those objections. Simply reiterating the original argument obviously isn't going to convince anyone. Sinigh (talk) 13:06, 12 October 2024 (UTC)
- Thats the best solution, the hyphen. Juts because Wikipedia has a preference for en dash) does not need to apply to us. I keep on saying we're a databank, names of cats should be as simple as possible and contain easily available on the keyboard. No need for affectations. Broichmore (talk) 12:40, 12 October 2024 (UTC)
Uploading photos taken in a commercial establishment
I took these photos of an old Heidelberg printing press at a commercial establishment. Removed any identification of the location, but didn't ask for permission. I want to upload them as PD, but unsure of their copyright status.
- https://ibb.co/z4WBMxw
- https://ibb.co/ysWWvF2
- https://ibb.co/dWmvV2h
- https://ibb.co/pPRqvwf
- https://ibb.co/StwsSXT
— Ineuw talk 06:48, 7 October 2024 (UTC)
- I'm not an expert on this, but my assumption is that it's fine to post these. The focus of these images is the printing press, which isn't copyrighted, and the commercial establishment it's placed in would fall under de minimis as it's barely visible. I would not post the third picture however, and I would crop out the sign at the top of the first. You can use the information on that sign (by paraphrasing or summarizing) but I assume the text itself is copyrighted and can't be copied or posted in full. ReneeWrites (talk) 09:51, 7 October 2024 (UTC)
- Also as a non-expert, maybe you could be concerned that the text in some of these is copyright-able, so you may want to crop that out. Seems like a pretty small concern, but better safe than sorry. Edit conflict-y: I agree with Renee—Justin (koavf)❤T☮C☺M☯ 09:59, 7 October 2024 (UTC)
- There is no need to remove location details. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:43, 7 October 2024 (UTC)
- Indeed. Keep the location details, they are relevant. - Jmabel ! talk 16:51, 7 October 2024 (UTC)
- Thanks for all the comments. I also noticed the first photo with some possible identification. But, I doubt that the establishment would object to the promotion. My concern is acceptability on the Commons. BTW, I already removed a lot of identifying information. As for location details, what does that mean? The 3rd photo with the 1985 date is meaningless. It's when the company folded.— Ineuw talk 12:51, 8 October 2024 (UTC)
- You said "Removed any identification of the location". Don't. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 16:20, 9 October 2024 (UTC)
- Thanks for all the comments. I also noticed the first photo with some possible identification. But, I doubt that the establishment would object to the promotion. My concern is acceptability on the Commons. BTW, I already removed a lot of identifying information. As for location details, what does that mean? The 3rd photo with the 1985 date is meaningless. It's when the company folded.— Ineuw talk 12:51, 8 October 2024 (UTC)
- Indeed. Keep the location details, they are relevant. - Jmabel ! talk 16:51, 7 October 2024 (UTC)
- There is no need to remove location details. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:43, 7 October 2024 (UTC)
We are talking about two different concerns. Please clarify.— Ineuw talk 17:50, 9 October 2024 (UTC)
- Are we? What concern are you talking about? Do not remove details of the location from your images or their metadata. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 18:11, 9 October 2024 (UTC)
- Thanks for the much appreciated clarification. In that case, I will let them know.— Ineuw talk 18:43, 9 October 2024 (UTC)
- I am soon uploading the unedited originals, as well as some additional close-ups. Thanks again.— Ineuw talk 00:55, 10 October 2024 (UTC)
- Thanks for the much appreciated clarification. In that case, I will let them know.— Ineuw talk 18:43, 9 October 2024 (UTC)
New reports: categories with only redcats & cats with only infobox cats
I just set up these two new reports as an addendum to Special:UncategorizedCategories.
- Commons:Report UncategorizedCategories with redcats (Quarry:query/86864, 10583 items)
categories that have only nonexisting categories set (redlinks) - Commons:Report UncategorizedCategories with only infobox categories (Quarry:query/86867, 4475 items)
categories that only have meta categories set by the Wikidata infobox like Category:Uses of Wikidata Infobox
The request for these reports to get updated by a bot instead of manually is here at Commons:Bots/Work requests.
I requested the Quarry query here so a big thanks to Matěj Suchánek who implemented it. (Collaboration like this makes everyone more productive and saves time as well as making each one's expertise/skills/knowledge more fruitful.)
In previous discussions about uncategorized categories I clarified multiple times that these categories are also missing categories or have issues but don't yet have a report where they show up. What I did not know at first is that this could be solved via a Quarry query and what the place to request such queries is.
Most of the categories on these reports need some work. So if you're interested in helping out please take a look, especially if you already have experience with cutting down the Special:UncategorizedCategories list. You can add /Page_1
to the two report links if you want to see only a small sample of the first 500 items.
Lots of work needed there. Probably, the list will not grow back as quickly as the entirely uncategorized cats in UncategorizedCategories.
--Prototyperspective (talk) 17:12, 7 October 2024 (UTC)
- Thanks for putting this together! Any chance it could be formatted as a table instead of a comma-separated list? That'd make it a lot easier to work with. Omphalographer (talk) 22:28, 7 October 2024 (UTC)
- I thought so at first but then found it would be easier if one was able to see just as many items as possible in one view. One can see lots of links without having to scroll around. The link color changes once you visited the pages so e.g. row-numbers don't add anything. A table would be useful if there were more columns....maybe at some point there could be additional columns that show e.g. whether a category only has 1 file (or the count of files) but I currently don't see how that would be useful (if anything that could go into a separate report page). Is there any reason you'd prefer a table? It wouldn't be difficult to convert it to a table but I don't see what advantage it would have. Maybe you mean a table with multiple columns but only these wikilinks instead of a one-column table but then I wouldn't see the difference to the current format. One thing that may be useful is a page with raw URLs so one can open many of these in new tabs at once (select and drag onto a tab or right click->open in new tabs). Prototyperspective (talk) 22:36, 7 October 2024 (UTC)
- There's one way a table format may be useful: making it possible to sort these alphabetically. However, the query could be changed to order the categories that way without losing the advantages of the comma separated list which I think only needs
ORDER BY page_title DESC
in the query (probably will be done before the next report update). Prototyperspective (talk) 22:43, 7 October 2024 (UTC)- I, for one, find the current format very off-putting (but thank you for doing this at all).
- It's OK if we have a format that doesn't end up showing everything in one view while there is so much to show. People will whittle down the list pretty quickly, in my experience, and it can be re-run periodically. It doesn't matter if not everything is in front of people at once while the content is still voluminous; it's a lot more important for this sort of task that what is in front of us is sane to skim. - Jmabel ! talk 20:51, 9 October 2024 (UTC)
- Alright the question is simply which other format would be better and why? Currently this is only one link and doesn't even other data for other columns and people often dislike things at first they are not familiar with. Prototyperspective (talk) 21:48, 9 October 2024 (UTC)
- If a list format would be better the page would be very large and I don't know if it causes other problems but one would have to scroll a lot. Without alphabetic sorting, a list format makes less sense. But I guess once the new version is sorted it could be easier to skim or skip over similarly named items if it was in a list format. Maybe there could be multiple columns so the page isn't so large. Prototyperspective (talk) 21:58, 9 October 2024 (UTC)
- A wiki table (with one row for each category) would be easiest to work with. Even if the report itself isn't sorted, presenting it as a table will allow users to sort it themselves. Omphalographer (talk) 22:16, 9 October 2024 (UTC)
- The report has been updated now. It has the same sorting as before (no ORDER BY SQL added) but now has a table format so it can be sorted alphabetically by clicking on the column header. Moreover, empty categories have been split off from the report. Prototyperspective (talk) 12:45, 12 October 2024 (UTC)
- A wiki table (with one row for each category) would be easiest to work with. Even if the report itself isn't sorted, presenting it as a table will allow users to sort it themselves. Omphalographer (talk) 22:16, 9 October 2024 (UTC)
- If a list format would be better the page would be very large and I don't know if it causes other problems but one would have to scroll a lot. Without alphabetic sorting, a list format makes less sense. But I guess once the new version is sorted it could be easier to skim or skip over similarly named items if it was in a list format. Maybe there could be multiple columns so the page isn't so large. Prototyperspective (talk) 21:58, 9 October 2024 (UTC)
- Alright the question is simply which other format would be better and why? Currently this is only one link and doesn't even other data for other columns and people often dislike things at first they are not familiar with. Prototyperspective (talk) 21:48, 9 October 2024 (UTC)
- Pretty hard to figure out any kind of pattern in that list. Maybe a simple word count helps. I used that when the "uncategorized" reports were longer. It allows to fix several similar ones at once. Sample wordcount (some "words" are years or other numbers): from 7899, Fire 7879, Sanborn 7877, Insurance 7877, Map 7876, County 7525, in 1170, North 829, Oklahoma 681, Texas 572, Rhine-Westphalia 546, California 546, New 537, Pennsylvania 513, Washington 492, Missouri 464, of 382, 1911 356, 1909 346, Carolina 332, 1912 328, 1910 324, Ohio 317, Jersey 317, Montana 296, 1914 287, 1913 282, Oregon 281, Indiana 272, Kansas 271, 1921 266, 1908 256, 1915 255, 1920 245, 1916 245, 1922 242, Georgia 241, 1905 235, Florida 232, Virginia 222, Maine 219, City 218, by 212, South 207, 1904 203, 1917 194, 1901 190, 1900 180, Massachusetts 178, West 176, 1919 175.
∞∞ Enhancing999 (talk) 23:46, 7 October 2024 (UTC)- Thanks. At the next update it will be sorted alphabetically, maybe that helps. I noticed most of the Rhine-Westphalia cats do have categories but they have been set by a template by the first and only edit by XrayBot and I don't know how to best make such categories disappear from the report – one example for that would be to have a query show all cats included there that have a template set (which usually sets cats) so all of them could get a nulledit to remove all which do have a category from the report (that's just an example, I don't think that's readily possible). Prototyperspective (talk) 09:23, 8 October 2024 (UTC)
Sanborn maps categories
- Looking at the Sanborn categories, it appears someone ran a script to dump content on the site and then walked away from it. Why is it anyone else's responsibility to clean up their messes for them? RadioKAOS / Talk to me, Billy / Transmissions 13:45, 8 October 2024 (UTC)
- You can use this report to ask the respective people. e.g. @Nowakki:
Also when it comes to files like those in Category:Sanborn Fire Insurance Map from Syracuse, Onondage County, New York, 1911, Volume 2 I have suggested that instead of dumping 110 niche files 1911 onto WMC that clutter search results and putting them into categories, it would be better if one instead uploads one document/PDF file. Prototyperspective (talk) 14:34, 8 October 2024 (UTC)- I uploaded the files with the intent of creating the most best collection of sanborn maps anywhere.
- Not only did i fail to find a single person with the slightest enthusiasm for it, but i encountered a number of obstacles. At first i tried to slalom around and through the stupidity, but it was going to be a long process of grinding through ever more peanut brainery, so i started a vote to give me the necessary authority to shove it aside and to flip the switch on the bureaucrats (of which i conversed with a total of at least 5, who were all not talking to each other) in one stroke of genius. I lost the vote, so i pulled the plug on the enterprise. As a bonus, i witnessed a bunch of people who most be on probation or something, manually rename 20000 files, because nobody writes scripts for them, and i learned that commons cannot reliably rename files and the ticket is 10 or so years old (i think there was more than one ticket in that general problem domain). It is not possible to work on a data-mega-center on complex issues if the number of people with a problem solving attitude hovers around 0%. Nowakki (talk) 18:05, 8 October 2024 (UTC)
- @Nowakki: Hi, Your intend is good, but your working process isn't. 1. You shouldn't count on anyone else to fix problems you created with your uploads. 2. You should have start slowly, only uploading a few dozens or hundreds of files, and check if everything was OK before uploading gazillions of files, where the problems have overcome you. Yann (talk) 20:19, 8 October 2024 (UTC)
- Nobody cared about sanborn maps until they suddenly started caring in the middle of the upload campaign. Even after the debate was over involving more than 5 people, at no point did a person appear who was in charge of these maps. I can compensate for lack of leadership with initiative, but if you don't want my sanborn power-user setup, then you don't want it. Nowakki (talk) 20:43, 8 October 2024 (UTC)
- I run into similar issues myself sometimes where people criticize and attack me for doing things that are supposedly controversial. But it's almost impossible to know what is or isn't going to trigger any of the over sensitive snowflakes on here or otherwise cause a controversy until after the fact. There's really no way what-so-ever to know before doing something what random nonsense is going to be an issue or otherwise cause problems though. And good luck getting anyone who throws a fit about it to actually help you fix the issue. Which 100% they should if their the ones making an issue out of it to begin with. More on topic, is there a reason that a bot can't just add parent categories to the ones for Sanborn maps? --Adamant1 (talk) 23:35, 9 October 2024 (UTC)
- Creating the categories is easy. Just got to find somebody to devise a scheme and write a bot.
- I had something in the making like so: User:Nowakki/test3 for the city-level category pages, which i presume all the redcats are pointing to. Nowakki (talk) 04:30, 10 October 2024 (UTC)
- @Adamant1: I spent like two weeks or so writing code for this. I don't really need anyone to do anything, except give me mass rename privilege for a selected number of files.
- Isn't there a way to override borderline imbecile snowflakes, who have the power to vote, with the help of specially trained overlords who act in the interest of progress being made? Nowakki (talk) 12:30, 10 October 2024 (UTC)
- It is a simple and straightforward process to request filer mover permissions which enables you to use the massrename tool. Prototyperspective (talk) 13:11, 10 October 2024 (UTC)
- I think people who refused the privilege back then were using about as many brain cells as you just did when you imagined you knew how to be of help. Nowakki (talk) 13:35, 10 October 2024 (UTC)
- You're quite condescending, even more than Adamant1 here once again. I prefer you do not upload these files versus uploading them the way you did. Apply again with the info why you need these permissions. Prototyperspective (talk) 13:45, 10 October 2024 (UTC)
- Somebody should step in and clear a way through the bureaucratic bloat. Then it will be a team effort. Nowakki (talk) 14:05, 10 October 2024 (UTC)
- Yeah well, I find your attitude towards Nowakki rather patronizing myself. I don't think it really helps to go off about how they shouldn't have uploaded the files to begin with at this point. Different strokes for different folks I guess. I'm sure your the one who's in the right here though just because your acting more agitated about it then I am.
- You're quite condescending, even more than Adamant1 here once again. I prefer you do not upload these files versus uploading them the way you did. Apply again with the info why you need these permissions. Prototyperspective (talk) 13:45, 10 October 2024 (UTC)
- I think people who refused the privilege back then were using about as many brain cells as you just did when you imagined you knew how to be of help. Nowakki (talk) 13:35, 10 October 2024 (UTC)
- It is a simple and straightforward process to request filer mover permissions which enables you to use the massrename tool. Prototyperspective (talk) 13:11, 10 October 2024 (UTC)
- I run into similar issues myself sometimes where people criticize and attack me for doing things that are supposedly controversial. But it's almost impossible to know what is or isn't going to trigger any of the over sensitive snowflakes on here or otherwise cause a controversy until after the fact. There's really no way what-so-ever to know before doing something what random nonsense is going to be an issue or otherwise cause problems though. And good luck getting anyone who throws a fit about it to actually help you fix the issue. Which 100% they should if their the ones making an issue out of it to begin with. More on topic, is there a reason that a bot can't just add parent categories to the ones for Sanborn maps? --Adamant1 (talk) 23:35, 9 October 2024 (UTC)
- Nobody cared about sanborn maps until they suddenly started caring in the middle of the upload campaign. Even after the debate was over involving more than 5 people, at no point did a person appear who was in charge of these maps. I can compensate for lack of leadership with initiative, but if you don't want my sanborn power-user setup, then you don't want it. Nowakki (talk) 20:43, 8 October 2024 (UTC)
- @Nowakki: Hi, Your intend is good, but your working process isn't. 1. You shouldn't count on anyone else to fix problems you created with your uploads. 2. You should have start slowly, only uploading a few dozens or hundreds of files, and check if everything was OK before uploading gazillions of files, where the problems have overcome you. Yann (talk) 20:19, 8 October 2024 (UTC)
- You can use this report to ask the respective people. e.g. @Nowakki:
- Looking at the Sanborn categories, it appears someone ran a script to dump content on the site and then walked away from it. Why is it anyone else's responsibility to clean up their messes for them? RadioKAOS / Talk to me, Billy / Transmissions 13:45, 8 October 2024 (UTC)
- More on topic, the files have already been uploaded. So the question is what to do about it now. I'm a little fuzzy on the original details, but assuming this mainly (or only) has to do with uncategorized categories then I don't see what renaming the files would have to do with it. Someone could use User:Nowakki/test3 as a starting point to create the categories through a bot edit. Then the file names could hopefully be fixed in the future, but that's tangential to creating the categories. Or I assume it is. Someone can correct me if I'm wrong on that though. --Adamant1 (talk) 22:09, 10 October 2024 (UTC)
- I'm not agitated at all and nowhere was I acting patronizingly or condescendingly towards Nowakki. I don't really care about this case at all. A user asked about it and I looked at the history page and pinged the respective user, then Nowakki said there's problems with renaming without any further details and I helpfully mentioned this can be done by requesting file-moving rights, that's all. Yes, the question is what to do about the files & cats now. The answer to your question of what file naming has to do it is that Nowakki wrote […] except give me mass rename privilege for a selected number of files. Prototyperspective (talk) 22:24, 10 October 2024 (UTC)
- I think you were. We'll have to disagree though. But that's not all you said. You seem to have a real problem with being patronizing towards other users and then trying to weasel out of it by acting like everything you said was totally innocent and on topic when it wasn't. Regardless, I think the best way to deal with this is by doing a bot edit based on User:Nowakki/test3. But I have other things to do myself. So someone else is going to have to do it, but that would be my suggestion. I see you asked Nowakki to create a bot request. So at least we're in agreement about it. --Adamant1 (talk) 22:56, 10 October 2024 (UTC)
- @Nowakki: Creating the categories is easy. Just got to find somebody to devise a scheme and write a bot. Could you please create a Bot work request with the details? and if you don't really need anyone to do anything, except give me mass rename privilege for a selected number of files could you please request these permissions? Thanks for your efforts so far. Prototyperspective (talk) 22:45, 10 October 2024 (UTC)
- The Sanborn map collection needs to be fixed properly. If you fix the categories you are hiding an indicator that it is broken, which will make the real problem worse. I would therefore advise you to not touch it and move on.
- You can copy+paste the above to where to redlink cats problem is being discussed, so they know that a fix is nontrivial and it will be fixed together with the underlying problem eventually. Nowakki (talk) 23:19, 10 October 2024 (UTC)
- What about just up-merging the files to more general ones and deleting the categories their currently in outright then? Or conversely the files could just be mass nominated for deletion. That sounds like a less then optimal solution though lol. --Adamant1 (talk) 23:26, 10 October 2024 (UTC)
- If you delete them now they might have to be downloaded again when somebody wants them. Therefore that should be not an option.
- You are thinking of ways to hide indicators of a problem. To me that is a character flaw. Nowakki (talk) 23:43, 10 October 2024 (UTC)
- I'm a little hazy on the details, but from what I remember your proposal to rename the files was rejected. So that's clearly not an option. I don't think just leaving the categories as is really helps either. So what's your suggestion to deal with the issue aside from leaving the red links or renaming the files when that already failed? --Adamant1 (talk) 23:46, 10 October 2024 (UTC)
- I only state my original proposition. You can proceed in any way you see fit. I would advise you to develop a scheme that make the sanborn map collection awesome and then implement it. Nowakki (talk) 23:50, 10 October 2024 (UTC)
- I'm a little hazy on the details, but from what I remember your proposal to rename the files was rejected. So that's clearly not an option. I don't think just leaving the categories as is really helps either. So what's your suggestion to deal with the issue aside from leaving the red links or renaming the files when that already failed? --Adamant1 (talk) 23:46, 10 October 2024 (UTC)
- What about just up-merging the files to more general ones and deleting the categories their currently in outright then? Or conversely the files could just be mass nominated for deletion. That sounds like a less then optimal solution though lol. --Adamant1 (talk) 23:26, 10 October 2024 (UTC)
- I'm not agitated at all and nowhere was I acting patronizingly or condescendingly towards Nowakki. I don't really care about this case at all. A user asked about it and I looked at the history page and pinged the respective user, then Nowakki said there's problems with renaming without any further details and I helpfully mentioned this can be done by requesting file-moving rights, that's all. Yes, the question is what to do about the files & cats now. The answer to your question of what file naming has to do it is that Nowakki wrote […] except give me mass rename privilege for a selected number of files. Prototyperspective (talk) 22:24, 10 October 2024 (UTC)
- More on topic, the files have already been uploaded. So the question is what to do about it now. I'm a little fuzzy on the original details, but assuming this mainly (or only) has to do with uncategorized categories then I don't see what renaming the files would have to do with it. Someone could use User:Nowakki/test3 as a starting point to create the categories through a bot edit. Then the file names could hopefully be fixed in the future, but that's tangential to creating the categories. Or I assume it is. Someone can correct me if I'm wrong on that though. --Adamant1 (talk) 22:09, 10 October 2024 (UTC)
The above discussion has gotten terribly convoluted. If there are things that one or more of the participants wish to see happen, could you please each state those, without a bunch of cross-talk? - Jmabel ! talk 07:45, 11 October 2024 (UTC)
- I think this short comment sums it up. I do not understand Nowakki's reply to it. It's unclear what is meant with needs to be fixed properly. And it's also unclear what is meant with I would advise you to develop a scheme that […]. If somebody understands what exactly needs to be done or is proposed please explain it briefly and if not I repeat my question for Nowakki to create a request with the explanation/details. Prototyperspective (talk) 09:37, 11 October 2024 (UTC)
- You are right, the underlying issue has never been properly explained to you.
- User:Nowakki/test3
- hover over the links in the right column.
- The filenames for map files prior to 1900 are bad. They were chosen by a not so brilliant coworker who came ahead of me.
- The map files after 1900 have a better naming scheme.
- If you think this is a cosmetic change, click on one of the files named "ind". The plate number and LOC sequential IDs don't match for the old scheme. It's broken. Suppose you were to run the index through OCR. With the old scheme, you need a separate database to map plate numbers to files. Unsustainable stupidity.
- The index page should be generated for each city when the above issue has been resolved. Nowakki (talk) 11:41, 11 October 2024 (UTC)
- Alright, thanks for explaining! That was the missing info. If you requested file-mover permissions and it was declined that doesn't mean you can't apply again. I think it the permissions will be granted if you provide them with this info. If not, you could also check what else is required to get these permissions but I don't see why the request would fail so please request the file-moving permissions, afterwards you could make use of the mass rename tool and implement the filetitle changes. Prototyperspective (talk) 15:40, 11 October 2024 (UTC)
- If I wanted file mover privileges I could have requested them at any time since January. I just came here to talk about stupidity. Nowakki (talk) 17:06, 11 October 2024 (UTC)
- @Nowakki: Not a very productive thing to do. There is not a lot we can do with "I'd rather have my grievance than a solution." - Jmabel ! talk 06:07, 12 October 2024 (UTC)
- You need at least a few people who can do what you perceive to be an impossible feat. Nowakki (talk) 06:47, 12 October 2024 (UTC)
- Agree. Not sure what the user is up to now when writing sth like that.
What about throwing all of these Sanborn cats into a flat category (for now)? Also maybe they should be separated from the report (if they are in a flat category they wouldn't show up in it anymore anyway). Prototyperspective (talk) 12:42, 12 October 2024 (UTC)- Or remove them from the report, then they don't show up in the report. Nowakki (talk) 12:55, 12 October 2024 (UTC)
- @Nowakki: Not a very productive thing to do. There is not a lot we can do with "I'd rather have my grievance than a solution." - Jmabel ! talk 06:07, 12 October 2024 (UTC)
- If I wanted file mover privileges I could have requested them at any time since January. I just came here to talk about stupidity. Nowakki (talk) 17:06, 11 October 2024 (UTC)
- Alright, thanks for explaining! That was the missing info. If you requested file-mover permissions and it was declined that doesn't mean you can't apply again. I think it the permissions will be granted if you provide them with this info. If not, you could also check what else is required to get these permissions but I don't see why the request would fail so please request the file-moving permissions, afterwards you could make use of the mass rename tool and implement the filetitle changes. Prototyperspective (talk) 15:40, 11 October 2024 (UTC)
October 08
Exhibition at the M.H. de Young Memorial Museum, 1932
Hi, I am looking for the catalog of the exhibition at the M.H. de Young Memorial Museum, San Francisco, 1932, by the Group f/64, or at least a list of artworks. Any idea? In newspaper reports? This would help finding copyright status for these notable pictures. Thanks, Yann (talk) 14:03, 8 October 2024 (UTC)
- Try asking at en:WP:RX and/ or en:WP:REFDESK/H. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 16:14, 9 October 2024 (UTC)
- Viriditas has got this information. en:Talk:Group_f/64#Exhibition_catalog. Thanks a lot! Yann (talk) 10:38, 10 October 2024 (UTC)
October 09
OsmappBot uploads from the OpenStreetMap app
Hi wikipedians!
I am creating an open-source app for browsing OpenStreetMap – the OsmAPP. You can access it at osmapp.org. One of main features of OsmAPP is showing a side panel with information on any clicked map feature or POI. This includes showing photos from many available sources, which is mainly from connected Wikipedia articles, Wikidata entries and directly linked Wikimedia Commons images (see eg. this page).
This connection naturally led me to think about making upload possible as well, as this could benefit both projects – supply Commons with real world images under correct license, and add photos to various map features in OpenStreetMap.
After a year of effort (the wiki api is quite challenging 😃), I have the upload script ready. I asked for the permission to make a test run, which succeeded (OsmappBot contributions). Now I would like to ask broader public about your opinions, and ideas how to make the most of it for Wikimedia Commons.
Regarding Freedom of panorama – the Upload dialog would warn users if the specifc country forbids public photos of buildings etc. I can't add the specific {{FoP-country}} templates programatically (that would need a AI object recongnition), but I will review uploaded images manually and add it if needed. OsmAPP doesn't have many users, I expect only handful of uploads per month. Mind that, this is not an import bot, it rather uploads images on behalf of logged in OSM users, any change I make to the bot will only affect future uploads.
Looking forward to your ideas and opinions 🙂 Zbytovsky (talk) 10:05, 9 October 2024 (UTC)
- @Zbytovsky: Sounds interesting. Taking File:Hammertalwand (Climbing Crag) - OsmAPP (4).JPG as an example, the most striking thing is that the image lacks categories. Also, the structured data lacks as "depicts" statement, and the coordinates are not clickable. What if the volume exceeds your capability (or availability) to review? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 16:12, 9 October 2024 (UTC)
- Interesting. I suggest you create a video demonstrating its features and put it on YouTube and also upload it here. Please enable users to add one or several categories with autocomplete similar to the Upload Wizard on this site (maybe the HotCat script could be used for that). Prototyperspective (talk) 16:21, 9 October 2024 (UTC)
- I would recommend rewriting Public photography is forbidden in this country. to Public photography is restricted in this country. as all but five countries with freedom of panorama provisions grant permission for some works but not others. For example, the United States has freedom of panorama for architecture but not sculpture. JohnCWiesenthal (talk) 19:20, 9 October 2024 (UTC)
- Looks very interesting and promising! Thank you for the work! It would be a nice addition if login and upload with Wikimedia account would be possible. That would also relieve you from the responsibility at least for uploads done through that way. --Marsupium (talk) 06:42, 11 October 2024 (UTC)
Category:Minority schools came to my attention when User:Elkost added it as a parent of Category:Jewish schools and my immediate reaction was, "What about a cheder in Israel (where Jews are not a minority)? We had a bit of a discussion and both agree that this area of categorization needs more thought than it has had, and more than just he and I can bring to it. In particular (some of this per our discussion, some not):
- Right now, Category:Minority schools includes Category:Minority schools in Bulgaria, Category:Armenian schools, Category:Jewish schools, and Category:Sami schools. Category:Armenian schools seems a particularly odd inclusion, because it, in turn, includes Category:Schools in Armenia, most of which presumably serve the ethnic Armenian majority there.
- @NeverDoING: as creator of Category:Minority schools, can you explain your intention and, in particular, the intended criteria for inclusion in this category?
- Quoting myself, rather than try to word this again from scratch: There are also Japanese and Korean schools in the U.S., American schools in practically every major city of the world, French schools in Spain, Romania, and who knows where else, etc. I'm not sure how we should handle these all, but the emphasis on minority seems wrong to me. Also, there is some sort of distinction to be made between (for example) a Japanese school in the U.S. that focuses entirely on people of Japanese ancestry and one where non-Japanese might study aspects of language and culture as well. I don't think any of this has been well thought through. Plus "Jewish" especially raises the issue of religious vs. ethnic. For example, a cheder is a very different matter than a Jewish day school that teaches a broad curriculum. Analogously, but not a school: the 92nd Street Y in Manhattan is emphatically culturally Jewish, and equally emphatically secular.
Jmabel ! talk 21:04, 9 October 2024 (UTC)
- @Jmabel: What do you think about something like Category:Historically black colleges and universities it relates to this? Surely if it's fine to have a category for historically black colleges and universities then the same would (or at least should) apply for schools having to do with other minority groups. --Adamant1 (talk) 01:14, 10 October 2024 (UTC)
- HBCUs are very simple, because they are self-organized into an association.
- I have no problem with the existence of any of these subcats. My issue is with the term "minority schools." Since you bring up Black Americans: would an African-American academy at the high-school level in Detroit (a city that is over 75% Black) be a "minority school"? And, again, why on a global basis are Jews and Armenians deems "minorities" (even though each has a country where they make up the majority) but Hungarians (the majority in Hungary, and an important minority in Romania, Slovakia, and elsewhere) are not? (All three are within about 20% of the same size populations.) - Jmabel ! talk 12:51, 10 October 2024 (UTC)
- @Jmabel: What do you think about something like Category:Historically black colleges and universities it relates to this? Surely if it's fine to have a category for historically black colleges and universities then the same would (or at least should) apply for schools having to do with other minority groups. --Adamant1 (talk) 01:14, 10 October 2024 (UTC)
- My thoughts are that the term "Minority" is really poorly defined and that maybe "Ethnic schools" would be better and more precise, we can also use "Religious schools" where relevant Oxyman (talk) 03:59, 13 October 2024 (UTC)
October 10
Internet Archive attacked
Hi, I just learned that Internet Archive has been attacked, and suffered a security breach: [1]. :((( We rely a lot on IA for various reasons, and that's very sad. Yann (talk) 10:36, 10 October 2024 (UTC)
- Sad but from the linked news article it doesn't seem like the contents there are affected. Prototyperspective (talk) 11:46, 10 October 2024 (UTC)
- somebody deleted 150 of my buckets in recent days. support said i deleted them. i didn't.
- this will take weeks before order is restored if this happened to many peeps. Nowakki (talk) 15:02, 10 October 2024 (UTC)
- Yes, earthquakes in San Francisco area, a concern that I have expressed several times here, is only one of the risks that may affect Internet Archive. Fortunately, as Prototyperspective said, it seems that contents weren't affected, but if, as their only publicly available information says, they only have 2 production copies of each file (and no proper backups), some attack could possibly delete information completely (Archive stores so many files that a complete destruction due to an attack seems highly unlikely, though). I hope this event will help them to greatly improve their infrastructure, and to get the money they would need for that. I hope that the recent Archive's partnership with Google also helps with this. MGeog2022 (talk) 20:04, 10 October 2024 (UTC)
- By the way, while WMF has much better resources and infrastructure than Internet Archive, absolute security does not exist, so this is a reminder that Commons media dumps are a real need (as well as true offline backups ready to be restored, both for media and for text content in all WMF wikis). MGeog2022 (talk) 20:26, 10 October 2024 (UTC)
Category slideshows
Does anyone understand why the UI to start a category slideshow uses such "mystery meat" navigation? Nothing obvious about the symbol at all. - Jmabel ! talk 14:32, 10 October 2024 (UTC)
New report: self-categorized categories
- Commons:Database reports/Self-categorized categories shows categories that directly contain themselves
When using the deepcategory search operator it should detect self-categorization so it doesn't break. Until that is implemented, fixing self-categorization is especially needed (e.g. to enable it to always show results phab:T376440) but it's also a general problem if categories contain themselves (in these cases directly).
The report was empty / out of date for a long time but now MZMcBride (talk · contribs) updated it – thanks for that!
I worked on it until Ludwig_August_Fallon (default sorting not alphabetic sorting) except for year_in_India cats. Fixing these problems is usually quite simple – removing that category or replacing a template that sets the category with the categories. Sometimes one also needs to add other categories such as some taxonomic category. I requested on the talk page that it (a separate page / report) also shows categories that include themselves somewhere in their subcategories.
If this is done and with the two other reports announced above there would then be reports for nearly all issues with categories. After these have been solved to some extent what would remain are the issue of categories with just 1 file (debatable whether that's preferable over just categorizing the file like the cat), empty categories (see Quarry:query/7200 and many/most of them should be deleted), and mismatches between WMC and WP (mainly ENWP) categories which often means some cats of the WMC cat are missing and just need to be synced with WP. --Prototyperspective (talk) 14:59, 10 October 2024 (UTC)
- A bot could attempt to remove them directly.
∞∞ Enhancing999 (talk) 16:15, 10 October 2024 (UTC)- That would be great. It would only fail for categories that have themselves set via templates. If you neither create such a bot nor request one, I'll sooner or later propose one at Commons:Bots/Work requests. Some of these cats may benefit from human checking (because they lack cats) but that's not worth the time required and may get addressed via the mentioned category-mismatch detection that could implemented at some point. Prototyperspective (talk) 16:36, 10 October 2024 (UTC)
- If no other categories remain, the category will end up on Commons:Report Special:UncategorizedCategories.
- As I wont create one and the request seems uncontroversial, I think you could add it to "work requests" directly.
∞∞ Enhancing999 (talk) 16:40, 10 October 2024 (UTC)- It's not about categories with no remaining issues, it's about the main category missing. For example, in the taxonomic Category:Phaeolepiota aurea Category:Agaricaceae was missing or Category:Rendered texts in Gothic script did not have Category:Gothic script or any subcat of it set. The request may be uncontroversial but people aren't even implementing the other requests on that page so adding another one quickly would overwhelm capacities more and make the page longer causing people to not read the existing requests. Prototyperspective (talk) 17:25, 10 October 2024 (UTC)
- That would be great. It would only fail for categories that have themselves set via templates. If you neither create such a bot nor request one, I'll sooner or later propose one at Commons:Bots/Work requests. Some of these cats may benefit from human checking (because they lack cats) but that's not worth the time required and may get addressed via the mentioned category-mismatch detection that could implemented at some point. Prototyperspective (talk) 16:36, 10 October 2024 (UTC)
- Seems the majority might be template based categorization. I tried to fix some (or most) of the others.
∞∞ Enhancing999 (talk) 07:31, 11 October 2024 (UTC)- Thanks! I don't know what the query for it is and hope MZMcBride updates it again at some point. Categories that contain themselves in some subcategory are just as much of a problem (e.g. cause deepcategory to fail and are logically contradictory) but aren't included there. If somebody is interested in these or can adjust the query (which is it?) to also scan several layers of subcategories, please see the report's talk page. When it comes to templates I suggest people who added the template are pinged or it's asked about at the template's talk page – often the best solution is to comment out or remove the template and replace it with the categories that it automatically sets except of the self-categorizing cat. Prototyperspective (talk) 09:27, 11 October 2024 (UTC)
- Not sure, maybe the templates can be fixed to do it more selectively.
- The query might be from w:Wikipedia:Database reports/Self-categorized categories/Configuration.
- Not sure if loops can be avoided entirely, but we could try to identify the categories where the parent category is also a subcategory.
∞∞ Enhancing999 (talk) 12:15, 11 October 2024 (UTC)- Thanks for the link, I've seen it before but don't know where I found the link and forgot about it – however, that's just some code without any info on how to run it. Can it be run in the Quarry tool? Loops can be avoided by putting all the scanned categories in an array and the checking whether the category currently checked is in that array. If so, it would not look into subcategories of that category and show it as one of the categories that contain themselves. This way it would scan layer by layer and the number of layers to scan could be increased over time. Prototyperspective (talk) 12:20, 11 October 2024 (UTC)
- In the meantime, I ran it: quarry:query/87026. It's down to 612 categories now.
∞∞ Enhancing999 (talk) 12:24, 11 October 2024 (UTC)- All direct inclusions should be fixed now (if my regex missed some, please tell me).
- This means that the remaining cases would be template based (151 categories), Some template talk pages have notes about the problem: [2].
∞∞ Enhancing999 (talk) 07:44, 12 October 2024 (UTC)
- In the meantime, I ran it: quarry:query/87026. It's down to 612 categories now.
- Here is a query for pages with parent category equaling subcategory: Quarry:history/87030/938798/910988.
- It has 7448 such pairs, sample: Category:(26)_Proserpina_symbols + Category:Persephone_symbols.
∞∞ Enhancing999 (talk) 04:23, 12 October 2024 (UTC)- Great! So this is already solved? Could you please put that in a separate query instead of just in some revision of a query about something else and then put that in some report?
- Putting it in a report on Commons is also useful for people here to find that query, not just for accessing the results and working on fixing the issues. When it comes to fixing the issues what is still needed is showing a categorization path per each item showing why the second category is located somewhere in the former like in the attached image. Do you think it would be possible to add a column for that in the query? It would also be relevant to this request for a new tool/gadget and this issue for petscan. Such is one of the most needed functionalities to improve categorizations and petscan often shows files that do not belong into categories without a way to find out why (examples in the two links). Prototyperspective (talk) 13:17, 12 October 2024 (UTC)
- Maybe we should try to reduce the list before making in into a report .. not quite sure how though.
∞∞ Enhancing999 (talk) 16:46, 12 October 2024 (UTC)- If you won't forget about it and keep track of all the relevant info and the query then that may be a good idea. I don't think there's an issue with putting it into a report if nobody finds it anyway. A categorization path is also needed for e.g. the two other things I linked and maybe that could be used there. Didn't you say the number of self-categorizations is way down now and the remaining ones are due to templates? In that case it seems like the problem is already solved: making template creators/writers aware of the problem on the template talk pages so the problem gets fixed at the template and/or simply commenting out the template on the respective pages to replace them with the other cats. If you mean the 7500 cats that contain themselves somewhere, I don't think a bot could correct [many of] these self-categorizations – it does need quick human checking and that is best done by enabling users to only need very little time per correction (quick succession editing) including displaying the necessary info of the categorization path. The optimal solution would be displaying the categorization path with each cat having an x button to remove the cat that caused the self-categorization but requiring the user to click the cat and remove it with a second click is also good. Prototyperspective (talk) 17:21, 12 October 2024 (UTC)
- They are all pairs as the sample above (Category:(26)_Proserpina_symbols + Category:Persephone_symbols). Longer chains are not include (e.g. grandparent = child).
- For some the fix is obvious (Category:Aerial_photographs_of_Bernrieth and Category:Bernrieth).
∞∞ Enhancing999 (talk) 17:43, 12 October 2024 (UTC)- Oh too bad, I thought I checked some and it was not a direct self-categorization and also I don't understand then why the query shows so many results while the report only has a few hundred or so or 151 remaining ones. In that case maybe it does need a report with two columns with a button to directly remove the problematic cat on either cat directly, maybe this can can be done with some specific URL and the URL could be attached next to each wikilink. No other idea, e.g. I think even for your example case where it's quite obvious to a human it would be difficult to a bot (one that has some chance of getting developed). Prototyperspective (talk) 17:50, 12 October 2024 (UTC)
- The 150 are just the ones that are self-categorizations (1 category involved, as your report above).
- The 7500 involve 2 categories (parent=child).
∞∞ Enhancing999 (talk) 17:53, 12 October 2024 (UTC)
- Oh too bad, I thought I checked some and it was not a direct self-categorization and also I don't understand then why the query shows so many results while the report only has a few hundred or so or 151 remaining ones. In that case maybe it does need a report with two columns with a button to directly remove the problematic cat on either cat directly, maybe this can can be done with some specific URL and the URL could be attached next to each wikilink. No other idea, e.g. I think even for your example case where it's quite obvious to a human it would be difficult to a bot (one that has some chance of getting developed). Prototyperspective (talk) 17:50, 12 October 2024 (UTC)
- If you won't forget about it and keep track of all the relevant info and the query then that may be a good idea. I don't think there's an issue with putting it into a report if nobody finds it anyway. A categorization path is also needed for e.g. the two other things I linked and maybe that could be used there. Didn't you say the number of self-categorizations is way down now and the remaining ones are due to templates? In that case it seems like the problem is already solved: making template creators/writers aware of the problem on the template talk pages so the problem gets fixed at the template and/or simply commenting out the template on the respective pages to replace them with the other cats. If you mean the 7500 cats that contain themselves somewhere, I don't think a bot could correct [many of] these self-categorizations – it does need quick human checking and that is best done by enabling users to only need very little time per correction (quick succession editing) including displaying the necessary info of the categorization path. The optimal solution would be displaying the categorization path with each cat having an x button to remove the cat that caused the self-categorization but requiring the user to click the cat and remove it with a second click is also good. Prototyperspective (talk) 17:21, 12 October 2024 (UTC)
- Maybe we should try to reduce the list before making in into a report .. not quite sure how though.
- Thanks for the link, I've seen it before but don't know where I found the link and forgot about it – however, that's just some code without any info on how to run it. Can it be run in the Quarry tool? Loops can be avoided by putting all the scanned categories in an array and the checking whether the category currently checked is in that array. If so, it would not look into subcategories of that category and show it as one of the categories that contain themselves. This way it would scan layer by layer and the number of layers to scan could be increased over time. Prototyperspective (talk) 12:20, 11 October 2024 (UTC)
- Thanks! I don't know what the query for it is and hope MZMcBride updates it again at some point. Categories that contain themselves in some subcategory are just as much of a problem (e.g. cause deepcategory to fail and are logically contradictory) but aren't included there. If somebody is interested in these or can adjust the query (which is it?) to also scan several layers of subcategories, please see the report's talk page. When it comes to templates I suggest people who added the template are pinged or it's asked about at the template's talk page – often the best solution is to comment out or remove the template and replace it with the categories that it automatically sets except of the self-categorizing cat. Prototyperspective (talk) 09:27, 11 October 2024 (UTC)
Stacking hands
Any category exists for Diverse people stacking hands together.jpg? RoyZuo (talk) 17:23, 10 October 2024 (UTC)
- Several categories have been added to the image for you. Bastique ☎ appelez-moi! 20:03, 10 October 2024 (UTC)
Categories vs subjects
In the Upload Wizard there are separate fields for "Main subjects visible in this work" and "Category" (which possibly should read "Categories"). I wonder why it is necessary to have both. In what case would (or could) a "main subject visible" NOT be a category? ITookSomePhotos (talk) 17:59, 10 October 2024 (UTC)
- Agree, it's duplication of work. I think categories should automatically sync to structured data if SD is to be used at all. However, note that a file may not depict some things that are in a category...however people also add depict statements for such things (e.g. when a video is only about sth but not depicting it) and there's also categories that are not about the contents in terms of what is being depicted such as Category:Videos without audio. Prototyperspective (talk) 18:50, 10 October 2024 (UTC)
- If I take a picture of a door handle inside a building, it may belong in the category for the building, but it does not depict the building. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 14:01, 11 October 2024 (UTC)
- Concur with Andy Mabbett. Ditto for an event at the building where it is mainly a photo of people, probably for a floor plan, certainly for a building permit, etc. - Jmabel ! talk 14:08, 11 October 2024 (UTC)
- It makes sense and I don't want to argue against it...I'd just like to leave a note about four things to consider:
- people often set SD for depicts: the building (using that example)
- when it has some depicts statements set the main thing depicted (door handle using that example) may not be in the SD while it's in the cats if cats are set in ca >95% of cases
- the category could (often should) have a dedicated subcategory for pictures like that
- the depicted thing can be inferred from the combination of categories set (e.g. close-up photos of door handles + building xy + other cats = pic depicts door handle of that building)
- Prototyperspective (talk) 15:19, 11 October 2024 (UTC)
- It makes sense and I don't want to argue against it...I'd just like to leave a note about four things to consider:
- Note that I deliberately asked how a "main subject visible" would or could not be a category, and not the other way around. In the case of the door handle, still, my point holds. We would have to specify "door handle" twice, once as a category, and once as a depicted object, apparently redundantly. ITookSomePhotos (talk) 17:25, 11 October 2024 (UTC)
- Though I guess the way things are set up now, if we only specified the categories, "door handle" and "building X", and no separate "subjects visible" information, then we would lose the information that "building X" is NOT visible. If this is important to capture, then I suppose in an ideal world we would specify visible objects once, and then have another field for "other categories". ITookSomePhotos (talk) 17:54, 11 October 2024 (UTC)
- Concur with Andy Mabbett. Ditto for an event at the building where it is mainly a photo of people, probably for a floor plan, certainly for a building permit, etc. - Jmabel ! talk 14:08, 11 October 2024 (UTC)
Cities in Belgium by year
Good evening! Who can help distribute of categoties for Belgian cities by year with following for example of Germany's category. This template for categories on {{Belgiumbycity}}. MasterRus21thCentury (talk) 18:30, 10 October 2024 (UTC)
October 11
Red fruits or flowers?
On a closer look it looks like to be flowers. Google Lens does not help me much. It shows a lot of commercial results with red things and little certainty. I got one Commons result, File:Ilex serrata7.jpg but this is not it.Smiley.toerist (talk) 12:28, 11 October 2024 (UTC)
- I suggest you use another plant identification tool instead of asking humans here. These tools could also be useful for suggesting categories and I think several work better for plants than Google Lens but I don't know if there's any that is free software. Prototyperspective (talk) 12:48, 11 October 2024 (UTC)
- @Smiley.toerist: This is the Euonymus europaeus, in Dutch it's known as a "wilde kardinaalsmuts". The red parts resemble flowers but are actualy its fruits, and the orange are its seeds. ReneeWrites (talk) 13:18, 11 October 2024 (UTC)
- For future reference, Com:iNaturalist. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:58, 11 October 2024 (UTC)
Category:Raster quick response codes (QR codes category)
i believe we have enough images that represents what QR codes looks like. and also there is many more redundant files. right now, 358 files. i believe we should do something about this. Category:Raster quick response codes. we should make DR'd these redundant files. and, i didnt check, also, i believe there is plenty of advertisements out there. modern_primat ඞඞඞ ----TALK 17:58, 11 October 2024 (UTC)
- Support. I'll go through these and open a batch DR later today. FWIW, you can decode most of these by pasting the image URL into Zxing - much easier than scanning them off the computer screen. Omphalographer (talk) 19:03, 11 October 2024 (UTC)
- Done - Commons:Deletion requests/Files in Category:Raster quick response codes. I've included the decoded content of most of the images. Omphalographer (talk) 03:17, 12 October 2024 (UTC)
- @Omphalographer brother, thats sucha hard work. Thank you.! modern_primat ඞඞඞ ----TALK 05:42, 12 October 2024 (UTC)
- Done - Commons:Deletion requests/Files in Category:Raster quick response codes. I've included the decoded content of most of the images. Omphalographer (talk) 03:17, 12 October 2024 (UTC)
Template:YouTubeReview change
I know the template is deprecated. But it’s still being used in lot of files. So I want someone to help by modifying the texts in Template:YouTubeReview/i18n. As User:YouTubeReviewBot is currently blocked, so it also need to be removed from the text or change to what is necessary. Regards, –TANBIRUZZAMAN (💬) 21:19, 11 October 2024 (UTC)
October 12
Tracking deletion requests concerning WLM-2024 files
As it is becoming evident that some submitted files do infringe on sculptural copyrights due to lack of FoP in their countries (I nominated some myself), is it appropriate to create tracking categories of deletion requests concerning WLM-2024-related files?
Proposed category:
- Category:Wiki Loves Monuments 2024-related deletion requests, to be split into deleted and pending subcategories (kept may come later, if some requests closed as kept).
- To be categorized under Category:Deletion requests by specific subject/Buildings and monuments.
In this way, organizers may be able to reassess their rules for future competitions and some may even try the stepping stones to reform their copyright laws. We may no longer be able to categorize those of past contests due to the deleted files not being visible to non-sysops like me.
Ping @Ciell: (who pioneered the contests way back in 2010 in the Netherlands) if they agree to this or not. _ JWilz12345 (Talk|Contrib's.) 00:50, 12 October 2024 (UTC)
- Support It's to bad the same thing can't be done for previous competitions. --Adamant1 (talk) 00:56, 12 October 2024 (UTC)
- I think this should be done on the level of the local contests and not on the global level. GPSLeo (talk) 06:09, 12 October 2024 (UTC)
- @Adamant1@GPSLeo on a second thought, I think it won't help much. With little to no initiatives of various Wikimedia chapters and local user groups as well as unwillingness of several countries to permit full exposure of their public spaces in commercially-available media, I don't think we can expect improvements in copyright laws very soon. Even the U.S. architectural FoP is not immune to criticism, albeit from a small sector composed of some architects and lawyers who are critical of the 1990 AWCPA as having deprived architects the right to control the U.S. public's images of U.S. buildings. Expect some "unsurprising surprises" on FoP statuses of some countries. Withdrawing my categorization proposal, to reduce some stress on my part. JWilz12345 (Talk|Contrib's.) 06:24, 12 October 2024 (UTC)
Organisation logo to Commons
I uploaded a logo and it is stored locally on the English Wikipedia (en:File:Linpra logo.png). I'd like to upload it to Commons with the same rationale as logos from Volvo, Ikea, Saab and others. How do I upload it to Commons?
Edit: I see that Nokia, Kimberly-Clark, Docusign, Klarna, Decathlon, Volvo and others are uploaded with the Upload Wizard. Even though Upload Wizard specifically asks to not upload logos. Is it ok to use Upload Wizard then if the logo clearly meets the licensing criteria listed below? — Preceding unsigned comment added by Drifting in Music (talk • contribs) 09:50, 12 October 2024 (UTC)
==={{int:license-header}}===
{{tl|PD-textlogo}}
{{tl|Trademarked}}
— Preceding unsigned comment added by Drifting in Music (talk • contribs) 09:25, 12 October 2024 (UTC)
- I don't see any documentation of the threshold of originality for Lithuania. Does someone have something? For the U.S. it should be OK. - Jmabel ! talk 16:20, 12 October 2024 (UTC)
- Thank you for the reply @Jmabel.I did an online search but have no conclusive results. I checked a bunch of prominent Lithuanian organisations that have logotypes on Commons and all except two used "Upload Wizzard", e.g. – Lithuanian National Television, Lithuanian National Radio and Television, Mailer Lite, Vinted, Paysera, CarVertical and NordVPN. The two exceptions among my sample were TV3 and Baltic Amadeus.
- The absolute majority seem to be using UploadWizzard. Using the same logic, could I simply upload the logo using the UploadWizard? Drifting in Music (talk) 17:10, 12 October 2024 (UTC)
- From a technical point of view, of course you could, but that doesn't make the copyright status in Lithuania any clearer. - Jmabel ! talk 20:11, 12 October 2024 (UTC)
Music band in Innsbruck
I am trying to find wich music band was playing. Websites such as https://www.songkick.com/metro-areas/26767-austria-innsbruck/october-2024 do not go back to to past events. I do have recordings (free acces to the podium) but I suppose it is not permitted to upload these to the Commons. There is no Freedom of Panorama for sounds.Smiley.toerist (talk) 11:21, 12 October 2024 (UTC)
- There is no Freedom of Panorama for sounds Does somebody have info on that? Videos of live music would be useful often and also there are several/many videos of such which if they aren't free would need to be deleted (example). I am trying to find wich music band was playing Use the Wayback Machine when it's functional – if they don't have the relevant page archived and other archival websites like archive.today and the Google Cache don't have neither and Web search engines don't show anything for relevant searches then you're probably out of luck but I don't see any problem with that since it's unlikely to be a very notable band where a photo is much needed and I think most such bands already have some photos on WMC. Prototyperspective (talk) 12:22, 12 October 2024 (UTC)
- A band playing not permanent and therefore definitely not covered by FOP in most regions as FOP requires works to be permanent in public space. For cases like bell installations it could be a difficult question. GPSLeo (talk) 11:22, 13 October 2024 (UTC)
- Thanks for the info. I wonder about the music but I think it can't be copyrighted if it's played in a public space. Nevertheless, I don't know if a concert etc is a public space in regards to whether videos of the live music can be uploaded under CCBY. Prototyperspective (talk) 11:45, 13 October 2024 (UTC)
- A band playing not permanent and therefore definitely not covered by FOP in most regions as FOP requires works to be permanent in public space. For cases like bell installations it could be a difficult question. GPSLeo (talk) 11:22, 13 October 2024 (UTC)
Change in editing behaviour for keyboard shortcuts?
I've probably missed an announcement, but now if I try to copy something with command-C I get ''Italic text'', and if I paste something with command-V I get <sup>Superscript text</sup> - any pointers to where this changed, and how to disable it? Thanks. Mike Peel (talk) 22:10, 12 October 2024 (UTC)
- With more investigation, this seems to be related to my keyboard overlay - if I switch to querty, it works fine, but with dvorak, it isn't. Weird. (I becomes C and > becomes V with dvorak, so the keyboard shortcuts then make sense...) Thanks. Mike Peel (talk) 16:44, 13 October 2024 (UTC)
Could someone verify if we are actually allowed to host all these files or do i have to start a DR?--Trade (talk) 23:37, 12 October 2024 (UTC)
- Seems plausible. The linked web site has a footer which states Музыка, тексты и изображения сайта распространяются по лицензии Creative Commons BY-SA 4.0. Next question is whether it's in scope; I can't help but notice that the artist doesn't have an article on any Wikipedia. Omphalographer (talk) 01:53, 13 October 2024 (UTC)
- Would you mind doing license review of the files in the category? Trade (talk) 03:32, 13 October 2024 (UTC)
October 13
Unidentified train station
The video ends with the arrival in a small train stop. A have been checking the stations in the en:Kyūdai Main Line article, but not finding it. Nearly all the bigger stations have a footbridge and this one does not. There is a Japanese text, but I am not certain that this the station name. Is there a handy way the extract an image from a video? I can do a print screen, but I am not certain this is the best way. (I am not used to working with videos). Is there an Japanese/English speaking forum, where I can ask the question? Unfortunatly OpenRailmap does not have an translate function. Smiley.toerist (talk) 08:29, 13 October 2024 (UTC)
- Is there a handy way the extract an image from a video? In Firefox you can simply do what is described in Category:Still images from videos by Terra X. Alternatively, you can open the video locally and press shift+s (or ctrl+s?). A tool to directly upload stills from videos on WMC would be handy (especially for the linked cat). Prototyperspective (talk) 10:11, 13 October 2024 (UTC)
- In Firefox: Pause video; right click; "Take snapshot". Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:07, 13 October 2024 (UTC)
- Note: It will be lower resolution than the video if one does not click Original file first as described in the cat. This may be irrelevant in many cases but when uploading stills to WMC the stills shouldn't be lower-quality than the video. Prototyperspective (talk) 11:10, 13 October 2024 (UTC)
- I edited the thumbnail to the right of the section to show it here.
∞∞ Enhancing999 (talk) 11:23, 13 October 2024 (UTC)
- I edited the thumbnail to the right of the section to show it here.
- Note: It will be lower resolution than the video if one does not click Original file first as described in the cat. This may be irrelevant in many cases but when uploading stills to WMC the stills shouldn't be lower-quality than the video. Prototyperspective (talk) 11:10, 13 October 2024 (UTC)
- In Firefox: Pause video; right click; "Take snapshot". Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:07, 13 October 2024 (UTC)
- Found it: Bungo-Kokubu train station at 33.194799,131.550949, the video starts at 33.201671,131.557507 . en:Kyūdai Main Line GeorgDerReisende (talk) 11:46, 13 October 2024 (UTC)
- If you like specifying train stations, this video (among several others) shows many of them. Maybe some of the places and villages shown could also be identified but I wonder about which use-cases that has, at least in the current form. Prototyperspective (talk) 20:27, 13 October 2024 (UTC)
- I would be handy to remove the long tunnel sections wich show only a black screen.Smiley.toerist (talk) 22:11, 13 October 2024 (UTC)
- I used the last modified time parameter of the original MP4 file, to add the correct time. I have added the start location. Is there a format to indicate the start and end location of a video? I have experimented with timetext in Dutch. Can I add the English text or is a completely different timetext necessary?Smiley.toerist (talk) 11:42, 14 October 2024 (UTC)
Page about empty categories?
Is there any Help/Commons page about empty categories? I'd like to ask if these are noindexed and if not propose that they are made noindexed.
Moreover, at some point it may be good to discuss what to do about empty categories created over e.g. 2 years ago of which there seem to be many.
--Prototyperspective (talk) 11:14, 13 October 2024 (UTC)
- Commons:Help_desk/Archive/2024/08#Why_should_we_keep_unused_empty_categories,_instead_of_deleting_them?.
∞∞ Enhancing999 (talk) 11:33, 13 October 2024 (UTC)
What to do with advertising in the description?
With this usable photo, the description is pure advertising. This is something I come across more often. What to do? Leave it as it is or reduce the text in this case to “Machu Picchu”? Wouter (talk) 17:43, 13 October 2024 (UTC)
- This is common in YouTube video descriptions which are abused for all kinds of spam and junk as well. Ranked:
- A meaningful description "Tourists posing on steps at Machu Picchu in 2023"
- Generic "Machu Picchu"
- Whatever spam-y noise someone initially inserted.
- —Justin (koavf)❤T☮C☺M☯ 17:58, 13 October 2024 (UTC)
- The text doesn't seem to have any relation to the image: delete beyond "Tourists at Machu Picchu".
∞∞ Enhancing999 (talk) 18:00, 13 October 2024 (UTC)- Thanks, problem solved. Wouter (talk) 08:16, 14 October 2024 (UTC)
Best way to upload a derived image?
I often do photo editing on low-quality images and upload better versions (corrected exposure, etc). For example, File:Admiralty Centre Tower 2 (adjusted).jpg Is there an easy way to do this? Its a pain to have to manually go through the upload wizard, copy over the original author, license, description, generate a new file name with "(adjusted)" stuffed into the original name, etc. It seems like some bit of javascript could do all of this with one click. Does such a thing exist? — Preceding unsigned comment added by RoySmith (talk • contribs) 19:52, 13 October 2024 (UTC)
- Others might be able to point you to a better tool, but the Basic Upload Form in Special:Upload might be useful for you. You can copy the entire description from one file and paste it here. This should almost definitely save you some clicks. Rubýñ (Scold) 20:25, 13 October 2024 (UTC)
- If I am cropping a file already on Commons, Commons:CropTool is good. If I am overwriting an image, User talk:Rillke/bigChunkedUpload.js is far less cumbersome than the normal upload interface. —Justin (koavf)❤T☮C☺M☯ 20:34, 13 October 2024 (UTC)
October 14
Preliminary results of the 2024 Wikimedia Foundation Board of Trustees elections
Hello all,
Thank you to everyone who participated in the 2024 Wikimedia Foundation Board of Trustees election. Close to 6000 community members from more than 180 wiki projects have voted.
The following four candidates were the most voted:
While these candidates have been ranked through the vote, they still need to be appointed to the Board of Trustees. They need to pass a successful background check and meet the qualifications outlined in the Bylaws. New trustees will be appointed at the next Board meeting in December 2024.
Learn more about the results on Meta-Wiki.
Best regards,
The Elections Committee and Board Selection Working Group
MPossoupe_(WMF) 08:24, 14 October 2024 (UTC)
Copyright question regarding document from "Memory of the People" (pamyat-naroda.ru)
Hi all,
I've uploaded a PDF document to Wikimedia Commons that I downloaded from the Russian "Memory of the People" website (pamyat-naroda.ru). This document details the World War II service record of my grandfather, Georgiy Pavlovich Osipov, and includes information about his unit, awards, and activities during the Siege of Leningrad.
The "Memory of the People" website states the following copyright information:
Правовая информация
Правообладателем ИС «Память народа», включая входящие в ее состав программы для ЭВМ, электронные банки (базы) данных, включающие электронные копии архивных документов, является Российская Федерация в лице Министерства обороны Российской Федерации (далее – Правообладатель).
Собственниками архивных документов, электронные копии которых размещены в ИС «Память народа», является Российская Федерация (в части документов, предоставленных российскими государственными архивами, государственными органами и организациями), а также иностранные государственные архивы и иные организации, предоставившие электронные копии этих документов.
Правообладателями аудиовизуальных произведений, предоставленных Российским государственным архивом кинофотодокументов и размещенных на сайтах ИС «Память народа», являются авторы этих произведений.
Оператором сайтов ИС «Память народа» (далее – Оператор) является Министерство обороны Российской Федерации, либо уполномоченная им организация.
Все сервисы, а также информационный контент, предоставляемые сайтами ИС «Память народа», являются государственными информационными услугами, оказываемыми бесплатно.
Обязательным условием использования документов с сайта в изданиях, связанных с историей Великой Отечественной войны, является ссылка на сайт.
English Translation:
Legal Information
The copyright holder of the "Memory of the People" information system, including the computer programs, electronic data banks (databases) containing electronic copies of archival documents, is the Russian Federation, represented by the Ministry of Defense of the Russian Federation (hereinafter referred to as the Copyright Holder).
The owners of the archival documents, electronic copies of which are posted in the "Memory of the People" information system, are the Russian Federation (with respect to documents provided by Russian state archives, state bodies, and organizations), as well as foreign state archives and other organizations that provided electronic copies of these documents.
The copyright holders of audiovisual works provided by the Russian State Archive of Film and Photo Documents and posted on the websites of the "Memory of the People" information system are the authors of these works.
The operator of the websites of the "Memory of the People" information system (hereinafter referred to as the Operator) is the Ministry of Defense of the Russian Federation or an organization authorized by it.
All services, as well as information content provided by the websites of the "Memory of the People" information system, are state information services provided free of charge.
A mandatory condition for the use of documents from the website in publications related to the history of the Great Patriotic War is a reference to the website.
My understanding is that the website considers its services and content to be state-provided information freely available, but requires attribution.
Could someone please advise on the most appropriate license to use? At this point of time, I gave it CC-BY-4.0. I want to ensure full compliance with copyright law and Wikimedia Commons guidelines. Any guidance would be greatly appreciated.
Thank you for your time and expertise.
Sincerely,