Commons:Bots/Requests/Smallbot (10)

From Wikimedia Commons, the free media repository
Jump to: navigation, search

Smallbot (talk · contribs)

Operator: Smallman12q (talk · contributions · Number of edits · recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: To upload ~500k files from the US National Archives and Records Administration based on a database dump provided in partnership with the Digital Public Library of America.

Yes, We Have No Ambitions Today! - Nara - 1693425.jpg
Author Berryman, Clifford Kennedy, 1869-1949
English: This cartoon plays off a line from a popular 1923 song ("Yes, We Have No Bananas!") to characterize car maker Henry Ford's Presidential ambitions--or lack thereof. Ford blames his busy schedule for his hesitation to jump into the "Presidential contest pool," while eager supporters encourage him to "come on in!" Berryman was correct in his prediction: Ford chose not to pursue the Presidency.
National Archives and Records Administration  Blue pencil.svg wikidata:Q518155
National Archives and Records Administration
Native name National Archives and Records Administration
Location Washington, D.C. (headquarters), and many regional facilities and presidential libraries nationwide in the USA
Coordinates 38° 53′ 34″ N, 77° 01′ 23″ W Link to OpenStreetMap Link to Google Maps
Established 1934
Web page
Authority control
Record ID
NARA Logo created 2010.svg This media is available in the holdings of the National Archives and Records Administration, cataloged under the National Archives Identifier (NAID) 1693425.

This tag does not indicate the copyright status of the attached work. A normal copyright tag is still required. See Commons:Licensing for more information.

English | Español | Français | Italiano | Македонски | മലയാളം | Nederlands | Polski | Português | Русский | Slovenščina | Türkçe | Українська | Tiếng Việt | 中文(简体) | 中文(繁體) | +/−

  • NWL-46-BERRYMAN-H009
Source U.S. National Archives and Records Administration
(Reusing this file)
Public domain This work is in the public domain in the United States because it is a work prepared by an officer or employee of the United States Government as part of that person’s official duties under the terms of Title 17, Chapter 1, Section 105 of the US Code. See Copyright.

Note: This only applies to original works of the Federal Government and not to the work of any individual U.S. state, territory, commonwealth, county, municipality, or any other subdivision. This template also does not apply to postage stamp designs published by the United States Postal Service since 1978. (See § 313.6(C)(1) of Compendium of U.S. Copyright Office Practices). It also does not apply to certain US coins; see The US Mint Terms of Use.

Great Seal of the United States (obverse).svg
This file has been identified as being free of known restrictions under copyright law, including all related and neighboring rights.
Other versions

Please do not overwrite this file: any restoration work should be uploaded with a new name and linked in this page's "other versions=" parameter, so that this file represents the exact file found in the NARA catalog record to which it links. The metadata on this page was imported directly from NARA's catalog record; additional descriptive text may be added by Wikimedians to the template below with the "description=" parameter, but please do not modify the other fields.

(Note: Editors who post this notice are strongly encouraged to add details explaining how it applies to this file.)
Ambox notice.png
The metadata on this page was imported directly from NARA's catalog record; additional descriptive text may be added by Wikimedians to the template below with the "Description=" parameter, but please do not modify the other fields.
Please help us by reporting errors! This may include misidentifications, erroneous images, typos in the metadata, possible copyright issues, and poor-quality images needing rescanning. (Be aware that, for documentary purposes, NARA often retains the original image captions, which may be erroneous, biased, or even misspelled.)

Automatic or manually assisted: Automatic

Edit type (e.g. Continuous, daily, one time run): One time

Maximum edit rate (e.g. edits per minute): 10-15, as fast as it uploads

Bot flag requested: (Y/N): No

Programming language(s): Python 3.2

Will use metadata from DPLA bulk download for NARA. The metadata is in json, and is converted formatted to the template by the bot.

Smallman12q (talk) 23:41, 8 May 2013 (UTC)


For reference, a previous NARA batch upload was approved at Commons:Bots/Requests/US National Archives bot.Smallman12q (talk) 23:41, 8 May 2013 (UTC)

  • Yeah sure, looks good to me. --Dschwen (talk) 21:21, 9 May 2013 (UTC)
Usual suggestion: please use language template for Author/Source/Record ID fields. --EugeneZelenko (talk) 13:44, 11 May 2013 (UTC)
  • Please can you put a deeplink in the "source" field, as that is where most editors will look. I tried to get the original of this example, but apparently "The Online Public Access (OPA) system will be down for maintenance from May 10 to May 25.", so we may not be able to thoroughly test this for a couple of weeks. --99of9 (talk) 13:01, 14 May 2013 (UTC)
Yes, I recently heard some details about that as well. I'll try to keep updated on the status. Bdcousineau (talk) 14:50, 16 May 2013 (UTC)
  • What kind of label is: "NWL-46-BERRYMAN-H009"? It might help to add the name of this kind of identifier. --99of9 (talk) 13:02, 14 May 2013 (UTC)
That is an old catalog number used by NARA. It is no longer in use, but since it is in the current template used by NARA on Commons, it has been included. It most likely refers to the "NAIL" database, which was the in use prior to ARC/OPA, the current database. For a sample, see File:Football team on the field, Haskell Institute, Lawrence, Kansas, 1914 - NARA - 519149.jpg. Better removed? Bdcousineau (talk) 14:50, 16 May 2013 (UTC)
I'd suggest leaving it in there, but having the template do nothing with it (i.e. not display it). That way we can easily reintroduce it if someone thinks it is useful later. --99of9 (talk) 15:36, 16 May 2013 (UTC)
  • 500k files! Wow, this is huge, congratulations and good luck! --99of9 (talk) 13:09, 14 May 2013 (UTC)
  • Great start! Since this is a large set, and since the metadata will not be perfect (never is for a transfer of this size): are you thinking of staging this? Say, a few hundred to start, then 1k, then 10k, with pauses to see what sort of cleanup is needed? --SJ+ 22:40, 15 May 2013 (UTC)

Time2wait.svg On hold-As stated at Online Public Access, access to records is suspended from the 10th to the 25th. (2 weeks is a loong roll out). Once access is restored, will do an initial batch upload of 100, 1000, then auto after that. Will also make source available once upload starts.Smallman12q (talk) 00:06, 24 May 2013 (UTC)

Can someone explain the process here? What does this have to do with DPLA (which does not host NARA images)? If you are just planning on copying the mostly low-resolution images from the catalog, I think we should slow down and concentrate on acquiring more of the high-resolution TIFF files like we did for the first mass upload. Also, with a separate mass upload based on a different set of source files, how are you planning to prevent uploading tens of thousands of duplicates? Dominic (talk) 16:37, 4 June 2013 (UTC)

@Smallman12q, @99of9, @Dominic: What's the state of this? Is there anything we're waiting for here? odder (talk) 16:17, 16 December 2013 (UTC)
If you ask me, this proposal wasn't very well-formed from the beginning. We already have a full-time staff member inside NARA (myself) who is working on preparing this sort of an upload, and I am working on doing it so we get the high resolution, use the full metadata from their own catalog, not DPLA, and so that it is consistent with the tens of thousands of other uploads already done. I think it is telling that my questions were never answered. Dominic (talk) 16:15, 21 December 2013 (UTC)
Ok, unless User:Smallman12q speaks up soon, I propose that we decline this request given that User:Dominic has something superior in the works. --99of9 (talk) 03:30, 9 January 2014 (UTC)

Declined per above. Also, noting Smallman12q's retirement, I'd like to thank him for all his efforts in bot writing. --99of9 (talk) 03:23, 13 January 2014 (UTC)