early modern digital collections

A view of the Folger's vault (© @pleasant_peasants)
A view of the Folger’s vault (©pleasant_peasants)

A list of digital collections of early printed books with open-access reuse policies, ranging from public domain to CC BY-SA; descriptions of copyright and licensing are simplifications focused on early modern materials, linked to full information. 1 Know of other open digital collections I should include? Email me. (Want more peeks into the Folger’s collections? Follow @pleasant_peasants’s Instagram! My thanks to Pleasant for permission to use this photo of STC books in the vault.) (updated November 30, 2017)

open-use digital collections

Aggregators

  • The Biodiversity Heritage Library has more than you might think and the interface is easy to use; I like to browse by year and downloading is easy, with helpful info on saving hi-res single-page images (lots of public domain with specifications on each item and clear info on what different statements mean)
  • Digital Public Library of America (DPLA) is not great for browsing, although the timeline option helps, but it pulls together a lot of American libraries (their partners include some of the ones I list elsewhere on this page) that would be exhausting to search individually. Downloadable format and licenses vary according to institution.
  • Europeana is hard to search and hard to browse but it does aggregate from a lot of European collections (their hints might help, and the site is only beta, but the inability to sort results by date makes it hard to separate first editions from later ones)
  • Google Books. Should we talk about clean metadata? And being able to download image files? Well, then. Proceed at your own risk.
  • HathiTrust is better for searching than browsing; quality of imagining varies and some works can only be downloaded from within partner institutions (although non-partners can download individual pages); right-click to save as image not pdf. Early modern works are generally public domain, though see records for details.
  • Internet Archive has plenty of early printed material. Be very careful, though, that what you’re looking at is not a facsimile—don’t trust the metadata but verify in the book itself, at the front and at the back (see, e.g., this book which appears to be a 1541 Hungarian New Testament, but is actually a 1960 facsimile).
    • via Smithsonian, instructions on how to download single pages in high JPEG2000 resolution, rather than a zip of the whole entire book—super handy!
      “1) Click on the link for “Read Full Screen” or “Find in: Internet Archive” located below the book. You should be redirected to a URL like this: http://archive.org/details/butterflybookpop00smholl – find the page you want, and open the image in a new window. Look at its URL and you should now know the page image’s filename, usually something like butterflybookpop00smholl_0008.jpg
      2) Go back to the original URL and replace /details/ with /download/ like so: http://archive.org/download/butterflybookpop00smholl
      3) Now, copy the last part of that URL, which is the book’s identifier. put a / at the end of the URL and then paste in the identifier, followed by _jp2.zip/ (n.b. the trailing slash is important) your URL will now look like http://archive.org/download/butterflybookpop00smholl/butterflybookpop00smholl_jp2.zip/
      4) Hit enter, and you should now see a list of all the individual JPEG2000 images in the book. Download the filename for the image you want.”
    • If you want to download the unprocessed jp2 (which often shows more detail and usually includes the stand holding the item and often a target card) follow the same process but the last part of the url should be _orig_jp2.tar/ (eg, https://archive.org/download/psalteriumcumapp00ratd/psalteriumcumapp00ratd_orig_jp2.tar/)
  • Primeros Libros de las Americas brings together copies (sometimes multiple copies) of books printed in the Americas in the 16th century in an interface that’s easy to browse and navigate; right-click to save jpg, although max is 1000px and you might in some cases get a larger image by going to the owning institution (public domain)
  • Catalogs! Many union catalogs include links to digitized copies; if you’re looking for something specific, those can often be the best place to start. You can browse this list of catalogs of early printed works; you’ll need to visit each one and what they link to in order to determine licensing. A few big or especially nice ones are below:
    • The Universal Short Title Catalogue in its original release covered European imprints through 1600; its beta site is expanding to cover through 1700. Both let you view only records that include links to open-access digitizations; licensing terms vary, but many are public domain or NC.
    • The English Short Title Catalogue includes primarily links to images in the subscription databases EEBO and ECCO, but they are adding OA images. One time-consuming trick if you’re looking for a specific work is to look at the holdings information for a record and then visit each institution’s digital collections.
    • Short Title Catalogus Vlaanderen (STCV) lists pre-1801 Flemish publications (a geographically small but very rich area for the history of printing!); you can view only records with available digitizations; quality, licensing, and ability to download varies.

Wait, I can’t believe you don’t include ……

The following collections aren’t included in my list above because they don’t meet my set of terms: 2

  • Digital Bodleian is chock full of great things, so many of them. But their terms explicitly state that they are “only for non-commercial purposes, including but not limited to private study, research, or teaching and instruction within an educational establishment” [bold theirs; italic mine] and, just to be crystal clear, “For the purposes of this user licence, commercial purposes means any use of the content that is primarily intended for or directed toward commercial advantage or monetary compensation. This includes any use on or in anything that is itself charged for, is connected with something that is charged for or is intended to make a profit.” Since my project is not within an educational establishment and it will be connected to a book that will be charged for, I am SOL here.
  • Biblioteca Nationale Hispánica, the digital collection of the Biblioteca Nacional de España, has great early print and manuscripts, but is mostly CC BY-NC-SA.
  • Cambridge’s University Library, especially the Royal Library and Lines of Thought exhibition, is also chock full of great things and often super contextual information, but it is governed by terms (no direct or indirect commercial use) that put it out of option for me.
  • Gallica, the digital collection of the Bibliothèque nationale de France, is awesome but only for non-commercial use.
  • Harry Ransom Center has some great public domain materials, but with the exception of Double Falshood and their First Folio, their early modern stuff is manuscripts
  • The John Carter Brown Library has amazing early American collections (both North and South American) but non-commercial use only (unless you access copies through Primeros Libros; see above)
  • Linda Hall Library has great science- and technology-related collections but non-commercial only.
  • The Max Planck Institut is CC BY-SA but the images, as far as I can figure out, are on the small side (e.g., 434 x 636 pixels)
  • Missouri Botanical Garden Library’s Botanicus portal is full of great stuff, some of which is lovely hi-res, but CC BY-NC-SA (a contributor to BHL, but sometimes you just want to browse a botanical collection)
  • Das Münchener DigitalisierungsZentrum der Bayerischen Staatsbibliothek (the Munich DigitiZation Center or MDZ in either language) is chock-full of relevant material, some of it hi-res and some of it less ideal, but it’s all for non-commercial use only
  • University of Oklahoma’s History of Science Collections has long had lots of images of early printed book. They are in the midst of implementing a new repository (see the curator’s explanation). I’ll update this once more information about licensing and the launch happens. In the meantime, you can browse their beta site.
  • Die Sächsische Landesbibliothek – Staats- und Universitätsbibliothek Dresden (aka SLUB, apparently) has a huge collection of early printed books, including 15th- , 16th- , 17th- , and 18th- century collections; do not browse the books in the English-language interface (CC BY-SA but downloadable only as pdfs)
notes
  1. I have included a couple of collections that do not state their terms. I have interpreted their “we conform to the law” in light of Bridgeman v Corel, which states that faithful reproductions of public domain texts are themselves public domain. I am not a lawyer, of course, so you should rely on your own—or your legal counsel’s—judgement here.[]
  2. This doesn’t list everything that doesn’t meet my criteria because that would be crazy; it does list the notable ones, especially those that use non-commercial licensing.[]