Skip to main content
Ctrl+K
Trove Data Guide - Home
  • Trove Data Guide
  • About this Guide
  • Who is this for?
  • The possibilities of Trove data

What is Trove?

  • 1. Trove is…
  • 2. Categories and zones
  • 3. Works and versions
  • 4. Collections within collections
  • 5. Links and identifiers
  • 6. Interfaces

Understanding search

  • 7. Understanding search
  • 8. ‘Simple’ search options
  • 9. Date searches
  • 10. Search interface hacks
  • 11. Finding NLA digitised content you can download

Accessing data

  • 12. Data access options
  • 13. Downloading data from the Trove web interface
  • 14. Trove API introduction
  • 15. How to
    • 15.1. HOW TO: Download higher resolution versions of images from the web interface
    • 15.2. HOW TO: Harvest a complete set of search results using the Trove API

Digitised newspapers and gazettes

  • 16. Understanding the digitised newspapers
  • 17. Accessing data
    • 17.1. Articles
    • 17.2. Pages
    • 17.3. Issues
    • 17.4. Titles
  • 18. How to
    • 18.1. HOW TO: Get a newspaper issue or article as a PDF
    • 18.2. HOW TO: Get information about the position of OCRd newspaper text
    • 18.3. HOW TO: Create a dataset of digitised newspaper articles

Other digitised resources

  • 19. Understanding digitised resources
  • 20. Accessing data from digitised resources
  • 21. Books
  • 22. Periodicals
    • 22.1. Overview of periodicals
    • 22.2. Finding digitised periodicals
    • 22.3. Accessing data from periodicals
  • 23. Parliamentary papers
    • 23.1. Overview of Parliamentary Papers
    • 23.2. Finding Parliamentary Papers in Trove
  • 24. Oral histories
    • 24.1. Overview of oral histories
    • 24.2. Accessing data from digitised oral histories
  • 25. How to
    • 25.1. HOW TO: Harvest data relating to digitised resources
    • 25.2. HOW TO: Extract additional metadata from the digitised resource viewer
    • 25.3. HOW TO: Get a list of items from a digitised collection
    • 25.4. HOW TO: Get text, images, and PDFs using Trove’s download link
    • 25.5. HOW TO: Create download links for images using nla.obj identifiers
    • 25.6. HOW TO: Get and use OCR data from a book or periodical page
    • 25.7. HOW TO: Scrape metadata from the Trove audio player

Research pathways

  • 26. Introduction
  • 27. Using text
    • 27.1. Data sources
    • 27.2. Tools and resources
    • 27.3. Tutorials and examples
      • Analysing keywords in Trove’s digitised newspapers
  • 28. Using images
    • 28.1. Data sources
    • 28.2. Tools and resources
    • 28.3. Tutorials and examples
      • Working with a Trove collection in Tropy
      • Comparing manuscript collections in Mirador
  • 29. Collection and system data
    • 29.1. Data sources
    • 29.2. Tools and resources
    • 29.3. Tutorials and examples
  • 30. Maps and places
    • 30.1. Data sources
    • 30.2. Tools and resources
    • 30.3. Tutorials and examples
      • Create a layer in GHAP using metadata from Trove’s digitised maps
  • 31. Creating collections
    • 31.1. Tools and resources
    • 31.2. Tutorials and examples
      • Sharing a Trove List as a CollectionBuilder exhibition
  • Contributing to the Trove Data Guide
  • References
  • Repository
  • Open issue
  • .ipynb

HOW TO: Create download links for images using nla.obj identifiers

Contents

  • 25.5.1. Introduction
  • 25.5.2. Method
  • 25.5.3. More examples
  • 25.5.4. Getting image/page identifiers
  • 25.5.5. Availability of high-resolution TIFF files

25.5. HOW TO: Create download links for images using nla.obj identifiers#

On this page

  • Introduction

  • Method

  • More examples

  • Getting image/page identifiers

  • Availability of high-resolution TIFF files

25.5.1. Introduction#

Many of the resources digitised by the NLA and its partners are made up of images. These might be digitised copies of visual material like photos and maps, or scanned pages of print publications like books or periodicals. In Trove, each image or page has its own unique nla.obj identifier. You can use these identifiers to construct urls that lead directly to downloadable versions of the image file.

25.5.2. Method#

See also

While this method is particularly useful in developing computational processes for downloading and processing images, you can also use it in the web interface to make sure you’re downloading the highest available resolution of an image. See HOW TO: Download higher resolution versions of images from the web interface

To construct a url to an image file you just add a suffix to the identifier url. For example, this photograph of a group of school children with gardening tools has the identifier nla.obj-141828112. To create a direct link to the image, you just add /image to the identifier url:

https://nla.gov.au/nla.obj-141828112/image

The /image suffix is probably the most useful option as it provides access to the image at its highest available resolution. In many cases this will be at a higher resolution than is available through the download option provided by the web interface. There are, however, other possible image suffixes:

url suffix

description

/image

leads to a higher-resolution JPEG version of the image (longest dimension is a maximum of 5000px)

-t

leads to a thumbnail version of the image (usually around 123px wide)

/representativeImage

leads to an image which has been selected to represent a collection

/m

leads to a very high-resolution TIFF version of the image (only available for selected resources, mostly maps)

../../_images/journal-cover-thumbnails.png

Fig. 25.3 An example of using the -t suffix to assemble a collection of periodical cover thumbnails#

There are additional parameters you can use with /image and /representativeImage, though I’m not sure how reliably they work:

parameter

description

wid

desired width in pixels

hei

desired height in pixels

For example: https://nla.gov.au/nla.obj-141828112/image?wid=500

Image sizes

The sizes of images downloaded using the /image suffix vary unpredictably. Sizes seem to range up to a maximum of 5000 pixels along the longest dimension, but some are much smaller, including many digitised photographs. However, images obtained this way are at the same, or higher, resolution than those available through Trove’s built-in download option.

25.5.3. More examples#

This photograph of some angry penguins on Heard Island has the identifier nla.obj-147135602.

../../_images/nla.obj-141171021.jpg

Fig. 25.4 Two Rockhopper Penguins and a predatory Skua, Heard Island, Antarctica, ca. 1930 (by Frank Hurley) http://nla.gov.au/nla.obj-141171021#

The persistent url is created by adding http://nla.gov.au/ to the identifier

http://nla.gov.au/nla.obj-141171021

To view the photograph in Trove’s digitised image viewer, you just add view to the persistent url (this is where the persistent url redirects to anyway)

http://nla.gov.au/nla.obj-141171021/view

To access a thumbnail version of the image, you add -t to the persistent url

http://nla.gov.au/nla.obj-141171021-t

To access a high-resolution version of the image, you add /image to the persistent url

http://nla.gov.au/nla.obj-141171021/image

To access a version of the image that is 1000 pixels wide, you add /image?wid=1000 to the persistent url

http://nla.gov.au/nla.obj-141171021/image?wid=1000

This works the same way with pages in books and periodicals, however, the urls are a bit more complicated. For example, this page in The Home also features a photo of penguins by Frank Hurley. The page’s identifier is nla.obj-387326197.

../../_images/nla.obj-387326197.jpg

Fig. 25.5 ‘Penguin pageant’ by Frank Hurley, The Home, vol. 20, no. 1, January 1940, p. 44 http://nla.gov.au/nla.obj-387326197#

The persistent url is created by adding http://nla.gov.au/ to the identifier

http://nla.gov.au/nla.obj-387326197

If you access the page’s persistent url you are redirected to the issue, with the page identifier included as a partId parameter

https://nla.gov.au/nla.obj-387284380/view?partId=nla.obj-387326197

To access a thumbnail version of the page image, you add -t to the persistent url

http://nla.gov.au/nla.obj-387326197-t

To access a high-resolution version of the page image, you add /image to the persistent url

http://nla.gov.au/nla.obj-387326197/image

To access a version of the image that is 500 pixels high, you add /image?hei=500 to the persistent url

http://nla.gov.au/nla.obj-387326197/image?hei=500

25.5.4. Getting image/page identifiers#

See also

The GLAM Workbench notebook Download a collection of digitised images provides a full working example of obtaining a list of image identifiers from a collection and then downloading each image by adding the /image suffix.

If you want to use this method in a computational process to download all the images in a collection or publication you need some way of finding all the image/page identifiers. The method for doing depends on the type of digitised resource you’re dealing with.

If you’re downloading images from a resource that is made up of pages, such as a book or periodical, you need to:

  • extract the metadata embedded in the digitised book or journal viewer

  • get a list of page identifiers from the metadata

If you’re downloading images from a collection of photographs, maps, or manuscripts, you need to:

  • harvest item identifiers from the digitised collection viewer

In fact, the latter method will also work with books and periodicals because they’re treated like collections of pages, but it’s much more efficient to grab the page identifiers from the embedded metadata.

25.5.5. Availability of high-resolution TIFF files#

See also

According to the Exploring digitised maps in Trove notebook in the GLAM Workbench there are more than 30,000 digitised maps with high-resolution TIFF downloads. The largest weighs in at more than 3gb!

As noted the /m suffix can be used to download huge, high-resolution TIFF versions of some images. I’ve only come across this option amongst the digitised maps, though it could be available elsewhere. If you add the /m suffix to an image that doesn’t have a TIFF version you’ll end up downloading a jpeg placeholder image that says ‘Not available online’. So how can you determine if a TIFF version is available to download? You need to:

  • extract the metadata embedded in the digitised map viewer

  • inspect the copies metadata to find a verion with copyrole set to m and access set to `true

previous

25.4. HOW TO: Get text, images, and PDFs using Trove’s download link

next

25.6. HOW TO: Get and use OCR data from a book or periodical page

Contents
  • 25.5.1. Introduction
  • 25.5.2. Method
  • 25.5.3. More examples
  • 25.5.4. Getting image/page identifiers
  • 25.5.5. Availability of high-resolution TIFF files

By Tim Sherratt

© Copyright 2024 Australian Research Data Commons.

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.

Version: v1.0-beta.16 (07 November 2024)

The Trove Data Guide received investment from the Australian Research Data Commons (ARDC). The ARDC is funded by the National Collaborative Research Infrastructure Strategy (NCRIS).