Conditions
- User needs to identify media records based on specimen record search.
- User needs to download media files automatically.
- Download process cannot break when an unresolvable download URL is passed.
- Output should be a directory with image files.
- Number of images expected is relatively small (<100,000?).
Example scenario
Researcher searches for herbarium specimens of species in the genus “Acer” and would like to download media associated with any records returned by this search. Media downloaded must retain relationship to specimen record.
Links to external solutions
- Module 3: Acquiring images, Python, as part of BOTANY 2021 Workshop: Using deep learning with digitized herbarium specimen image data
Solutions hosted in this repo
- download-media-from-specimens_ridigbio (view in browser) (download for local use)