Hespi

Software used to process images of herbarium sheets by detecting the various elements in a sheet and creating structured data from those with the help of Optical Character Recognition (OCR) and Handwritten Text Recognition (HTR) then using Large Language Models (LLM) to further process and correct the data.

Additional Info

Field Value
URL https://github.com/rbturnbull/hespi
Task Clusters Digital Data Capture
Task Label Transcription
Preparations Herbarium Sheets
Discipline
  • Botany
  • Botany - Seed Plants
Audience
  • Collections Professionals
  • Researchers
Category Command-Line Application
First Author Robert Turnbull
Other Authors
  1. Emily Fitzgerald
Date Modified 2025-08-28
Version v0.6.1
Operating System Platform Independent
Programming Language Python
Maintainer Robert Turnbull

Other versions

This dataset has no data