Methods for Automated Text Digitisation

In this report, the authors explore the landscape of tools and techniques that can be used to automate the data capture from text elements in specimens. They explore approaches of optical character recognition, handwritten text recognition, language identification, named entity recognition, and terminology extraction.

Additional Info

Field Value
URL https://zenodo.org/records/3364502
Task Clusters Digital Data Capture
Preparations General
Discipline General
Audience Collections Professionals
Language English
Category Technical Report
DOI https://doi.org/10.5281/zenodo.3364501
First Author David Owen
Other Authors
  1. Quentin Groom
  2. Alex Hardisty
  3. Thijs Leegwater
  4. Myriam van Walsum
  5. Noortje Wijkamp
  6. Irena Spasić
Publisher Zenodo
Date Published 2019-01-31
Version v1
Accessible for Free Yes

Other versions

This dataset has no data