Quantifying the impact of dirty OCR on historical text analysis: Eighteenth Century Collections Online as a case study

Mark John Hill, Simon Hengchen

Research output: Contribution to journalArticleScientificpeer-review

Original languageEnglish
JournalDigital Scholarship in the Humanities
Issue number4
Pages (from-to)825–843
Number of pages19
Publication statusPublished - 2019
MoE publication typeA1 Journal article-refereed

Fields of Science

  • 615 History and Archaeology
  • 6160 Other humanities
  • 113 Computer and information sciences

Cite this