Document Layout Error Rate (DLER) metric to Evaluate Image Segmentation Methods

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Scholarly editions play a crucial role in humanities research, particularly in
the study of literature and historical documents. The primary objective of
these editions is to reconstruct the original text or provide insights into the
author’s intentions. Traditionally, crafting a critical edition required a life
time of dedication. However, thanks to recent advancements in deep learning
and computer vision, modern text recognition tools can now be used to ex
pedite this process. A key part of these tools is document layout analysis
(DLA), where image segmentation methods are used to detect different text
elements. Most existing DLA solutions have focused on evaluating the accu
racy of these methods, often neglecting to study the practical consequences
of method selection. In this study, we have developed a new metric, the Doc
ument Layout Error Rate (DLER), which evaluates the performance of fine
grained DLA methods within the overall pipeline. This metric helps identify
the method with the lowest error rate, thereby minimizing the manual effort
required for corrections. We applied this evaluation method to assess four
different methods and their efficacy for the DLA task in the context of David
Hume’s History of England.
Original languageEnglish
Article number100606
JournalMachine learning with applications
Volume18
Pages (from-to)1-28
Number of pages10
ISSN2666-8270
DOIs
Publication statusPublished - Dec 2024
MoE publication typeA1 Journal article-refereed

Fields of Science

  • 113 Computer and information sciences
  • Computer vision
  • Deep learning
  • Document layout analysis

Cite this