TY - JOUR
T1 - NetLay
T2 - Layout Classification Dataset for Enhancing Layout Analysis
AU - Gogawale, Sharva
AU - Bambaci, Luigi
AU - Kurar-Barakat, Berat
AU - Shapira, Daria Vasyutinsky
AU - Ezra, Daniel Stökl Ben
AU - Dershowitz, Nachum
N1 - Publisher Copyright:
© 2024 Gogawale et al.
PY - 2024/12
Y1 - 2024/12
N2 - Within the domain of historical document image analysis, the process of identifying the spatial structure of a document image is an essential step in many document processing tasks, such as optical character recognition and information extraction. Advancements in layout analysis promise to enhance efficiency and accuracy using specialized models tailored to distinct layouts. We introduce NetLay, a new dataset for benchmarking layout classification algorithms for historical works. It consists of over 1,300 images of pages of printed Hebrew (or Hebrew-character) books in a variety of styles, categorized into four different classes based on their layout (the number of text columns and regions). Ground truth was crafted manually at the page level. Furthermore, we conduct an in-depth performance evaluation of various layout classification algorithms, which are based on deep-learning models that learn to extract spatial features from images. We evaluate our algorithms on NetLay and achieve state-of-the-art results on the task of layout classification for historical books.
AB - Within the domain of historical document image analysis, the process of identifying the spatial structure of a document image is an essential step in many document processing tasks, such as optical character recognition and information extraction. Advancements in layout analysis promise to enhance efficiency and accuracy using specialized models tailored to distinct layouts. We introduce NetLay, a new dataset for benchmarking layout classification algorithms for historical works. It consists of over 1,300 images of pages of printed Hebrew (or Hebrew-character) books in a variety of styles, categorized into four different classes based on their layout (the number of text columns and regions). Ground truth was crafted manually at the page level. Furthermore, we conduct an in-depth performance evaluation of various layout classification algorithms, which are based on deep-learning models that learn to extract spatial features from images. We evaluate our algorithms on NetLay and achieve state-of-the-art results on the task of layout classification for historical books.
KW - Convolutional neural networks
KW - Deep learning
KW - Historical document analysis
KW - Layout analysis
KW - Layout classification
KW - Multi-label classification
UR - https://www.scopus.com/pages/publications/85213358505
U2 - 10.30687/mag/2724-3923/2024/02/003
DO - 10.30687/mag/2724-3923/2024/02/003
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85213358505
SN - 2724-3923
VL - 5
SP - 223
EP - 240
JO - Magazen
JF - Magazen
IS - 2
ER -