TY - GEN
T1 - Text line segmentation for challenging handwritten document images using fully convolutional network
AU - Barakat, Berat
AU - Droby, Ahmad
AU - Kassis, Majeed
AU - El-Sana, Jihad
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/12/5
Y1 - 2018/12/5
N2 - This paper presents a method for text line segmentation of challenging historical manuscript images. These manuscript images contain narrow interline spaces with touching components, interpenetrating vowel signs and inconsistent font types and sizes. In addition, they contain curved, multi-skewed and multi-directed side note lines within a complex page layout. Therefore, bounding polygon labeling would be very difficult and time consuming. Instead we rely on line masks that connect the components on the same text line. Then these line masks are predicted using a Fully Convolutional Network (FCN). In the literature, FCN has been successfully used for text line segmentation of regular handwritten document images. The present paper shows that FCN is useful with challenging manuscript images as well. Using a new evaluation metric that is sensitive to over segmentation as well as under segmentation, testing results on a publicly available challenging handwritten dataset are comparable with the results of a previous work on the same dataset.
AB - This paper presents a method for text line segmentation of challenging historical manuscript images. These manuscript images contain narrow interline spaces with touching components, interpenetrating vowel signs and inconsistent font types and sizes. In addition, they contain curved, multi-skewed and multi-directed side note lines within a complex page layout. Therefore, bounding polygon labeling would be very difficult and time consuming. Instead we rely on line masks that connect the components on the same text line. Then these line masks are predicted using a Fully Convolutional Network (FCN). In the literature, FCN has been successfully used for text line segmentation of regular handwritten document images. The present paper shows that FCN is useful with challenging manuscript images as well. Using a new evaluation metric that is sensitive to over segmentation as well as under segmentation, testing results on a publicly available challenging handwritten dataset are comparable with the results of a previous work on the same dataset.
KW - Fully convolutional network
KW - challenging historical document
KW - text line segmentation
UR - http://www.scopus.com/inward/record.url?scp=85060021499&partnerID=8YFLogxK
U2 - 10.1109/ICFHR-2018.2018.00072
DO - 10.1109/ICFHR-2018.2018.00072
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85060021499
T3 - Proceedings of International Conference on Frontiers in Handwriting Recognition, ICFHR
SP - 374
EP - 379
BT - Proceedings - 2018 16th International Conference on Frontiers in Handwriting Recognition, ICFHR 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 16th International Conference on Frontiers in Handwriting Recognition, ICFHR 2018
Y2 - 5 August 2018 through 8 August 2018
ER -