TY - GEN
T1 - Optical character recognition guided image super resolution
AU - Hildebrandt, Philipp
AU - Schulze, Maximilian
AU - Cohen, Sarel
AU - Doskoč, Vanja
AU - Saabni, Raid
AU - Friedrich, Tobias
N1 - Publisher Copyright:
© 2022 ACM.
PY - 2022/9/20
Y1 - 2022/9/20
N2 - Recognizing disturbed text in real-life images is a difficult problem, as information that is missing due to low resolution or out-of-focus text has to be recreated. Combining text super-resolution and optical character recognition deep learning models can be a valuable tool to enlarge and enhance text images for better readability, as well as recognize text automatically afterwards. We achieve improved peak signal-to-noise ratio and text recognition accuracy scores over a state-of-the-art text super-resolution model TBSRN on the real-world low-resolution dataset TextZoom while having a smaller theoretical model size due to the usage of quantization techniques. In addition, we show how different training strategies influence the performance of the resulting model.
AB - Recognizing disturbed text in real-life images is a difficult problem, as information that is missing due to low resolution or out-of-focus text has to be recreated. Combining text super-resolution and optical character recognition deep learning models can be a valuable tool to enlarge and enhance text images for better readability, as well as recognize text automatically afterwards. We achieve improved peak signal-to-noise ratio and text recognition accuracy scores over a state-of-the-art text super-resolution model TBSRN on the real-world low-resolution dataset TextZoom while having a smaller theoretical model size due to the usage of quantization techniques. In addition, we show how different training strategies influence the performance of the resulting model.
KW - deep learning
KW - image super-resolution
KW - optical character recognition
KW - unfocused images
UR - http://www.scopus.com/inward/record.url?scp=85143154388&partnerID=8YFLogxK
U2 - 10.1145/3558100.3563841
DO - 10.1145/3558100.3563841
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85143154388
T3 - DocEng 2022 - Proceedings of the 2022 ACM Symposium on Document Engineering
BT - DocEng 2022 - Proceedings of the 2022 ACM Symposium on Document Engineering
PB - Association for Computing Machinery, Inc
T2 - 22nd ACM Symposium on Document Engineering, DocEng 2022
Y2 - 20 September 2022 through 23 September 2022
ER -