Image and text correction using language models

Ido Kissos, Nachum Dershowitz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We report on experiments with the use of learned classifiers for improving OCR accuracy and generating word-level correction candidates. The method involves the simultaneous application of several image- and text- correction models, followed by a performance evaluation that enables the selection of the best image-processing model for each document and the most likely corrections for each word. It relies on a training set comprising document images and their transcriptions, plus a domain corpus used to build the language model. It is applicable to any language with simple segmentation rules and performs well on morphologically-rich languages. Experiments with an Arabic newspaper corpus show a 50% reduction in word error rate, with per-document image enhancement a major contributor.

Original languageEnglish
Title of host publication1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages158-162
Number of pages5
ISBN (Electronic)9781509066285
DOIs
StatePublished - 13 Oct 2017
Event1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017 - Nancy, France
Duration: 3 Apr 20175 Apr 2017

Publication series

Name1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017

Conference

Conference1st IEEE International Workshop on Arabic Script Analysis and Recognition, ASAR 2017
Country/TerritoryFrance
CityNancy
Period3/04/175/04/17

Fingerprint

Dive into the research topics of 'Image and text correction using language models'. Together they form a unique fingerprint.

Cite this