We present one of the first attempts towards automatic retrieval of documents, in the noisy environment of unconstrained, multiple author, handwritten forms. The documents were written in cursive script for which conventional OCR and text retrieval engines are not adequate. We focus on a visual word spotting indexing scheme for scanned documents housed in the Archives of the Indies in Seville, Spain. The framework presented utilizes pattern recognition, learning and information fusion methods, and is motivated from human word-spotting studies. The proposed system is described and initial results are presented.
|Number of pages||8|
|State||Published - 1997|
|Event||Proceedings of the 1997 Workshop on Document Image Analysis, DIA - San Juan, Puerto Rico|
Duration: 20 Jun 1997 → 20 Jun 1997
|Conference||Proceedings of the 1997 Workshop on Document Image Analysis, DIA|
|City||San Juan, Puerto Rico|
|Period||20/06/97 → 20/06/97|