TY - GEN
T1 - Writer characterization and identification of short modern and historical documents
T2 - 19th ACM Symposium on Document Engineering, DocEng 2019
AU - Faigenbaum-Golovin, Shira
AU - Levin, David
AU - Piasetzky, Eli
AU - Finkelstein, Israel
N1 - Publisher Copyright:
© 2019 Association for Computing Machinery.
PY - 2019/9/23
Y1 - 2019/9/23
N2 - Handwriting is considered a unique “fingerprint” that characterizes a scribe (it is even used as evidence in modern forensics). In paleography (the study of ancient writing), it is presumed that each writer has a one prototype for each letter in the alphabet. Commonly, for ancient inscriptions, letters are organized into paleographic tables (where the rows are the alphabet letters, and the columns represent the examined inscriptions). These tables play a significant role in dating inscriptions based on their resemblance to columns in the table. In this paper, we argue that each scribe "fingerprint" is not represented by a single character prototype, but in fact by a distribution of characters. We introduce a framework for automatically identifying the writer style and constructing paleographic tables based on character histograms. Subsequently, we propose a method for comparing short documents utilizing letter distribution. We demonstrate the validity of the methods on two handwritten datasets: Modern and Ancient Hebrew pertaining to the First Temple period. Our methodology on the ancient dataset enables us to provide additional evidence concerning the level of literacy in the kingdom of Judah ca. 600 BCE.
AB - Handwriting is considered a unique “fingerprint” that characterizes a scribe (it is even used as evidence in modern forensics). In paleography (the study of ancient writing), it is presumed that each writer has a one prototype for each letter in the alphabet. Commonly, for ancient inscriptions, letters are organized into paleographic tables (where the rows are the alphabet letters, and the columns represent the examined inscriptions). These tables play a significant role in dating inscriptions based on their resemblance to columns in the table. In this paper, we argue that each scribe "fingerprint" is not represented by a single character prototype, but in fact by a distribution of characters. We introduce a framework for automatically identifying the writer style and constructing paleographic tables based on character histograms. Subsequently, we propose a method for comparing short documents utilizing letter distribution. We demonstrate the validity of the methods on two handwritten datasets: Modern and Ancient Hebrew pertaining to the First Temple period. Our methodology on the ancient dataset enables us to provide additional evidence concerning the level of literacy in the kingdom of Judah ca. 600 BCE.
KW - Epigraphy
KW - Handwriting comparison
KW - Hebrew ostraca
KW - Historical documents
KW - Iron Age
KW - Judah
KW - Paleographic tables
UR - http://www.scopus.com/inward/record.url?scp=85073329992&partnerID=8YFLogxK
U2 - 10.1145/3342558.3345413
DO - 10.1145/3342558.3345413
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85073329992
T3 - Proceedings of the ACM Symposium on Document Engineering, DocEng 2019
BT - Proceedings of the ACM Symposium on Document Engineering, DocEng 2019
PB - Association for Computing Machinery, Inc
Y2 - 23 September 2019 through 26 September 2019
ER -