TY - GEN
T1 - Learning Document Graphs with Attention for Image Manipulation Detection
AU - James, Hailey
AU - Gupta, Otkrist
AU - Raviv, Dan
N1 - Publisher Copyright:
© 2022, Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - Detecting manipulations in images is becoming increasingly important for combating misinformation and forgery. While recent advances in computer vision have lead to improved methods for detecting spliced images, most state-of-the-art methods fail when applied to images containing mostly text, such as images of documents. We propose a deep-learning method for detecting manipulations in images of documents which leverages the unique structured nature of these images in comparison with those of natural scenes. Specifically, we re-frame the classic image splice detection problem as a node classification problem, in which Optical Character Recognition (OCR) bounding boxes form nodes and edges are added according to a text-specific distance heuristic. We propose a Variational Autoencoder (VAE)-based embedding algorithm, which when combined with a graph neural network with attention, outperforms both a state-of-the-art image splice detection method and a document-specific method.
AB - Detecting manipulations in images is becoming increasingly important for combating misinformation and forgery. While recent advances in computer vision have lead to improved methods for detecting spliced images, most state-of-the-art methods fail when applied to images containing mostly text, such as images of documents. We propose a deep-learning method for detecting manipulations in images of documents which leverages the unique structured nature of these images in comparison with those of natural scenes. Specifically, we re-frame the classic image splice detection problem as a node classification problem, in which Optical Character Recognition (OCR) bounding boxes form nodes and edges are added according to a text-specific distance heuristic. We propose a Variational Autoencoder (VAE)-based embedding algorithm, which when combined with a graph neural network with attention, outperforms both a state-of-the-art image splice detection method and a document-specific method.
KW - Graph Neural Networks
KW - Manipulation detection
KW - Variational auto-encoders
UR - http://www.scopus.com/inward/record.url?scp=85131920042&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-09037-0_22
DO - 10.1007/978-3-031-09037-0_22
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85131920042
SN - 9783031090363
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 263
EP - 274
BT - Pattern Recognition and Artificial Intelligence - 3rd International Conference, ICPRAI 2022, Proceedings
A2 - El Yacoubi, Mounîm
A2 - Granger, Eric
A2 - Yuen, Pong Chi
A2 - Pal, Umapada
A2 - Vincent, Nicole
PB - Springer Science and Business Media Deutschland GmbH
T2 - 3rd International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2022
Y2 - 1 June 2022 through 3 June 2022
ER -