4.7 Article

Self-supervised deep reconstruction of mixed strip-shredded text documents

Journal

PATTERN RECOGNITION
Volume 107, Issue -, Pages -

Publisher

ELSEVIER SCI LTD
DOI: 10.1016/j.patcog.2020.107535

Keywords

Deep learning; Self-supervised learning; Fully convolutional neural networks; Document reconstruction; Forensics; Optimization search

Funding

  1. Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior - Brasil (CAPES) [001]
  2. Conselho Nacional de Desenvolvimento Cientifico e Tecnologico (CNPq, Brazil) [311120/2016-4, 311504/2017-5, PDE: 200864/2019]

Ask authors/readers for more resources

The reconstruction of shredded documents consists of coherently arranging fragments of paper (shreds) to recover the original document(s). A great challenge in computational reconstruction is to properly evaluate the compatibility between the shreds. While traditional pixel-based approaches are not robust to real shredding, more sophisticated solutions compromise significantly time performance. The solution presented in this work extends our previous deep learning method for single-page reconstruction to a more realistic/complex scenario: the reconstruction of several mixed shredded documents at once. In our approach, the compatibility evaluation is modeled as a two-class (valid or invalid) pattern recognition problem. The model is trained in a self-supervised manner on samples extracted from simulated-shredded documents, which obviates manual annotation. Experimental results on three datasets - including a new collection of 100 strip-shredded documents produced for this work - have shown that the proposed method outperforms the competing ones on complex scenarios, achieving accuracy superior to 90%. (C) 2020 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available