TorMentor: Deterministic dynamic-path, data augmentations with fractals

7 Apr 2022  ยท  Anguelos Nicolaou, Vincent Christlein, Edgar Riba, Jian Shi, Georg Vogeler, Mathias Seuret ยท

We propose the use of fractals as a means of efficient data augmentation. Specifically, we employ plasma fractals for adapting global image augmentation transformations into continuous local transforms. We formulate the diamond square algorithm as a cascade of simple convolution operations allowing efficient computation of plasma fractals on the GPU. We present the TorMentor image augmentation framework that is totally modular and deterministic across images and point-clouds. All image augmentation operations can be combined through pipelining and random branching to form flow networks of arbitrary width and depth. We demonstrate the efficiency of the proposed approach with experiments on document image segmentation (binarization) with the DIBCO datasets. The proposed approach demonstrates superior performance to traditional image augmentation techniques. Finally, we use extended synthetic binary text images in a self-supervision regiment and outperform the same model when trained with limited data and simple extensions.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
No real Data Binarization DIBCO 2010 Synth BIUnet Plasma Branching F-Score 87.78 # 1
No real Data Binarization DIBCO 2012 Synth BIUnet Plasma Branching F-Score 87.35 # 1
No real Data Binarization DIBCO 2014 Synth BIUnet Plasma Branching F-Score 90.39 # 1
No real Data Binarization DIBCO 2016 Synth BIUnet Plasma Branching F-Score 89.07 # 1
No real Data Binarization DIBCO 2018 Synth BIUnet Plasma Branching F-Score 82.82 # 1
No real Data Binarization DIBCO 2019 Synth BIUnet Plasma Branching Top 1 Accuracy 69.84 # 1
No real Data Binarization DIBCO and H_DIBCO 2009 Synth BIUnet Plasma Branching F-Score 87.85 # 1
No real Data Binarization DIBCO and H_DIBCO 2011 Synth BIUnet Plasma Branching F-Score 88.45 # 1
No real Data Binarization DIBCO and H_DIBCO 2013 Synth BIUnet Plasma Branching F-Score 89.34 # 1
No real Data Binarization DIBCO and H_DIBCO 2017 Synth BIUnet Plasma Branching F-Score 89.69 # 1

Methods