A MS-SSIM score helps to analyze how much a De-warping module has been able to de-warp a document image from its initial distorted view.
We describe an image compression method, consisting of a nonlinear analysis transformation, a uniform quantizer, and a nonlinear synthesis transformation.
This paper presents CompressAI, a platform that provides custom operations, layers, models and tools to research, develop and evaluate end-to-end image and video compression codecs.
While it is well known that autoregressive models come with a significant computational penalty, we find that in terms of compression performance, autoregressive and hierarchical priors are complementary and, together, exploit the probabilistic structure in the latents better than all previous learned models.
Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information.
In this work, we propose DewarpNet, a deep-learning approach for document image unwarping from a single image.
Ranked #1 on
MS-SSIM
on DocUNet
(using extra training data)
During training, the auto-encoder makes use of the context model to estimate the entropy of its representation, and the context model is concurrently updated to learn the dependencies between the symbols in the latent representation.
In order to show the effectiveness of our proposed JointIQ-Net, extensive experiments have been performed, and showed that the JointIQ-Net achieves a remarkable performance improvement in coding efficiency in terms of both PSNR and MS-SSIM, compared to the previous learned image compression methods and the conventional codecs such as VVC Intra (VTM 7. 1), BPG, and JPEG2000.
In this work, we present the Text Conditioned Auxiliary Classifier Generative Adversarial Network, (TAC-GAN) a text to image Generative Adversarial Network (GAN) for synthesizing images from their text descriptions.
At the time of writing this report, several learned video compression methods are superior to DVC, but currently none of them provides open source codes.
The experiments show that our approach achieves the state-of-the-art learned video compression performance in terms of both PSNR and MS-SSIM.