Search Results for author: Shanshe Wang

Found 27 papers, 14 papers with code

STIP: A SpatioTemporal Information-Preserving and Perception-Augmented Model for High-Resolution Video Prediction

1 code implementation9 Jun 2022 Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

To solve the information loss problem, the proposed model aims to preserve the spatiotemporal information for videos during the feature extraction and the state transitions, respectively.

Video Prediction

Hierarchical Similarity Learning for Aliasing Suppression Image Super-Resolution

no code implementations7 Jun 2022 Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

As a highly ill-posed issue, single image super-resolution (SISR) has been widely investigated in recent years.

Image Super-Resolution

Learning Weighting Map for Bit-Depth Expansion within a Rational Range

1 code implementation26 Apr 2022 Yuqing Liu, Qi Jia, Jian Zhang, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

Existing BDE methods have no unified solution for various BDE situations, and directly learn a mapping for each pixel from LBD image to the desired value in HBD image, which may change the given high-order bits and lead to a huge deviation from the ground truth.

SSIM

STAU: A SpatioTemporal-Aware Unit for Video Prediction and Beyond

no code implementations20 Apr 2022 Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

In this paper, we propose a SpatioTemporal-Aware Unit (STAU) for video prediction and beyond by exploring the significant spatiotemporal correlations in videos.

Action Recognition object-detection +2

Cross-SRN: Structure-Preserving Super-Resolution Network with Cross Convolution

no code implementations5 Jan 2022 Yuqing Liu, Qi Jia, Xin Fan, Shanshe Wang, Siwei Ma, Wen Gao

It is challenging to restore low-resolution (LR) images to super-resolution (SR) images with correct and clear details.

Super-Resolution

MAU: A Motion-Aware Unit for Video Prediction and Beyond

1 code implementation NeurIPS 2021 Zheng Chang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Yan Ye, Xiang Xinguang, Wen Gao

The attention module aims to learn an attention map based on the correlations between the current spatial state and the historical spatial states.

Action Recognition Video Prediction

Rethinking Lightweight Convolutional Neural Networks for Efficient and High-quality Pavement Crack Detection

2 code implementations13 Sep 2021 Kai Li, Jie Yang, Siwei Ma, Bo wang, Shanshe Wang, Yingjie Tian, Zhiquan Qi

For the second issue, we reconsider how to improve detection efficiency with excellent performance, and then propose our lightweight encoder-decoder architecture termed CarNet.

Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation

1 code implementation29 Jul 2021 Wenkang Shan, Haopeng Lu, Shanshe Wang, Xinfeng Zhang, Wen Gao

To alleviate these two problems, we propose a relative information encoding method that yields positional and temporal enhanced representations.

Monocular 3D Human Pose Estimation

Rate Distortion Characteristic Modeling for Neural Image Compression

no code implementations24 Jun 2021 Chuanmin Jia, Ziqing Ge, Shanshe Wang, Siwei Ma, Wen Gao

End-to-end optimized neural image compression (NIC) has obtained superior lossy compression performance recently.

Image Compression

Visual Analysis Motivated Rate-Distortion Model for Image Coding

no code implementations21 Apr 2021 Zhimeng Huang, Chuanmin Jia, Shanshe Wang, Siwei Ma

We first propose the region of interest for machine (ROIM) to evaluate the degree of importance for each coding tree unit (CTU) in visual analysis.

Image Classification object-detection +2

Implicit Subspace Prior Learning for Dual-Blind Face Restoration

1 code implementation12 Oct 2020 Lingbo Yang, Pan Wang, Zhanning Gao, Shanshe Wang, Peiran Ren, Siwei Ma, Wen Gao

Face restoration is an inherently ill-posed problem, where additional prior constraints are typically considered crucial for mitigating such pathology.

Blind Face Restoration

Towards Fine-grained Human Pose Transfer with Detail Replenishing Network

no code implementations26 May 2020 Lingbo Yang, Pan Wang, Chang Liu, Zhanning Gao, Peiran Ren, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Xian-Sheng Hua, Wen Gao

Human pose transfer (HPT) is an emerging research topic with huge potential in fashion design, media production, online advertising and virtual reality.

Pose Transfer Retrieval

Iterative Network for Image Super-Resolution

1 code implementation20 May 2020 Yuqing Liu, Shiqi Wang, Jian Zhang, Shanshe Wang, Siwei Ma, Wen Gao

A novel iterative super-resolution network (ISRN) is proposed on top of the iterative optimization.

Image Super-Resolution SSIM

HiFaceGAN: Face Renovation via Collaborative Suppression and Replenishment

5 code implementations11 May 2020 Lingbo Yang, Chang Liu, Pan Wang, Shanshe Wang, Peiran Ren, Siwei Ma, Wen Gao

Existing face restoration researches typically relies on either the degradation prior or explicit guidance labels for training, which often results in limited generalization ability over real-world images with heterogeneous degradations and rich background contents.

Blind Face Restoration Face Hallucination +3

Towards Analysis-friendly Face Representation with Scalable Feature and Texture Compression

no code implementations21 Apr 2020 Shurun Wang, Shiqi Wang, Wenhan Yang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

In particular, we study the feature and texture compression in a scalable coding framework, where the base layer serves as the deep learning feature and enhancement layer targets to perfectly reconstruct the texture.

Image Compression

Masked Non-Autoregressive Image Captioning

no code implementations3 Jun 2019 Junlong Gao, Xi Meng, Shiqi Wang, Xia Li, Shanshe Wang, Siwei Ma, Wen Gao

Existing captioning models often adopt the encoder-decoder architecture, where the decoder uses autoregressive decoding to generate captions, such that each token is generated sequentially given the preceding generated tokens.

Image Captioning Machine Translation +1

Self-critical n-step Training for Image Captioning

no code implementations CVPR 2019 Junlong Gao, Shiqi Wang, Shanshe Wang, Siwei Ma, Wen Gao

Existing methods for image captioning are usually trained by cross entropy loss, which leads to exposure bias and the inconsistency between the optimizing function and evaluation metrics.

Image Captioning

Image and Video Compression with Neural Networks: A Review

no code implementations7 Apr 2019 Siwei Ma, Xinfeng Zhang, Chuanmin Jia, Zhenghui Zhao, Shiqi Wang, Shanshe Wang

Deep convolution neural network (CNN) which makes the neural network resurge in recent years and has achieved great success in both artificial intelligent and signal processing fields, also provides a novel and promising solution for image and video compression.

Video Compression

Scalable Facial Image Compression with Deep Feature Reconstruction

no code implementations14 Mar 2019 Shurun Wang, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao

In this paper, we propose a scalable image compression scheme, including the base layer for feature representation and enhancement layer for texture representation.

Image Compression

Spatial-Temporal Residue Network Based In-Loop Filter for Video Coding

no code implementations25 Sep 2017 Chuanmin Jia, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma

Deep learning has demonstrated tremendous break through in the area of image/video processing.

Multimedia

Cannot find the paper you are looking for? You can Submit a new open access paper.