Search Results for author: Zhongwei Qiu

Found 11 papers, 4 papers with code

Bridging the Semantic-Numerical Gap: A Numerical Reasoning Method of Cross-modal Knowledge Graph for Material Property Prediction

no code implementations • 15 Dec 2023 • Guangxuan Song, Dongmei Fu, Zhongwei Qiu, Zijiang Yang, Jiaxin Dai, Lingwei Ma, Dawei Zhang

In this paper, we propose a numerical reasoning method for material KGs (NR-KG), which constructs a cross-modal KG using semantic nodes and numerical proxy nodes.

Graph Neural Network Knowledge Graphs +1

Paper
Add Code

HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception

1 code implementation • NeurIPS 2023 • Junkun Yuan, Xinyu Zhang, Hao Zhou, Jian Wang, Zhongwei Qiu, Zhiyin Shao, Shaofeng Zhang, Sifan Long, Kun Kuang, Kun Yao, Junyu Han, Errui Ding, Lanfen Lin, Fei Wu, Jingdong Wang

To further capture human characteristics, we propose a structure-invariant alignment loss that enforces different masked views, guided by the human part prior, to be closely aligned for the same image.

2D Pose Estimation Attribute +3

Paper
Code

MM-NeRF: Multimodal-Guided 3D Multi-Style Transfer of Neural Radiance Field

no code implementations • 24 Sep 2023 • Zijiang Yang, Zhongwei Qiu, Chang Xu, Dongmei Fu

3D style transfer aims to generate stylized views of 3D scenes with specified styles, which requires high-quality generating and keeping multi-view consistency.

Incremental Learning Style Transfer

Paper
Add Code

Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation

no code implementations • 29 Jun 2023 • Zhongwei Qiu, Qiansheng Yang, Jian Wang, Xiyu Wang, Chang Xu, Dongmei Fu, Kun Yao, Junyu Han, Errui Ding, Jingdong Wang

One of the mainstream schemes for 2D human pose estimation (HPE) is learning keypoints heatmaps by a neural network.

2D Human Pose Estimation Denoising +1

Paper
Add Code

Stable Diffusion is Unstable

1 code implementation • NeurIPS 2023 • Chengbin Du, Yanxi Li, Zhongwei Qiu, Chang Xu

Recently, text-to-image models have been thriving.

Paper
Code

PSVT: End-to-End Multi-person 3D Pose and Shape Estimation with Progressive Video Transformers

no code implementations • CVPR 2023 • Zhongwei Qiu, Yang Qiansheng, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Chang Xu, Dongmei Fu, Jingdong Wang

To handle the variances of objects as time proceeds, a novel scheme of progressive decoding is used to update pose and shape queries at each frame.

Ranked #27 on 3D Human Pose Estimation on 3DPW

3D human pose and shape estimation Decoder

Paper
Add Code

Learning Spatiotemporal Frequency-Transformer for Low-Quality Video Super-Resolution

1 code implementation • 27 Dec 2022 • Zhongwei Qiu, Huan Yang, Jianlong Fu, Daochang Liu, Chang Xu, Dongmei Fu

Video Super-Resolution (VSR) aims to restore high-resolution (HR) videos from low-resolution (LR) videos.

Ranked #2 on Video Super-Resolution on REDS4- 4x upscaling

Video Enhancement Video Super-Resolution

148

Paper
Code

Weakly-supervised Pre-training for 3D Human Pose Estimation via Perspective Knowledge

no code implementations • 22 Nov 2022 • Zhongwei Qiu, Kai Qiu, Jianlong Fu, Dongmei Fu

Based on MCPC, we propose a weakly-supervised pre-training (WSP) strategy to distinguish the depth relationship between two points in an image.

3D Human Pose Estimation 3D Pose Estimation

Paper
Add Code

IVT: An End-to-End Instance-guided Video Transformer for 3D Pose Estimation

no code implementations • 6 Aug 2022 • Zhongwei Qiu, Qiansheng Yang, Jian Wang, Dongmei Fu

In particular, we firstly formulate video frames as a series of instance-guided tokens and each token is in charge of predicting the 3D pose of a human instance.

Ranked #11 on 3D Multi-Person Pose Estimation on Panoptic (using extra training data)

2D Pose Estimation 3D Multi-Person Pose Estimation +1

Paper
Add Code

Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution

1 code implementation • 5 Aug 2022 • Zhongwei Qiu, Huan Yang, Jianlong Fu, Dongmei Fu

First, we divide a video frame into patches, and transform each patch into DCT spectral maps in which each channel represents a frequency band.

Ranked #3 on Video Super-Resolution on REDS4- 4x upscaling

Video Enhancement Video Super-Resolution

148

Paper
Code

Dynamic Graph Reasoning for Multi-person 3D Pose Estimation

no code implementations • 22 Jul 2022 • Zhongwei Qiu, Qiansheng Yang, Jian Wang, Dongmei Fu

Finally, the 3D poses are decoded according to dynamic decoding graphs for each detected person.

Ranked #6 on 3D Multi-Person Pose Estimation (root-relative) on MuPoTS-3D

3D Multi-Person Pose Estimation (absolute) 3D Multi-Person Pose Estimation (root-relative) +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.