Search Results for author: Hezhen Hu

Found 12 papers, 1 papers with code

Comp4D: LLM-Guided Compositional 4D Scene Generation

no code implementations • 25 Mar 2024 • Dejia Xu, Hanwen Liang, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Plataniotis, Zhangyang Wang

Recent advancements in diffusion models for 2D and 3D content creation have sparked a surge of interest in generating 4D content.

Object Scene Generation +1

Paper
Add Code

PersonMAE: Person Re-Identification Pre-Training with Masked AutoEncoders

no code implementations • 8 Nov 2023 • Hezhen Hu, Xiaoyi Dong, Jianmin Bao, Dongdong Chen, Lu Yuan, Dong Chen, Houqiang Li

Pre-training is playing an increasingly important role in learning generic feature representation for Person Re-identification (ReID).

Person Re-Identification

Paper
Add Code

Sign Language Translation with Iterative Prototype

no code implementations • ICCV 2023 • Huijie Yao, Wengang Zhou, Hao Feng, Hezhen Hu, Hao Zhou, Houqiang Li

Technically, IP-SLT consists of feature extraction, prototype initialization, and iterative prototype refinement.

Ranked #5 on Sign Language Translation on CSL-Daily

Sentence Sign Language Translation +1

Paper
Add Code

Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video

no code implementations • 8 Aug 2023 • Weichao Zhao, Hezhen Hu, Wengang Zhou, Li Li, Houqiang Li

Reconstructing interacting hands from monocular RGB data is a challenging task, as it involves many interfering factors, e. g. self- and mutual occlusion and similar textures.

Paper
Add Code

SignBERT+: Hand-model-aware Self-supervised Pre-training for Sign Language Understanding

no code implementations • 8 May 2023 • Hezhen Hu, Weichao Zhao, Wengang Zhou, Houqiang Li

In our framework, the hand pose is regarded as a visual token, which is derived from an off-the-shelf detector.

Ranked #1 on Sign Language Recognition on WLASL

Self-Supervised Learning Sign Language Recognition +1

Paper
Add Code

DIRE for Diffusion-Generated Image Detection

1 code implementation • ICCV 2023 • Zhendong Wang, Jianmin Bao, Wengang Zhou, Weilun Wang, Hezhen Hu, Hong Chen, Houqiang Li

We find that existing detectors struggle to detect images generated by diffusion models, even if we include generated images from a specific diffusion model in their training data.

201

Paper
Code

BEST: BERT Pre-Training for Sign Language Recognition with Coupling Tokenization

no code implementations • 10 Feb 2023 • Weichao Zhao, Hezhen Hu, Wengang Zhou, Jiaxin Shi, Houqiang Li

In this work, we are dedicated to leveraging the BERT pre-training success and modeling the domain-specific statistics to fertilize the sign language recognition~(SLR) model.

Pseudo Label Sign Language Recognition

Paper
Add Code

Hand-Object Interaction Image Generation

no code implementations • 28 Nov 2022 • Hezhen Hu, Weilun Wang, Wengang Zhou, Houqiang Li

In this work, we are dedicated to a new task, i. e., hand-object interaction image generation, which aims to conditionally generate the hand-object image under the given hand, object and their interaction status.

Image Generation Object

Paper
Add Code

SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition

no code implementations • ICCV 2021 • Hezhen Hu, Weichao Zhao, Wengang Zhou, Yuechen Wang, Houqiang Li

To validate the effectiveness of our method on SLR, we perform extensive experiments on four public benchmark datasets, i. e., NMFs-CSL, SLR500, MSASL and WLASL.

Ranked #1 on Sign Language Recognition on WLASL100 (using extra training data)

Self-Supervised Learning Sign Language Recognition

Paper
Add Code

Model-Aware Gesture-to-Gesture Translation

no code implementations • CVPR 2021 • Hezhen Hu, Weilun Wang, Wengang Zhou, Weichao Zhao, Houqiang Li

Then, a transformation flow is calculated based on the correspondence of the source and target topology map.

Gesture-to-Gesture Translation Sign Language Production +1

Paper
Add Code

Boosting Continuous Sign Language Recognition via Cross Modality Augmentation

no code implementations • 11 Oct 2020 • Junfu Pu, Wengang Zhou, Hezhen Hu, Houqiang Li

Continuous sign language recognition (SLR) deals with unaligned video-text pair and uses the word error rate (WER), i. e., edit distance, as the main evaluation metric.

Sentence Sign Language Recognition

Paper
Add Code

Global-local Enhancement Network for NMFs-aware Sign Language Recognition

no code implementations • 24 Aug 2020 • Hezhen Hu, Wengang Zhou, Junfu Pu, Houqiang Li

Sign language recognition (SLR) is a challenging problem, involving complex manual features, i. e., hand gestures, and fine-grained non-manual features (NMFs), i. e., facial expression, mouth shapes, etc.

Sign Language Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.