Search Results for author: Jinhui Ye

Found 5 papers, 3 papers with code

Improving Gloss-free Sign Language Translation by Reducing Representation Density

1 code implementation • 23 May 2024 • Jinhui Ye, Xing Wang, Wenxiang Jiao, Junwei Liang, Hui Xiong

In this paper, we identify a representation density problem that could be a bottleneck in restricting the performance of gloss-free SLT.

Contrastive Learning Gloss-free Sign Language Translation +2

Paper
Code

GeoDeformer: Geometric Deformable Transformer for Action Recognition

no code implementations • 29 Nov 2023 • Jinhui Ye, Jiaming Zhou, Hui Xiong, Junwei Liang

Specifically, at the core of GeoDeformer is the Geometric Deformation Predictor, a module designed to identify and quantify potential spatial and temporal geometric deformations within the given video.

Action Recognition

Paper
Add Code

Spatial-Temporal Alignment Network for Action Recognition

no code implementations • 19 Aug 2023 • Jinhui Ye, Junwei Liang

This paper studies introducing viewpoint invariant feature representations in existing action recognition architecture.

Action Recognition

Paper
Add Code

Cross-modality Data Augmentation for End-to-End Sign Language Translation

1 code implementation • 18 May 2023 • Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Hui Xiong

To tackle these challenges, we propose a novel Cross-modality Data Augmentation (XmDA) framework to transfer the powerful gloss-to-text translation capabilities to end-to-end sign language translation (i. e. video-to-text) by exploiting pseudo gloss-text pairs from the sign gloss translation model.

Ranked #4 on Sign Language Translation on CSL-Daily

Data Augmentation Knowledge Distillation +3

Paper
Code

Scaling Back-Translation with Domain Text Generation for Sign Language Gloss Translation

1 code implementation • 13 Oct 2022 • Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu

In this paper, to overcome the limitation, we propose a Prompt based domain text Generation (PGEN) approach to produce the large-scale in-domain spoken language text data.

Language Modelling Text Generation +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.