1 code implementation • 23 May 2024 • Jinhui Ye, Xing Wang, Wenxiang Jiao, Junwei Liang, Hui Xiong
In this paper, we identify a representation density problem that could be a bottleneck in restricting the performance of gloss-free SLT.
Contrastive Learning Gloss-free Sign Language Translation +2
no code implementations • 29 Nov 2023 • Jinhui Ye, Jiaming Zhou, Hui Xiong, Junwei Liang
Specifically, at the core of GeoDeformer is the Geometric Deformation Predictor, a module designed to identify and quantify potential spatial and temporal geometric deformations within the given video.
no code implementations • 19 Aug 2023 • Jinhui Ye, Junwei Liang
This paper studies introducing viewpoint invariant feature representations in existing action recognition architecture.
1 code implementation • 18 May 2023 • Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Hui Xiong
To tackle these challenges, we propose a novel Cross-modality Data Augmentation (XmDA) framework to transfer the powerful gloss-to-text translation capabilities to end-to-end sign language translation (i. e. video-to-text) by exploiting pseudo gloss-text pairs from the sign gloss translation model.
Ranked #4 on Sign Language Translation on CSL-Daily
1 code implementation • 13 Oct 2022 • Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu
In this paper, to overcome the limitation, we propose a Prompt based domain text Generation (PGEN) approach to produce the large-scale in-domain spoken language text data.