Search Results for author: Guanxiong Sun

Found 6 papers, 4 papers with code

TDViT: Temporal Dilated Video Transformer for Dense Video Tasks

1 code implementation • 14 Feb 2024 • Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

Deep video models, for example, 3D CNNs or video transformers, have achieved promising performance on sparse video tasks, i. e., predicting one result per video.

Instance Segmentation object-detection +3

Paper
Code

Efficient One-stage Video Object Detection by Exploiting Temporal Consistency

1 code implementation • 14 Feb 2024 • Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

Based on the analysis, we present a simple yet efficient framework to address the computational bottlenecks and achieve efficient one-stage VOD by exploiting the temporal consistency in video frames.

object-detection Video Object Detection

Paper
Code

Spatio-temporal Prompting Network for Robust Video Feature Extraction

2 code implementations • ICCV 2023 • Guanxiong Sun, Chi Wang, Zhaoyu Zhang, Jiankang Deng, Stefanos Zafeiriou, Yang Hua

Then, these video prompts are prepended to the patch embeddings of the current frame as the updated input for video feature extraction.

Instance Segmentation object-detection +5

Paper
Code

MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection

1 code implementation • 18 Jan 2024 • Guanxiong Sun, Yang Hua, Guosheng Hu, Neil Robertson

However, we argue that these memory structures are not efficient or sufficient because of two implied operations: (1) concatenating all features in memory for enhancement, leading to a heavy computational cost; (2) frame-wise memory updating, preventing the memory from capturing more temporal information.

object-detection Video Object Detection

Paper
Code

Neural Sign Actors: A diffusion model for 3D sign language production from text

no code implementations • 5 Dec 2023 • Vasileios Baltatzis, Rolandos Alexandros Potamias, Evangelos Ververas, Guanxiong Sun, Jiankang Deng, Stefanos Zafeiriou

Sign Languages (SL) serve as the primary mode of communication for the Deaf and Hard of Hearing communities.

Sign Language Production

Paper
Add Code

Focus On What's Important: Self-Attention Model for Human Pose Estimation

no code implementations • 22 Sep 2018 • Guanxiong Sun, Chengqin Ye, Kuanquan Wang

In this work, we proposed a convolutional network architecture combined with the novel attention model.

Pose Estimation Self-Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.