Search Results for author: Guangting Wang

Found 11 papers, 6 papers with code

Visual Perception by Large Language Model's Weights

no code implementations • 30 May 2024 • Feipeng Ma, Hongwei Xue, Guangting Wang, Yizhou Zhou, Fengyun Rao, Shilin Yan, Yueyi Zhang, Siying Wu, Mike Zheng Shou, Xiaoyan Sun

Following this paradigm, we propose VLoRA with the perceptual weights generator.

Paper
Add Code

Multi-Modal Generative Embedding Model

no code implementations • 29 May 2024 • Feipeng Ma, Hongwei Xue, Guangting Wang, Yizhou Zhou, Fengyun Rao, Shilin Yan, Yueyi Zhang, Siying Wu, Mike Zheng Shou, Xiaoyan Sun

Existing models usually tackle these two types of problems by decoupling language modules into a text decoder for generation, and a text encoder for embedding.

Caption Generation Cross-Modal Retrieval +7

Paper
Add Code

Correlation-Aware Deep Tracking

1 code implementation • CVPR 2022 • Fei Xie, Chunyu Wang, Guangting Wang, Yue Cao, Wankou Yang, Wenjun Zeng

In contrast to the Siamese-like feature extraction, our network deeply embeds cross-image feature correlation in multiple layers of the feature network.

Feature Correlation Visual Object Tracking

Paper
Code

When Shift Operation Meets Vision Transformer: An Extremely Simple Alternative to Attention Mechanism

2 code implementations • 26 Jan 2022 • Guangting Wang, Yucheng Zhao, Chuanxin Tang, Chong Luo, Wenjun Zeng

It can be even replaced by a zero-parameter operation.

Ranked #67 on Object Detection on COCO minival (APM metric)

Image Classification Object Detection +1

2,671

Paper
Code

Learning Tracking Representations via Dual-Branch Fully Transformer Networks

1 code implementation • 5 Dec 2021 • Fei Xie, Chunyu Wang, Guangting Wang, Wankou Yang, Wenjun Zeng

We present a Siamese-like Dual-branch network based on solely Transformers for tracking.

Object Tracking

Paper
Code

Sparse MLP for Image Recognition: Is Self-Attention Really Necessary?

2 code implementations • 12 Sep 2021 • Chuanxin Tang, Yucheng Zhao, Guangting Wang, Chong Luo, Wenxuan Xie, Wenjun Zeng

Specifically, we replace the MLP module in the token-mixing step with a novel sparse MLP (sMLP) module.

Ranked #395 on Image Classification on ImageNet

Image Classification

192

Paper
Code

A Battle of Network Structures: An Empirical Study of CNN, Transformer, and MLP

1 code implementation • 30 Aug 2021 • Yucheng Zhao, Guangting Wang, Chuanxin Tang, Chong Luo, Wenjun Zeng, Zheng-Jun Zha

Convolutional neural networks (CNN) are the dominant deep neural network (DNN) architecture for computer vision.

192

Paper
Code

Self-Supervised Visual Representations Learning by Contrastive Mask Prediction

no code implementations • ICCV 2021 • Yucheng Zhao, Guangting Wang, Chong Luo, Wenjun Zeng, Zheng-Jun Zha

In this paper, we propose a novel contrastive mask prediction (CMP) task for visual representation learning and design a mask contrast (MaskCo) framework to implement the idea.

Representation Learning Self-Supervised Learning

Paper
Add Code

Unsupervised Visual Representation Learning by Tracking Patches in Video

1 code implementation • CVPR 2021 • Guangting Wang, Yizhou Zhou, Chong Luo, Wenxuan Xie, Wenjun Zeng, Zhiwei Xiong

The proxy task is to estimate the position and size of the image patch in a sequence of video frames, given only the target bounding box in the first frame.

Action Classification Action Recognition +1