Search Results for author: Fangrui Zhu

Found 6 papers, 3 papers with code

Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions

1 code implementation28 Nov 2023 Zeyu Han, Fangrui Zhu, Qianru Lao, Huaizu Jiang

After that, grounding is accomplished by calculating the structural similarity matrix between visual and textual triplets with a VLA model, and subsequently propagate it to an instance-level similarity matrix.

Disentanglement Referring Expression +2

Diagnosing Human-object Interaction Detectors

1 code implementation16 Aug 2023 Fangrui Zhu, Yiming Xie, Weidi Xie, Huaizu Jiang

To address this issue, in this paper, we introduce a diagnosis toolbox to provide detailed quantitative break-down analysis of HOI detection models, inspired by the success of object detection diagnosis toolboxes.

Classification Human-Object Interaction Detection +3

A Unified Efficient Pyramid Transformer for Semantic Segmentation

no code implementations29 Jul 2021 Fangrui Zhu, Yi Zhu, Li Zhang, Chongruo wu, Yanwei Fu, Mu Li

Semantic segmentation is a challenging problem due to difficulties in modeling context in complex scenes and class confusions along boundaries.

Segmentation Semantic Segmentation

Self-supervised Video Object Segmentation

no code implementations22 Jun 2020 Fangrui Zhu, Li Zhang, Yanwei Fu, Guodong Guo, Weidi Xie

The objective of this paper is self-supervised representation learning, with the goal of solving semi-supervised video object segmentation (a. k. a.

Object One-shot visual object segmentation +4

Long-Term Cloth-Changing Person Re-identification

no code implementations26 May 2020 Xuelin Qian, Wenxuan Wang, Li Zhang, Fangrui Zhu, Yanwei Fu, Tao Xiang, Yu-Gang Jiang, xiangyang xue

Specifically, we consider that under cloth-changes, soft-biometrics such as body shape would be more reliable.

Cloth-Changing Person Re-Identification

A Two-point Method for PTZ Camera Calibration in Sports

1 code implementation26 Jan 2018 Jianhui Chen, Fangrui Zhu, James J. Little

We also propose a fast random forest method to predict pan-tilt angles without image-to-image feature matching, leading to an efficient calibration method for new images.

Camera Calibration Vocal Bursts Valence Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.