Search Results for author: Zhaoheng Zheng

Found 14 papers, 4 papers with code

GaitSTR: Gait Recognition with Sequential Two-stream Refinement

no code implementations2 Apr 2024 Wanrong Zheng, Haidong Zhu, Zhaoheng Zheng, Ram Nevatia

We demonstrate that with refined skeletons, the performance of the gait recognition model can achieve further improvement on public gait recognition datasets compared with state-of-the-art methods without extra annotations.

Gait Recognition

Large Language Models are Good Prompt Learners for Low-Shot Image Classification

1 code implementation7 Dec 2023 Zhaoheng Zheng, Jingmin Wei, Xuefeng Hu, Haidong Zhu, Ram Nevatia

Thus, we propose LLaMP, Large Language Models as Prompt learners, that produces adaptive prompts for the CLIP text encoder, establishing it as the connecting bridge.

Classification Few-Shot Image Classification +1

ShARc: Shape and Appearance Recognition for Person Identification In-the-wild

no code implementations24 Oct 2023 Haidong Zhu, Wanrong Zheng, Zhaoheng Zheng, Ram Nevatia

PSE encodes the body shape via binarized silhouettes, skeleton motions, and 3-D body shape, while AAE provides two levels of temporal appearance feature aggregation: attention-based feature aggregation and averaging aggregation.

Person Identification

CAILA: Concept-Aware Intra-Layer Adapters for Compositional Zero-Shot Learning

1 code implementation26 May 2023 Zhaoheng Zheng, Haidong Zhu, Ram Nevatia

In this paper, we study the problem of Compositional Zero-Shot Learning (CZSL), which is to recognize novel attribute-object combinations with pre-existing concepts.

Attribute Compositional Zero-Shot Learning

Adversarial Demonstration Attacks on Large Language Models

no code implementations24 May 2023 Jiongxiao Wang, Zichen Liu, Keun Hee Park, Zhuojun Jiang, Zhaoheng Zheng, Zhuofeng Wu, Muhao Chen, Chaowei Xiao

We propose a novel attack method named advICL, which aims to manipulate only the demonstration without changing the input to mislead the models.

GPT-4 In-Context Learning

GaitRef: Gait Recognition with Refined Sequential Skeletons

1 code implementation16 Apr 2023 Haidong Zhu, Wanrong Zheng, Zhaoheng Zheng, Ram Nevatia

Two common modalities used for representing the walking sequence of a person are silhouettes and joint skeletons.

Multiview Gait Recognition

CAT-NeRF: Constancy-Aware Tx$^2$Former for Dynamic Body Modeling

1 code implementation16 Apr 2023 Haidong Zhu, Zhaoheng Zheng, Wanrong Zheng, Ram Nevatia

This paper addresses the problem of human rendering in the video with temporal appearance constancy.

Neural Rendering

MoMo: A shared encoder Model for text, image and multi-Modal representations

no code implementations11 Apr 2023 Rakesh Chada, Zhaoheng Zheng, Pradeep Natarajan

The results on downstream text-only, image-only and multimodal tasks show that our model is competitive with several strong models while using fewer parameters and lesser pre-training data.

Gait Recognition Using 3-D Human Body Shape Inference

no code implementations18 Dec 2022 Haidong Zhu, Zhaoheng Zheng, Ram Nevatia

Gait recognition, which identifies individuals based on their walking patterns, is an important biometric technique since it can be observed from a distance and does not require the subject's cooperation.

Gait Identification Gait Recognition

PatchZero: Defending against Adversarial Patch Attacks by Detecting and Zeroing the Patch

no code implementations5 Jul 2022 Ke Xu, Yao Xiao, Zhaoheng Zheng, Kaijie Cai, Ram Nevatia

Despite the diversity in attack patterns, adversarial patches tend to be highly textured and different in appearance from natural images.

Image Classification object-detection +3

FashionVLP: Vision Language Transformer for Fashion Retrieval With Feedback

no code implementations CVPR 2022 Sonam Goenka, Zhaoheng Zheng, Ayush Jaiswal, Rakesh Chada, Yue Wu, Varsha Hedau, Pradeep Natarajan

Fashion image retrieval based on a query pair of reference image and natural language feedback is a challenging task that requires models to assess fashion related information from visual and textual modalities simultaneously.

Image Retrieval Retrieval

Weakly Supervised Scene Graph Grounding

no code implementations1 Jan 2021 Yizhou Zhang, Zhaoheng Zheng, Yan Liu

Recent researches have achieved substantial advances in learning structured representations from images.

Utilizing Every Image Object for Semi-supervised Phrase Grounding

no code implementations5 Nov 2020 Haidong Zhu, Arka Sadhu, Zhaoheng Zheng, Ram Nevatia

The annotated language queries available during training are limited, which also limits the variations of language combinations that a model can see during training.

Phrase Grounding Referring Expression

Cannot find the paper you are looking for? You can Submit a new open access paper.