Search Results for author: Chuhan Zhang

Found 7 papers, 1 papers with code

Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings

no code implementations • 25 Apr 2024 • Olivia Wiles, Chuhan Zhang, Isabela Albuquerque, Ivana Kajić, Su Wang, Emanuele Bugliarello, Yasumasa Onoe, Chris Knutsen, Cyrus Rashtchian, Jordi Pont-Tuset, Aida Nematzadeh

Human-rated prompt sets are generally small and the reliability of the ratings -- and thereby the prompt set used to compare models -- is not evaluated.

Paper
Add Code

NiSNN-A: Non-iterative Spiking Neural Networks with Attention with Application to Motor Imagery EEG Classification

no code implementations • 9 Dec 2023 • Chuhan Zhang, Wei Pan, Cosimo Della Santina

Motor imagery, an important category in electroencephalogram (EEG) research, often intersects with scenarios demanding low energy consumption, such as portable medical devices and isolated environment operations.

EEG Motor Imagery

Paper
Add Code

Helping Hands: An Object-Aware Ego-Centric Video Recognition Model

1 code implementation • ICCV 2023 • Chuhan Zhang, Ankush Gupta, Andrew Zisserman

We demonstrate the performance of the object-aware representations learnt by our model, by: (i) evaluating it for strong transfer, i. e. through zero-shot testing, on a number of downstream video-text retrieval and classification benchmarks; and (ii) by using the representations learned as input for long-term video understanding tasks (e. g. Episodic Memory in Ego4D).

Decoder Object +4

Paper
Code

Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime

no code implementations • 3 May 2023 • Chuhan Zhang, Antoine Miech, Jiajun Shen, Jean-Baptiste Alayrac, Pauline Luc

Large-scale visual language models are widely used as pre-trained models and then adapted for various downstream tasks.

Image Captioning Question Answering +1

Paper
Add Code

Is an Object-Centric Video Representation Beneficial for Transfer?

no code implementations • 20 Jul 2022 • Chuhan Zhang, Ankush Gupta, Andrew Zisserman

The model learns a set of object-centric summary vectors for the video, and uses these vectors to fuse the visual and spatio-temporal trajectory 'modalities' of the video clip.

Action Classification Object +1