Saliency Prediction

88 papers with code • 3 benchmarks • 7 datasets

A saliency map is a model that predicts eye fixations on a visual scene. Saliency prediction is informed by the human visual attention mechanism and predicts the possibility of the human eyes to stay in a certain position in the scene.

Libraries

Use these libraries to find Saliency Prediction models and implementations

Latest papers with no code

Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method

no code yet • 11 Apr 2024

In this paper we have present an improved Cycle GAN based model for under water image enhancement.

How is Visual Attention Influenced by Text Guidance? Database and Model

no code yet • 11 Apr 2024

Finally, considering the effect of text descriptions on visual attention, while most existing saliency models ignore this impact, we further propose a text-guided saliency (TGSal) prediction model, which extracts and integrates both image features and text features to predict the image saliency under various text-description conditions.

SalFoM: Dynamic Saliency Prediction with Video Foundation Models

no code yet • 3 Apr 2024

Recent advancements in video saliency prediction (VSP) have shown promising performance compared to the human visual system, whose emulation is the primary goal of VSP.

GreenSaliency: A Lightweight and Efficient Image Saliency Detection Method

no code yet • 30 Mar 2024

Image saliency detection is crucial in understanding human gaze patterns from visual stimuli.

Selective Attention-based Modulation for Continual Learning

no code yet • 29 Mar 2024

We present SAM, a biologically-plausible selective attention-driven modulation approach to enhance classification models in a continual learning setting.

DiffGaze: A Diffusion Model for Continuous Gaze Sequence Generation on 360° Images

no code yet • 26 Mar 2024

We present DiffGaze, a novel method for generating realistic and diverse continuous human gaze sequences on 360{\deg} images based on a conditional score-based denoising diffusion model.

Learning User Embeddings from Human Gaze for Personalised Saliency Prediction

no code yet • 20 Mar 2024

At the core of our method is a Siamese convolutional neural encoder that learns the user embeddings by contrasting the image and personal saliency map pairs of different users.

A Modified Word Saliency-Based Adversarial Attack on Text Classification Models

no code yet • 17 Mar 2024

This paper introduces a novel adversarial attack method targeting text classification models, termed the Modified Word Saliency-based Adversarial At-tack (MWSAA).

DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction

no code yet • 2 Mar 2024

Audio-visual saliency prediction can draw support from diverse modality complements, but further performance enhancement is still challenged by customized architectures as well as task-specific loss functions.