Saliency Prediction
87 papers with code • 3 benchmarks • 7 datasets
A saliency map is a model that predicts eye fixations on a visual scene. Saliency prediction is informed by the human visual attention mechanism and predicts the possibility of the human eyes to stay in a certain position in the scene.
Libraries
Use these libraries to find Saliency Prediction models and implementationsLatest papers
Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection
Saliency detection with light field images is becoming attractive given the abundant cues available, however, this comes at the expense of large-scale pixel level annotated data which is expensive to generate.
An Energy-Based Prior for Generative Saliency
We propose a novel generative saliency prediction framework that adopts an informative energy-based model as a prior distribution.
Saliency in Augmented Reality
Therefore, in this paper, we mainly analyze the interaction effect between background (BG) scenes and AR contents, and study the saliency prediction problem in AR.
Pyramidal Attention for Saliency Detection
Consequently, we present a new SOD perspective of generating RGB-D SOD without acquiring depth data during training and testing and assist RGB methods with depth clues for improved performance.
Multi-task UNet: Jointly Boosting Saliency Prediction and Disease Classification on Chest X-ray Images
To support the use of visual attention, this paper describes a novel deep learning model for visual saliency prediction on chest X-ray (CXR) images.
Learning From Pixel-Level Noisy Label: A New Perspective for Light Field Saliency Detection
Saliency detection with light field images is becoming attractive given the abundant cues available, however, this comes at the expense of large-scale pixel level annotated data which is expensive to generate.
Does Text Attract Attention on E-Commerce Images: A Novel Saliency Prediction Dataset and Method
E-commerce images are playing a central role in attracting people's attention when retailing and shopping online, and an accurate attention prediction is of significant importance for both customers and retailers, where its research is yet to start.
Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos
In this paper, we propose a multitask learning method for visual-audio saliency prediction and sound source localization on multi-face video by leveraging visual, audio and face information.
MaskSplit: Self-supervised Meta-learning for Few-shot Semantic Segmentation
To alleviate this need, we propose a self-supervised training approach for learning few-shot segmentation models.
TranSalNet: Towards perceptually relevant visual saliency prediction
Due to the inherent inductive biases of CNN architectures, there is a lack of sufficient long-range contextual encoding capacity.