no code implementations • 2 Mar 2024 • Junwen Xiong, Peng Zhang, Tao You, Chuanyue Li, Wei Huang, Yufei zha
Audio-visual saliency prediction can draw support from diverse modality complements, but further performance enhancement is still challenged by customized architectures as well as task-specific loss functions.
no code implementations • 15 Sep 2023 • Junwen Xiong, Peng Zhang, Chuanyue Li, Wei Huang, Yufei zha, Tao You
While many approaches have crafted task-specific training paradigms for either video saliency prediction or video salient object detection tasks, few attention has been devoted to devising a generalized saliency modeling framework that seamlessly bridges both these distinct tasks.