no code implementations • 15 Apr 2024 • Xinyu Xie, Yawen Cui, Chio-in Ieong, Tao Tan, Xiaozhi Zhang, Xubin Zheng, Zitong Yu
In this paper, we propose FusionMamba, a novel dynamic feature enhancement method for multimodal image fusion with Mamba.
1 code implementation • 7 Mar 2024 • Qilang Ye, Zitong Yu, Rui Shao, Xinyu Xie, Philip Torr, Xiaochun Cao
This paper focuses on the challenge of answering questions in scenarios that are composed of rich and complex dynamic audio-visual components.
Audio-visual Question Answering Audio-Visual Question Answering (AVQA) +5
no code implementations • 20 Sep 2021 • Haitao Liu, Jiaqi Ding, Xinyu Xie, Xiaomo Jiang, Yusong Zhao, Xiaofang Wang
Multi-task regression attempts to exploit the task similarity in order to achieve knowledge transfer across related tasks for performance improvement.