1 code implementation • NeurIPS 2023 • Yuxin Guo, Shijie Ma, Hu Su, Zhiqing Wang, Yuhao Zhao, Wei Zou, Siyang Sun, Yun Zheng
Audio-Visual Source Localization (AVSL) aims to locate sounding objects within video frames given the paired audio clips.
no code implementations • 5 Mar 2024 • Yuxin Guo, Shijie Ma, Yuhao Zhao, Hu Su, Wei Zou
Audio-Visual Source Localization (AVSL) is the task of identifying specific sounding objects in the scene given audio cues.