Search Results for author: Qilang Ye

Found 2 papers, 2 papers with code

Answering Diverse Questions via Text Attached with Key Audio-Visual Clues

1 code implementation11 Mar 2024 Qilang Ye, Zitong Yu, Xin Liu

Audio-visual question answering (AVQA) requires reference to video content and auditory information, followed by correlating the question to predict the most precise answer.

Audio-visual Question Answering Audio-Visual Question Answering (AVQA) +3

Cannot find the paper you are looking for? You can Submit a new open access paper.