Search Results for author: Jianguo Mao

Found 4 papers, 0 papers with code

Dynamic Multistep Reasoning based on Video Scene Graph for Video Question Answering

no code implementations NAACL 2022 Jianguo Mao, Wenbin Jiang, Xiangdong Wang, Zhifan Feng, Yajuan Lyu, Hong Liu, Yong Zhu

Then, it performs multistep reasoning for better answer decision between the representations of the question and the video, and dynamically integrate the reasoning results.

Question Answering Video Question Answering +1

Audio Generation with Multiple Conditional Diffusion Model

no code implementations23 Aug 2023 Zhifang Guo, Jianguo Mao, Rui Tao, Long Yan, Kazushige Ouchi, Hong Liu, Xiangdong Wang

To address this issue, we propose a novel model that enhances the controllability of existing pre-trained text-to-audio models by incorporating additional conditions including content (timestamp) and style (pitch contour and energy contour) as supplements to the text.

Audio Generation Language Modelling +1

Cannot find the paper you are looking for? You can Submit a new open access paper.