no code implementations • 28 Apr 2024 • Zirui Song, Yaohang Li, Meng Fang, Zhenhao Chen, Zecheng Shi, Yuan Huang, Ling Chen
Autonomous virtual agents are often limited by their singular mode of interaction with real-world environments, restricting their versatility.
1 code implementation • 19 Feb 2024 • Loka Li, Guangyi Chen, Yusheng Su, Zhenhao Chen, Yixuan Zhang, Eric Xing, Kun Zhang
We have experimentally observed that LLMs possess the capability to understand the "confidence" in their own responses.
no code implementations • 25 Jan 2024 • Guangyi Chen, Yifan Shen, Zhenhao Chen, Xiangchen Song, Yuewen Sun, Weiran Yao, Xiao Liu, Kun Zhang
Identifying the underlying time-delayed latent causal processes in sequential data is vital for grasping temporal dynamics and making downstream reasoning.
2 code implementations • 5 Dec 2023 • Rizhao Cai, Zirui Song, Dayan Guan, Zhenhao Chen, Xing Luo, Chenyu Yi, Alex Kot
Large Multimodal Models (LMMs) such as GPT-4V and LLaVA have shown remarkable capabilities in visual reasoning with common image styles.
Ranked #1000000000 on Visual Question Answering on MS COCO
1 code implementation • CVPR 2023 • Guangyi Chen, Zhenhao Chen, Shunxing Fan, Kun Zhang
Specifically, we model the trajectory sampling as a Gaussian process and construct an acquisition function to measure the potential sampling value.
1 code implementation • 25 Nov 2020 • Yi Zhou, Zhenhao Chen
Memes are used for spreading ideas through social networks.