no code implementations • 8 Mar 2024 • ZiHao Wang, Anji Liu, Haowei Lin, Jiaqi Li, Xiaojian Ma, Yitao Liang
We explore how iterative revising a chain of thoughts with the help of information retrieval significantly improves large language models' reasoning and generation ability in long-horizon generation tasks, while hugely mitigating hallucination.
no code implementations • 29 Feb 2024 • Yang Chen, Yitao Liang, Zhouchen Lin
Causality has been combined with machine learning to produce robust representations for domain generalization.
no code implementations • 4 Feb 2024 • Haowei Lin, Baizhou Huang, Haotian Ye, Qinyu Chen, ZiHao Wang, Sujian Li, Jianzhu Ma, Xiaojun Wan, James Zou, Yitao Liang
The ever-growing ecosystem of LLMs has posed a challenge in selecting the most appropriate pre-trained model to fine-tune amidst a sea of options.
no code implementations • 10 Nov 2023 • ZiHao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang
Achieving human-like planning and control with multimodal observations in an open world is a key milestone for more functional generalist agents.
no code implementations • 31 Oct 2023 • Xuejie Liu, Anji Liu, Guy Van Den Broeck, Yitao Liang
A popular paradigm for offline Reinforcement Learning (RL) tasks is to first fit the offline trajectories to a sequence model, and then prompt the model for actions that lead to high expected return.
1 code implementation • 12 Oct 2023 • Haowei Lin, ZiHao Wang, Jianzhu Ma, Yitao Liang
To pursue the goal of creating an open-ended agent in Minecraft, an open-ended game environment with unlimited possibilities, this paper introduces a task-centric framework named MCU for Minecraft agent evaluation.
no code implementations • 12 Oct 2023 • Shaofei Cai, Bowei Zhang, ZiHao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
We propose to follow reference videos as instructions, which offer expressive goal specifications while eliminating the need for expensive text-gameplay annotations.
no code implementations • 22 Aug 2023 • Ceyao Zhang, Kaijie Yang, Siyi Hu, ZiHao Wang, Guanghe Li, Yihang Sun, Cheng Zhang, Zhaowei Zhang, Anji Liu, Song-Chun Zhu, Xiaojun Chang, Junge Zhang, Feng Yin, Yitao Liang, Yaodong Yang
Building agents with adaptive behavior in cooperative tasks stands as a paramount goal in the realm of multi-agent systems.
1 code implementation • 24 May 2023 • Xiaojuan Tang, Zilong Zheng, Jiaqi Li, Fanxu Meng, Song-Chun Zhu, Yitao Liang, Muhan Zhang
On the whole, our analysis provides a novel perspective on the role of semantics in developing and evaluating language models' reasoning abilities.
no code implementations • 16 Feb 2023 • Xuejie Liu, Anji Liu, Guy Van Den Broeck, Yitao Liang
In this paper, we theoretically and empirically discover that the performance of a PC can exceed that of its teacher model.
1 code implementation • 3 Feb 2023 • ZiHao Wang, Shaofei Cai, Guanzhou Chen, Anji Liu, Xiaojian Ma, Yitao Liang
We investigate the challenge of task planning for multi-task embodied agents in open-world environments.
2 code implementations • CVPR 2023 • Shaofei Cai, ZiHao Wang, Xiaojian Ma, Anji Liu, Yitao Liang
We study the problem of learning goal-conditioned policies in Minecraft, a popular, widely accessible yet challenging open-ended environment for developing human-level multi-task agents.
1 code implementation • 20 Nov 2022 • Zhizhou Ren, Anji Liu, Yitao Liang, Jian Peng, Jianzhu Ma
To bridge this gap, we study the problem of few-shot adaptation in the context of human-in-the-loop reinforcement learning.
1 code implementation • 24 Oct 2022 • Xiaojuan Tang, Song-Chun Zhu, Yitao Liang, Muhan Zhang
In this paper, we propose a novel and principled framework called \textbf{RulE} (stands for {Rul}e {E}mbedding) to effectively leverage logical rules to enhance KG reasoning.
1 code implementation • 14 Oct 2022 • Xiaojian Ma, Silong Yong, Zilong Zheng, Qing Li, Yitao Liang, Song-Chun Zhu, Siyuan Huang
We propose a new task to benchmark scene understanding of embodied agents: Situated Question Answering in 3D Scenes (SQA3D).
Ranked #1 on Referring Expression on SQA3D
no code implementations • 4 Oct 2022 • Qing Li, Yixin Zhu, Yitao Liang, Ying Nian Wu, Song-Chun Zhu, Siyuan Huang
In experiments, NSR achieves state-of-the-art performance in three benchmarks from different domains: SCAN for semantic parsing, PCFG for string manipulation, and HINT for arithmetic reasoning.
no code implementations • 16 Jul 2021 • Rushil Gupta, Vishal Sharma, Yash Jain, Yitao Liang, Guy Van Den Broeck, Parag Singla
We work with models which are object-centric, i. e., explicitly work with object representations, and propagate a loss in the latent space.
no code implementations • 29 Jun 2020 • Pasha Khosravi, Antonio Vergari, YooJung Choi, Yitao Liang, Guy Van Den Broeck
As such, handling missing data in decision trees is a well studied problem.
no code implementations • 15 Jun 2020 • Anji Liu, Yitao Liang, Ji Liu, Guy Van Den Broeck, Jianshu Chen
Second, and more importantly, we demonstrate how the proposed necessary conditions can be adopted to design more effective parallel MCTS algorithms.
1 code implementation • 25 Feb 2020 • Anji Liu, Yitao Liang, Guy Van Den Broeck
Off-policy reinforcement learning (RL) is concerned with learning a rewarding policy by executing another policy that gathers samples of experience.
1 code implementation • 6 Dec 2019 • Albert Zhao, Tong He, Yitao Liang, Haibin Huang, Guy Van Den Broeck, Stefano Soatto
To learn this representation, we train a squeeze network to drive using annotations for the side task as input.
1 code implementation • NeurIPS 2019 • Pasha Khosravi, YooJung Choi, Yitao Liang, Antonio Vergari, Guy Van Den Broeck
In this paper, we identify a pair of generative and discriminative models that enables tractable computation of expectations, as well as moments of any order, of the latter with respect to the former in case of regression.
1 code implementation • 5 Mar 2019 • Pasha Khosravi, Yitao Liang, YooJung Choi, Guy Van Den Broeck
While discriminative classifiers often yield strong predictive performance, missing feature values at prediction time can still be a challenge.
1 code implementation • 27 Feb 2019 • Yitao Liang, Guy Van Den Broeck
This paper proposes a new classification model called logistic circuits.
2 code implementations • 1 Nov 2018 • Jason Gauci, Edoardo Conti, Yitao Liang, Kittipat Virochsiri, Yuchen He, Zachary Kaden, Vivek Narayanan, Xiaohui Ye, Zhengxing Chen, Scott Fujimoto
In this paper we present Horizon, Facebook's open source applied reinforcement learning (RL) platform.
no code implementations • NeurIPS 2018 • Zehong Hu, Yitao Liang, Yang Liu, Jie Zhang
Incentive mechanisms for crowdsourcing are designed to incentivize financially self-interested workers to generate and report high-quality labels.
1 code implementation • ICML 2018 • Jingyi Xu, Zilu Zhang, Tal Friedman, Yitao Liang, Guy Van Den Broeck
This paper develops a novel methodology for using symbolic knowledge in deep learning.
1 code implementation • 4 Dec 2015 • Yitao Liang, Marlos C. Machado, Erik Talvitie, Michael Bowling
The recently introduced Deep Q-Networks (DQN) algorithm has gained attention as one of the first successful combinations of deep neural networks and reinforcement learning.