Search Results for author: Yitao Liang

Found 28 papers, 15 papers with code

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

no code implementations8 Mar 2024 ZiHao Wang, Anji Liu, Haowei Lin, Jiaqi Li, Xiaojian Ma, Yitao Liang

We explore how iterative revising a chain of thoughts with the help of information retrieval significantly improves large language models' reasoning and generation ability in long-horizon generation tasks, while hugely mitigating hallucination.

Code Generation Hallucination +3

DIGIC: Domain Generalizable Imitation Learning by Causal Discovery

no code implementations29 Feb 2024 Yang Chen, Yitao Liang, Zhouchen Lin

Causality has been combined with machine learning to produce robust representations for domain generalization.

Causal Discovery Domain Generalization +1

Selecting Large Language Model to Fine-tune via Rectified Scaling Law

no code implementations4 Feb 2024 Haowei Lin, Baizhou Huang, Haotian Ye, Qinyu Chen, ZiHao Wang, Sujian Li, Jianzhu Ma, Xiaojun Wan, James Zou, Yitao Liang

The ever-growing ecosystem of LLMs has posed a challenge in selecting the most appropriate pre-trained model to fine-tune amidst a sea of options.

Language Modelling Large Language Model

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

no code implementations10 Nov 2023 ZiHao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang

Achieving human-like planning and control with multimodal observations in an open world is a key milestone for more functional generalist agents.

Expressive Modeling Is Insufficient for Offline RL: A Tractable Inference Perspective

no code implementations31 Oct 2023 Xuejie Liu, Anji Liu, Guy Van Den Broeck, Yitao Liang

A popular paradigm for offline Reinforcement Learning (RL) tasks is to first fit the offline trajectories to a sequence model, and then prompt the model for actions that lead to high expected return.

Offline RL Reinforcement Learning (RL)

MCU: A Task-centric Framework for Open-ended Agent Evaluation in Minecraft

1 code implementation12 Oct 2023 Haowei Lin, ZiHao Wang, Jianzhu Ma, Yitao Liang

To pursue the goal of creating an open-ended agent in Minecraft, an open-ended game environment with unlimited possibilities, this paper introduces a task-centric framework named MCU for Minecraft agent evaluation.

Out-of-Distribution Generalization

GROOT: Learning to Follow Instructions by Watching Gameplay Videos

no code implementations12 Oct 2023 Shaofei Cai, Bowei Zhang, ZiHao Wang, Xiaojian Ma, Anji Liu, Yitao Liang

We propose to follow reference videos as instructions, which offer expressive goal specifications while eliminating the need for expensive text-gameplay annotations.

Instruction Following

Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners

1 code implementation24 May 2023 Xiaojuan Tang, Zilong Zheng, Jiaqi Li, Fanxu Meng, Song-Chun Zhu, Yitao Liang, Muhan Zhang

On the whole, our analysis provides a novel perspective on the role of semantics in developing and evaluating language models' reasoning abilities.

Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits

no code implementations16 Feb 2023 Xuejie Liu, Anji Liu, Guy Van Den Broeck, Yitao Liang

In this paper, we theoretically and empirically discover that the performance of a PC can exceed that of its teacher model.

Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction

2 code implementations CVPR 2023 Shaofei Cai, ZiHao Wang, Xiaojian Ma, Anji Liu, Yitao Liang

We study the problem of learning goal-conditioned policies in Minecraft, a popular, widely accessible yet challenging open-ended environment for developing human-level multi-task agents.

Representation Learning Zero-shot Generalization

Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation

1 code implementation20 Nov 2022 Zhizhou Ren, Anji Liu, Yitao Liang, Jian Peng, Jianzhu Ma

To bridge this gap, we study the problem of few-shot adaptation in the context of human-in-the-loop reinforcement learning.

Meta Reinforcement Learning reinforcement-learning +1

RulE: Neural-Symbolic Knowledge Graph Reasoning with Rule Embedding

1 code implementation24 Oct 2022 Xiaojuan Tang, Song-Chun Zhu, Yitao Liang, Muhan Zhang

In this paper, we propose a novel and principled framework called \textbf{RulE} (stands for {Rul}e {E}mbedding) to effectively leverage logical rules to enhance KG reasoning.

Knowledge Graph Embedding Knowledge Graphs +1

SQA3D: Situated Question Answering in 3D Scenes

1 code implementation14 Oct 2022 Xiaojian Ma, Silong Yong, Zilong Zheng, Qing Li, Yitao Liang, Song-Chun Zhu, Siyuan Huang

We propose a new task to benchmark scene understanding of embodied agents: Situated Question Answering in 3D Scenes (SQA3D).

Question Answering Referring Expression +1

Neural-Symbolic Recursive Machine for Systematic Generalization

no code implementations4 Oct 2022 Qing Li, Yixin Zhu, Yitao Liang, Ying Nian Wu, Song-Chun Zhu, Siyuan Huang

In experiments, NSR achieves state-of-the-art performance in three benchmarks from different domains: SCAN for semantic parsing, PCFG for string manipulation, and HINT for arithmetic reasoning.

Arithmetic Reasoning Semantic Parsing +1

Towards an Interpretable Latent Space in Structured Models for Video Prediction

no code implementations16 Jul 2021 Rushil Gupta, Vishal Sharma, Yash Jain, Yitao Liang, Guy Van Den Broeck, Parag Singla

We work with models which are object-centric, i. e., explicitly work with object representations, and propagate a loss in the latent space.

Contrastive Learning Inductive Bias +2

On Effective Parallelization of Monte Carlo Tree Search

no code implementations15 Jun 2020 Anji Liu, Yitao Liang, Ji Liu, Guy Van Den Broeck, Jianshu Chen

Second, and more importantly, we demonstrate how the proposed necessary conditions can be adopted to design more effective parallel MCTS algorithms.

Atari Games

Off-Policy Deep Reinforcement Learning with Analogous Disentangled Exploration

1 code implementation25 Feb 2020 Anji Liu, Yitao Liang, Guy Van Den Broeck

Off-policy reinforcement learning (RL) is concerned with learning a rewarding policy by executing another policy that gathers samples of experience.

Continuous Control reinforcement-learning +1

SAM: Squeeze-and-Mimic Networks for Conditional Visual Driving Policy Learning

1 code implementation6 Dec 2019 Albert Zhao, Tong He, Yitao Liang, Haibin Huang, Guy Van Den Broeck, Stefano Soatto

To learn this representation, we train a squeeze network to drive using annotations for the side task as input.

Semantic Segmentation

On Tractable Computation of Expected Predictions

1 code implementation NeurIPS 2019 Pasha Khosravi, YooJung Choi, Yitao Liang, Antonio Vergari, Guy Van Den Broeck

In this paper, we identify a pair of generative and discriminative models that enables tractable computation of expectations, as well as moments of any order, of the latter with respect to the former in case of regression.

Fairness Imputation +1

What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features

1 code implementation5 Mar 2019 Pasha Khosravi, Yitao Liang, YooJung Choi, Guy Van Den Broeck

While discriminative classifiers often yield strong predictive performance, missing feature values at prediction time can still be a challenge.

Imputation regression

Learning Logistic Circuits

1 code implementation27 Feb 2019 Yitao Liang, Guy Van Den Broeck

This paper proposes a new classification model called logistic circuits.

General Classification

Inference Aided Reinforcement Learning for Incentive Mechanism Design in Crowdsourcing

no code implementations NeurIPS 2018 Zehong Hu, Yitao Liang, Yang Liu, Jie Zhang

Incentive mechanisms for crowdsourcing are designed to incentivize financially self-interested workers to generate and report high-quality labels.

Bayesian Inference reinforcement-learning +1

State of the Art Control of Atari Games Using Shallow Reinforcement Learning

1 code implementation4 Dec 2015 Yitao Liang, Marlos C. Machado, Erik Talvitie, Michael Bowling

The recently introduced Deep Q-Networks (DQN) algorithm has gained attention as one of the first successful combinations of deep neural networks and reinforcement learning.

Atari Games reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.