Search Results for author: Lin Guan

Found 15 papers, 5 papers with code

"Task Success" is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors

no code implementations6 Feb 2024 Lin Guan, Yifan Zhou, Denis Liu, Yantian Zha, Heni Ben Amor, Subbarao Kambhampati

Large-scale generative models are shown to be useful for sampling meaningful candidate solutions, yet they often overlook task constraints and user preferences.

Automated Theorem Proving Game of Go

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks

no code implementations2 Feb 2024 Subbarao Kambhampati, Karthik Valmeekam, Lin Guan, Kaya Stechly, Mudit Verma, Siddhant Bhambri, Lucas Saldyt, Anil Murthy

On the other side are perhaps over-pessimistic claims that all that LLMs are good for in planning/reasoning tasks are as mere translators of the problem specification from one syntactic format to another, and ship the problem off to external symbolic solvers.

TWIN: TWo-stage Interest Network for Lifelong User Behavior Modeling in CTR Prediction at Kuaishou

no code implementations5 Feb 2023 Jianxin Chang, Chenbin Zhang, Zhiyi Fu, Xiaoxue Zang, Lin Guan, Jing Lu, Yiqun Hui, Dewei Leng, Yanan Niu, Yang song, Kun Gai

And for the user-item cross features, we compress each into a one-dimentional bias term in the attention score calculation to save the computational cost.

Click-Through Rate Prediction

Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences

no code implementations28 Oct 2022 Lin Guan, Karthik Valmeekam, Subbarao Kambhampati

We propose two practical methods that can learn to model any kind of behavioral attributes from ordered behavior clips.

Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill Diversity

1 code implementation6 Feb 2022 Lin Guan, Sarath Sreedharan, Subbarao Kambhampati

At the low level, we learn a set of diverse policies for each possible task subgoal identified by the landmark, which are then stitched together.

reinforcement-learning Reinforcement Learning (RL)

MuSCLe: A Multi-Strategy Contrastive Learning Framework for Weakly Supervised Semantic Segmentation

no code implementations18 Jan 2022 Kunhao Yuan, Gerald Schaefer, Yu-Kun Lai, Yifan Wang, Xiyao Liu, Lin Guan, Hui Fang

Weakly supervised semantic segmentation (WSSS) has gained significant popularity since it relies only on weak labels such as image level annotations rather than pixel level annotations required by supervised semantic segmentation (SSS) methods.

Contrastive Learning Segmentation +2

Enhanced Exploration in Neural Feature Selection for Deep Click-Through Rate Prediction Models via Ensemble of Gating Layers

no code implementations7 Dec 2021 Lin Guan, Xia Xiao, Ming Chen, Youlong Cheng

Inspired by gradient-based neural architecture search (NAS) and network pruning methods, people have tackled the NFS problem with Gating approach that inserts a set of differentiable binary gates to drop less informative features.

Click-Through Rate Prediction Ensemble Learning +3

Learning from Ambiguous Demonstrations with Self-Explanation Guided Reinforcement Learning

1 code implementation11 Oct 2021 Yantian Zha, Lin Guan, Subbarao Kambhampati

Our main contribution is to propose the Self-Explanation for RL from Demonstrations (SERLfD) framework, which can overcome the limitations of traditional RLfD works.

reinforcement-learning Reinforcement Learning (RL)

Symbols as a Lingua Franca for Bridging Human-AI Chasm for Explainable and Advisable AI Systems

no code implementations21 Sep 2021 Subbarao Kambhampati, Sarath Sreedharan, Mudit Verma, Yantian Zha, Lin Guan

The jury is still out on whether AI systems will need to use symbols in their internal reasoning to achieve general intelligence capabilities.

Contrastively Learning Visual Attention as Affordance Cues from Demonstrations for Robotic Grasping

1 code implementation2 Apr 2021 Yantian Zha, Siddhant Bhambri, Lin Guan

In this work, our goal is instead to fill the gap between affordance discovery and affordance-based policy learning by integrating the two objectives in an end-to-end imitation learning framework based on deep neural networks.

Contrastive Learning Imitation Learning +1

QoS-aware Link Scheduling Strategy for Data Transmission in SDVN

no code implementations1 Feb 2021 Yong Zhang, Mao Ye, Lin Guan

The original contributions of this paper are summarized as follows: (1) Model the packets collision probability of broadcast or NACK transmission in VANET with the combination theory and investigate the potential influence of miss my packets (MMP) problem.

Networking and Internet Architecture

Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation

1 code implementation NeurIPS 2021 Lin Guan, Mudit Verma, Sihang Guo, Ruohan Zhang, Subbarao Kambhampati

We focus on the task of learning from feedback, in which the human trainer not only gives binary evaluative "good" or "bad" feedback for queried state-action pairs, but also provides a visual explanation by annotating relevant features in images.

Atari Games Data Augmentation +3

Leveraging Human Guidance for Deep Reinforcement Learning Tasks

no code implementations21 Sep 2019 Ruohan Zhang, Faraz Torabi, Lin Guan, Dana H. Ballard, Peter Stone

Reinforcement learning agents can learn to solve sequential decision tasks by interacting with the environment.

Imitation Learning reinforcement-learning +1

Atari-HEAD: Atari Human Eye-Tracking and Demonstration Dataset

1 code implementation15 Mar 2019 Ruohan Zhang, Calen Walshe, Zhuode Liu, Lin Guan, Karl S. Muller, Jake A. Whritner, Luxin Zhang, Mary M. Hayhoe, Dana H. Ballard

We hope that the scale and quality of this dataset can provide more opportunities to researchers in the areas of visual attention, imitation learning, and reinforcement learning.

Imitation Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.