Search Results for author: Qingyun Wu

Found 27 papers, 12 papers with code

Embodied LLM Agents Learn to Cooperate in Organized Teams

no code implementations19 Mar 2024 Xudong Guo, Kaixuan Huang, Jiale Liu, Wenhui Fan, Natalia Vélez, Qingyun Wu, Huazheng Wang, Thomas L. Griffiths, Mengdi Wang

Large Language Models (LLMs) have emerged as integral tools for reasoning, planning, and decision-making, drawing upon their extensive world knowledge and proficiency in language-related tasks.

Decision Making World Knowledge

StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows

2 code implementations17 Mar 2024 Yiran Wu, Tianwei Yue, Shaokun Zhang, Chi Wang, Qingyun Wu

In StateFlow, we distinguish between "process grounding" (via state and state transitions) and "sub-task solving" (through actions within a state), enhancing control and interpretability of the task-solving procedure.

Management

AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks

1 code implementation2 Mar 2024 Yifan Zeng, Yiran Wu, Xiao Zhang, Huazheng Wang, Qingyun Wu

Through conducting extensive experiments on a large scale of harmful and safe prompts, we validate the effectiveness of the proposed AutoDefense in improving the robustness against jailbreak attacks, while maintaining the performance at normal user request.

Instruction Following LLM real-life tasks +1

Training Language Model Agents without Modifying Language Models

no code implementations17 Feb 2024 Shaokun Zhang, Jieyu Zhang, Jiale Liu, Linxin Song, Chi Wang, Ranjay Krishna, Qingyun Wu

Researchers and practitioners have recently reframed powerful Large Language Models (LLMs) as agents, enabling them to automate complex tasks largely via the use of specialized functions.

Language Modelling

Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications

no code implementations14 Feb 2024 Negar Arabzadeh, Julia Kiseleva, Qingyun Wu, Chi Wang, Ahmed Awadallah, Victor Dibia, Adam Fourney, Charles Clarke

The rapid development in the field of Large Language Models (LLMs) has led to a surge in applications that facilitate collaboration among multiple agents to assist humans in their daily tasks.

Math

Refined Coreset Selection: Towards Minimal Coreset Size under Model Performance Constraints

no code implementations15 Nov 2023 Xiaobo Xia, Jiale Liu, Shaokun Zhang, Qingyun Wu, Hongxin Wei, Tongliang Liu

Coreset selection is powerful in reducing computational costs and accelerating data processing for deep learning algorithms.

IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models

no code implementations16 Oct 2023 Shaokun Zhang, Xiaobo Xia, Zhaoqing Wang, Ling-Hao Chen, Jiale Liu, Qingyun Wu, Tongliang Liu

However, since the prompts need to be sampled from a large volume of annotated examples, finding the right prompt may result in high annotation costs.

In-Context Learning

Adversarial Attacks on Combinatorial Multi-Armed Bandits

no code implementations8 Oct 2023 Rishab Balasubramanian, Jiawei Li, Prasad Tadepalli, Huazheng Wang, Qingyun Wu, Haoyu Zhao

Contrary to prior understanding of multi-armed bandits, our work reveals a surprising fact that the attackability of a specific CMAB instance also depends on whether the bandit instance is known or unknown to the adversary.

Multi-Armed Bandits

Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective

2 code implementations NeurIPS 2023 Zeyu Zhang, Yi Su, Hui Yuan, Yiran Wu, Rishab Balasubramanian, Qingyun Wu, Huazheng Wang, Mengdi Wang

Building upon this, we leverage offline RL techniques for off-policy LTR and propose the Click Model-Agnostic Unified Off-policy Learning to Rank (CUOLR) method, which could be easily applied to a wide range of click models.

Learning-To-Rank Offline RL +2

An Empirical Study on Challenging Math Problem Solving with GPT-4

1 code implementation2 Jun 2023 Yiran Wu, Feiran Jia, Shaokun Zhang, Hangyu Li, Erkang Zhu, Yue Wang, Yin Tat Lee, Richard Peng, Qingyun Wu, Chi Wang

Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields.

Elementary Mathematics Math

HyperTime: Hyperparameter Optimization for Combating Temporal Distribution Shifts

no code implementations28 May 2023 Shaokun Zhang, Yiran Wu, Zhonghua Zheng, Qingyun Wu, Chi Wang

In this work, we propose a hyperparameter optimization method named \emph{HyperTime} to find hyperparameters robust to potential temporal distribution shifts in the unseen test data.

Hyperparameter Optimization Philosophy

Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization

no code implementations29 Jun 2022 Kaixuan Huang, Yu Wu, Xuezhou Zhang, Shenyinying Tu, Qingyun Wu, Mengdi Wang, Huazheng Wang

Online influence maximization aims to maximize the influence spread of a content in a social network with unknown network model by selecting a few seed nodes.

Model-based Reinforcement Learning reinforcement-learning +1

FairAutoML: Embracing Unfairness Mitigation in AutoML

no code implementations11 Nov 2021 Qingyun Wu, Chi Wang

In this work, we propose an Automated Machine Learning (AutoML) system to search for models not only with good prediction accuracy but also fair.

AutoML BIG-bench Machine Learning +1

ChaCha for Online AutoML

1 code implementation9 Jun 2021 Qingyun Wu, Chi Wang, John Langford, Paul Mineiro, Marco Rossi

We propose the ChaCha (Champion-Challengers) algorithm for making an online choice of hyperparameters in online learning settings.

AutoML Scheduling

When and Whom to Collaborate with in a Changing Environment: A Collaborative Dynamic Bandit Solution

no code implementations14 Apr 2021 Chuanhao Li, Qingyun Wu, Hongning Wang

However, all existing collaborative bandit learning solutions impose a stationary assumption about the environment, i. e., both user preferences and the dependency among users are assumed static over time.

Bayesian Inference Collaborative Filtering +3

ECONOMIC HYPERPARAMETER OPTIMIZATION WITH BLENDED SEARCH STRATEGY

no code implementations ICLR 2021 Chi Wang, Qingyun Wu, Silu Huang, Amin Saied

We study the problem of using low cost to search for hyperparameter configurations in a large search space with heterogeneous evaluation cost and model quality.

Hyperparameter Optimization

Unifying Clustered and Non-stationary Bandits

no code implementations5 Sep 2020 Chuanhao Li, Qingyun Wu, Hongning Wang

Non-stationary bandits and online clustering of bandits lift the restrictive assumptions in contextual bandits and provide solutions to many important real-world scenarios.

Change Detection Clustering +2

Fast Distributed Bandits for Online Recommendation Systems

no code implementations16 Jul 2020 Kanak Mahadik, Qingyun Wu, Shuai Li, Amit Sabne

This algorithm lazily creates clusters in a distributed manner, and dramatically reduces the network data sharing requirement, achieving high scalability.

Clustering Recommendation Systems

Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-Start Users

1 code implementation23 May 2020 Shijun Li, Wenqiang Lei, Qingyun Wu, Xiangnan He, Peng Jiang, Tat-Seng Chua

In this work, we consider the conversational recommendation for cold-start users, where a system can both ask the attributes from and recommend items to a user interactively.

Collaborative Filtering Thompson Sampling

Estimation-Action-Reflection: Towards Deep Interaction Between Conversational and Recommender Systems

no code implementations21 Feb 2020 Wenqiang Lei, Xiangnan He, Yisong Miao, Qingyun Wu, Richang Hong, Min-Yen Kan, Tat-Seng Chua

Recommender systems are embracing conversational technologies to obtain user preferences dynamically, and to overcome inherent limitations of their static models.

Recommendation Systems

FLAML: A Fast and Lightweight AutoML Library

2 code implementations12 Nov 2019 Chi Wang, Qingyun Wu, Markus Weimer, Erkang Zhu

We study the problem of using low computational cost to automate the choices of learners and hyperparameters for an ad-hoc training dataset and error metric, by conducting trials of different configurations on the given training data.

Hyperparameter Optimization

Variance Reduction in Gradient Exploration for Online Learning to Rank

no code implementations10 Jun 2019 Huazheng Wang, Sonwoo Kim, Eric McCord-Snook, Qingyun Wu, Hongning Wang

We prove that the projected gradient is an unbiased estimation of the true gradient, and show that this lower-variance gradient estimation results in significant regret reduction.

Learning-To-Rank

Factorization Bandits for Online Influence Maximization

1 code implementation9 Jun 2019 Qingyun Wu, Zhige Li, Huazheng Wang, Wei Chen, Hongning Wang

We capitalize on an important property of the influence maximization problem named network assortativity, which is ignored by most existing works in online influence maximization.

Bandit Learning with Implicit Feedback

1 code implementation NeurIPS 2018 Yi Qi, Qingyun Wu, Hongning Wang, Jie Tang, Maosong Sun

Implicit feedback, such as user clicks, although abundant in online information service systems, does not provide substantial evidence on users' evaluation of system's output.

Bayesian Inference Thompson Sampling

Learning Contextual Bandits in a Non-stationary Environment

1 code implementation23 May 2018 Qingyun Wu, Naveen Iyer, Hongning Wang

Multi-armed bandit algorithms have become a reference solution for handling the explore/exploit dilemma in recommender systems, and many other important real-world problems, such as display advertisement.

Multi-Armed Bandits Recommendation Systems

Cannot find the paper you are looking for? You can Submit a new open access paper.