Search Results for author: Yiran Wu

Found 9 papers, 7 papers with code

StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows

2 code implementations • 17 Mar 2024 • Yiran Wu, Tianwei Yue, Shaokun Zhang, Chi Wang, Qingyun Wu

In StateFlow, we distinguish between "process grounding" (via state and state transitions) and "sub-task solving" (through actions within a state), enhancing control and interpretability of the task-solving procedure.

Management

Paper
Code

AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks

1 code implementation • 2 Mar 2024 • Yifan Zeng, Yiran Wu, Xiao Zhang, Huazheng Wang, Qingyun Wu

Through conducting extensive experiments on a large scale of harmful and safe prompts, we validate the effectiveness of the proposed AutoDefense in improving the robustness against jailbreak attacks, while maintaining the performance at normal user request.

Instruction Following LLM real-life tasks +1

Paper
Code

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation

1 code implementation • 16 Aug 2023 • Qingyun Wu, Gagan Bansal, Jieyu Zhang, Yiran Wu, Beibin Li, Erkang Zhu, Li Jiang, Xiaoyun Zhang, Shaokun Zhang, Jiale Liu, Ahmed Hassan Awadallah, Ryen W White, Doug Burger, Chi Wang

AutoGen is an open-source framework that allows developers to build LLM applications via multiple agents that can converse with each other to accomplish tasks.

3D Human Pose Estimation College Computer Science +2

25,197

Paper
Code

Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective

2 code implementations • NeurIPS 2023 • Zeyu Zhang, Yi Su, Hui Yuan, Yiran Wu, Rishab Balasubramanian, Qingyun Wu, Huazheng Wang, Mengdi Wang

Building upon this, we leverage offline RL techniques for off-policy LTR and propose the Click Model-Agnostic Unified Off-policy Learning to Rank (CUOLR) method, which could be easily applied to a wide range of click models.

Learning-To-Rank Offline RL +2

Paper
Code

An Empirical Study on Challenging Math Problem Solving with GPT-4

1 code implementation • 2 Jun 2023 • Yiran Wu, Feiran Jia, Shaokun Zhang, Hangyu Li, Erkang Zhu, Yue Wang, Yin Tat Lee, Richard Peng, Qingyun Wu, Chi Wang

Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields.

Elementary Mathematics Math

25,197

Paper
Code

HyperTime: Hyperparameter Optimization for Combating Temporal Distribution Shifts

no code implementations • 28 May 2023 • Shaokun Zhang, Yiran Wu, Zhonghua Zheng, Qingyun Wu, Chi Wang

In this work, we propose a hyperparameter optimization method named \emph{HyperTime} to find hyperparameters robust to potential temporal distribution shifts in the unseen test data.

Hyperparameter Optimization Philosophy

Paper
Add Code

Improving Policy Optimization with Generalist-Specialist Learning

1 code implementation • 26 Jun 2022 • Zhiwei Jia, Xuanlin Li, Zhan Ling, Shuang Liu, Yiran Wu, Hao Su

Generalization in deep reinforcement learning over unseen environment variations usually requires policy learning over a large set of diverse training variations.

Imitation Learning

Paper
Code

PointSIFT: A SIFT-like Network Module for 3D Point Cloud Semantic Segmentation

4 code implementations • 2 Jul 2018 • Mingyang Jiang, Yiran Wu, Tianqi Zhao, Zelin Zhao, Cewu Lu

Recently, 3D understanding research sheds light on extracting features from point cloud directly, which requires effective shape pattern description of point clouds.

Point Cloud Segmentation Semantic Segmentation

636

Paper
Code

Size-to-depth: A New Perspective for Single Image Depth Estimation

no code implementations • 13 Jan 2018 • Yiran Wu, Sihao Ying, Lianmin Zheng

To overcome these problems, we propose a new perspective for single monocular image depth estimation problem: size to depth.

Depth Estimation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.