Search Results for author: Yiran Wu

Found 9 papers, 7 papers with code

StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows

2 code implementations17 Mar 2024 Yiran Wu, Tianwei Yue, Shaokun Zhang, Chi Wang, Qingyun Wu

In StateFlow, we distinguish between "process grounding" (via state and state transitions) and "sub-task solving" (through actions within a state), enhancing control and interpretability of the task-solving procedure.

Management

AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks

1 code implementation2 Mar 2024 Yifan Zeng, Yiran Wu, Xiao Zhang, Huazheng Wang, Qingyun Wu

Through conducting extensive experiments on a large scale of harmful and safe prompts, we validate the effectiveness of the proposed AutoDefense in improving the robustness against jailbreak attacks, while maintaining the performance at normal user request.

Instruction Following LLM real-life tasks +1

Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective

2 code implementations NeurIPS 2023 Zeyu Zhang, Yi Su, Hui Yuan, Yiran Wu, Rishab Balasubramanian, Qingyun Wu, Huazheng Wang, Mengdi Wang

Building upon this, we leverage offline RL techniques for off-policy LTR and propose the Click Model-Agnostic Unified Off-policy Learning to Rank (CUOLR) method, which could be easily applied to a wide range of click models.

Learning-To-Rank Offline RL +2

An Empirical Study on Challenging Math Problem Solving with GPT-4

1 code implementation2 Jun 2023 Yiran Wu, Feiran Jia, Shaokun Zhang, Hangyu Li, Erkang Zhu, Yue Wang, Yin Tat Lee, Richard Peng, Qingyun Wu, Chi Wang

Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields.

Elementary Mathematics Math

HyperTime: Hyperparameter Optimization for Combating Temporal Distribution Shifts

no code implementations28 May 2023 Shaokun Zhang, Yiran Wu, Zhonghua Zheng, Qingyun Wu, Chi Wang

In this work, we propose a hyperparameter optimization method named \emph{HyperTime} to find hyperparameters robust to potential temporal distribution shifts in the unseen test data.

Hyperparameter Optimization Philosophy

Improving Policy Optimization with Generalist-Specialist Learning

1 code implementation26 Jun 2022 Zhiwei Jia, Xuanlin Li, Zhan Ling, Shuang Liu, Yiran Wu, Hao Su

Generalization in deep reinforcement learning over unseen environment variations usually requires policy learning over a large set of diverse training variations.

Imitation Learning

PointSIFT: A SIFT-like Network Module for 3D Point Cloud Semantic Segmentation

4 code implementations2 Jul 2018 Mingyang Jiang, Yiran Wu, Tianqi Zhao, Zelin Zhao, Cewu Lu

Recently, 3D understanding research sheds light on extracting features from point cloud directly, which requires effective shape pattern description of point clouds.

Point Cloud Segmentation Semantic Segmentation

Size-to-depth: A New Perspective for Single Image Depth Estimation

no code implementations13 Jan 2018 Yiran Wu, Sihao Ying, Lianmin Zheng

To overcome these problems, we propose a new perspective for single monocular image depth estimation problem: size to depth.

Depth Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.