Search Results for author: Xiaodong Zeng

Found 16 papers, 0 papers with code

PrivateLoRA For Efficient Privacy Preserving LLM

no code implementations23 Nov 2023 Yiming Wang, Yu Lin, Xiaodong Zeng, Guannan Zhang

To our knowledge, our proposed framework is the first efficient and privacy-preserving LLM solution in the literature.

Language Modelling Large Language Model +1

MultiLoRA: Democratizing LoRA for Better Multi-Task Learning

no code implementations20 Nov 2023 Yiming Wang, Yu Lin, Xiaodong Zeng, Guannan Zhang

Further investigation into weight update matrices of MultiLoRA exhibits reduced dependency on top singular vectors and more democratic unitary transform contributions.

Multi-Task Learning Natural Language Understanding +1

Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning

no code implementations6 Sep 2023 Tianchi Cai, Jiyan Jiang, Wenpeng Zhang, Shiji Zhou, Xierui Song, Li Yu, Lihong Gu, Xiaodong Zeng, Jinjie Gu, Guannan Zhang

We further show that this method is guaranteed to converge to the optimal policy, which cannot be achieved by previous value-based reinforcement learning methods for marketing budget allocation.

Marketing reinforcement-learning

Adversarial Learning for Incentive Optimization in Mobile Payment Marketing

no code implementations28 Dec 2021 Xuanying Chen, Zhining Liu, Li Yu, Sen Li, Lihong Gu, Xiaodong Zeng, Yize Tan, Jinjie Gu

This bias deteriorates the performance of the response model and misleads the linear programming process, dramatically degrading the performance of the resulting allocation policy.

Marketing

Multi-Objective Online Learning

no code implementations29 Sep 2021 Jiyan Jiang, Wenpeng Zhang, Shiji Zhou, Lihong Gu, Xiaodong Zeng, Wenwu Zhu

This paper presents a systematic study of multi-objective online learning.

A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning

no code implementations29 Aug 2021 Tianchi Cai, Wenpeng Zhang, Lihong Gu, Xiaodong Zeng, Jinjie Gu

To apply value-based methods to CRL, a recent groundbreaking line of game-theoretic approaches uses the mixed policy that randomizes among a set of carefully generated policies to converge to the desired constraint-satisfying policy.

General Reinforcement Learning reinforcement-learning +1

LinkLouvain: Link-Aware A/B Testing and Its Application on Online Marketing Campaign

no code implementations3 Feb 2021 Tianchi Cai, Daxi Cheng, Chen Liang, Ziqi Liu, Lihong Gu, Huizhi Xie, Zhiqiang Zhang, Xiaodong Zeng, Jinjie Gu

In this paper, we analyze the network A/B testing problem under a real-world online marketing campaign, describe our proposed LinkLouvain method, and evaluate it on real-world data.

Link Prediction Marketing

A Reduction Approach to Constrained Reinforcement Learning

no code implementations1 Jan 2021 Tianchi Cai, Wenjie Shi, Lihong Gu, Xiaodong Zeng, Jinjie Gu

In this paper, we present a reduction approach to find sparse policies that randomize among a constant number of policies for the constrained RL problem.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.