Search Results for author: Zipeng Dai

Found 4 papers, 1 papers with code

HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

no code implementations • 29 Dec 2023 • Hao Wang, Bo Tang, Chi Harold Liu, Shangqin Mao, Jiahong Zhou, Zipeng Dai, Yaqi Sun, Qianlong Xie, Xingxing Wang, Dong Wang

Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day.

Data Augmentation

Paper
Add Code

Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction

no code implementations • 2 Sep 2022 • Taher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew Taylor, Kun Shao, Jun Wang, David Mguni

Centralised training with decentralised execution (CT-DE) serves as the foundation of many leading multi-agent reinforcement learning (MARL) algorithms.

Multi-agent Reinforcement Learning reinforcement-learning +3

Paper
Add Code

Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints

no code implementations • 31 May 2022 • David Mguni, Aivar Sootla, Juliusz Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun Wang

In this paper, we introduce a reinforcement learning (RL) framework named \textbf{L}earnable \textbf{I}mpulse \textbf{C}ontrol \textbf{R}einforcement \textbf{A}lgorithm (LICRA), for learning to optimally select both when to act and which actions to take when actions incur costs.

Reinforcement Learning (RL)

Paper
Add Code

Curiosity-Driven Energy-Efficient Worker Scheduling in Vehicular Crowdsourcing: A Deep Reinforcement Learning Approach

1 code implementation • ICDE 2020 • Chi Harold Liu, Yinuo Zhao, Zipeng Dai, Ye Yuan, Guoren Wang, Dapeng Wu, Kin K. Leung

Spatial crowdsourcing (SC) utilizes the potential of a crowd to accomplish certain location based tasks.

Fairness Reinforcement Learning (RL) +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.