Search Results for author: Guojun Xiong

Found 11 papers, 1 papers with code

Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback

no code implementations • 2 May 2024 • Guojun Xiong, Jian Li

Restless multi-armed bandits (RMAB) play a central role in modeling sequential decision making problems under an instantaneous activation constraint that at most B arms can be activated at any decision epoch.

Multi-Armed Bandits

Paper
Add Code

Structured Reinforcement Learning for Delay-Optimal Data Transmission in Dense mmWave Networks

no code implementations • 25 Apr 2024 • Shufan Wang, Guojun Xiong, Shichen Zhang, Huacheng Zeng, Jian Li, Shivendra Panwar

We study the data packet transmission problem (mmDPT) in dense cell-free millimeter wave (mmWave) networks, i. e., users sending data packet requests to access points (APs) via uplinks and APs transmitting requested data packets to users via downlinks.

Fairness Multi-Armed Bandits +1

Paper
Add Code

The FinBen: An Holistic Financial Benchmark for Large Language Models

2 code implementations • 20 Feb 2024 • Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu, Jiajia Huang, Xiao-Yang Liu, Alejandro Lopez-Lira, Benyou Wang, Yanzhao Lai, Hao Wang, Min Peng, Sophia Ananiadou, Jimin Huang

This along with the rapid development of LLMs, highlights the urgent need for a systematic financial evaluation benchmark for LLMs.

401

Paper
Code

DePRL: Achieving Linear Convergence Speedup in Personalized Decentralized Learning with Shared Representations

no code implementations • 17 Dec 2023 • Guojun Xiong, Gang Yan, Shiqiang Wang, Jian Li

Decentralized learning has emerged as an alternative method to the popular parameter-server framework which suffers from high communication burden, single-point failure and scalability issues due to the need of a central server.

Learning Theory Representation Learning

Paper
Add Code

Online Restless Multi-Armed Bandits with Long-Term Fairness Constraints

no code implementations • 16 Dec 2023 • Shufan Wang, Guojun Xiong, Jian Li

Restless multi-armed bandits (RMAB) have been widely used to model sequential decision making problems with constraints.

Decision Making Fairness +2

Paper
Add Code

Straggler-Resilient Decentralized Learning via Adaptive Asynchronous Updates

no code implementations • 11 Jun 2023 • Guojun Xiong, Gang Yan, Shiqiang Wang, Jian Li

With the increasing demand for large-scale training of machine learning models, fully decentralized optimization methods have recently been advocated as alternatives to the popular parameter server framework.

Paper
Add Code

Decentralized Stochastic Multi-Player Multi-Armed Walking Bandits

no code implementations • 12 Dec 2022 • Guojun Xiong, Jian Li

Most research for this problem focuses exclusively on the settings that players have \textit{full access} to all arms and receive no reward when pulling the same arm.

Decision Making Distributed Optimization

Paper
Add Code

Whittle Index based Q-Learning for Wireless Edge Caching with Linear Function Approximation

no code implementations • 26 Feb 2022 • Guojun Xiong, Shufan Wang, Jian Li, Rahul Singh

Using this structural result, we establish the indexability of our problem, and employ the Whittle index policy to minimize average latency.

Edge-computing Q-Learning +1

Paper
Add Code

Reinforcement Learning for Finite-Horizon Restless Multi-Armed Multi-Action Bandits

no code implementations • 20 Sep 2021 • Guojun Xiong, Jian Li, Rahul Singh

We call it the R(MA)^2B-UCB algorithm.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Straggler-Resilient Distributed Machine Learning with Dynamic Backup Workers

no code implementations • 11 Feb 2021 • Guojun Xiong, Gang Yan, Rahul Singh, Jian Li

In this paradigm, each worker maintains a local estimate of the optimal parameter vector, and iteratively updates it by waiting and averaging all estimates obtained from its neighbors, and then corrects it on the basis of its local dataset.

BIG-bench Machine Learning Distributed Optimization

Paper
Add Code

Learning Augmented Index Policy for Optimal Service Placement at the Network Edge

no code implementations • 10 Jan 2021 • Guojun Xiong, Rahul Singh, Jian Li

We pose the problem as a Markov decision process (MDP) in which the system state is given by describing, for each service, the number of customers that are currently waiting at the edge to obtain the service.

Q-Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.