Search Results for author: Flint Xiaofeng Fan

Found 6 papers, 4 papers with code

CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

1 code implementation • 29 Mar 2024 • Hei Yi Mak, Flint Xiaofeng Fan, Luca A. Lanzendörfer, Cheston Tan, Wei Tsang Ooi, Roger Wattenhofer

CAESAR is an aggregation strategy used by the server that combines convergence-aware sampling with a screening mechanism.

Paper
Code

Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence

1 code implementation • 7 Jan 2024 • Philip Jordan, Florian Grötschla, Flint Xiaofeng Fan, Roger Wattenhofer

We provide the first decentralized Byzantine fault-tolerant FRL method.

Paper
Code

Action and Trajectory Planning for Urban Autonomous Driving with Hierarchical Reinforcement Learning

no code implementations • 28 Jun 2023 • Xinyang Lu, Flint Xiaofeng Fan, Tianying Wang

In this work, we propose an action and trajectory planner using Hierarchical Reinforcement Learning (atHRL) method, which models the agent behavior in a hierarchical model by using the perception of the lidar and birdeye view.

Autonomous Driving Decision Making +4

Paper
Add Code

FedHQL: Federated Heterogeneous Q-Learning

no code implementations • 26 Jan 2023 • Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Cheston Tan, Bryan Kian Hsiang Low, Roger Wattenhofer

Federated Reinforcement Learning (FedRL) encourages distributed agents to learn collectively from each other's experience to improve their performance without exchanging their raw trajectories.

Q-Learning reinforcement-learning +1

Paper
Add Code

Federated Neural Bandits

1 code implementation • 28 May 2022 • Zhongxiang Dai, Yao Shu, Arun Verma, Flint Xiaofeng Fan, Bryan Kian Hsiang Low, Patrick Jaillet

To better exploit the federated setting, FN-UCB adopts a weighted combination of two UCBs: $\text{UCB}^{a}$ allows every agent to additionally use the observations from the other agents to accelerate exploration (without sharing raw observations), while $\text{UCB}^{b}$ uses an NN with aggregated parameters for reward prediction in a similar way to federated averaging for supervised learning.

Multi-Armed Bandits

Paper
Code

Fault-Tolerant Federated Reinforcement Learning with Theoretical Guarantee

2 code implementations • NeurIPS 2021 • Flint Xiaofeng Fan, Yining Ma, Zhongxiang Dai, Wei Jing, Cheston Tan, Bryan Kian Hsiang Low

The growing literature of Federated Learning (FL) has recently inspired Federated Reinforcement Learning (FRL) to encourage multiple agents to federatively build a better decision-making policy without sharing raw trajectories.

Decision Making Federated Learning +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.