Search Results for author: Shuang Luo

Found 8 papers, 3 papers with code

CodeApex: A Bilingual Programming Evaluation Benchmark for Large Language Models

1 code implementation • 5 Sep 2023 • Lingyue Fu, Huacan Chai, Shuang Luo, Kounianhua Du, Weiming Zhang, Longteng Fan, Jiayi Lei, Renting Rui, Jianghao Lin, Yuchen Fang, Yifan Liu, Jingkuan Wang, Siyuan Qi, Kangning Zhang, Weinan Zhang, Yong Yu

With the emergence of Large Language Models (LLMs), there has been a significant improvement in the programming capabilities of models, attracting growing attention from researchers.

Code Generation Multiple-choice

Paper
Code

GFlowNets with Human Feedback

no code implementations • 11 May 2023 • Yinchuan Li, Shuang Luo, Yunfeng Shao, Jianye Hao

We propose the GFlowNets with Human Feedback (GFlowHF) framework to improve the exploration ability when training AI models.

Paper
Add Code

CFlowNets: Continuous Control with Generative Flow Networks

1 code implementation • 4 Mar 2023 • Yinchuan Li, Shuang Luo, Haozhi Wang, Jianye Hao

Generative flow networks (GFlowNets), as an emerging technique, can be used as an alternative to reinforcement learning for exploratory control tasks.

Active Learning Continuous Control +2

Paper
Code

S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning?

no code implementations • 20 Jun 2022 • Shuang Luo, Yinchuan Li, Jiahui Li, Kun Kuang, Furui Liu, Yunfeng Shao, Chao Wu

To this end, we propose a sparse state based MARL (S2RL) framework, which utilizes a sparse attention mechanism to discard irrelevant information in local observations.

Multi-agent Reinforcement Learning Reinforcement Learning (RL) +2

Paper
Add Code

Towards Effective Clustered Federated Learning: A Peer-to-peer Framework with Adaptive Neighbor Matching

no code implementations • 23 Mar 2022 • Zexi Li, Jiaxun Lu, Shuang Luo, Didi Zhu, Yunfeng Shao, Yinchuan Li, Zhimeng Zhang, Yongheng Wang, Chao Wu

In the literature, centralized clustered FL algorithms require the assumption of the number of clusters and hence are not effective enough to explore the latent relationships among clients.

Federated Learning

Paper
Add Code

Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing Images

1 code implementation • 22 Nov 2021 • Ye Liu, Huifang Li, Chao Hu, Shuang Luo, Yan Luo, Chang Wen Chen

The proposed model exploits three lightweight plug-and-play modules, namely dense feature pyramid network (DenseFPN), spatial context pyramid (SCP), and hierarchical region of interest extractor (HRoIE), to aggregate global visual context at feature, spatial, and instance domains, respectively.

Instance Segmentation Object Detection

Paper
Code

Ensemble Federated Adversarial Training with Non-IID data

no code implementations • 26 Oct 2021 • Shuang Luo, Didi Zhu, Zexi Li, Chao Wu

Despite federated learning endows distributed clients with a cooperative training mode under the premise of protecting data privacy and security, the clients are still vulnerable when encountering adversarial samples due to the lack of robustness.

Federated Learning

Paper
Add Code

Transfer Heterogeneous Knowledge Among Peer-to-Peer Teammates: A Model Distillation Approach

no code implementations • 6 Feb 2020 • Zeyue Xue, Shuang Luo, Chao Wu, Pan Zhou, Kaigui Bian, Wei Du

Peer-to-peer knowledge transfer in distributed environments has emerged as a promising method since it could accelerate learning and improve team-wide performance without relying on pre-trained teachers in deep reinforcement learning.

Transfer Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.