Search Results for author: Jianhong Wang

Found 17 papers, 10 papers with code

Open Ad Hoc Teamwork with Cooperative Game Theory

no code implementations • 23 Feb 2024 • Jianhong Wang, Yang Li, Yuan Zhang, Wei Pan, Samuel Kaski

Open ad hoc teamwork further complicates this challenge by considering environments with a changing number of teammates, referred to as open teams.

Paper
Add Code

Shapley Value Based Multi-Agent Reinforcement Learning: Theory, Method and Its Application to Energy Network

no code implementations • 23 Feb 2024 • Jianhong Wang

We first extend a game model called convex game and a payoff distribution scheme called Shapley value in cooperative game theory to Markov decision process, named as Markov convex game and Markov Shapley value respectively.

Learning Theory Multi-agent Reinforcement Learning +1

Paper
Add Code

Aligning Individual and Collective Objectives in Multi-Agent Cooperation

no code implementations • 19 Feb 2024 • Yang Li, WenHao Zhang, Jianhong Wang, Shao Zhang, Yali Du, Ying Wen, Wei Pan

The visualization of learning dynamics effectively demonstrates that AgA successfully achieves alignment between individual and collective objectives.

SMAC+

Paper
Add Code

E2E-AT: A Unified Framework for Tackling Uncertainty in Task-aware End-to-end Learning

1 code implementation • 17 Dec 2023 • Wangkun Xu, Jianhong Wang, Fei Teng

Successful machine learning involves a complete pipeline of data, model, and downstream applications.

Decision Making Load Forecasting +1

Paper
Code

Invariant Learning via Probability of Sufficient and Necessary Causes

1 code implementation • NeurIPS 2023 • Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang

To capture the information of sufficient and necessary causes, we employ a classical concept, the probability of sufficiency and necessary causes (PNS), which indicates the probability of whether one is the necessary and sufficient cause.

Paper
Code

Semi-Supervised Dual-Stream Self-Attentive Adversarial Graph Contrastive Learning for Cross-Subject EEG-based Emotion Recognition

no code implementations • 13 Aug 2023 • Weishan Ye, Zhiguo Zhang, Min Zhang, Fei Teng, Li Zhang, Linling Li, Gan Huang, Jianhong Wang, Dong Ni, Zhen Liang

In this paper, a semi-supervised Dual-stream Self-Attentive Adversarial Graph Contrastive learning framework (termed as DS-AGC) is proposed to tackle the challenge of limited labeled data in cross-subject EEG-based emotion recognition.

Contrastive Learning Domain Adaptation +2

Paper
Add Code

Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction

no code implementations • 2 Sep 2022 • Taher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew Taylor, Kun Shao, Jun Wang, David Mguni

Centralised training with decentralised execution (CT-DE) serves as the foundation of many leading multi-agent reinforcement learning (MARL) algorithms.

Multi-agent Reinforcement Learning reinforcement-learning +3

Paper
Add Code

Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization

1 code implementation • 5 Jul 2022 • Yuan Zhang, Jianhong Wang, Joschka Boedecker

To deal with unknown uncertainty sets, we further propose a novel adversarial approach to generate them based on the value function.

Continuous Control reinforcement-learning +1

Paper
Code

Learning to Estimate and Refine Fluid Motion with Physical Dynamics

1 code implementation • 21 Jun 2022 • Mingrui Zhang, Jianhong Wang, James Tlhomole, Matthew D. Piggott

General optical flow methods are typically designed for rigid body motion, and thus struggle if applied to fluid motion estimation directly.

Motion Estimation Optical Flow Estimation

Paper
Code

Blending Data and Physics Against False Data Injection Attack: An Event-Triggered Moving Target Defence Approach

1 code implementation • 27 Apr 2022 • Wangkun Xu, Martin Higgins, Jianhong Wang, Imad M. Jaimoukha, Fei Teng

However, the uncontrollable false positive rate of the data-driven detector and the extra cost of frequent MTD usage limit their wide applications.

Paper
Code

Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks

1 code implementation • NeurIPS 2021 • Jianhong Wang, Wangkun Xu, Yunjie Gu, Wenbin Song, Tim C. Green

This paper presents a problem in power networks that creates an exciting and yet challenging real-world scenario for application of multi-agent reinforcement learning (MARL).

Multi-agent Reinforcement Learning reinforcement-learning +1

160

Paper
Code

SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning

1 code implementation • 31 May 2021 • Jianhong Wang, Yuan Zhang, Yunjie Gu, Tae-Kyun Kim

This paper studies a theoretical framework for value factorisation with interpretability via Shapley value theory.

Fairness Q-Learning +2

Paper
Code

Learning to Shape Rewards using a Game of Two Partners

no code implementations • 16 Mar 2021 • David Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez-Nieves, Tianpei Yang, Matthew Taylor, Wenbin Song, Feifei Tong, Hui Chen, Jiangcheng Zhu, Jun Wang, Yaodong Yang

Reward shaping (RS) is a powerful method in reinforcement learning (RL) for overcoming the problem of sparse or uninformative rewards.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Modelling Hierarchical Structure between Dialogue Policy and Natural Language Generator with Option Framework for Task-oriented Dialogue System

2 code implementations • ICLR 2021 • Jianhong Wang, Yuan Zhang, Tae-Kyun Kim, Yunjie Gu

We test HDNO on MultiWoz 2. 0 and MultiWoz 2. 1, the datasets on multi-domain dialogues, in comparison with word-level E2E model trained with RL, LaRL and HDSA, showing improvements on the performance evaluated by automatic evaluation metrics and human evaluation.

Hierarchical Reinforcement Learning reinforcement-learning +2

Paper
Code

Shapley Q-value: A Local Reward Approach to Solve Global Reward Games

2 code implementations • 11 Jul 2019 • Jianhong Wang, Yuan Zhang, Tae-Kyun Kim, Yunjie Gu

To deal with this problem, we i) introduce a cooperative-game theoretical framework called extended convex game (ECG) that is a superset of global reward game, and ii) propose a local reward approach called Shapley Q-value.

Multi-agent Reinforcement Learning Policy Gradient Methods

105

Paper
Code

Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning

1 code implementation • NeurIPS 2018 • Rui Luo, Jianhong Wang, Yaodong Yang, Zhanxing Zhu, Jun Wang

We propose a new sampling method, the thermostat-assisted continuously-tempered Hamiltonian Monte Carlo, for Bayesian learning on large datasets and multimodal distributions.

Paper
Code

Spatio-temporal Aware Non-negative Component Representation for Action Recognition

no code implementations • 27 Aug 2016 • Jianhong Wang, Tian Lan, Xu Zhang, Limin Luo

This paper presents a novel mid-level representation for action recognition, named spatio-temporal aware non-negative component representation (STANNCR).

Action Recognition Temporal Action Localization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.