no code implementations • 23 Feb 2024 • Jianhong Wang, Yang Li, Yuan Zhang, Wei Pan, Samuel Kaski
Open ad hoc teamwork further complicates this challenge by considering environments with a changing number of teammates, referred to as open teams.
no code implementations • 23 Feb 2024 • Jianhong Wang
We first extend a game model called convex game and a payoff distribution scheme called Shapley value in cooperative game theory to Markov decision process, named as Markov convex game and Markov Shapley value respectively.
no code implementations • 19 Feb 2024 • Yang Li, WenHao Zhang, Jianhong Wang, Shao Zhang, Yali Du, Ying Wen, Wei Pan
The visualization of learning dynamics effectively demonstrates that AgA successfully achieves alignment between individual and collective objectives.
1 code implementation • 17 Dec 2023 • Wangkun Xu, Jianhong Wang, Fei Teng
Successful machine learning involves a complete pipeline of data, model, and downstream applications.
1 code implementation • NeurIPS 2023 • Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang
To capture the information of sufficient and necessary causes, we employ a classical concept, the probability of sufficiency and necessary causes (PNS), which indicates the probability of whether one is the necessary and sufficient cause.
no code implementations • 13 Aug 2023 • Weishan Ye, Zhiguo Zhang, Min Zhang, Fei Teng, Li Zhang, Linling Li, Gan Huang, Jianhong Wang, Dong Ni, Zhen Liang
In this paper, a semi-supervised Dual-stream Self-Attentive Adversarial Graph Contrastive learning framework (termed as DS-AGC) is proposed to tackle the challenge of limited labeled data in cross-subject EEG-based emotion recognition.
no code implementations • 2 Sep 2022 • Taher Jafferjee, Juliusz Ziomek, Tianpei Yang, Zipeng Dai, Jianhong Wang, Matthew Taylor, Kun Shao, Jun Wang, David Mguni
Centralised training with decentralised execution (CT-DE) serves as the foundation of many leading multi-agent reinforcement learning (MARL) algorithms.
Multi-agent Reinforcement Learning reinforcement-learning +3
1 code implementation • 5 Jul 2022 • Yuan Zhang, Jianhong Wang, Joschka Boedecker
To deal with unknown uncertainty sets, we further propose a novel adversarial approach to generate them based on the value function.
1 code implementation • 21 Jun 2022 • Mingrui Zhang, Jianhong Wang, James Tlhomole, Matthew D. Piggott
General optical flow methods are typically designed for rigid body motion, and thus struggle if applied to fluid motion estimation directly.
1 code implementation • 27 Apr 2022 • Wangkun Xu, Martin Higgins, Jianhong Wang, Imad M. Jaimoukha, Fei Teng
However, the uncontrollable false positive rate of the data-driven detector and the extra cost of frequent MTD usage limit their wide applications.
1 code implementation • NeurIPS 2021 • Jianhong Wang, Wangkun Xu, Yunjie Gu, Wenbin Song, Tim C. Green
This paper presents a problem in power networks that creates an exciting and yet challenging real-world scenario for application of multi-agent reinforcement learning (MARL).
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 31 May 2021 • Jianhong Wang, Yuan Zhang, Yunjie Gu, Tae-Kyun Kim
This paper studies a theoretical framework for value factorisation with interpretability via Shapley value theory.
no code implementations • 16 Mar 2021 • David Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez-Nieves, Tianpei Yang, Matthew Taylor, Wenbin Song, Feifei Tong, Hui Chen, Jiangcheng Zhu, Jun Wang, Yaodong Yang
Reward shaping (RS) is a powerful method in reinforcement learning (RL) for overcoming the problem of sparse or uninformative rewards.
2 code implementations • ICLR 2021 • Jianhong Wang, Yuan Zhang, Tae-Kyun Kim, Yunjie Gu
We test HDNO on MultiWoz 2. 0 and MultiWoz 2. 1, the datasets on multi-domain dialogues, in comparison with word-level E2E model trained with RL, LaRL and HDSA, showing improvements on the performance evaluated by automatic evaluation metrics and human evaluation.
Hierarchical Reinforcement Learning reinforcement-learning +2
2 code implementations • 11 Jul 2019 • Jianhong Wang, Yuan Zhang, Tae-Kyun Kim, Yunjie Gu
To deal with this problem, we i) introduce a cooperative-game theoretical framework called extended convex game (ECG) that is a superset of global reward game, and ii) propose a local reward approach called Shapley Q-value.
1 code implementation • NeurIPS 2018 • Rui Luo, Jianhong Wang, Yaodong Yang, Zhanxing Zhu, Jun Wang
We propose a new sampling method, the thermostat-assisted continuously-tempered Hamiltonian Monte Carlo, for Bayesian learning on large datasets and multimodal distributions.
no code implementations • 27 Aug 2016 • Jianhong Wang, Tian Lan, Xu Zhang, Limin Luo
This paper presents a novel mid-level representation for action recognition, named spatio-temporal aware non-negative component representation (STANNCR).