no code implementations • 27 Nov 2023 • Redwan Ahmed Rizvee, Raheeb Hasan, Md. Mosaddek Khan
We formulate and empirically evaluate the compatibility of the QUBO-formulated Hamiltonian as the generic reward function in the RL-based paradigm in the form of rewards.
1 code implementation • 22 Oct 2023 • Raheeb Hassan, K. M. Shadman Wadith, Md. Mamun or Rashid, Md. Mosaddek Khan
In this paper, we address the problem of multi-agent policy optimization in a decentralized setting, where agents communicate with their neighbors to maximize the sum of their cumulative rewards while also satisfying each agent's safety constraints.
no code implementations • 27 Aug 2023 • Redwan Ahmed Rizvee, Md. Mosaddek Khan
The Hamiltonian function is often used to formulate QUBO problems where it is used as the objective function in the context of optimization.
no code implementations • 28 Oct 2021 • Plabon Shaha, Talha Islam Zadid, Ismat Rahman, Md. Mosaddek Khan
Estimating causal effects from observational data informs us about which factors are important in an autonomous system, and enables us to take better decisions.
no code implementations • 16 Oct 2021 • Rafid Ameer Mahmud, Fahim Faisal, Saaduddin Mahmud, Md. Mosaddek Khan
Against this background, we introduce a simulation based online planning algorithm, that we call SiCLOP, for multi-agent cooperative environments.
no code implementations • 23 Feb 2021 • Md. Musfiqur Rahman, Ayman Rasheed, Md. Mosaddek Khan, Mohammad Ali Javidian, Pooyan Jamshidi, Md. Mamun-or-Rashid
This paper proposes a generic causal structure refinement strategy that can locate the undesired relations with a small number of CI-tests, thus speeding up the algorithm for large and complex problems.
no code implementations • 2 Dec 2020 • Md. Musfiqur Rahman, Mashrur Rashik, Md. Mamun-or-Rashid, Md. Mosaddek Khan
In particular, BMS algorithm is able to solve problems of this type having large search space at the expense of low computational cost.
no code implementations • 20 Oct 2020 • Moumita Choudhury, Amit Sarker, Md. Mosaddek Khan, William Yeoh
To address this issue, we propose a new C-DCOP algorithm, namely Particle Swarm Optimization Based C-DCOP (PCD), which is inspired by Particle Swarm Optimization (PSO), a well-known centralized population-based approach for solving continuous optimization problems.
no code implementations • 2 Sep 2020 • Saaduddin Mahmud, Md. Mosaddek Khan, Nicholas R. Jennings
The main characteristic of these algorithms is that they maintain a population of candidate solutions of a given problem and use this population to cover a large area of the search space and to avoid local-optima.
no code implementations • 27 Feb 2020 • Amit Sarker, Abdullahil Baki Arif, Moumita Choudhury, Md. Mosaddek Khan
To overcome this limitation, Functional DCOP (F-DCOP) model is proposed that is able to model problems with continuous variables.
no code implementations • 27 Feb 2020 • Saaduddin Mahmud, Md. Mosaddek Khan, Moumita Choudhury, Long Tran-Thanh, Nicholas R. Jennings
Distributed Constraint Optimization Problems (DCOPs) are an important framework for modeling coordinated decision-making problems in multi-agent systems with a set of discrete variables.
no code implementations • 14 Sep 2019 • Mashrur Rashik, Md. Musfiqur Rahman, Md. Mamun-or-Rashid, Md. Mosaddek Khan
Distributed Pseudo-tree Optimization Procedure (DPOP) is a well-known message passing algorithm that has been used to provide optimal solutions of Distributed Constraint Optimization Problems (DCOPs) -- a framework that is designed to optimize constraints in cooperative multi-agent systems.
no code implementations • 13 Sep 2019 • Saaduddin Mahmud, Moumita Choudhury, Md. Mosaddek Khan, Long Tran-Thanh, Nicholas R. Jennings
Evolutionary optimization is a generic population-based metaheuristic that can be adapted to solve a wide variety of optimization problems and has proven very effective for combinatorial optimization problems.
1 code implementation • 20 Nov 2018 • Md Mofijul Islam, Amar Debnath, Tahsin Al Sayeed, Jyotirmay Nag Setu, Md Mahmudur Rahman, Md Sadman Sakib, Md Abdur Razzaque, Md. Mosaddek Khan, Swakkhar Shatabda
In this work, we have developed a visual interactive web application, namely d-DeVIS, which helps to visualize the internal reasoning of the learning model which is trained on the audio data.