Search Results for author: Md. Mosaddek Khan

Found 14 papers, 2 papers with code

A Graph Neural Network-Based QUBO-Formulated Hamiltonian-Inspired Loss Function for Combinatorial Optimization using Reinforcement Learning

no code implementations • 27 Nov 2023 • Redwan Ahmed Rizvee, Raheeb Hasan, Md. Mosaddek Khan

We formulate and empirically evaluate the compatibility of the QUBO-formulated Hamiltonian as the generic reward function in the RL-based paradigm in the form of rewards.

Combinatorial Optimization Reinforcement Learning (RL)

Paper
Add Code

DePAint: A Decentralized Safe Multi-Agent Reinforcement Learning Algorithm considering Peak and Average Constraints

1 code implementation • 22 Oct 2023 • Raheeb Hassan, K. M. Shadman Wadith, Md. Mamun or Rashid, Md. Mosaddek Khan

In this paper, we address the problem of multi-agent policy optimization in a decentralized setting, where agents communicate with their neighbors to maximize the sum of their cumulative rewards while also satisfying each agent's safety constraints.

Multi-agent Reinforcement Learning Privacy Preserving +1

Paper
Code

A Graph Neural Network-Based QUBO-Formulated Hamiltonian-Inspired Loss Function for Combinatorial Optimization using Reinforcement Learning

no code implementations • 27 Aug 2023 • Redwan Ahmed Rizvee, Md. Mosaddek Khan

The Hamiltonian function is often used to formulate QUBO problems where it is used as the objective function in the context of optimization.

Combinatorial Optimization reinforcement-learning

Paper
Add Code

Improving Causal Effect Estimation of Weighted RegressionBased Estimator using Neural Networks

no code implementations • 28 Oct 2021 • Plabon Shaha, Talha Islam Zadid, Ismat Rahman, Md. Mosaddek Khan

Estimating causal effects from observational data informs us about which factors are important in an autonomous system, and enables us to take better decisions.

regression

Paper
Add Code

Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network

no code implementations • 16 Oct 2021 • Rafid Ameer Mahmud, Fahim Faisal, Saaduddin Mahmud, Md. Mosaddek Khan

Against this background, we introduce a simulation based online planning algorithm, that we call SiCLOP, for multi-agent cooperative environments.

Behavioural cloning Decision Making +1

Paper
Add Code

Accelerating Recursive Partition-Based Causal Structure Learning

no code implementations • 23 Feb 2021 • Md. Musfiqur Rahman, Ayman Rasheed, Md. Mosaddek Khan, Mohammad Ali Javidian, Pooyan Jamshidi, Md. Mamun-or-Rashid

This paper proposes a generic causal structure refinement strategy that can locate the undesired relations with a small number of CI-tests, thus speeding up the algorithm for large and complex problems.

Causal Discovery Decision Making +1

Paper
Add Code

Improving Solution Quality of Bounded Max-Sum Algorithm to Solve DCOPs involving Hard and Soft Constraints

no code implementations • 2 Dec 2020 • Md. Musfiqur Rahman, Mashrur Rashik, Md. Mamun-or-Rashid, Md. Mosaddek Khan

In particular, BMS algorithm is able to solve problems of this type having large search space at the expense of low computational cost.

Paper
Add Code

A Particle Swarm Inspired Approach for Continuous Distributed Constraint Optimization Problems

no code implementations • 20 Oct 2020 • Moumita Choudhury, Amit Sarker, Md. Mosaddek Khan, William Yeoh

To address this issue, we propose a new C-DCOP algorithm, namely Particle Swarm Optimization Based C-DCOP (PCD), which is inspired by Particle Swarm Optimization (PSO), a well-known centralized population-based approach for solving continuous optimization problems.

Scheduling

Paper
Add Code

On Population-Based Algorithms for Distributed Constraint Optimization Problems

no code implementations • 2 Sep 2020 • Saaduddin Mahmud, Md. Mosaddek Khan, Nicholas R. Jennings

The main characteristic of these algorithms is that they maintain a population of candidate solutions of a given problem and use this population to cover a large area of the search space and to avoid local-optima.

Paper
Add Code

C-CoCoA: A Continuous Cooperative Constraint Approximation Algorithm to Solve Functional DCOPs

no code implementations • 27 Feb 2020 • Amit Sarker, Abdullahil Baki Arif, Moumita Choudhury, Md. Mosaddek Khan

To overcome this limitation, Functional DCOP (F-DCOP) model is proposed that is able to model problems with continuous variables.

Paper
Add Code

Learning Optimal Temperature Region for Solving Mixed Integer Functional DCOPs

no code implementations • 27 Feb 2020 • Saaduddin Mahmud, Md. Mosaddek Khan, Moumita Choudhury, Long Tran-Thanh, Nicholas R. Jennings

Distributed Constraint Optimization Problems (DCOPs) are an important framework for modeling coordinated decision-making problems in multi-agent systems with a set of discrete variables.

Decision Making

Paper
Add Code

Speeding Up Distributed Pseudo-tree Optimization Procedure with Cross Edge Consistency to Solve DCOPs

no code implementations • 14 Sep 2019 • Mashrur Rashik, Md. Musfiqur Rahman, Md. Mamun-or-Rashid, Md. Mosaddek Khan

Distributed Pseudo-tree Optimization Procedure (DPOP) is a well-known message passing algorithm that has been used to provide optimal solutions of Distributed Constraint Optimization Problems (DCOPs) -- a framework that is designed to optimize constraints in cooperative multi-agent systems.

Scheduling

Paper
Add Code

AED: An Anytime Evolutionary DCOP Algorithm

no code implementations • 13 Sep 2019 • Saaduddin Mahmud, Moumita Choudhury, Md. Mosaddek Khan, Long Tran-Thanh, Nicholas R. Jennings

Evolutionary optimization is a generic population-based metaheuristic that can be adapted to solve a wide variety of optimization problems and has proven very effective for combinatorial optimization problems.

Combinatorial Optimization

Paper
Add Code

A Gray Box Interpretable Visual Debugging Approach for Deep Sequence Learning Model

1 code implementation • 20 Nov 2018 • Md Mofijul Islam, Amar Debnath, Tahsin Al Sayeed, Jyotirmay Nag Setu, Md Mahmudur Rahman, Md Sadman Sakib, Md Abdur Razzaque, Md. Mosaddek Khan, Swakkhar Shatabda

In this work, we have developed a visual interactive web application, namely d-DeVIS, which helps to visualize the internal reasoning of the learning model which is trained on the audio data.

Decision Making

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.