Search Results for author: S. Rasoul Etesami

Found 12 papers, 0 papers with code

Learning How to Strategically Disclose Information

no code implementations • 13 Mar 2024 • Raj Kiriti Velicheti, Melih Bastopcu, S. Rasoul Etesami, Tamer Başar

In this work, we consider an online version of information design where a sender interacts with a receiver of an unknown type who is adversarially chosen at each round.

Paper
Add Code

Scalable and Independent Learning of Nash Equilibrium Policies in $n$-Player Stochastic Games with Unknown Independent Chains

no code implementations • 4 Dec 2023 • Tiancheng Qin, S. Rasoul Etesami

Specifically, under no assumptions on the reward functions, we show the proposed algorithm converges in polynomial time in a weaker distance (namely, the averaged Nikaido-Isoda gap) to the set of $\epsilon$-NE policies with arbitrarily high probability.

Paper
Add Code

Online Reinforcement Learning in Markov Decision Process Using Linear Programming

no code implementations • 31 Mar 2023 • Vincent Leon, S. Rasoul Etesami

We consider online reinforcement learning in episodic Markov decision process (MDP) with unknown transition function and stochastic rewards drawn from some fixed but unknown distribution.

reinforcement-learning

Paper
Add Code

Local Environment Poisoning Attacks on Federated Reinforcement Learning

no code implementations • 5 Mar 2023 • Evelyn Ma, Praneet Rathi, S. Rasoul Etesami

However, the federated mechanism also exposes the system to poisoning by malicious agents that can mislead the trained policy.

Federated Learning OpenAI Gym +2

Paper
Add Code

The Role of Local Steps in Local SGD

no code implementations • 14 Mar 2022 • Tiancheng Qin, S. Rasoul Etesami, César A. Uribe

Our main contribution is to characterize the convergence rate of Local SGD as a function of $\{H_i\}_{i=1}^R$ under various settings of strongly convex, convex, and nonconvex local functions, where $R$ is the total number of communication rounds.

Stochastic Optimization

Paper
Add Code

Faster Convergence of Local SGD for Over-Parameterized Models

no code implementations • 30 Jan 2022 • Tiancheng Qin, S. Rasoul Etesami, César A. Uribe

For general convex loss functions, we establish an error bound of $\O(1/T)$ under a mild data similarity assumption and an error bound of $\O(K/T)$ otherwise, where $K$ is the number of local steps.

Paper
Add Code

Learning Stationary Nash Equilibrium Policies in $n$-Player Stochastic Games with Independent Chains

no code implementations • 28 Jan 2022 • S. Rasoul Etesami

We consider a subclass of $n$-player stochastic games, in which players have their own internal state/action spaces while they are coupled through their payoff functions.

energy management Management

Paper
Add Code

Online Learning in Budget-Constrained Dynamic Colonel Blotto Games

no code implementations • 23 Mar 2021 • Vincent Leon, S. Rasoul Etesami

In this paper, we study the strategic allocation of limited resources using a Colonel Blotto game (CBG) under a dynamic setting and analyze the problem using an online learning approach.

Paper
Add Code

Maximizing Social Welfare Subject to Network Externalities: A Unifying Submodular Optimization Approach

no code implementations • 17 Feb 2021 • S. Rasoul Etesami

We consider the problem of allocating multiple indivisible items to a set of networked agents to maximize the social welfare subject to network externalities.

Computer Science and Game Theory Discrete Mathematics Multiagent Systems Systems and Control Systems and Control Optimization and Control

Paper
Add Code

Communication-efficient Decentralized Local SGD over Undirected Networks

no code implementations • 6 Nov 2020 • Tiancheng Qin, S. Rasoul Etesami, César A. Uribe

Agents have access to $F$ through noisy gradients, and they can locally communicate with their neighbors a network.

Paper
Add Code

Maximizing Convergence Time in Network Averaging Dynamics Subject to Edge Removal

no code implementations • 11 Sep 2020 • S. Rasoul Etesami

We consider the consensus interdiction problem (CIP), in which the goal is to maximize the convergence time of consensus averaging dynamics subject to removing a limited number of network edges.

Paper
Add Code

Toward Optimal Adversarial Policies in the Multiplicative Learning System with a Malicious Expert

no code implementations • 2 Jan 2020 • S. Rasoul Etesami, Negar Kiyavash, Vincent Leon, H. Vincent Poor

We consider a learning system based on the conventional multiplicative weight (MW) rule that combines experts' advice to predict a sequence of true outcomes.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.