Search Results for author: Atilla Eryilmaz

Found 8 papers, 0 papers with code

Convergence of Gradient Descent for Recurrent Neural Networks: A Nonasymptotic Analysis

no code implementations • 19 Feb 2024 • Semih Cayci, Atilla Eryilmaz

We analyze recurrent neural networks trained with gradient descent in the supervised learning setting for dynamical systems, and prove that gradient descent can achieve optimality \emph{without} massive overparameterization.

Paper
Add Code

A Lyapunov-Based Methodology for Constrained Optimization with Bandit Feedback

no code implementations • 9 Jun 2021 • Semih Cayci, Yilin Zheng, Atilla Eryilmaz

In a wide variety of applications including online advertising, contractual hiring, and wireless scheduling, the controller is constrained by a stringent budget constraint on the available resources, which are consumed in a random amount by each action, and a stochastic feasibility constraint that may impose important operational limitations on decision-making.

Decision Making Scheduling

Paper
Add Code

Continuous-Time Multi-Armed Bandits with Controlled Restarts

no code implementations • 30 Jun 2020 • Semih Cayci, Atilla Eryilmaz, R. Srikant

Time-constrained decision processes have been ubiquitous in many fundamental applications in physics, biology and computer science.

Multi-Armed Bandits

Paper
Add Code

Group-Fair Online Allocation in Continuous Time

no code implementations • NeurIPS 2020 • Semih Cayci, Swati Gupta, Atilla Eryilmaz

Furthermore, as a consequence of certain ethical and economic concerns, the controller may impose deadlines on the completion of each task, and require fairness across different groups in the allocation of total time budget $B$.

Cloud Computing Decision Making +2

Paper
Add Code

Budget-Constrained Bandits over General Cost and Reward Distributions

no code implementations • 29 Feb 2020 • Semih Cayci, Atilla Eryilmaz, R. Srikant

We prove a regret lower bound for this problem, and show that the proposed algorithms achieve tight problem-dependent regret bounds, which are optimal up to a universal constant factor in the case of jointly Gaussian cost and reward pairs.

Paper
Add Code

Optimal Learning for Dynamic Coding in Deadline-Constrained Multi-Channel Networks

no code implementations • 27 Nov 2018 • Semih Cayci, Atilla Eryilmaz

We study the problem of serving randomly arriving and delay-sensitive traffic over a multi-channel communication system with time-varying channel states and unknown statistics.

Thompson Sampling

Paper
Add Code

Combinatorial Multi-Objective Multi-Armed Bandit Problem

no code implementations • 11 Mar 2018 • Doruk Öner, Altuğ Karakurt, Atilla Eryilmaz, Cem Tekin

In this paper, we introduce the COmbinatorial Multi-Objective Multi-Armed Bandit (COMO-MAB) problem that captures the challenges of combinatorial and multi-objective online learning simultaneously.

Paper
Add Code

Reward Maximization Under Uncertainty: Leveraging Side-Observations on Networks

no code implementations • 26 Apr 2017 • Swapna Buccapatnam, Fang Liu, Atilla Eryilmaz, Ness B. Shroff

We study the stochastic multi-armed bandit (MAB) problem in the presence of side-observations across actions that occur as a result of an underlying network structure.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.