Search Results for author: Sahin Lale

Found 18 papers, 2 papers with code

EKGNet: A 10.96μW Fully Analog Neural Network for Intra-Patient Arrhythmia Classification

no code implementations • 24 Oct 2023 • Benyamin Haghi, Lin Ma, Sahin Lale, Anima Anandkumar, Azita Emami

We present an integrated approach by combining analog computing and deep learning for electrocardiogram (ECG) arrhythmia classification.

Classification

Paper
Add Code

Forecasting subcritical cylinder wakes with Fourier Neural Operators

no code implementations • 19 Jan 2023 • Peter I Renn, Cong Wang, Sahin Lale, Zongyi Li, Anima Anandkumar, Morteza Gharib

The learned FNO solution operator can be evaluated in milliseconds, potentially enabling faster-than-real-time modeling for predictive flow control in physical systems.

Operator learning

Paper
Add Code

Thompson Sampling Achieves $\tilde O(\sqrt{T})$ Regret in Linear Quadratic Control

no code implementations • 17 Jun 2022 • Taylan Kargin, Sahin Lale, Kamyar Azizzadenesheli, Anima Anandkumar, Babak Hassibi

By carefully prescribing an early exploration strategy and a policy update rule, we show that TS achieves order-optimal regret in adaptive control of multidimensional stabilizable LQRs.

Decision Making Decision Making Under Uncertainty +1

Paper
Add Code

KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed Stability in Nonlinear Dynamical Systems

no code implementations • 3 Jun 2022 • Sahin Lale, Yuanyuan Shi, Guannan Qu, Kamyar Azizzadenesheli, Adam Wierman, Anima Anandkumar

However, current reinforcement learning (RL) methods lack stabilization guarantees, which limits their applicability for the control of safety-critical systems.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Optimal Competitive-Ratio Control

no code implementations • 3 Jun 2022 • Oron Sabag, Sahin Lale, Babak Hassibi

The key techniques that underpin our explicit solution is a reduction of the control problem to a Nehari problem, along with a novel factorization of the clairvoyant controller's cost.

Paper
Add Code

Explicit Regularization via Regularizer Mirror Descent

no code implementations • 22 Feb 2022 • Navid Azizan, Sahin Lale, Babak Hassibi

RMD starts with a standard cost which is the sum of the training loss and a convex regularizer of the weights.

Continual Learning

Paper
Add Code

CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning

1 code implementation • 14 Dec 2021 • Kevin Huang, Sahin Lale, Ugo Rosolia, Yuanyuan Shi, Anima Anandkumar

It then uses the top trajectories as initialization for gradient descent and applies gradient updates to each of these trajectories to find the optimal action sequence.

Continuous Control Model-based Reinforcement Learning +1

Paper
Code

Finite-time System Identification and Adaptive Control in Autoregressive Exogenous Systems

no code implementations • 26 Aug 2021 • Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

Using these guarantees, we design adaptive control algorithms for unknown ARX systems with arbitrary strongly convex or convex quadratic regulating costs.

Paper
Add Code

Regret-Optimal LQR Control

no code implementations • 4 May 2021 • Oron Sabag, Gautam Goel, Sahin Lale, Babak Hassibi

Motivated by competitive analysis in online learning, as a criterion for controller design we introduce the dynamic regret, defined as the difference between the LQR cost of a causal controller (that has only access to past disturbances) and the LQR cost of the \emph{unique} clairvoyant one (that has also access to future disturbances) that is known to dominate all other controllers.

Learning Theory

Paper
Add Code

Stable Online Control of Linear Time-Varying Systems

no code implementations • 29 Apr 2021 • Guannan Qu, Yuanyuan Shi, Sahin Lale, Anima Anandkumar, Adam Wierman

In this work, we propose an efficient online control algorithm, COvariance Constrained Online Linear Quadratic (COCO-LQ) control, that guarantees input-to-state stability for a large class of LTV systems while also minimizing the control cost.

Paper
Add Code

Stability and Identification of Random Asynchronous Linear Time-Invariant Systems

no code implementations • 8 Dec 2020 • Sahin Lale, Oguzhan Teke, Babak Hassibi, Anima Anandkumar

In this model, each state variable is updated randomly and asynchronously with some probability according to the underlying system dynamics.

Paper
Add Code

Reinforcement Learning with Fast Stabilization in Linear Dynamical Systems

no code implementations • 23 Jul 2020 • Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynamical systems.

Model-based Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Logarithmic Regret Bound in Partially Observable Linear Dynamical Systems

no code implementations • NeurIPS 2020 • Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

We study the problem of system identification and adaptive control in partially observable linear dynamical systems.

counterfactual

Paper
Add Code

Adaptive Control and Regret Minimization in Linear Quadratic Gaussian (LQG) Setting

no code implementations • 12 Mar 2020 • Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

We study the problem of adaptive control in partially observable linear quadratic Gaussian control systems, where the model dynamics are unknown a priori.

Paper
Add Code

Regret Minimization in Partially Observable Linear Quadratic Control

no code implementations • 31 Jan 2020 • Sahin Lale, Kamyar Azizzadenesheli, Babak Hassibi, Anima Anandkumar

We propose a novel way to decompose the regret and provide an end-to-end sublinear regret upper bound for partially observable linear quadratic control.

Paper
Add Code

Stochastic Mirror Descent on Overparameterized Nonlinear Models

no code implementations • 25 Sep 2019 • Navid Azizan, Sahin Lale, Babak Hassibi

On the theory side, we show that in the overparameterized nonlinear setting, if the initialization is close enough to the manifold of global optima, SMD with sufficiently small step size converges to a global minimum that is approximately the closest global minimum in Bregman divergence, thus attaining approximate implicit regularization.

Paper
Add Code

Stochastic Mirror Descent on Overparameterized Nonlinear Models: Convergence, Implicit Regularization, and Generalization

1 code implementation • 10 Jun 2019 • Navid Azizan, Sahin Lale, Babak Hassibi

Most modern learning problems are highly overparameterized, meaning that there are many more parameters than the number of training data points, and as a result, the training loss may have infinitely many global minima (parameter vectors that perfectly interpolate the training data).

Paper
Code

Stochastic Linear Bandits with Hidden Low Rank Structure

no code implementations • 28 Jan 2019 • Sahin Lale, Kamyar Azizzadenesheli, Anima Anandkumar, Babak Hassibi

We modify the image classification task into the SLB setting and empirically show that, when a pre-trained DNN provides the high dimensional feature representations, deploying PSLB results in significant reduction of regret and faster convergence to an accurate model compared to state-of-art algorithm.

Decision Making Dimensionality Reduction +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.