Search Results for author: Kaige Yang

Found 6 papers, 0 papers with code

Learn Dynamic-Aware State Embedding for Transfer Learning

no code implementations6 Jan 2021 Kaige Yang

In this setting, the MDP dynamic is a good knowledge to transfer, which can be inferred by uniformly random policy.

Transfer Reinforcement Learning

Differentiable Linear Bandit Algorithm

no code implementations4 Jun 2020 Kaige Yang, Laura Toni

Theoretically, we show that the proposed algorithm achieves a $\tilde{\mathcal{O}}(\hat{\beta}\sqrt{dT})$ upper bound of $T$-round regret, where $d$ is the dimension of arm features and $\hat{\beta}$ is the learned size of confidence bound.

Laplacian-regularized graph bandits: Algorithms and theoretical analysis

no code implementations12 Jul 2019 Kaige Yang, Xiaowen Dong, Laura Toni

In terms of network regret (sum of cumulative regret over $n$ users), the proposed algorithm leads to a scaling as $\tilde{\mathcal{O}}(\Psi d\sqrt{nT})$, which is a significant improvement over $\tilde{\mathcal{O}}(nd\sqrt{T})$ in the state-of-the-art algorithm \algo{Gob. Lin} \Ccite{cesa2013gang}.

Error Analysis on Graph Laplacian Regularized Estimator

no code implementations11 Feb 2019 Kaige Yang, Xiaowen Dong, Laura Toni

We provide a theoretical analysis of the representation learning problem aimed at learning the latent variables (design matrix) $\Theta$ of observations $Y$ with the knowledge of the coefficient matrix $X$.

Representation Learning

Data Driven Chiller Plant Energy Optimization with Domain Knowledge

no code implementations3 Dec 2018 Hoang Dung Vu, Kok Soon Chai, Bryan Keating, Nurislam Tursynbek, Boyan Xu, Kaige Yang, Xiaoyan Yang, Zhenjie Zhang

Refrigeration and chiller optimization is an important and well studied topic in mechanical engineering, mostly taking advantage of physical models, designed on top of over-simplified assumptions, over the equipments.

BIG-bench Machine Learning

Graph-Based Recommendation System

no code implementations31 Jul 2018 Kaige Yang, Laura Toni

In this work, we study recommendation systems modelled as contextual multi-armed bandit (MAB) problems.

Recommendation Systems

Cannot find the paper you are looking for? You can Submit a new open access paper.