Search Results for author: Zaiyue Yang

Found 6 papers, 1 papers with code

Incentive-aware Electric Vehicle Routing Problem: a Bi-level Model and a Joint Solution Algorithm

no code implementations13 Oct 2021 Canqi Yao, Shibo Chen, Mauro Salazar, Zaiyue Yang

Specifically, we first devise a bi-level model whereby the fleet operator optimizes the routes and charging schedules of the fleet jointly with an incentive rate to reimburse the delivery delays experienced by the customers.

Cooperative Operation of the Fleet Operator and Incentive-aware Customers in an On-demand Delivery System: A Bi-level Approach

no code implementations8 Sep 2021 Canqi Yao, Shibo Chen, Zaiyue Yang

In this paper, we study the cooperative operation problem between the fleet operator and incentive-aware customers in an on-demand delivery system.

Optimal Estimator Design and Properties Analysis for Interconnected Systems with Asymmetric Information Structure

no code implementations21 May 2021 Yan Wang, Junlin Xiong, Zaiyue Yang, Rong Su

We found that there exists a critical probability such that the EEC is bounded if the delay probability is below the critical probability.

Finite-Sample Analysis of Decentralized Temporal-Difference Learning with Linear Function Approximation

no code implementations3 Nov 2019 Jun Sun, Gang Wang, Georgios B. Giannakis, Qinmin Yang, Zaiyue Yang

Motivated by the emerging use of multi-agent reinforcement learning (MARL) in engineering applications such as networked robotics, swarming drones, and sensor networks, we investigate the policy evaluation problem in a fully decentralized setting, using temporal-difference (TD) learning with linear function approximation to handle large state spaces in practice.

Multi-agent Reinforcement Learning

Communication-Efficient Distributed Learning via Lazily Aggregated Quantized Gradients

1 code implementation NeurIPS 2019 Jun Sun, Tianyi Chen, Georgios B. Giannakis, Zaiyue Yang

The present paper develops a novel aggregated gradient approach for distributed machine learning that adaptively compresses the gradient communication.

Cannot find the paper you are looking for? You can Submit a new open access paper.