no code implementations • 17 Jun 2020 • Amber Srivastava, Srinivasa M. Salapaka
The central idea underlying our framework is to quantify exploration in terms of the Shannon Entropy of the trajectories under the MDP and determine the stochastic policy that maximizes it while guaranteeing a low value of the expected cost along a trajectory.
no code implementations • 14 Apr 2016 • Mayank Baranwal, Brian Roehl, Srinivasa M. Salapaka
This paper presents a novel and efficient heuristic framework for approximating the solutions to the multiple traveling salesmen problem (m-TSP) and other variants on the TSP.