no code implementations • 16 Dec 2019 • Dan Schmidt, Nick Moran, Jonathan S. Rosenfeld, Jonathan Rosenthal, Jonathan Yedidia
The AlphaZero algorithm for the learning of strategy games via self-play, which has produced superhuman ability in the games of Go, chess, and shogi, uses a quantitative reward function for game outcomes, requiring the users of the algorithm to explicitly balance different components of the reward against each other, such as the game winner and margin of victory.
no code implementations • 10 May 2018 • Veit Elser, Dan Schmidt, Jonathan Yedidia
This constraint is simply that the value of the output node associated with the correct class should be zero.
no code implementations • 18 Jan 2016 • Patrick Eschenfeldt, Dan Schmidt, Stark Draper, Jonathan Yedidia
We introduce a new type of graphical model that we call a "memory factor network" (MFN).
no code implementations • 12 May 2015 • Charles Mathy, Nate Derbinsky, José Bento, Jonathan Rosenthal, Jonathan Yedidia
We describe a new instance-based learning algorithm called the Boundary Forest (BF) algorithm, that can be used for supervised and unsupervised learning.
no code implementations • NeurIPS 2013 • Jose Bento, Nate Derbinsky, Javier Alonso-Mora, Jonathan Yedidia
We describe a novel approach for computing collision-free \emph{global} trajectories for $p$ agents with specified initial and final configurations, based on an improved version of the alternating direction method of multipliers (ADMM).