no code implementations • 16 Jan 2014 • Peng Dai, Mausam, Daniel Sabby Weld, Judy Goldsmith
Value iteration is a powerful yet inefficient algorithm for Markov decision processes (MDPs) because it puts the majority of its effort into backing up the entire state space, which turns out to be unnecessary in many cases.