no code implementations • NeurIPS 2020 • Ruosong Wang, Peilin Zhong, Simon S. Du, Russ R. Salakhutdinov, Lin Yang
Standard sequential decision-making paradigms aim to maximize the cumulative reward when interacting with the unknown environment., i. e., maximize $\sum_{h = 1}^H r_h$ where $H$ is the planning horizon.
no code implementations • NeurIPS 2019 • Zhilin Yang, Thang Luong, Russ R. Salakhutdinov, Quoc V. Le
The softmax bottleneck has been shown to limit the expressiveness of neural language models.
1 code implementation • NeurIPS 2019 • Charlie Tang, Russ R. Salakhutdinov
Towards these goals, we introduce a probabilistic framework that efficiently learns latent variables to jointly model the multi-step future motions of agents in a scene.