Search Results for author: Yura Burda

Found 3 papers, 1 papers with code

Let's Verify Step by Step

3 code implementations • Preprint 2023 • Hunter Lightman, Vineet Kosaraju, Yura Burda, Harri Edwards, Bowen Baker, Teddy Lee, Jan Leike, John Schulman, Ilya Sutskever, Karl Cobbe

We conduct our own investigation, finding that process supervision significantly outperforms outcome supervision for training models to solve problems from the challenging MATH dataset.

Ranked #1 on Math Word Problem Solving on MATH minival (using extra training data)

Active Learning Math +2

1,282

Paper
Code

Learning Policy Representations in Multiagent Systems

no code implementations • ICML 2018 • Aditya Grover, Maruan Al-Shedivat, Jayesh K. Gupta, Yura Burda, Harrison Edwards

Modeling agent behavior is central to understanding the emergence of complex phenomena in multiagent systems.

Clustering Continuous Control +4

Paper
Add Code

Generative Models for Alignment and Data Efficiency in Language

no code implementations • ICLR 2018 • Dustin Tran, Yura Burda, Ilya Sutskever

We examine how learning from unaligned data can improve both the data efficiency of supervised tasks as well as enable alignments without any supervision.

Decipherment Translation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.