Search Results for author: Sydney Nguyen

Found 2 papers, 1 papers with code

StudentEval: A Benchmark of Student-Written Prompts for Large Language Models of Code

no code implementations • 7 Jun 2023 • Hannah McLean Babe, Sydney Nguyen, Yangtian Zi, Arjun Guha, Molly Q Feldman, Carolyn Jane Anderson

We use StudentEval to evaluate 5 Code LLMs and find that StudentEval is a better discriminator of model performance than existing benchmarks.

Code Generation

Paper
Add Code

MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation

1 code implementation • 17 Aug 2022 • Federico Cassano, John Gouwar, Daniel Nguyen, Sydney Nguyen, Luna Phipps-Costin, Donald Pinckney, Ming-Ho Yee, Yangtian Zi, Carolyn Jane Anderson, Molly Q Feldman, Arjun Guha, Michael Greenberg, Abhinav Jangda

Using these new parallel benchmarks, we evaluate the multi-language performance of three state-of-the-art code generation models: Codex, CodeGen, and InCoder.

Benchmarking Code Generation

159

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.