Search Results for author: Shahin Honarvar

Found 1 papers, 1 papers with code

Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code

1 code implementation22 Dec 2023 Shahin Honarvar, Mark van der Wilk, Alastair Donaldson

Thus, from a single question template, it is possible to ask an LLM a $\textit{neighbourhood}$ of very similar programming questions, and assess the correctness of the result returned for each question.

Code Generation

Cannot find the paper you are looking for? You can Submit a new open access paper.