Code Generation

337 papers with code • 17 benchmarks • 43 datasets

Code Generation is an important field to predict explicit code or program structure from multimodal data sources such as incomplete code, programs in another programming language, natural language descriptions or execution examples. Code Generation tools can assist the development of automatic programming tools to improve programming productivity.

Source: Deep Learning for Source Code Modeling and Generation

Image source: Measuring Coding Challenge Competence With APPS

Benchmarks

Add a Result

These leaderboards are used to track progress in Code Generation

Dataset	Best Model	Compare
HumanEval	AgentCoder (GPT-4)	See all
MBPP	GPT-4 + AgentCoder	See all
CoNaLa	PanGu-Coder-FT-I	See all
APPS	CodeRL+CodeT5	See all
Django	MarianCG	See all
WikiSQL	NL2SQL-RULE	See all
PECC	Claude 3 Haiku	See all
CoNaLa-Ext	BART Base	See all
Turbulence	GPT-4	See all
CodeContests	AlphaCode 41B + clustering	See all
Shellcode_IA32	CodeBERT	See all
TACO-Code	GPT-4	See all
CodeXGLUE - CodeSearchNet	Redcoder-ext	See all
CONCODE	Redcoder-ext	See all
Verified Smart Contract Code Comments	GPT-J 6B Smart Contract	See all
Android Repos	Entity Type Model	See all
DSEval-Kaggle	CoML	See all

Show all 17 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Code Generation models and implementations

epfllm/megatron-llm

4 papers

469

eth-sri/sven

4 papers

uiuc-focal-lab/syncode

3 papers

DeepLearnXMU/CG-RL

3 papers

See all 15 libraries.

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

Self-Organized Agents: A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization

tsukushiai/self-organized-agent • 2 Apr 2024

To tackle this challenge, we propose Self-Organized multi-Agent framework (SoA), a novel multi-agent framework that enables the scalable and efficient generation and optimization of large-scale code.

02 Apr 2024

Paper
Code

EvoCodeBench: An Evolving Code Generation Benchmark Aligned with Real-World Code Repositories

seketeam/evocodebench • 31 Mar 2024

Existing benchmarks demonstrate poor alignment with real-world code repositories and are insufficient to evaluate the coding abilities of LLMs.

31 Mar 2024

Paper
Code

CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks

veronicium/codebenchgen • 31 Mar 2024

To demonstrate the complexity and solvability of examples in Exec-CSN, we present a human study demonstrating that 81. 3% of the examples can be solved by humans and 61% are rated as "requires effort to solve".

31 Mar 2024

Paper
Code

Top Leaderboard Ranking = Top Coding Proficiency, Always? EvoEval: Evolving Coding Benchmarks via LLM

evo-eval/evoeval • 28 Mar 2024

Such limitations inevitably lead us to inquire: Is the leaderboard performance on existing benchmarks reliable and comprehensive enough to measure the program synthesis ability of LLMs?

28 Mar 2024

Paper
Code

CYCLE: Learning to Self-Refine the Code Generation

arise-lab/cycle_oopsla_24 • 27 Mar 2024

Pre-trained code language models have achieved promising performance in code generation and improved the programming efficiency of human developers.

27 Mar 2024

Paper
Code

Diffusion-based Aesthetic QR Code Generation via Scanning-Robust Perceptual Guidance

jwliao1209/diffqrcode • 23 Mar 2024

In this paper, we introduce a novel diffusion-model-based aesthetic QR code generation pipeline, utilizing pre-trained ControlNet and guided iterative refinement via a novel classifier guidance (SRG) based on the proposed Scanning-Robust Loss (SRL) tailored with QR code mechanisms, which ensures both aesthetics and scannability.

23 Mar 2024

Paper
Code

Exploring Language Model's Code Generation Ability with Auxiliary Functions

sh0416/humanextension • • 15 Mar 2024

However, our analysis also reveals the model's underutilized behavior to call the auxiliary function, suggesting the future direction to enhance their implementation by eliciting the auxiliary function call ability encoded in the models.

15 Mar 2024

Paper
Code