Code Search

49 papers with code • 5 benchmarks • 10 datasets

The goal of Code Search is to retrieve code fragments from a large code corpus that most closely match a developer’s intent, which is expressed in natural language.

Source: When Deep Learning Met Code Search

Benchmarks

Add a Result

These leaderboards are used to track progress in Code Search

Dataset	Best Model	Compare
CodeSearchNet	cpt-code M	See all
CoDesc	Self-attention	See all
CodeXGLUE - AdvTest	CodeT5+ 770M	See all
CodeSearchNet - Ruby	Uni-SBT	See all
CodeXGLUE - WebQueryTest	CodeBERT	See all

Libraries

Use these libraries to find Code Search models and implementations

microsoft/CodeBERT

5 papers

1,973

facebookresearch/CodeGen

2 papers

672

Datasets

Subtasks

Annotated Code Search

Latest papers with no code

Most implemented Social Latest No code

Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate Representation

no code yet • 26 Apr 2023

We further fine-tune UnixCoder, the best-performing model for zero-shot cross-programming language code search, for the Code Cloning task with the SBT IRs of C code-pairs, available in the CodeNet dataset.

Paper
Add Code

Unveiling Code Pre-Trained Models: Investigating Syntax and Semantics Capacities

no code yet • 20 Dec 2022

These structures are fundamental to understanding code.

Paper
Add Code

You Don't Know Search: Helping Users Find Code by Automatically Evaluating Alternative Queries

no code yet • 7 Dec 2022

Our main result shows that relative to the control group, users are on average 22% more likely to click on a search result at all on any given day when AQE is active.

Paper
Add Code

ContraCLM: Contrastive Learning For Causal Language Model

no code yet • 3 Oct 2022

Specifically, we attain $44\%$ relative improvement on the Semantic Textual Similarity tasks and $34\%$ on Code-to-Code Search tasks.

Paper
Add Code

CodeDSI: Differentiable Code Search

no code yet • 1 Oct 2022

In an effort to improve the performance of code search, we have investigated docid representation strategies, impact of tokenization on docid structure, and dataset sizes on overall code search performance.

Paper
Add Code

CSSAM:Code Search via Attention Matching of Code Semantics and Structures

no code yet • 8 Aug 2022

By leveraging the residual interaction, a matching module is designed to preserve more code semantics and descriptive features, that enhances the adhesion between the code and its corresponding query text.

Paper
Add Code

CoCoSoDa: Effective Contrastive Learning for Code Search

no code yet • 7 Apr 2022

However, there is still a lot of room for improvement in using contrastive learning for code search.

Paper
Add Code

On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages

no code yet • 5 Apr 2022

Furthermore, some programming languages are inherently different and code written in one language usually cannot be interchanged with the others, i. e., Ruby and Java code possess very different structure.

Paper
Add Code

Accelerating Code Search with Deep Hashing and Code Classification

no code yet • ACL 2022

Code search is to search reusable code snippets from source code corpus based on natural languages queries.

Paper
Add Code

AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees

no code yet • 20 Jan 2022

Specifically, we collect a sheer number of source codes (both Java and Python) from the Alipay code repository and incorporate both syntactic and semantic code knowledge into our model through the help of code parsers, in which AST information of the source codes can be interpreted and integrated.

Paper
Add Code

Code Search

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result