Code Search

49 papers with code • 5 benchmarks • 10 datasets

The goal of Code Search is to retrieve code fragments from a large code corpus that most closely match a developer’s intent, which is expressed in natural language.

Source: When Deep Learning Met Code Search

Libraries

Use these libraries to find Code Search models and implementations

Latest papers with no code

Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate Representation

no code yet • 26 Apr 2023

We further fine-tune UnixCoder, the best-performing model for zero-shot cross-programming language code search, for the Code Cloning task with the SBT IRs of C code-pairs, available in the CodeNet dataset.

Unveiling Code Pre-Trained Models: Investigating Syntax and Semantics Capacities

no code yet • 20 Dec 2022

These structures are fundamental to understanding code.

You Don't Know Search: Helping Users Find Code by Automatically Evaluating Alternative Queries

no code yet • 7 Dec 2022

Our main result shows that relative to the control group, users are on average 22% more likely to click on a search result at all on any given day when AQE is active.

ContraCLM: Contrastive Learning For Causal Language Model

no code yet • 3 Oct 2022

Specifically, we attain $44\%$ relative improvement on the Semantic Textual Similarity tasks and $34\%$ on Code-to-Code Search tasks.

CodeDSI: Differentiable Code Search

no code yet • 1 Oct 2022

In an effort to improve the performance of code search, we have investigated docid representation strategies, impact of tokenization on docid structure, and dataset sizes on overall code search performance.

CSSAM:Code Search via Attention Matching of Code Semantics and Structures

no code yet • 8 Aug 2022

By leveraging the residual interaction, a matching module is designed to preserve more code semantics and descriptive features, that enhances the adhesion between the code and its corresponding query text.

CoCoSoDa: Effective Contrastive Learning for Code Search

no code yet • 7 Apr 2022

However, there is still a lot of room for improvement in using contrastive learning for code search.

On the Transferability of Pre-trained Language Models for Low-Resource Programming Languages

no code yet • 5 Apr 2022

Furthermore, some programming languages are inherently different and code written in one language usually cannot be interchanged with the others, i. e., Ruby and Java code possess very different structure.

Accelerating Code Search with Deep Hashing and Code Classification

no code yet • ACL 2022

Code search is to search reusable code snippets from source code corpus based on natural languages queries.

AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees

no code yet • 20 Jan 2022

Specifically, we collect a sheer number of source codes (both Java and Python) from the Alipay code repository and incorporate both syntactic and semantic code knowledge into our model through the help of code parsers, in which AST information of the source codes can be interpreted and integrated.