Source Code Summarization

37 papers with code • 9 benchmarks • 7 datasets

Code Summarization is a task that tries to comprehend code and automatically generate descriptions directly from the source code.

Source: Improving Automatic Source Code Summarization via Deep Reinforcement Learning

Libraries

Use these libraries to find Source Code Summarization models and implementations
2 papers
21

Assemble Foundation Models for Automatic Code Summarization

jianguda/afm4acs 13 Jan 2022

Thereby, we propose a flexible and robust approach for automatic code summarization, based on neural models.

15
13 Jan 2022

Leveraging Unsupervised Learning to Summarize APIs Discussed in Stack Overflow

scam2021-so/scam2021 27 Nov 2021

Automated source code summarization is a task that generates summarized information about the purpose, usage, and--or implementation of methods and classes to support understanding of these code entities.

1
27 Nov 2021

Modeling Hierarchical Syntax Structure with Triplet Position for Source Code Summarization

gjcexp/codescribe ACL ARR November 2021

In this paper, we propose CODESCRIBE to model the hierarchical syntax structure of code by introducing a novel triplet position for code summarization.

1
16 Nov 2021

GraphSearchNet: Enhancing GNNs via Capturing Global Dependencies for Semantic Code Search

shangqing-liu/graphsearchnet 4 Nov 2021

Specifically, we propose to construct graphs for the source code and queries with bidirectional GGNN (BiGGNN) to capture the local structural information of the source code and queries.

29
04 Nov 2021

On the Evaluation of Neural Code Summarization

DeepSoftwareAnalytics/CodeSumEvaluation 15 Jul 2021

To achieve a profound understanding of how far we are from solving this problem and provide suggestions to future research, in this paper, we conduct a systematic and in-depth analysis of 5 state-of-the-art neural code summarization models on 6 widely used BLEU variants, 4 pre-processing operations and their combinations, and 3 widely used datasets.

27
15 Jul 2021

Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors

SageSELab/CodeSumStudy ACL (NLP4Prog) 2021

Automated source code summarization is a popular software engineering research topic wherein machine translation models are employed to "translate" code snippets into relevant natural language descriptions.

6
15 Jun 2021

CoDesc: A Large Code-Description Parallel Dataset

csebuetnlp/CoDesc 29 May 2021

In this study, we present CoDesc -- a large parallel dataset composed of 4. 2 million Java methods and natural language descriptions.

48
29 May 2021

CodeTrans: Towards Cracking the Language of Silicon's Code Through Self-Supervised Deep Learning and High Performance Computing

agemagician/CodeTrans 6 Apr 2021

Simultaneously, the transformer model, especially its combination with transfer learning, has been proven to be a powerful technique for natural language processing tasks.

242
06 Apr 2021

HAConvGNN: Hierarchical Attention Based Convolutional Graph Neural Network for Code Documentation Generation in Jupyter Notebooks

liubest/HAConvGNN Findings (EMNLP) 2021

Jupyter notebook allows data scientists to write machine learning code together with its documentation in cells.

6
31 Mar 2021

Project-Level Encoding for Neural Source Code Summarization of Subroutines

aakashba/projcon 22 Mar 2021

Source code summarization of a subroutine is the task of writing a short, natural language description of that subroutine.

9
22 Mar 2021