Latent Predictor Networks for Code Generation

Many language generation tasks require the production of text conditioned on both structured and unstructured inputs. We present a novel neural network architecture which generates an output sequence conditioned on an arbitrary number of input functions. Crucially, our approach allows both the choice of conditioning context and the granularity of generation, for example characters or tokens, to be marginalised, thus permitting scalable and effective training. Using this framework, we address the problem of generating programming code from a mixed natural language and structured specification. We create two new data sets for this paradigm derived from the collectible trading card games Magic the Gathering and Hearthstone. On these, and a third preexisting corpus, we demonstrate that marginalising multiple predictors allows our model to outperform strong benchmarks.

PDF Abstract ACL 2016 PDF ACL 2016 Abstract

Datasets


Introduced in the Paper:

Hearthstone

Used in the Paper:

Django

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Code Generation Django lpn (Ling et al., 2016) Accuracy 62.3 # 10
BLEU Score 77.6 # 5
Code Generation Django Phrasal Statistical MT (Ling et al., 2016) Accuracy 31.5 # 11
BLEU Score 47.6 # 8

Methods


No methods listed for this paper. Add relevant methods here