Search Results for author: Gabriel Recchia

Found 3 papers, 2 papers with code

Teaching Autoregressive Language Models Complex Tasks By Demonstration

1 code implementation5 Sep 2021 Gabriel Recchia

This paper demonstrates that by fine-tuning an autoregressive language model (GPT-Neo) on appropriately structured step-by-step demonstrations, it is possible to teach it to execute a mathematical task that has previously proved difficult for Transformers - longhand modulo operations - with a relatively small number of examples.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.