no code implementations • 7 Oct 2022 • Andrew J. Nam, Mustafa Abdool, Trevor Maxfield, James L. McClelland
As a step toward understanding how transformer-based systems generalize, we explore the question of OODG in small scale transformers trained with examples from a known distribution.
Out-of-Distribution Generalization Systematic Generalization
no code implementations • 6 Oct 2022 • Andrew J. Nam, Mengye Ren, Chelsea Finn, James L. McClelland
Large language models have recently shown promising progress in mathematical reasoning when fine-tuned with human-generated sequences walking through a sequence of solution steps.
no code implementations • 10 Jul 2021 • Andrew J. Nam, James L. McClelland
We also find that most of those who master the task can describe a valid solution strategy, and such participants perform better on transfer puzzles than those whose strategy descriptions are vague or incomplete.