no code implementations • 5 May 2023 • Daniel Johnson, Trevor Maxfield, Yongxu Jin, Ronald Fedkiw
Various software efforts embrace the idea that object oriented programming enables a convenient implementation of the chain rule, facilitating so-called automatic differentiation via backpropagation.
no code implementations • 7 Oct 2022 • Andrew J. Nam, Mustafa Abdool, Trevor Maxfield, James L. McClelland
As a step toward understanding how transformer-based systems generalize, we explore the question of OODG in small scale transformers trained with examples from a known distribution.
Out-of-Distribution Generalization Systematic Generalization