1 code implementation • 18 Oct 2023 • Shaoxiong Duan, Yining Shi, Wei Xu
In this paper, we investigate the inherent capabilities of transformer models in learning arithmetic algorithms, such as addition and parity.
Position