Search Results for author: Shaoxiong Duan

Found 1 papers, 1 papers with code

From Interpolation to Extrapolation: Complete Length Generalization for Arithmetic Transformers

1 code implementation18 Oct 2023 Shaoxiong Duan, Yining Shi, Wei Xu

In this paper, we investigate the inherent capabilities of transformer models in learning arithmetic algorithms, such as addition and parity.

Position

Cannot find the paper you are looking for? You can Submit a new open access paper.