Search Results for author: Arvind Sundararajan

Found 1 papers, 1 papers with code

Giraffe: Adventures in Expanding Context Lengths in LLMs

1 code implementation21 Aug 2023 Arka Pal, Deep Karkhanis, Manley Roberts, Samuel Dooley, Arvind Sundararajan, Siddartha Naidu

To use these models on sequences longer than the train-time context length, one might employ techniques from the growing family of context length extrapolation methods -- most of which focus on modifying the system of positional encodings used in the attention mechanism to indicate where tokens or activations are located in the input sequence.

16k 4k

Cannot find the paper you are looking for? You can Submit a new open access paper.