Attention

Attention Sinks

Introduced by Xiao et al. in Efficient Streaming Language Models with Attention Sinks

Please enter a description about the method here

Source: Efficient Streaming Language Models with Attention Sinks

Papers


Paper Code Results Date Stars

Tasks


Task Papers Share
Language Modelling 1 100.00%

Components


Component Type
🤖 No Components Found You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories