1 code implementation • 4 Apr 2024 • Jerry Yao-Chieh Hu, Pei-Hsuan Chang, Robin Luo, Hong-Yu Chen, Weijian Li, Wei-Po Wang, Han Liu
Interestingly, this memory model manifests a model-based interpretation of an outlier-efficient attention mechanism ($\text{Softmax}_1$): it is an approximation of the memory retrieval process of $\mathtt{OutEffHop}$.
Ranked #1 on Quantization on Wiki-40B
no code implementations • 22 Mar 2021 • Karan Samel, Zelin Zhao, Binghong Chen, Kuan Wang, Robin Luo, Le Song
In multi-modal reasoning tasks, such as visual question answering (VQA), there have been many modeling and training paradigms tested.
no code implementations • 1 Jan 2021 • Karan Samel, Zelin Zhao, Kuan Wang, Robin Luo, Binghong Chen, Le Song
We present a differentiable end-to-end program executor (DePe), which addresses Visual Question Answering (VQA) in a sample and computationally efficient manner.