no code implementations • NeurIPS 2023 • Philip Sun, David Simcha, Dave Dopson, Ruiqi Guo, Sanjiv Kumar
This paper introduces SOAR: Spilling with Orthogonality-Amplified Residuals, a novel data indexing technique for approximate nearest neighbor (ANN) search.
1 code implementation • EMNLP 2021 • Xinying Song, Alex Salcianu, Yang song, Dave Dopson, Denny Zhou
For general text, we further propose an algorithm that combines pre-tokenization (splitting the text into words) and our linear-time WordPiece method into a single pass.
no code implementations • 20 Mar 2019 • Xiang Wu, Ruiqi Guo, David Simcha, Dave Dopson, Sanjiv Kumar
In this paper, we propose a technique that approximates the inner product computation in hybrid vectors, leading to substantial speedup in search while maintaining high accuracy.