1 code implementation • 29 Apr 2024 • Sebastian Bruch, Franco Maria Nardini, Cosimo Rulli, Rossano Venturini
In this work, we propose a novel organization of the inverted index that enables fast yet effective approximate retrieval over learned sparse embeddings.
1 code implementation • 3 Apr 2024 • Franco Maria Nardini, Cosimo Rulli, Rossano Venturini
This paper proposes ``Efficient Multi-Vector dense retrieval with Bit vectors'' (EMVB), a novel framework for efficient query processing in multi-vector dense retrieval.
no code implementations • 15 Jun 2023 • Franco Maria Nardini, Cosimo Rulli, Salvatore Trani, Rossano Venturini
Quantization and pruning are two effective Deep Neural Networks model compression methods.
1 code implementation • 26 Nov 2021 • Lorenzo Beretta, Franco Maria Nardini, Roberto Trani, Rossano Venturini
In this paper, we address the problem of finding a champion of the tournament, also known as Copeland winner, which is a player that wins the highest number of matches.
1 code implementation • 13 May 2020 • Simon Gog, Giulio Ermanno Pibiri, Rossano Venturini
Query Auto-Completion (QAC) is an ubiquitous feature of modern textual search systems, suggesting possible ways of completing the query being typed by the user.
2 code implementations • 28 Aug 2019 • Giulio Ermanno Pibiri, Rossano Venturini
The aim of this article is twofold: first, surveying the encoding algorithms suitable for inverted index compression and, second, characterizing the performance of the inverted index through experimentation.
1 code implementation • 16 Apr 2019 • Raffaele Perego, Giulio Ermanno Pibiri, Rossano Venturini
The sheer increase in volume of RDF data demands efficient solutions for the triple indexing problem, that is devising a compressed data structure to compactly represent RDF triples by guaranteeing, at the same time, fast pattern matching operations.
1 code implementation • 25 Jun 2018 • Giulio Ermanno Pibiri, Rossano Venturini
This paper deals with the two fundamental problems concerning the handling of large n-gram language models: indexing, that is compressing the n-gram strings and associated satellite data without compromising their retrieval speed; and estimation, that is computing the probability distribution of the strings from a large textual source.
1 code implementation • 29 Apr 2018 • Giulio Ermanno Pibiri, Rossano Venturini
The ubiquitous Variable-Byte encoding is one of the fastest compressed representation for integer sequences.
1 code implementation • 2 Dec 2013 • Djamal Belazzougui, Paolo Boldi, Giuseppe Ottaviano, Rossano Venturini, Sebastiano Vigna
The computation of a peeling order in a randomly generated hypergraph is the most time-consuming step in a number of constructions, such as perfect hashing schemes, random $r$-SAT solvers, error-correcting codes, and approximate set encodings.
Data Structures and Algorithms
6 code implementations • 20 Dec 2007 • Paolo Ferragina, Rodrigo Gonzalez, Gonzalo Navarro, Rossano Venturini
A compressed full-text self-index represents a text in a compressed form and still answers queries efficiently.
Data Structures and Algorithms F.2.2; H.2.1; H.3.2; H.3.3