no code implementations • 15 Jun 2023 • Franco Maria Nardini, Cosimo Rulli, Salvatore Trani, Rossano Venturini
Quantization and pruning are two effective Deep Neural Networks model compression methods.
1 code implementation • 6 May 2021 • Francesco Busolin, Claudio Lucchese, Franco Maria Nardini, Salvatore Orlando, Raffaele Perego, Salvatore Trani
Modern search engine ranking pipelines are commonly based on large machine-learned ensembles of regression trees.
no code implementations • 30 Apr 2020 • Claudio Lucchese, Franco Maria Nardini, Salvatore Orlando, Raffaele Perego, Salvatore Trani
In this paper, we investigate the novel problem of \textit{query-level early exiting}, aimed at deciding the profitability of early stopping the traversal of the ranking ensemble for all the candidate documents to be scored for a query, by simply returning a ranking based on the additive scores computed by a limited portion of the ensemble.