TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Ad-Hoc Information Retrieval	TREC Robust04	Anserini BM25+RM3	MAP	0.302	# 3
Ad-Hoc Information Retrieval	TREC Robust04	Anserini BM25+RM3	P@20	0.4012	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/the-neural-hype-and-comparisons-against-weak/ad-hoc-information-retrieval-on-trec-robust04)](https://paperswithcode.com/sota/ad-hoc-information-retrieval-on-trec-robust04?p=the-neural-hype-and-comparisons-against-weak)`

The Neural Hype and Comparisons Against Weak Baselines

ACM SIGIR Forum, Volume 52 Issue 2 2018 · Jimmy Lin ·

Recently, the machine learning community paused in a moment of self-reflection. In a widely discussed paper at ICLR 2018, Sculley et al. wrote: "We observe that the rate of empirical advancement may not have been matched by consistent increase in the level of empirical rigor across the field as a whole." Their primary complaint is the development of a "research and publication culture that emphasizes wins" (emphasis in original), which typically means "demonstrating that a new method beats previous methods on a given task or benchmark". An apt description might be "leaderboard chasing"-and for many vision and NLP tasks, this isn't a metaphor. There are literally centralized leaderboards1 that track incremental progress, down to the fifth decimal point, some persisting over years, accumulating dozens of entries. Sculley et al. remind us that "the goal of science is not wins, but knowledge". The structure of the scientific enterprise today (pressure to publish, pace of progress, etc.) means that "winning" and "doing good science" are often not fully aligned. To wit, they cite a number of papers showing that recent advances in neural networks could very well be attributed to mundane issues like better hyperparameter optimization. Many results can't be reproduced, and some observed improvements might just be noise.

PDF Abstract

Code

Add Remove Mark official

castorini/Anserini

978

Tasks

Add Remove

Ad-Hoc Information Retrieval

Cultural Vocal Bursts Intensity Prediction

Hyperparameter Optimization

Datasets

Robust04

Results from the Paper

Add Remove

Ranked #3 on Ad-Hoc Information Retrieval on TREC Robust04 (MAP metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Ad-Hoc Information Retrieval	TREC Robust04	Anserini BM25+RM3	MAP	0.302	# 3	Compare
Ad-Hoc Information Retrieval	TREC Robust04	Anserini BM25+RM3	P@20	0.4012	# 8	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

The Neural Hype and Comparisons Against Weak Baselines

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove