TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Answer Selection	ASNQ	ELECTRA-Base + SSP	MAP	0.697	# 2
Answer Selection	ASNQ	ELECTRA-Base + SSP	MRR	0.757	# 2
Answer Selection	ASNQ	DeBERTa-V3-Large + SSP	MAP	0.743	# 1
Answer Selection	ASNQ	DeBERTa-V3-Large + SSP	MRR	0.800	# 1
Question Answering	TrecQA	RoBERTa-Base + PSD	MAP	0.903	# 7
Question Answering	TrecQA	RoBERTa-Base + PSD	MRR	0.951	# 5
Question Answering	TrecQA	DeBERTa-V3-Large + SSP	MAP	0.923	# 3
Question Answering	TrecQA	DeBERTa-V3-Large + SSP	MRR	0.946	# 6
Question Answering	WikiQA	DeBERTa-Large + SSP	MAP	0.901	# 5
Question Answering	WikiQA	DeBERTa-Large + SSP	MRR	0.914	# 4
Question Answering	WikiQA	DeBERTa-V3-Large + ALL	MAP	0.909	# 4
Question Answering	WikiQA	DeBERTa-V3-Large + ALL	MRR	0.920	# 3
Question Answering	WikiQA	RoBERTa-Base + SSP	MAP	0.887	# 6
Question Answering	WikiQA	RoBERTa-Base + SSP	MRR	0.899	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pre-training-transformer-models-with-sentence/answer-selection-on-asnq)](https://paperswithcode.com/sota/answer-selection-on-asnq?p=pre-training-transformer-models-with-sentence)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pre-training-transformer-models-with-sentence/question-answering-on-trecqa)](https://paperswithcode.com/sota/question-answering-on-trecqa?p=pre-training-transformer-models-with-sentence)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/pre-training-transformer-models-with-sentence/question-answering-on-wikiqa)](https://paperswithcode.com/sota/question-answering-on-wikiqa?p=pre-training-transformer-models-with-sentence)`

Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection

20 May 2022 · Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti ·

An important task for designing QA systems is answer sentence selection (AS2): selecting the sentence containing (or constituting) the answer to a question from a set of retrieved relevant documents. In this paper, we propose three novel sentence-level transformer pre-training objectives that incorporate paragraph-level semantics within and across documents, to improve the performance of transformers for AS2, and mitigate the requirement of large labeled datasets. Specifically, the model is tasked to predict whether: (i) two sentences are extracted from the same paragraph, (ii) a given sentence is extracted from a given paragraph, and (iii) two paragraphs are extracted from the same document. Our experiments on three public and one industrial AS2 datasets demonstrate the empirical superiority of our pre-trained transformers over baseline models such as RoBERTa and ELECTRA for AS2.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Answer Selection

Question Answering

Sentence

Datasets

WikiQA

TrecQA ASNQ

Results from the Paper

Add Remove

Ranked #1 on Answer Selection on ASNQ

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Answer Selection	ASNQ	ELECTRA-Base + SSP	MAP	0.697	# 2	Compare
Answer Selection	ASNQ	ELECTRA-Base + SSP	MRR	0.757	# 2	Compare
Answer Selection	ASNQ	DeBERTa-V3-Large + SSP	MAP	0.743	# 1	Compare
Answer Selection	ASNQ	DeBERTa-V3-Large + SSP	MRR	0.800	# 1	Compare
Question Answering	TrecQA	RoBERTa-Base + PSD	MAP	0.903	# 7	Compare
Question Answering	TrecQA	RoBERTa-Base + PSD	MRR	0.951	# 5	Compare
Question Answering	TrecQA	DeBERTa-V3-Large + SSP	MAP	0.923	# 3	Compare
Question Answering	TrecQA	DeBERTa-V3-Large + SSP	MRR	0.946	# 6	Compare
Question Answering	WikiQA	DeBERTa-Large + SSP	MAP	0.901	# 5	Compare
Question Answering	WikiQA	DeBERTa-Large + SSP	MRR	0.914	# 4	Compare
Question Answering	WikiQA	DeBERTa-V3-Large + ALL	MAP	0.909	# 4	Compare
Question Answering	WikiQA	DeBERTa-V3-Large + ALL	MRR	0.920	# 3	Compare
Question Answering	WikiQA	RoBERTa-Base + SSP	MAP	0.887	# 6	Compare
Question Answering	WikiQA	RoBERTa-Base + SSP	MRR	0.899	# 7	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Dense Connections • Dropout • ELECTRA • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • RoBERTa • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

Pre-training Transformer Models with Sentence-Level Objectives for Answer Sentence Selection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove