TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Zero-shot Text Search	BEIR	cpt-text XL	Avg. Accuracy	52.8	# 4
Zero-shot Text Search	BEIR	BM25 (Robertson, 2009)	Avg. Accuracy	47.6	# 13
Zero-shot Text Search	BEIR	Contriever (Izacard et al., 2021)	Avg. Accuracy	50.2	# 12
Zero-shot Text Search	BEIR	Contriever (Izacard et al., 2021)-unsupervised	Avg. Accuracy	40.9	# 18
Zero-shot Text Search	BEIR	cpt-text L	Avg. Accuracy	44.2	# 16
Code Search	CodeSearchNet	cpt-code M	Overall	93.5	# 1
Code Search	CodeSearchNet	cpt-code M	Go	97.5	# 2
Code Search	CodeSearchNet	cpt-code M	Ruby	85.5	# 2
Code Search	CodeSearchNet	cpt-code M	Python	99.9	# 1
Code Search	CodeSearchNet	cpt-code M	Java	94.4	# 1
Code Search	CodeSearchNet	cpt-code M	JS	86.5	# 1
Code Search	CodeSearchNet	cpt-code M	PHP	97.2	# 1
Code Search	CodeSearchNet	cpt-code S	Overall	93.4	# 2
Code Search	CodeSearchNet	cpt-code S	Go	97.7	# 1
Code Search	CodeSearchNet	cpt-code S	Ruby	86.3	# 1
Code Search	CodeSearchNet	cpt-code S	Python	99.8	# 2
Code Search	CodeSearchNet	cpt-code S	Java	94.0	# 2
Code Search	CodeSearchNet	cpt-code S	JS	86.0	# 2
Code Search	CodeSearchNet	cpt-code S	PHP	96.7	# 2
Passage Ranking	MS MARCO	Fine-tuned SOTA	MRR@10	44.3	# 1
Passage Ranking	MS MARCO	cpt-text XL	MRR@10	22.7	# 2
Passage Ranking	MS MARCO	cpt-text L	MRR@10	21.5	# 3
Passage Ranking	MS MARCO	BM25	MRR@10	18.4	# 4
Linear-Probe Classification	SentEval	cpt-text XL-unsupervised	Accuracy	91.8	# 2
Linear-Probe Classification	SentEval	cpt-text XL-supervised	Accuracy	92.2	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/text-and-code-embeddings-by-contrastive-pre/code-search-on-codesearchnet)](https://paperswithcode.com/sota/code-search-on-codesearchnet?p=text-and-code-embeddings-by-contrastive-pre)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/text-and-code-embeddings-by-contrastive-pre/passage-ranking-on-ms-marco)](https://paperswithcode.com/sota/passage-ranking-on-ms-marco?p=text-and-code-embeddings-by-contrastive-pre)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/text-and-code-embeddings-by-contrastive-pre/linear-probe-classification-on-senteval)](https://paperswithcode.com/sota/linear-probe-classification-on-senteval?p=text-and-code-embeddings-by-contrastive-pre)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/text-and-code-embeddings-by-contrastive-pre/zero-shot-text-search-on-beir)](https://paperswithcode.com/sota/zero-shot-text-search-on-beir?p=text-and-code-embeddings-by-contrastive-pre)`

Text and Code Embeddings by Contrastive Pre-Training

24 Jan 2022 · Arvind Neelakantan, Tao Xu, Raul Puri, Alec Radford, Jesse Michael Han, Jerry Tworek, Qiming Yuan, Nikolas Tezak, Jong Wook Kim, Chris Hallacy, Johannes Heidecke, Pranav Shyam, Boris Power, Tyna Eloundou Nekoul, Girish Sastry, Gretchen Krueger, David Schnurr, Felipe Petroski Such, Kenny Hsu, Madeleine Thompson, Tabarak Khan, Toki Sherbakov, Joanne Jang, Peter Welinder, Lilian Weng ·

Text embeddings are useful features in many applications such as semantic search and computing text similarity. Previous work typically trains models customized for different use cases, varying in dataset choice, training objective and model architecture. In this work, we show that contrastive pre-training on unsupervised data at scale leads to high quality vector representations of text and code. The same unsupervised text embeddings that achieve new state-of-the-art results in linear-probe classification also display impressive semantic search capabilities and sometimes even perform competitively with fine-tuned models. On linear-probe classification accuracy averaging over 7 tasks, our best unsupervised model achieves a relative improvement of 4% and 1.8% over previous best unsupervised and supervised text embedding models respectively. The same text embeddings when evaluated on large-scale semantic search attains a relative improvement of 23.4%, 14.7%, and 10.6% over previous best unsupervised methods on MSMARCO, Natural Questions and TriviaQA benchmarks, respectively. Similarly to text embeddings, we train code embedding models on (text, code) pairs, obtaining a 20.8% relative improvement over prior best work on code search.

PDF Abstract

Code

Add Remove Mark official

openmatch/coco-dr

Tasks

Add Remove

Code Search

Linear-Probe Classification

Natural Questions

Passage Ranking

text similarity

TriviaQA

Zero-shot Text Search

Datasets

SST SST-2

Natural Questions

MS MARCO

TriviaQA CodeSearchNet

BEIR

SentEval

Results from the Paper

Edit

Ranked #1 on Passage Ranking on MS MARCO

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Zero-shot Text Search	BEIR	cpt-text XL	Avg. Accuracy	52.8	# 4	Compare
Zero-shot Text Search	BEIR	BM25 (Robertson, 2009)	Avg. Accuracy	47.6	# 13	Compare
Zero-shot Text Search	BEIR	Contriever (Izacard et al., 2021)	Avg. Accuracy	50.2	# 12	Compare
Zero-shot Text Search	BEIR	Contriever (Izacard et al., 2021)-unsupervised	Avg. Accuracy	40.9	# 18	Compare
Zero-shot Text Search	BEIR	cpt-text L	Avg. Accuracy	44.2	# 16	Compare
Code Search	CodeSearchNet	cpt-code M	Overall	93.5	# 1	Compare
			Go	97.5	# 2	Compare
			Ruby	85.5	# 2	Compare
			Python	99.9	# 1	Compare
			Java	94.4	# 1	Compare
			JS	86.5	# 1	Compare
			PHP	97.2	# 1	Compare
Code Search	CodeSearchNet	cpt-code S	Overall	93.4	# 2	Compare
			Go	97.7	# 1	Compare
			Ruby	86.3	# 1	Compare
			Python	99.8	# 2	Compare
			Java	94.0	# 2	Compare
			JS	86.0	# 2	Compare
			PHP	96.7	# 2	Compare
Passage Ranking	MS MARCO	Fine-tuned SOTA	MRR@10	44.3	# 1	Compare
Passage Ranking	MS MARCO	cpt-text XL	MRR@10	22.7	# 2	Compare
Passage Ranking	MS MARCO	cpt-text L	MRR@10	21.5	# 3	Compare
Passage Ranking	MS MARCO	BM25	MRR@10	18.4	# 4	Compare
Linear-Probe Classification	SentEval	cpt-text XL-unsupervised	Accuracy	91.8	# 2	Compare
Linear-Probe Classification	SentEval	cpt-text XL-supervised	Accuracy	92.2	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Text and Code Embeddings by Contrastive Pre-Training

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove