Learning General Purpose Distributed Sentence Representations via Large Scale Multi-task Learning

ICLR 2018 Sandeep SubramanianAdam TrischlerYoshua BengioChristopher J Pal

A lot of the recent success in natural language processing (NLP) has been driven by distributed vector representations of words trained on large amounts of text in an unsupervised manner. These representations are typically used as general purpose features for words across a range of NLP problems... (read more)

PDF Abstract
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Semantic Textual Similarity MRPC GenSen Accuracy 78.6% # 18
F1 84.4% # 8
Natural Language Inference MultiNLI GenSen Matched 71.4 # 19
Mismatched 71.3 # 17
Paraphrase Identification Quora Question Pairs GenSen Accuracy 87.01 # 9
Semantic Textual Similarity SentEval GenSen MRPC 78.6/84.4 # 1
SICK-R 0.888 # 1
SICK-E 87.8 # 1
STS 78.9/78.6 # 1

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet