Attention Dropout

Attention Dropout is a type of dropout used in attention-based architectures, where elements are randomly dropped out of the softmax in the attention equation. For example, for scaled-dot product attention, we would drop elements from the first term:

$$ {\text{Attention}}(Q, K, V) = \text{softmax}\left(\frac{QK^{T}}{\sqrt{d_k}}\right)V $$

Latest Papers

PAPER DATE
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
Dongling XiaoYu-Kun LiHan ZhangYu SunHao TianHua WuHaifeng Wang
2020-10-23
Multilingual BERT Post-Pretraining Alignment
Lin PanChung-Wei HangHaode QiAbhishek ShahMo YuSaloni Potdar
2020-10-23
Pre-trained Model for Chinese Word Segmentation with Meta Learning
Zhen KeLiang ShiErli MengBin WangXipeng Qiu
2020-10-23
ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding
Minjeong KimGyuwan KimSang-Woo LeeJung-Woo Ha
2020-10-23
BARThez: a Skilled Pretrained French Sequence-to-Sequence Model
Moussa Kamal EddineAntoine J. -P. TixierMichalis Vazirgiannis
2020-10-23
HateBERT: Retraining BERT for Abusive Language Detection in English
Tommaso CaselliValerio BasileJelena MitrovićMichael Granitzer
2020-10-23
GiBERT: Introducing Linguistic Knowledge into BERT through a Lightweight Gated Injection Method
Nicole PeineltMarek ReiMaria Liakata
2020-10-23
On the Transformer Growth for Progressive BERT Training
Xiaotao GuLiyuan LiuHongkun YuJing LiChen ChenJiawei Han
2020-10-23
Language Models are Open Knowledge Graphs
Chenguang WangXiao LiuDawn Song
2020-10-22
Investigating the True Performance of Transformers in Low-Resource Languages: A Case Study in Automatic Corpus Creation
Jan Christian Blaise CruzJose Kristian ResabalJames LinDan John VelascoCharibeth Cheng
2020-10-22
UniCase -- Rethinking Casing in Language Models
Rafal PowalskiTomasz Stanislawek
2020-10-22
Distilling Dense Representations for Ranking using Tightly-Coupled Teachers
Sheng-Chieh LinJheng-Hong YangJimmy Lin
2020-10-22
Knowledge Distillation for BERT Unsupervised Domain Adaptation
Minho RyuKichun Lee
2020-10-22
Towards Fully Bilingual Deep Language Modeling
Li-Hsin ChangSampo PyysaloJenna KanervaFilip Ginter
2020-10-22
Improving BERT Performance for Aspect-Based Sentiment Analysis
Akbar KarimiLeonardo RossiAndrea Prati
2020-10-22
Self-alignment Pre-training for Biomedical Entity Representations
Fangyu LiuEhsan ShareghiZaiqiao MengMarco BasaldellaNigel Collier
2020-10-22
Scientific Claim Verification with VERT5ERINI
Ronak PradeepXueguang MaRodrigo NogueiraJimmy Lin
2020-10-22
mT5: A massively multilingual pre-trained text-to-text transformer
| Linting XueNoah ConstantAdam RobertsMihir KaleRami Al-RfouAditya SiddhantAditya BaruaColin Raffel
2020-10-22
Detection of COVID-19 informative tweets using RoBERTa
Sirigireddy DhanalaxmiRohit AgarwalAman Sinha
2020-10-21
Latte-Mix: Measuring Sentence Semantic Similarity with Latent Categorical Mixtures
H. BaiL. TanK. XiongM. LiJ. Lin
2020-10-21
German's Next Language Model
Branden ChanStefan SchweterTimo Möller
2020-10-21
Generalized Conditioned Dialogue Generation Based on Pre-trained Language Model
Yan ZengJian-Yun Nie
2020-10-21
AutoMeTS: The Autocomplete for Medical Text Simplification
Hoang VanDavid KauchakGondy Leroy
2020-10-20
What makes multilingual BERT multilingual?
Chi-Liang LiuTsung-Yuan HsuYung-Sung ChuangHung-Yi Lee
2020-10-20
ConjNLI: Natural Language Inference Over Conjunctive Sentences
Swarnadeep SahaYixin NieMohit Bansal
2020-10-20
Language Representation in Multilingual BERTand its applications to improve Cross-lingual Generalization
Chi-Liang LiuTsung-Yuan HsuYung-Sung ChuangHung-Yi Lee
2020-10-20
PROP: Pre-training with Representative Words Prediction for Ad-hoc Retrieval
| Xinyu MaJiafeng GuoRuqing ZhangYixing FanXiang JiXueqi Cheng
2020-10-20
Text Classification of COVID-19 Press Briefings using BERT and Convolutional Neural Networks
Kakia Chatsiou
2020-10-20
BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online E-Commerce Search
Yunjiang JiangYue ShangZiyang LiuHongwei ShenYun XiaoWei XiongSulong XuWeipeng YanDi Jin
2020-10-20
Optimal Subarchitecture Extraction For BERT
| Adrian de WynterDaniel J. Perry
2020-10-20
CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters
| Hicham El BoukkouriOlivier FerretThomas LavergneHiroshi NojiPierre ZweigenbaumJunichi Tsujii
2020-10-20
Cross-Lingual Transfer in Zero-Shot Cross-Language Entity Linking
Elliot SchumacherJames MayfieldMark Dredze
2020-10-19
ColloQL: Robust Cross-Domain Text-to-SQL Over Search Queries
Karthik RadhakrishnanArvind SrikantanXi Victoria Lin
2020-10-19
BERTnesia: Investigating the capture and forgetting of knowledge in BERT
Jonas WallatJaspreet SinghAvishek Anand
2020-10-19
Cold-start Active Learning through Self-supervised Language Modeling
| Michelle YuanHsuan-Tien LinJordan Boyd-Graber
2020-10-19
Better Distractions: Transformer-based Distractor Generation and Multiple Choice Question Filtering
Jeroen OfferijnsSuzan VerberneTessa Verhoef
2020-10-19
Drug Repurposing for COVID-19 via Knowledge Graph Completion
Rui ZhangDimitar HristovskiDalton SchutteAndrej KastrinMarcelo FiszmanHalil Kilicoglu
2020-10-19
Parameter Norm Growth During Training of Transformers
William MerrillVivek RamanujanYoav GoldbergRoy SchwartzNoah Smith
2020-10-19
The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation Classification
| Abdullatif KöksalArzucan Özgür
2020-10-19
Towards Interpreting BERT for Reading Comprehension Based QA
| Sahana RamnathPreksha NemaDeep SahniMitesh M. Khapra
2020-10-18
Explaining and Improving Model Behavior with k Nearest Neighbor Representations
Nazneen Fatema RajaniBen KrauseWengpeng YinTong NiuRichard SocherCaiming Xiong
2020-10-18
TweetBERT: A Pretrained Language Representation Model for Twitter Text Analysis
| Mohiuddin Md Abdul QudarVijay Mago
2020-10-17
Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering
Hantao HuangTao HanWei HanDeep YapCheng-Ming Chiang
2020-10-17
HABERTOR: An Efficient and Effective Deep Hatespeech Detector
Thanh TranYifan HuChangwei HuKevin YenFei TanKyumin LeeSerim Park
2020-10-17
Question Answering over Knowledge Base using Language Model Embeddings
Sai Sharath JapaRekabdar Banafsheh
2020-10-17
Linguistically-Informed Transformations (LIT): A Method forAutomatically Generating Contrast Sets
| Chuanrong LiLin ShengshuoLeo Z. LiuXinyi WuXuhui ZhouShane Steinert-Threlkeld
2020-10-16
Delaying Interaction Layers in Transformer-based Encoders for Efficient Open Domain Question Answering
Wissam SibliniMohamed ChallalCharlotte Pasqual
2020-10-16
It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT
Hila GonenShauli RavfogelYanai ElazarYoav Goldberg
2020-10-16
Coarse-to-Fine Pre-training for Named Entity Recognition
Mengge XueBowen YuZhenyu ZhangTingwen LiuYue ZhangBin Wang
2020-10-16
Neural Deepfake Detection with Factual Structure of Text
Wanjun ZhongDuyu TangZenan XuRuize WangNan DuanMing ZhouJiahai WangJian Yin
2020-10-15
Context-Guided BERT for Targeted Aspect-Based Sentiment Analysis
Zhengxuan WuDesmond C. Ong
2020-10-15
Does Chinese BERT Encode Word Structure?
| Yile WangLeyang CuiYue Zhang
2020-10-15
Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings
Phillip KeungJulian SalazarYichao LuNoah A. Smith
2020-10-15
Response Selection for Multi-Party Conversations withDynamic Topic Tracking
Weishi Wang§Shafiq Joty§Steven C. H. Hoi
2020-10-15
DA-Transformer: Distance-aware Transformer
Chuhan WuFangzhao WuYongfeng Huang
2020-10-14
An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models
Zihan ZhaoYuncong LiuLu ChenQi LiuRao MaKai Yu
2020-10-14
Geometry matters: Exploring language examples at the decision boundary
Debajyoti DattaShashwat KumarLaura BarnesTom Fletcher
2020-10-14
Decoding Methods for Neural Narrative Generation
| Alexandra DeLuciaAaron MuellerXiang Lisa LiJoão Sedoc
2020-10-14
No Rumours Please! A Multi-Indic-Lingual Approach for COVID Fake-Tweet Detection
| Debanjana KarMohit BhardwajSuranjana SamantaAmar Prakash Azad
2020-10-14
Probing for Multilingual Numerical Understanding in Transformer-Based Language Models
| Devin JohnsonDenise MakDrew BarkerLexi Loessberg-Zahl
2020-10-13
BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's Distance
| Jianquan LiXiaokang LiuHonghong ZhaoRuifeng XuMin YangYaohong Jin
2020-10-13
Incorporating BERT into Parallel Sequence Decoding with Adapters
| Junliang GuoZhirui ZhangLinli XuHao-Ran WeiBoxing ChenEnhong Chen
2020-10-13
Improving Text Generation Evaluation with Batch Centering and Tempered Word Mover Distance
Xi ChenNan DingTomer LevinboimRadu Soricut
2020-10-13
The workweek is the best time to start a family -- A Study of GPT-2 Based Claim Generation
Shai GretzYonatan BiluEdo Cohen-KarlikNoam Slonim
2020-10-13
CAPT: Contrastive Pre-Training for LearningDenoised Sequence Representations
Fuli LuoPengcheng YangShicheng LiXuancheng RenXu sun
2020-10-13
Aspect-based Document Similarity for Research Papers
| Malte OstendorffTerry RuasTill BlumeBela GippGeorg Rehm
2020-10-13
Multilingual Argument Mining: Datasets and Analysis
Orith Toledo-RonenMatan OrbachYonatan BiluArtem SpectorNoam Slonim
2020-10-13
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy LinRodrigo NogueiraAndrew Yates
2020-10-13
COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs
Jena D. HwangChandra BhagavatulaRonan Le BrasJeff DaKeisuke SakaguchiAntoine BosselutYejin Choi
2020-10-12
Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification
Jordan J. BirdAnikó EkártDiego R. Faria
2020-10-12
Zero-shot Entity Linking with Efficient Long Range Sequence Modeling
| Zonghai YaoLiangliang CaoHuapu Pan
2020-10-12
Meta-Context Transformers for Domain-Specific Response Generation
Debanjana KarSuranjana SamantaAmar Prakash Azad
2020-10-12
Counterfactual Variable Control for Robust and Interpretable Question Answering
| Sicheng YuYulei NiuShuohang WangJing JiangQianru Sun
2020-10-12
Improving Compositional Generalization in Semantic Parsing
| Inbar OrenJonathan HerzigNitish GuptaMatt GardnerJonathan Berant
2020-10-12
HUJI-KU at MRP~2020: Two Transition-based Neural Parsers
Ofir ArvivRuixiang CuiDaniel Hershcovich
2020-10-12
Probing Pretrained Language Models for Lexical Semantics
Ivan VulićEdoardo Maria PontiRobert LitschkoGoran GlavašAnna Korhonen
2020-10-12
EFSG: Evolutionary Fooling Sentences Generator
Marco Di GiovanniMarco Brambilla
2020-10-12
Layer-wise Guided Training for BERT: Learning Incrementally Refined Document Representations
Nikolaos ManginasIlias ChalkidisProdromos Malakasiotis
2020-10-12
From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks
| Steffen EgerYannik Benz
2020-10-12
Load What You Need: Smaller Versions of Multilingual BERT
| Amine AbdaouiCamille PradelGrégoire Sigel
2020-10-12
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task
Zuchao LiHai ZhaoRui WangKehai ChenMasao UtiyamaEiichiro Sumita
2020-10-11
Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually)
| Alex WarstadtYian ZhangHaau-Sing LiHaokun LiuSamuel R. Bowman
2020-10-11
Detecting Foodborne Illness Complaints in Multiple Languages Using English Annotations Only
Ziyi LiuGiannis KaramanolakisDaniel HsuLuis Gravano
2020-10-11
Connecting the Dots Between Fact Verification and Fake News Detection
Qifei LiWangchunshu Zhou
2020-10-11
Unsupervised Distillation of Syntactic Information from Contextualized Word Representations
Shauli RavfogelYanai ElazarJacob GoldbergerYoav Goldberg
2020-10-11
Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU
| Brielen MadureiraDavid Schlangen
2020-10-11
Data Agnostic RoBERTa-based Natural Language to SQL Query Generation
| Debaditya PalHarsh SharmaKaustubh Chaudhari
2020-10-11
SMYRF: Efficient Attention using Asymmetric Clustering
| Giannis DarasNikita KitaevAugustus OdenaAlexandros G. Dimakis
2020-10-11
Information Extraction from Swedish Medical Prescriptions with Sig-Transformer Encoder
John Pougue BiyongBo wangTerry LyonsAlejo J Nevado-Holgado
2020-10-10
Tag Recommendation for Online Q&A Communities based on BERT Pre-Training Technique
Navid KhezrianJafar HabibiIssa Annamoradnejad
2020-10-10
Compressing Transformer-Based Semantic Parsing Models using Compositional Code Embeddings
Prafull PrakashSaurabh Kumar ShashidharWenlong ZhaoSubendhu RongaliHaidar KhanMichael Kayser
2020-10-10
Automated Concatenation of Embeddings for Structured Prediction
| Xinyu WangYong JiangNguyen BachTao WangZhongqiang HuangFei HuangKewei Tu
2020-10-10
Second-Order Neural Dependency Parsing with Message Passing and End-to-End Training
| Xinyu WangKewei Tu
2020-10-10
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding
Jin CaoJun WangWael HamzaKelly VaneeShang-Wen Li
2020-10-09
Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis
| João A. LeiteDiego F. SilvaKalina BontchevaCarolina Scarton
2020-10-09
Grid Tagging Scheme for Aspect-oriented Fine-grained Opinion Extraction
Zhen WuChengcan YingFei ZhaoZhifang FanXinyu DaiRui Xia
2020-10-09
NutCracker at WNUT-2020 Task 2: Robustly Identifying Informative COVID-19 Tweets using Ensembling and Adversarial Training
| Priyanshu KumarAadarsh Singh
2020-10-09
Deep Learning Meets Projective Clustering
Alaa MaaloufHarry LangDaniela RusDan Feldman
2020-10-08
Masked ELMo: An evolution of ELMo towards fully contextual RNN language models
Gregory SenayEmmanuelle Salin
2020-10-08
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Yinghui HuangHong-Kwang KuoSamuel ThomasZvi KonsKartik AudhkhasiBrian KingsburyRon HooryMichael Picheny
2020-10-08
PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge
| Yun HeZhuoer WangYin ZhangRuihong HuangJames Caverlee
2020-10-08
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
| Yun HeZiwei ZhuYin ZhangQin ChenJames Caverlee
2020-10-08
Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference
| Xiaoan DingTianyu LiuBaobao ChangZhifang SuiKevin Gimpel
2020-10-08
Improving Attention Mechanism with Query-Value Interaction
Chuhan WuFangzhao WuTao QiYongfeng Huang
2020-10-08
TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling
Parker RileyNoah ConstantMandy GuoGirish KumarDavid UthusZarana Parekh
2020-10-08
Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding
Dechuang TengLibo QinWanxiang CheSendong ZhaoTing Liu
2020-10-08
Automatic generation of reviews of scientific papers
| Anna NikiforovskayaNikolai KapralovAnna VlasovaOleg ShpynovAleksei Shpilman
2020-10-08
Optimizing Transformers with Approximate Computing for Faster, Smaller and more Accurate NLP Models
Amrit NagarajanSanchari SenJacob R. StevensAnand Raghunathan
2020-10-07
Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets
Mihaela GamanRadu Tudor Ionescu
2020-10-07
Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank
| Eleftheria BriakouMarine Carpuat
2020-10-07
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling
Jiecao ChenLiu YangKarthik RamanMichael BenderskyJung-Jung YehYun ZhouMarc NajorkDanyang CaiEhsan Emadzadeh
2020-10-07
Why do you think that? Exploring Faithful Sentence-Level Rationales Without Supervision
Max GlocknerIvan HabernalIryna Gurevych
2020-10-07
ELMo and BERT in semantic change detection for Russian
Julia RodinaYuliya TrofimovaAndrey KutuzovEkaterina Artemova
2020-10-07
Investigating African-American Vernacular English in Transformer-Based Text Generation
Sophie GroenwoldLily OuAesha ParekhSamhita HonnavalliSharon LevyDiba MirzaWilliam Yang Wang
2020-10-06
Do Explicit Alignments Robustly Improve Multilingual Encoders?
Shijie WuMark Dredze
2020-10-06
LEGAL-BERT: The Muppets straight out of Law School
Ilias ChalkidisManos FergadiotisProdromos MalakasiotisNikolaos AletrasIon Androutsopoulos
2020-10-06
Cross-Lingual Text Classification with Minimal Resources by Transferring a Sparse Teacher
| Giannis KaramanolakisDaniel HsuLuis Gravano
2020-10-06
The Multilingual Amazon Reviews Corpus
Phillip KeungYichao LuGyörgy SzarvasNoah A. Smith
2020-10-06
Scene Graph Modification Based on Natural Language Commands
| Xuanli HeQuan Hung TranGholamreza HaffariWalter ChangTrung BuiZhe LinFranck DernoncourtNhan Dam
2020-10-06
Converting the Point of View of Messages Spoken to Virtual Assistants
| Isabelle G. LeeVera ZuSai Srujana BuddiDennis LiangJack G. M. FitzGerald
2020-10-06
On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained Transformers
| Marius MosbachAnna KhokhlovaMichael A. HedderichDietrich Klakow
2020-10-06
Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation
| Sebastian HofstätterSophia AlthammerMichael SchröderMete SertkanAllan Hanbury
2020-10-06
Incorporating Behavioral Hypotheses for Query Generation
Ruey-Cheng ChenChia-Jung Lee
2020-10-06
Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder
Alvin ChanYi TayYew-Soon OngAston Zhang
2020-10-06
BERT Knows Punta Cana is not just beautiful, it's gorgeous: Ranking Scalar Adjectives with Contextualised Representations
| Aina Garí SolerMarianna Apidianaki
2020-10-06
Analyzing Individual Neurons in Pre-trained Language Models
Nadir DurraniHassan SajjadFahim DalviYonatan Belinkov
2020-10-06
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
| Minki KangMoonsu HanSung Ju Hwang
2020-10-06
Intrinsic Probing through Dimension Selection
Lucas Torroba HennigenAdina WilliamsRyan Cotterell
2020-10-06
Exploring BERT's Sensitivity to Lexical Cues using Tests from Semantic Priming
Kanishka MisraAllyson EttingerJulia Taylor Rayz
2020-10-06
PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation
Xinyu HuaLu Wang
2020-10-05
InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Boxin WangShuohang WangYu ChengZhe GanRuoxi JiaBo LiJingjing Liu
2020-10-05
Mixup-Transfomer: Dynamic Data Augmentation for NLP Tasks
Lichao SunCongying XiaWenpeng YinTingTing LiangPhilip S. YuLifang He
2020-10-05
Self-training Improves Pre-training for Natural Language Understanding
Jingfei DuEdouard GraveBeliz GunelVishrav ChaudharyOnur CelebiMichael AuliVes StoyanovAlexis Conneau
2020-10-05
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?
Shayne LongpreYu WangChristopher DuBois
2020-10-05
Improving AMR Parsing with Sequence-to-Sequence Pre-training
| Dongqin XuJunhui LiMuhua ZhuMin ZhangGuodong Zhou
2020-10-05
Unsupervised Reference-Free Summary Quality Evaluation via Contrastive Learning
Hanlu WuTengfei MaLingfei WuTariro ManyumwaShouling Ji
2020-10-05
Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior
Zi LinJeremiah Zhe LiuZi YangNan HuaDan Roth
2020-10-05
GenAug: Data Augmentation for Finetuning Text Generators
Steven Y. FengVarun GangalDongyeop KangTeruko MitamuraEduard Hovy
2020-10-05
PMI-Masking: Principled masking of correlated spans
Yoav LevineBarak LenzOpher LieberOmri AbendKevin Leyton-BrownMoshe TennenholtzYoav Shoham
2020-10-05
Linguistic Profiling of a Neural Language Model
Alessio MiaschiDominique BrunatoFelice Dell'OrlettaGiulia Venturi
2020-10-05
PUM at SemEval-2020 Task 12: Aggregation of Transformer-based models' features for offensive language recognition
Piotr JaniszewskiMateusz SkibaUrszula Walińska
2020-10-05
X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset
Angel DazaAnette Frank
2020-10-05
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels
Ilias ChalkidisManos FergadiotisSotiris KotitsasProdromos MalakasiotisNikolaos AletrasIon Androutsopoulos
2020-10-04
Inquisitive Question Generation for High Level Text Comprehension
Wei-Jen KoTe-Yuan ChenYiyan HuangGreg DurrettJunyi Jessy Li
2020-10-04
On Losses for Modern Language Models
Stephane Aroca-OuelletteFrank Rudzicz
2020-10-04
Mining Knowledge for Natural Language Inference from Wikipedia Categories
Mingda ChenZewei ChuKarl StratosKevin Gimpel
2020-10-03
Personality Trait Detection Using Bagged SVM over BERT Word Embedding Ensembles
Amirmohammad KazameiniSamin FatehiYash MehtaSauleh EetemadiErik Cambria
2020-10-03
Cost-effective Selection of Pretraining Data: A Case Study of Pretraining BERT on Social Media
Xiang DaiSarvnaz KarimiBen HacheyCecile Paris
2020-10-02
STIL -- Simultaneous Slot Filling, Translation, Intent Classification, and Language Identification: Initial Results using mBART on MultiATIS++
Jack G. M. FitzGerald
2020-10-02
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
| Andreas RückléJonas PfeifferIryna Gurevych
2020-10-02
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
Ikuya YamadaAkari AsaiHiroyuki ShindoHideaki TakedaYuji Matsumoto
2020-10-02
Beyond The Text: Analysis of Privacy Statements through Syntactic and Semantic Role Labeling
Yan ShvartzshnaiderAnanth BalashankarVikas PatidarThomas WiesLakshminarayanan Subramanian
2020-10-01
Examining the rhetorical capacities of neural language models
Zining ZhuChuer PanMohamed AbdallaFrank Rudzicz
2020-10-01
Evaluating Multilingual BERT for Estonian
Claudia KittaskKirill MilintsevichKairit Sirts
2020-10-01
RRF102: Meeting the TREC-COVID Challenge with a 100+ Runs Ensemble
Michael BenderskyHonglei ZhuangJi MaShuguang HanKeith HallRyan Mcdonald
2020-10-01
Understanding tables with intermediate pre-training
| Julian Martin EisenschlosSyrine KrichineThomas Müller
2020-10-01
Detecting White Supremacist Hate Speech using Domain Specific Word Embedding with Deep Learning and BERT
Hind Saleh AlatawiAreej Maatog AlhothaliKawthar Mustafa Moria
2020-10-01
CoLAKE: Contextualized Language and Knowledge Embedding
| Tianxiang SunYunfan ShaoXipeng QiuQipeng GuoYaru HuXuanjing HuangZheng Zhang
2020-10-01
AUBER: Automated BERT Regularization
Hyun Dong LeeSeongmin LeeU Kang
2020-09-30
BERT for Monolingual and Cross-Lingual Reverse Dictionary
| Hang YanXiaonan LiXipeng Qiu
2020-09-30
A Tale of Two Linkings: Dynamically Gating between Schema Linking and Structural Linking for Text-to-SQL Parsing
| Sanxing ChenAidan SanXiaodong LiuYangfeng Ji
2020-09-30
Gender prediction using limited Twitter Data
Maaike BurghoornMaaike H. T. de BoerStephan Raaijmakers
2020-09-29
Visually-Grounded Planning without Vision: Language Models Infer Detailed Plans from High-level Instructions
| Peter A. Jansen
2020-09-29
TEST_POSITIVE at W-NUT 2020 Shared Task-3: Joint Event Multi-task Learning for Slot Filling in Noisy Text
Chacha ChenChieh-Yang HuangYaqi HouYang ShiEnyan DaiJiaqi Wang
2020-09-29
Cross-lingual Alignment Methods for Multilingual BERT: A Comparative Study
Saurabh KulshreshthaJosé Luis Redondo-GarcíaChing-Yun Chang
2020-09-29
MaP: A Matrix-based Prediction Approach to Improve Span Extraction in Machine Reading Comprehension
Huaishao LuoYu ShiMing GongLinjun ShouTianrui Li
2020-09-29
The design and implementation of Language Learning Chatbot with XAI using Ontology and Transfer Learning
Nuobei ShiQin ZengRaymond Lee
2020-09-29
Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation
Yinfei YangNing JinKuo LinMandy GuoDaniel Cer
2020-09-29
HINT3: Raising the bar for Intent Detection in the Wild
Gaurav AroraChirag JainManas ChaturvediKrupal Modi
2020-09-29
Contrastive Distillation on Intermediate Representations for Language Model Compression
Siqi SunZhe GanYu ChengYuwei FangShuohang WangJingjing Liu
2020-09-29
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
Shikib MehriMihail EricDilek Hakkani-Tur
2020-09-28
Fancy Man Lauches Zippo at WNUT 2020 Shared Task-1: A Bert Case Model for Wet Lab Entity Extraction
Haoding MengQingcheng ZengXiaoyang FangZhexin Liang
2020-09-28
A Simple and Efficient Ensemble Classifier Combining Multiple Neural Network Models on Social Media Datasets in Vietnamese
Huy Duc HuynhHang Thi-Thuy DoKiet Van NguyenNgan Luu-Thuy Nguyen
2020-09-28
Accelerating Multi-Model Inference by Merging DNNs of Different Weights
Joo Seong JeongSoojeong KimGyeong-In YuYunseong LeeByung-Gon Chun
2020-09-28
Knowledge-Aware Procedural Text Understanding with Multi-Stage Training
Zhihan ZhangXiubo GengTao QinYunfang WuDaxin Jiang
2020-09-28
PIN: A Novel Parallel Interactive Network for Spoken Language Understanding
Peilin ZhouZhiqi HuangFenglin LiuYuexian Zou
2020-09-28
What does it mean to be language-agnostic? Probing multilingual sentence encoders for typological properties
Rochelle ChoenniEkaterina Shutova
2020-09-27
TernaryBERT: Distillation-aware Ultra-low Bit BERT
Wei ZhangLu HouYichun YinLifeng ShangXiao ChenXin JiangQun Liu
2020-09-27
Metaphor Detection using Deep Contextualized Word Embeddings
Shashwat AggarwalRamesh Singh
2020-09-26
Metaphor Detection using Deep Contextualized Word Embeddings
Shashwat AggarwalRamesh Singh
2020-09-26
Techniques to Improve Q&A Accuracy with Transformer-based models on Large Complex Documents
Chejui LiaoTabish ManiarSravanajyothi NAnantha Sharma
2020-09-26
HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
| Yifan DingNicholas BotzerTim Weninger
2020-09-25
BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification Context
Jean-Philippe CorbeilHadi Abdi Ghadivel
2020-09-25
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Zhaojiang LinAndrea MadottoGenta Indra WinataPascale Fung
2020-09-25
An Unsupervised Sentence Embedding Method byMutual Information Maximization
Yan ZhangRuidan HeZuozhu LiuKwan Hui LimLidong Bing
2020-09-25
A little goes a long way: Improving toxic language classification despite data scarcity
Mika JuutiTommi GröndahlAdrian FlanaganN. Asokan
2020-09-25
A Comparative Study of Feature Types for Age-Based Text Classification
| Anna GlazkovaYury EgorovMaksim Glazkov
2020-09-24
Toward a Thermodynamics of Meaning
Jonathan Scott Enderle
2020-09-24
AnchiBERT: A Pre-Trained Model for Ancient ChineseLanguage Understanding and Generation
Huishuang TianKexin YangDayiheng LiuJiancheng Lv
2020-09-24
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences
| Boon Peng YapAndrew Koh Jin JieEng Siong Chng
2020-09-24
A Token-wise CNN-based Method for Sentence Compression
Weiwei HouHanna SuominenPiotr KoniuszSabrina CaldwellTom Gedeon
2020-09-23
On Data Augmentation for Extreme Multi-label Classification
Danqing ZhangTao LiHaiyang ZhangBing Yin
2020-09-22
AutoRC: Improving BERT Based Relation Classification Models via Architecture Search
Wei ZhuXiaoling WangXipeng QiuYuan NiGuotong Xie
2020-09-22
GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis
| Huaishao LuoLei JiTianrui LiNan DuanDaxin Jiang
2020-09-22
Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
Chris J. KennedyGeoff BaconAlexander SahnClaudia von Vacano
2020-09-22
"When they say weed causes depression, but it's your fav antidepressant": Knowledge-aware Attention Framework for Relationship Extraction
Shweta YadavUsha LokalaRaminta DaniulaityteKrishnaprasad ThirunarayanFrancois LamyAmit Sheth
2020-09-21
Profile Consistency Identification for Open-domain Dialogue Agents
Haoyu SongYan WangWei-Nan ZhangZhengyu ZhaoTing LiuXiaojiang Liu
2020-09-21
UCD-CS at W-NUT 2020 Shared Task-3: A Text to Text Approach for COVID-19 Event Extraction on Social Media
Congcong WangDavid Lillis
2020-09-21
Latin BERT: A Contextual Language Model for Classical Philology
David BammanPatrick J. Burns
2020-09-21
Dual-path CNN with Max Gated block for Text-Based Person Re-identification
Tinghuai MaMingming YangHuan RongYurong QianYurong QianYuan TianNajlaAl-Nabhan
2020-09-20
Longformer for MS MARCO Document Re-ranking Task
| Ivan SekulićAmir SoleimaniMohammad AliannejadiFabio Crestani
2020-09-20
Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging
| Ehsan DoostmohammadiMinoo NassajianAdel Rahimi
2020-09-20
VirtualFlow: Decoupling Deep Learning Model Execution from Underlying Hardware
Andrew OrHaoyu ZhangMichael J. Freedman
2020-09-20
Prior Art Search and Reranking for Generated Patent Text
Jieh-Sheng LeeJieh Hsiang
2020-09-19
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
Jonathan PilaultAmine ElhattamiChristopher Pal
2020-09-19
Nominal Compound Chain Extraction: A New Task for Semantic-enriched Lexical Chain
Bobo LiHao FeiYafeng RenDonghong Ji
2020-09-19
Will it Unblend?
Yuval PinterCassandra L. JacobsJacob Eisenstein
2020-09-18
Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models
Jihyeon RohHuiseong GimSoo-Young Lee
2020-09-18
The birth of Romanian BERT
Stefan Daniel DumitrescuAndrei-Marius AvramSampo Pyysalo
2020-09-18
fastHan: A BERT-based Joint Many-Task Toolkit for Chinese NLP
Zhichao GengHang YanXipeng QiuXuanjing Huang
2020-09-18
NEU at WNUT-2020 Task 2: Data Augmentation To Tell BERT That Death Is Not Necessarily Informative
Kumud Chauhan
2020-09-18
Cross-Modal Alignment with Mixture Experts Neural Network for Intral-City Retail Recommendation
Po LiLei LiYan FuJun RongYu Zhang
2020-09-17
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning
Bingbing LiZhenglun KongTianyun ZhangJi LiZhengang LiHang LiuCaiwen Ding
2020-09-17
Multi^2OIE: Multilingual Open Information Extraction based on Multi-Head Attention with BERT
Youngbin RoYukyung LeePilsung Kang
2020-09-17
DSC IIT-ISM at SemEval-2020 Task 6: Boosting BERT with Dependencies for Definition Extraction
| Aadarsh SinghPriyanshu KumarAman Sinha
2020-09-17
Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA
Ieva StaliūnaitėIgnacio Iacobacci
2020-09-17
A Multimodal Memes Classification: A Survey and Open Research Issues
Tariq Habib AfridiAftab AlamMuhammad Numan KhanJawad KhanYoung-Koo Lee
2020-09-17
Solomon at SemEval-2020 Task 11: Ensemble Architecture for Fine-Tuned Propaganda Detection in News Articles
Mayank RajAjay JaiswalRohit R. RAnkita GuptaSudeep Kumar SahooVertika SrivastavaYeon Hyang Kim
2020-09-16
Simplified TinyBERT: Knowledge Distillation for Document Retrieval
Xuanang ChenBen HeKai HuiLe SunYingfei Sun
2020-09-16
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
| Jian GuanMinlie Huang
2020-09-16
Deep Learning Approaches for Extracting Adverse Events and Indications of Dietary Supplements from Clinical Text
Yadan FanSicheng ZhouYifan LiRui Zhang
2020-09-16
DeNERT-KG: Named Entity and Relation Extraction Model Using DQN, Knowledge Graph, and BERT
SungMin YangSoYeop YooOkRan Jeong
2020-09-15
Augmented Natural Language for Generative Sequence Labeling
Ben AthiwaratkunCicero Nogueira dos SantosJason KroneBing Xiang
2020-09-15
The Radicalization Risks of GPT-3 and Advanced Neural Language Models
Kris McGuffieAlex Newhouse
2020-09-15
Dialogue Response Ranking Training with Large-Scale Human Feedback Data
| Xiang GaoYizhe ZhangMichel GalleyChris BrockettBill Dolan
2020-09-15
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
| Timo SchickHinrich Schütze
2020-09-15
Critical Thinking for Language Models
Gregor Betz
2020-09-15
Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Wei NiuZhenglun KongGeng YuanWeiwen JiangJiexiong GuanCaiwen DingPu ZhaoSijia LiuBin RenYanzhi Wang
2020-09-15
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis ClouatrePhilippe TrempeAmal ZouaqSarath Chandar
2020-09-15
Event Presence Prediction Helps Trigger Detection Across Languages
Parul AwasthyTahira NaseemJian NiTaesun MoonRadu Florian
2020-09-15
Lessons Learned from Applying off-the-shelf BERT: There is no SilverBullet
Victor MakarenkovLior Rokach
2020-09-15
BERT-QE: Contextualized Query Expansion for Document Re-ranking
Zhi ZhengKai HuiBen HeXianpei HanLe SunAndrew Yates
2020-09-15
Efficient Transformers: A Survey
Yi TayMostafa DehghaniDara BahriDonald Metzler
2020-09-14
Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue
Longxiang LiuZhuosheng ZhangHai ZhaoXi ZhouXiang Zhou
2020-09-14
GeDi: Generative Discriminator Guided Sequence Generation
Ben KrauseAkhilesh Deepak GotmareBryan McCannNitish Shirish KeskarShafiq JotyRichard SocherNazneen Fatema Rajani
2020-09-14
Can Fine-tuning Pre-trained Models Lead to Perfect NLP? A Study of the Generalizability of Relation Extraction
Ningyu ZhangLuoqiu LiShumin DengHaiyang YuXu ChengWei ZhangHuajun Chen
2020-09-14
Beyond Accuracy: ROI-driven Data Analytics of Empirical Data
Gouri DeshpandeGuenther Ruhe
2020-09-14
Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Shuohang WangLuowei ZhouZhe GanYen-Chun ChenYuwei FangSiqi SunYu ChengJingjing Liu
2020-09-13
BoostingBERT:Integrating Multi-Class Boosting into BERT for NLP Tasks
Tongwen HuangQingyun SheJunlin Zhang
2020-09-13
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using Pre-trained Language Models
Yandrapati Prakash BabuRajagopal Eswari
2020-09-12
Country Image in COVID-19 Pandemic: A Case Study of China
Huimin ChenZeyu ZhuFanchao QiYining YeZhiyuan LiuMaosong SunJianbin Jin
2020-09-12
Fine-tuning Pre-trained Contextual Embeddings for Citation Content Analysis in Scholarly Publication
Haihua ChenHuyen Nguyen
2020-09-12
Unit Test Case Generation with Transformers
Michele TufanoDawn DrainAlexey SvyatkovskiyShao Kun DengNeel Sundaresan
2020-09-11
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank Approximation
Murad TukanAlaa MaaloufMatan WekslerDan Feldman
2020-09-11
UPB at SemEval-2020 Task 6: Pretrained Language Models for DefinitionExtraction
Andrei-Marius AvramDumitru-Clementin CercelCostin-Gabriel Chiru
2020-09-11
UPB at SemEval-2020 Task 11: Propaganda Detection with Domain-Specific Trained BERT
Andrei ParaschivDumitru-Clementin CercelMihai Dascalu
2020-09-11
A Comparison of LSTM and BERT for Small Corpus
Aysu Ezen-Can
2020-09-11
FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding
Yuwei FangShuohang WangZhe GanSiqi SunJingjing Liu
2020-09-10
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas AffolterBeni EgressyDamian PascualRoger Wattenhofer
2020-09-10
Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection
| Taesun WhangDongyub LeeDongsuk OhChanhee LeeKijong HanDong-hun LeeSaebyeok Lee
2020-09-10
Modern Methods for Text Generation
| Dimas Munoz Montesinos
2020-09-10
Investigating Gender Bias in BERT
Rishabh BhardwajNavonil MajumderSoujanya Poria
2020-09-10
Pay Attention when Required
Swetha MandavaSzymon MigaczAlex Fit Florea
2020-09-09
Comparative Study of Language Models on Cross-Domain Data with Model Agnostic Explainability
Mayank ChhipaHrushikesh Mahesh VazurkarAbhijeet KumarMridul Mishra
2020-09-09
ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by Pre-trained Language Model
Zhengjie HuangShikun FengWeiyue SuXuyi ChenShuohuan WangJiaxiang LiuXuan OuyangYu Sun
2020-09-08
Improving Language Generation with Sentence Coherence Objective
Ruixiao SunJie YangMehrdad Yousefzadeh
2020-09-07
Black Box to White Box: Discover Model Characteristics Based on Strategic Probing
Josh KalinMatthew CiolinoDavid NoeverGerry Dozier
2020-09-07
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for E-commerce
Denghui ZhangZixuan YuanYanchi LiuFuzhen ZhuangHui Xiong
2020-09-07
Measuring Massive Multitask Language Understanding
Dan HendrycksCollin BurnsSteven BasartAndy ZouMantas MazeikaDawn SongJacob Steinhardt
2020-09-07
EdinburghNLP at WNUT-2020 Task 2: Leveraging Transformers with Generalized Augmentation for Identifying Informativeness in COVID-19 Tweets
Nickil Maveli
2020-09-06
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation system based on ensemble of language model
Pai Liu
2020-09-06
Accenture at CheckThat! 2020: If you say so: Post-hoc fact-checking of claims using transformer-based models
Evan WilliamsPaul RodriguesValerie Novak
2020-09-05
Comparative Evaluation of Pretrained Transfer Learning Models on Automatic Short Answer Grading
Sasi Kiran GaddipatiDeebul NairPaul G. Plöger
2020-09-02
Sentimental LIAR: Extended Corpus and Deep Learning Models for Fake Claim Classification
Bibek UpadhayayVahid Behzadan
2020-09-01
Automatic Assignment of Radiology Examination Protocols Using Pre-trained Language Models with Knowledge Distillation
Wilson LauLaura AaltonenMartin GunnMeliha Yetisgen
2020-09-01
A Bidirectional Tree Tagging Scheme for Jointly Extracting Overlapping Entities and Relations
Xukun LuoWeijie LiuMeng MaPing Wang
2020-08-31
SocCogCom at SemEval-2020 Task 11: Characterizing and Detecting Propaganda using Sentence-Level Emotional Salience Features
Gangeshwar KrishnamurthyRaj Kumar GuptaYinping Yang
2020-08-29
Rethinking the objectives of extractive question answering
Martin FajcikJosef JonSantosh KesirajuPavel Smrz
2020-08-28
Knowledge Efficient Deep Learning for Natural Language Processing
Hai Wang
2020-08-28
DAVE: Deriving Automatically Verilog from English
Hammond PearceBenjamin TanRamesh Karri
2020-08-27
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
Xinsong ZhangHang Li
2020-08-27
MultiGBS: A multi-layer graph approach to biomedical summarization
Ensieh DavoodijamNasser GhadiriMaryam Lotfi ShahrezaFabio Rinaldi
2020-08-27
Query Focused Multi-document Summarisation of Biomedical Texts
Diego MollaChristopher JonesVincent Nguyen
2020-08-27
GREEK-BERT: The Greeks visiting Sesame Street
John KoutsikakisIlias ChalkidisProdromos MalakasiotisIon Androutsopoulos
2020-08-27
Entity and Evidence Guided Relation Extraction for DocRED
Kevin HuangGuangtao WangTengyu MaJing Huang
2020-08-27
APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm
Hanlin TangShaoduo GanSamyam RajbhandariXiangru LianCe ZhangJi LiuYuxiong He
2020-08-26
Language Models and Word Sense Disambiguation: An Overview and Analysis
| Daniel LoureiroKiamehr RezaeeMohammad Taher PilehvarJose Camacho-Collados
2020-08-26
Discrete Word Embedding for Logical Natural Language Understanding
Masataro AsaiZilu Tang
2020-08-26
Conceptualized Representation Learning for Chinese Biomedical Text Mining
| Ningyu ZhangQianghuai JiaKangping YinLiang DongFeng GaoNengwei Hua
2020-08-25
syrapropa at SemEval-2020 Task 11: BERT-based Models Design For Propagandistic Technique and Span Detection
Jinfen LiLu Xiao
2020-08-24
Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and Resources
Taolin ZhangChengyu WangMinghui QiuBite YangXiaofeng HeJun Huang
2020-08-24
Two Stages Approach for Tweet Engagement Prediction
Amine DadounIsmail HarrandoPasquale LisenaAlison ReboudRaphael Troncy
2020-08-24
Prediction of ICD Codes with Clinical BERT Embeddings and Text Augmentation with Label Balancing using MIMIC-III
Brent BisedaGaurav DesaiHaifeng LinAnish Philip
2020-08-24
YNU-HPCC at SemEval-2020 Task 11: LSTM Network for Detection of Propaganda Techniques in News Articles
Jiaxu DaoJin WangXuejie Zhang
2020-08-24
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERT
| Omar MossadAmgad AhmedAnandharaju RajuHari KarthikeyanZayed Ahmed
2020-08-22
Applications of BERT Based Sequence Tagging Models on Chinese Medical Text Attributes Extraction
Gang ZhaoTeng ZhangChenxiao WangPing LvJi Wu
2020-08-22
HinglishNLP: Fine-tuned Language Models for Hinglish Sentiment Detection
Meghana BhangeNirant Kasliwal
2020-08-22
CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection
| Verena BlaschkeMaxim KorniyenkoSam Tureski
2020-08-22
DUTH at SemEval-2020 Task 11: BERT with Entity Mapping for Propaganda Classification
Anastasios BairaktarisSymeon SymeonidisAvi Arampatzis
2020-08-22
Adapting Event Extractors to Medical Data: Bridging the Covariate Shift
Aakanksha NaikJill LehmanCarolyn Rose
2020-08-21
Abstractive Summarization of Spoken andWritten Instructions with BERT
Alexandra SavelievaBryan Au-YeungVasanth Ramani
2020-08-21
PTT5: Pretraining and validating the T5 model on Brazilian Portuguese data
| Diedre CarmoMarcos PiauIsrael CampiottiRodrigo NogueiraRoberto Lotufo
2020-08-20
An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension
Son T. LuuKiet Van NguyenAnh Gia-Tuan NguyenNgan Luu-Thuy Nguyen
2020-08-20
Lite Training Strategies for Portuguese-English and English-Portuguese Translation
Alexandre LopesRodrigo NogueiraRoberto LotufoHelio Pedrini
2020-08-20
UoB at SemEval-2020 Task 12: Boosting BERT with Corpus Level Information
Wah Meng LimHarish Tayyar Madabushi
2020-08-19
Ranking Clarification Questions via Natural Language Inference
Vaibhav KumarVikas RaunakJamie Callan
2020-08-18
Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Dara BahriYi TayChe ZhengDonald MetzlerCliff BrunkAndrew Tomkins
2020-08-17
Narrative Interpolation for Generating and Understanding Stories
Su WangGreg DurrettKatrin Erk
2020-08-17
Stock Index Prediction with Multi-task Learning and Word Polarity Over Time
Yue ZhouKerstin Voigt
2020-08-17
Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size
Davis YoshidaAllyson EttingerKevin Gimpel
2020-08-16
DeVLBert: Learning Deconfounded Visio-Linguistic Representations
| Shengyu ZhangTan JiangTan WangKun KuangZhou ZhaoJianke ZhuJin YuHongxia YangFei Wu
2020-08-16
Jointly Fine-Tuning “BERT-like” Self Supervised Models to Improve Multimodal Speech Emotion Recognition
| Shamane SiriwardhanaAndrew ReisRivindu WeerasekeraSuranga Nanayakkara
2020-08-15
Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Henry TsaiJayden OoiChun-Sung FerngHyung Won ChungJason Riesa
2020-08-15
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Shamane SiriwardhanaAndrew ReisRivindu WeerasekeraSuranga Nanayakkara
2020-08-15
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Andrea Madotto
2020-08-14
Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model
Marzieh MozafariReza FarahbakhshNoel Crespi
2020-08-14
MICE: Mining Idioms with Contextual Embeddings
Tadej ŠkvorcPolona GantarMarko Robnik-Šikonja
2020-08-13
ANDES at SemEval-2020 Task 12: A jointly-trained BERT multilingual model for offensive language detection
| Juan Manuel PérezAymé ArangoFranco Luque
2020-08-13
Variance-reduced Language Pretraining via a Mask Proposal Network
Liang Chen
2020-08-12
FireBERT: Hardening BERT-based classifiers against adversarial attack
Gunnar MeinKevin HartmanAndrew Morris
2020-08-10
Navigating Language Models with Synthetic Agents
Philip Feldman
2020-08-10
KR-BERT: A Small-Scale Korean-Specific Language Model
| Sangah LeeHansol JangYunmee BaikSuzi ParkHyopil Shin
2020-08-10
Does BERT Solve Commonsense Task via Commonsense Knowledge?
Leyang CuiSijie ChengYu WuYue Zhang
2020-08-10
Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine
Kuan FangLong ZhaoZhan ShenRuiXing WangRiKang ZhourLiWen Fan
2020-08-10
GANBERT: Generative Adversarial Networks with Bidirectional Encoder Representations from Transformers for MRI to PET synthesis
Hoo-Chang ShinAlvin IhsaniSwetha MandavaSharath Turuvekere SreenivasChristopher ForsterJiook ChaAlzheimer's Disease Neuroimaging Initiative
2020-08-10
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
| Hayato FutamiHirofumi InagumaSei UenoMasato MimuraShinsuke SakaiTatsuya Kawahara
2020-08-09
Fast and Accurate Neural CRF Constituency Parsing
| Yu ZhangHouquan ZhouZhenghua Li
2020-08-09
SemEval-2020 Task 10: Emphasis Selection for Written Text in Visual Media
Amirreza ShiraniFranck DernoncourtNedim LipkaPaul AsenteJose EchevarriaThamar Solorio
2020-08-07
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Zihang JiangWeihao YuDaquan ZhouYunpeng ChenJiashi FengShuicheng Yan
2020-08-06
DeText: A Deep Text Ranking Framework with BERT
| Weiwei GuoXiaowei LiuSida WangHuiji GaoAnanth SankarZimeng YangQi GuoLiang ZhangBo LongBee-Chung ChenDeepak Agarwal
2020-08-06
aschern at SemEval-2020 Task 11: It Takes Three to Tango: RoBERTa, CRF, and Transfer Learning
| Anton ChernyavskiyDmitry IlvovskyPreslav Nakov
2020-08-06
I-AID: Identifying Actionable Information from Disaster-related Tweets
Hamada M. ZaheraRricha JalotaMohamed A. SherifAxel N. Ngomo
2020-08-04
Taking Notes on the Fly Helps BERT Pre-training
Qiyu WuChen XingYatao LiGuolin KeDi HeTie-Yan Liu
2020-08-04
NLPDove at SemEval-2020 Task 12: Improving Offensive Language Detection with Cross-lingual Transfer
Hwijeen AhnJimin SunChan Young ParkJungyun Seo
2020-08-04
Improving One-stage Visual Grounding by Recursive Sub-query Construction
| Zhengyuan YangTianlang ChenLiwei WangJiebo Luo
2020-08-03
[email protected] at SemEval-2020 Task 12: Multilingual or language-specific BERT?
Marc PàmiesEmily ÖhmanKaisla KajavaJörg Tiedemann
2020-08-03
Trojaning Language Models for Fun and Profit
Xinyang ZhangZheng ZhangTing Wang
2020-08-01
Multi-node Bert-pretraining: Cost-efficient Approach
Jiahuang LinXin LiGennady Pekhimenko
2020-08-01
On Learning Universal Representations Across Languages
Xiangpeng WeiYue HuRongxiang WengLuxi XingHeng YuWeihua Luo
2020-07-31
Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
Yu GuRobert TinnHao ChengMichael LucasNaoto UsuyamaXiaodong LiuTristan NaumannJianfeng GaoHoifung Poon
2020-07-31
TweepFake: about Detecting Deepfake Tweets
Tiziano FagniFabrizio FalchiMargherita GambiniAntonio MartellaMaurizio Tesconi
2020-07-31
Model Reduction of Shallow CNN Model for Reliable Deployment of Information Extraction from Medical Reports
Abhishek K DubeyAlina PelusoJacob HinkleDevanshu AgarawalZilong Tan
2020-07-31
What does BERT know about books, movies and music? Probing BERT for Conversational Recommendation
| Gustavo PenhaClaudia Hauff
2020-07-30
Depressive, Drug Abusive, or Informative: Knowledge-aware Study of News Exposure during COVID-19 Outbreak
Amanuel AlamboManas GaurKrishnaprasad Thirunarayan
2020-07-30
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
| Shayne LongpreYi LuJoachim Daiber
2020-07-30
Composer Style Classification of Piano Sheet Music Images Using Language Model Pretraining
TJ TsaiKevin Ji
2020-07-29
Improving Results on Russian Sentiment Datasets
| Anton GolubevNatalia Loukachevitch
2020-07-28
BUT-FIT at SemEval-2020 Task 5: Automatic detection of counterfactual statements with deep pre-trained language representation models
Martin FajcikJosef JonMartin DocekalPavel Smrz
2020-07-28
Variants of BERT, Random Forests and SVM approach for Multimodal Emotion-Target Sub-challenge
Hoang Manh HungHyung-Jeong YangSoo-Hyung KimGuee-Sang Lee
2020-07-28
GUIR at SemEval-2020 Task 12: Domain-Tuned Contextualized Models for Offensive Language Detection
Sajad SotudehTong XiangHao-Ren YaoSean MacAvaneyEugene YangNazli GoharianOphir Frieder
2020-07-28
Deep Learning Brasil -- NLP at SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets
Manoel Veríssimo dos Santos NetoAyrton Denner da Silva AmaralNádia Félix Felipe da SilvaAnderson da Silva Soares
2020-07-28
KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media
| Ali SafayaMoutasem AbdullatifDeniz Yuret
2020-07-26
Reed at SemEval-2020 Task 9: Fine-Tuning and Bag-of-Words Approaches to Code-Mixed Sentiment Analysis
Vinay GopalanMark Hopkins
2020-07-26
To BERT or Not To BERT: Comparing Speech and Language-based Approaches for Alzheimer's Disease Detection
Aparna BalagopalanBenjamin EyreFrank RudziczJekaterina Novikova
2020-07-26
MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning
Aina Garí SolerMarianna Apidianaki
2020-07-24
Product Title Generation for Conversational Systems using BERT
Mansi Ranjit ManeShashank KediaAditya ManthaStephen GuoKannan Achan
2020-07-23
The Lottery Ticket Hypothesis for Pre-trained BERT Networks
| Tianlong ChenJonathan FrankleShiyu ChangSijia LiuYang ZhangZhangyang WangMichael Carbin
2020-07-23
IITK at the FinSim Task: Hypernym Detection in Financial Domain via Context-Free and Contextualized Word Embeddings
Vishal KeswaniSakshi SinghAshutosh Modi
2020-07-22
Multi-task learning for natural language processing in the 2020s: where are we going?
Joseph WorshamJugal Kalita
2020-07-22
problemConquero at SemEval-2020 Task 12: Transformer and Soft label-based approaches
Karishma LaudJagriti SinghRandeep Kumar SahuAshutosh Modi
2020-07-21
newsSweeper at SemEval-2020 Task 11: Context-Aware Rich Feature Representations For Propaganda Classification
| Paramansh SinghSiraj SandhuSubham KumarAshutosh Modi
2020-07-21
Word Representation for Rhythms
Tongyu LuLyucheng YanGus Xia
2020-07-21
Understanding BERT Rankers Under Distillation
Luyu GaoZhuyun DaiJamie Callan
2020-07-21
A Comparison of Supervised Learning to Match Methods for Product Search
| Fatemeh SarviNikos VoskaridesLois MooimanSebastian SchelterMaarten de Rijke
2020-07-20
Mono vs Multilingual Transformer-based Models: a Comparison across Several Language Tasks
Diego de Vargas FeijoViviane Pereira Moreira
2020-07-19
Multi-Perspective Semantic Information Retrieval in the Biomedical Domain
Samarth Rawal
2020-07-17
Investigating Pretrained Language Models for Graph-to-Text Generation
Leonardo F. R. RibeiroMartin SchmittHinrich SchützeIryna Gurevych
2020-07-16
Towards Debiasing Sentence Representations
Paul Pu LiangIrene Mengze LiEmily ZhengYao Chong LimRuslan SalakhutdinovLouis-Philippe Morency
2020-07-16
Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT
Yang Jiao
2020-07-16
Hopfield Networks is All You Need
| Hubert RamsauerBernhard SchäflJohannes LehnerPhilipp SeidlMichael WidrichLukas GruberMarkus HolzleitnerMilena PavlovićGeir Kjetil SandveVictor GreiffDavid KreilMichael KoppGünter KlambauerJohannes BrandstetterSepp Hochreiter
2020-07-16
Fine-Tune Longformer for Jointly Predicting Rumor Stance and Veracity
Anant Khandelwal
2020-07-15
AdapterHub: A Framework for Adapting Transformers
| Jonas PfeifferAndreas RückléClifton PothAishwarya KamathIvan VulićSebastian RuderKyunghyun ChoIryna Gurevych
2020-07-15
Multimodal Word Sense Disambiguation in Creative Practice
Manuel Ladron de GuevaraChristopher GeorgeAkshat GuptaDaragh ByrneRamesh Krishnamurti
2020-07-15
Logic Constrained Pointer Networks for Interpretable Textual Similarity
| Subhadeep MajiRohan KumarManish BansalKalyani RoyPawan Goyal
2020-07-15
Predicting Clinical Diagnosis from Patients Electronic Health Records Using BERT-based Neural Networks
Pavel BlinovManvel AvetisianVladimir KokhDmitry UmerenkovAlexander Tuzhilin
2020-07-15
Overview of CheckThat! 2020: Automatic Identification and Verification of Claims in Social Media
| Alberto Barron-CedenoTamer ElsayedPreslav NakovGiovanni Da San MartinoMaram HasanainReem SuwailehFatima HaouariNikolay BabulkovBayan HamdanAlex NikolovShaden ShaarZien Sheikh Ali
2020-07-15
Deep Reinforced Query Reformulation for Information Retrieval
Xiao WangCraig MacdonaldIadh Ounis
2020-07-15
Fast and Accurate Neural CRF Constituency Parsing
| Yu ZhangHouquan ZhouZhenghua Li
2020-07-14
Deep Transformer based Data Augmentation with Subword Units for Morphologically Rich Online ASR
Balázs TarjánGyörgy SzaszákTibor FegyóPéter Mihajlik
2020-07-14
What's in a Name? Are BERT Named Entity Representations just as Good for any other Name?
Sriram BalasubramanianNaman JainGaurav JindalAbhijeet AwasthiSunita Sarawagi
2020-07-14
An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models
Lifu TuGarima LalwaniSpandana GellaHe He
2020-07-14
Can neural networks acquire a structural bias from raw linguistic data?
Alex WarstadtSamuel R. Bowman
2020-07-14
Emoji Prediction: Extensions and Benchmarking
Weicheng MaRuibo LiuLili WangSoroush Vosoughi
2020-07-14
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Shauharda KhadkaEstelle AflaloMattias MarderAvrech Ben-DavidSantiago MiretHanlin TangShie MannorTamir HazanSomdeb Majumdar
2020-07-14
Add a SideNet to your MainNet
Adrien Morisot
2020-07-14
An Enhanced Text Classification to Explore Health based Indian Government Policy Tweets
Aarzoo DhimanDurga Toshniwal
2020-07-13
HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections
Yi TayZhe ZhaoDara BahriDonald MetzlerDa-Cheng Juan
2020-07-12
Generative Graph Perturbations for Scene Graph Prediction
Boris KnyazevHarm de VriesCătălina CangeaGraham W. TaylorAaron CourvilleEugene Belilovsky
2020-07-11
BERT Learns (and Teaches) Chemistry
Josh PayneMario SroujiDian Ang YapVineet Kosaraju
2020-07-11
To BAN or not to BAN: Bayesian Attention Networks for Reliable Hate Speech Detection
Kristian MiokBlaz SkrljDaniela ZaharieMarko Robnik-Sikonja
2020-07-10
BISON:BM25-weighted Self-Attention Framework for Multi-Fields Document Search
| Xuan ShanChuanjie LiuYiqian XiaQi ChenYusi ZhangAngen LuoYuxiang Luo
2020-07-10
Multi-Dialect Arabic BERT for Country-Level Dialect Identification
| Bashar TalafhaMohammad AliMuhy Eddin Za'terHaitham SeelawiIbraheem TuffahaMostafa SamirWael FarhanHussein T. Al-Natsheh
2020-07-10
Contrastive Code Representation Learning
| Paras JainAjay JainTianjun ZhangPieter AbbeelJoseph E. GonzalezIon Stoica
2020-07-09
Fast Transformers with Clustered Attention
| Apoorv VyasAngelos KatharopoulosFrançois Fleuret
2020-07-09
The Go Transformer: Natural Language Modeling for Game Play
Matthew CiolinoDavid NoeverJosh Kalin
2020-07-07
Continual BERT: Continual Learning for Adaptive Extractive Summarization of COVID-19 Literature
Jong Won Park
2020-07-07
Exploring Heterogeneous Information Networks via Pre-Training
Yang FangXiang ZhaoWeidong Xiao
2020-07-07
Deep Contextual Embeddings for Address Classification in E-commerce
Shreyas MangalgiLakshya KumarRavindra Babu Tallamraju
2020-07-06
You Autocomplete Me: Poisoning Vulnerabilities in Neural Code Completion
Roei SchusterCongzheng SongEran TromerVitaly Shmatikov
2020-07-05
Text Data Augmentation: Towards better detection of spear-phishing emails
Mehdi ReginaMaxime MeyerSébastien Goutal
2020-07-04
Robust Prediction of Punctuation and Truecasing for Medical ASR
Monica SunkaraSrikanth RonankiKalpit DixitSravan BodapatiKatrin Kirchhoff
2020-07-04
Language-agnostic BERT Sentence Embedding
| Fangxiaoyu FengYinfei YangDaniel CerNaveen ArivazhaganWei Wang
2020-07-03
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Pavel DenisovNgoc Thang Vu
2020-07-03
Reading Comprehension in Czech via Machine Translation and Cross-lingual Transfer
Kateřina MackováMilan Straka
2020-07-03
Playing with Words at the National Library of Sweden -- Making a Swedish BERT
| Martin MalmstenLove BörjesonChris Haffenden
2020-07-03
On-The-Fly Information Retrieval Augmentation for Language Models
Hai WangDavid McAllester
2020-07-03
MIRA: Leveraging Multi-Intention Co-click Information in Web-scale Document Retrieval using Deep Neural Networks
Yusi ZhangChuanjie LiuAngen LuoHui XueXuan ShanYuxiang LuoYiqian XiaYuanchi YanHaidong Wang
2020-07-03
Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey
Shivaji AlaparthiManit Mishra
2020-07-02
The Impact of Explanations on AI Competency Prediction in VQA
Kamran AlipourArijit RayXiao LinJurgen P. SchulzeYi YaoGiedrius T. Burachas
2020-07-02
Improving Event Detection using Contextual Word and Sentence Embeddings
Mariano MaisonnaveFernando DelbiancoFernando TohméAna MaguitmanEvangelos Milios
2020-07-02
Information Retrieval and Extraction on COVID-19 Clinical Articles Using Graph Community Detection and Bio-BERT Embeddings
Debasmita DasYatin KatyalJanu VermaShashank DubeyAakashDeep SinghKushagra AgarwalSourojit BhaduriRajeshKumar Ranjan
2020-07-01
Self-supervised context-aware COVID-19 document exploration through atlas grounding
| Dusan GrujicicGorjan RadevskiTinne TuytelaarsMatthew Blaschko
2020-07-01
Transformers on Sarcasm Detection with Context
2020-07-01
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining
Ivana Kvapil{\'\i}kov{\'a}Mikel ArtetxeGorka LabakaEneko AgirreOnd{\v{r}}ej Bojar
2020-07-01
On-The-Fly Information Retrieval Augmentation for Language Models
Hai WangDavid McAllester
2020-07-01
Unsupervised FAQ Retrieval with Question Generation and BERT
Yosi MassBoaz CarmeliHaggai RoitmanDavid Konopnicki
2020-07-01
GAN-BERT: Generative Adversarial Learning for Robust Text Classification with a Bunch of Labeled Examples
| Danilo CroceGiuseppe CastellucciRoberto Basili
2020-07-01
Integrating Multimodal Information in Large Pretrained Transformers
Wasifur RahmanMd Kamrul HasanSangwu LeeAmirAli Bagher ZadehChengfeng MaoLouis-Philippe MorencyEhsan Hoque
2020-07-01
Modelling Context and Syntactical Features for Aspect-based Sentiment Analysis
Minh Hieu PhanPhilip O. Ogunbona
2020-07-01
Roles and Utilization of Attention Heads in Transformer-based Neural Language Models
Jae-young JoSung-Hyon Myaeng
2020-07-01
Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation
Bo PangErik NijkampWenjuan HanLinqi ZhouYixian LiuKewei Tu
2020-07-01
Adversarial and Domain-Aware BERT for Cross-Domain Sentiment Analysis
Chunning DuHaifeng SunJingyu WangQi QiJianxin Liao
2020-07-01
How does BERT's attention change when you fine-tune? An analysis methodology and a case study in negation scope
Yiyun ZhaoSteven Bethard
2020-07-01
Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work?
Yada PruksachatkunJason PhangHaokun LiuPhu Mon HtutXiaoyi ZhangRichard Yuanzhe PangClara VaniaKatharina KannSamuel R. Bowman
2020-07-01
Would you Rather? A New Benchmark for Learning Machine Alignment with Cultural Values and Social Preferences
Yi TayDonovan OngJie FuAlvin ChanNancy ChenAnh Tuan LuuChris Pal
2020-07-01
Towards Debiasing Sentence Representations
Paul Pu LiangIrene Mengze LiEmily ZhengYao Chong LimRuslan SalakhutdinovLouis-Philippe Morency
2020-07-01
Automatic Generation of Citation Texts in Scholarly Papers: A Pilot Study
Xinyu XingXiaosheng FanXiaojun Wan
2020-07-01
Transition-based Semantic Dependency Parsing with Pointer Networks
Daniel Fern{\'a}ndez-Gonz{\'a}lezCarlos G{\'o}mez-Rodr{\'\i}guez
2020-07-01
tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection
Nicole PeineltDong NguyenMaria Liakata
2020-07-01
Understanding Advertisements with BERT
Kanika KalraBhargav KurmaSilpa Vadakkeeveetil SreelathaManasi PatwardhanKarShirish e
2020-07-01
Feature Projection for Improved Text Classification
Qi QinWenpeng HuBing Liu
2020-07-01
A Generate-and-Rank Framework with Semantic Type Regularization for Biomedical Concept Normalization
Dongfang XuZeyu ZhangSteven Bethard
2020-07-01
Revisiting Higher-Order Dependency Parsers
Erick FonsecaAndr{\'e} F. T. Martins
2020-07-01
SUPP.AI: finding evidence for supplement-drug interactions
Lucy WangOyvind TafjordArman CohanSarthak JainSam SkjonsbergCarissa SchoenickNick BotnerWaleed Ammar
2020-07-01
Why is penguin more similar to polar bear than to sea gull? Analyzing conceptual knowledge in distributional models
Pia Sommerauer
2020-07-01
A Simple and Effective Dependency Parser for Telugu
Sneha NallaniManish ShrivastavaDipti Sharma
2020-07-01
Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold Mixup
Jishnu Ray ChowdhuryCornelia CarageaDoina Caragea
2020-07-01
Should You Fine-Tune BERT for Automated Essay Scoring?
Elijah MayfieldAlan W Black
2020-07-01
A BERT-based One-Pass Multi-Task Model for Clinical Temporal Relation Extraction
Chen LinTimothy MillerDmitriy DligachFarig SadequeSteven BethardGuergana Savova
2020-07-01
Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity
Yuxia WangFei LiuKarin VerspoorTimothy Baldwin
2020-07-01
Item-based Collaborative Filtering with BERT
Tian WangYuyangzi Fu
2020-07-01
Sarcasm Identification and Detection in Conversion Context using BERT
Kalaivani A.Thenmozhi D.
2020-07-01
Neural Sarcasm Detection using Conversation Context
Nikhil Jaiswal
2020-07-01
Context-Aware Sarcasm Detection Using BERT
Arup BaruahKaushik DasFerdous BarbhuiyaKuntal Dey
2020-07-01
Character aware models with similarity learning for metaphor detection
Tarun KumarYashvardhan Sharma
2020-07-01
IlliniMet: Illinois System for Metaphor Detection with Contextual and Linguistic Information
Hongyu GongKshitij GuptaAkriti JainSuma Bhat
2020-07-01
Go Figure! Multi-task transformer-based architecture for metaphor detection using idioms: ETS team in 2020 metaphor shared task
Xianyang ChenChee Wee (Ben) LeongMichael FlorBeata Beigman Klebanov
2020-07-01
Metaphor Detection Using Contextual Word Embeddings From Transformers
Jerry LiuNathan O{'}HaraAlex RubinerRachel DraelosCynthia Rudin
2020-07-01
A Transformer Approach to Contextual Sarcasm Detection in Twitter
Hunter GregorySteven LiPouya MohammadiNatalie TarnRachel DraelosCynthia Rudin
2020-07-01
Turku Enhanced Parser Pipeline: From Raw Text to Enhanced Graphs in the IWPT 2020 Shared Task
Jenna KanervaFilip GinterSampo Pyysalo
2020-07-01
K\opsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
Daniel HershcovichMiryam de LhoneuxArtur KulmizevElham PejhanJoakim Nivre
2020-07-01
RobertNLP at the IWPT 2020 Shared Task: Surprisingly Simple Enhanced UD Parsing for English
Stefan Gr{\"u}newaldAnnemarie Friedrich
2020-07-01
The HW-TSC Video Speech Translation System at IWSLT 2020
Minghan WangHao YangYao DengYing QinLizhi LeiDaimeng WeiHengchao ShangNing XieXiaochun LiJiaxian Guo
2020-07-01
CopyBERT: A Unified Approach to Question Generation with Self-Attention
Stalin VaranasiSaadullah AminGuenter Neumann
2020-07-01
Robust Prediction of Punctuation and Truecasing for Medical ASR
Monica SunkaraSrikanth RonankiKalpit DixitSravan BodapatiKatrin Kirchhoff
2020-07-01
Exploring the Limits of Simple Learners in Knowledge Distillation for Document Classification with DocBERT
Ashutosh AdhikariAchyudh RamRaphael TangWilliam L. HamiltonJimmy Lin
2020-07-01
Joint Training with Semantic Role Labeling for Better Generalization in Natural Language Inference
Cemil CengizDeniz Yuret
2020-07-01
A Metric Learning Approach to Misogyny Categorization
Juan Manuel CoriaSahar GhannaySophie RossetHerv{\'e} Bredin
2020-07-01
Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation
Alessio MiaschiFelice Dell{'}Orletta
2020-07-01
What's in a Name? Are BERT Named Entity Representations just as Good for any other Name?
Sriram BalasubramanianNaman JainGaurav JindalAbhijeet AwasthiSunita Sarawagi
2020-07-01
Getting the \#\#life out of living: How Adequate Are Word-Pieces for Modelling Complex Morphology?
Stav KleinReut Tsarfaty
2020-07-01
SentiTel: TABSA for Twitter reviews on Uganda Telecoms
David KabiitoJoyce Nakatumba Nabende
2020-07-01
Adversarial Evaluation of BERT for Biomedical Named Entity Recognition
Vladimir AraujoAndr{\'e}s CarvalloDenis Parra
2020-07-01
Improving Multimodal Named Entity Recognition via Entity Span Detection with Unified Multimodal Transformer
Jianfei YuJing JiangLi YangRui Xia
2020-07-01
Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions
Hannah CraigheadAndrew CainesPaula ButteryHelen Yannakoudakis
2020-07-01
LSTM and GPT-2 Synthetic Speech Transfer Learning for Speaker Recognition to Overcome Data Scarcity
Jordan J. BirdDiego R. FariaAnikó EkártCristiano PremebidaPedro P. S. Ayrosa
2020-07-01
The Summary Loop: Learning to Write Abstractive Summaries Without Examples
| Philippe LabanAndrew HsiJohn CannyMarti A. Hearst
2020-07-01
Go Wide, Then Narrow: Efficient Training of Deep Thin Networks
Denny ZhouMao YeChen ChenTianjian MengMingxing TanXiaodan SongQuoc LeQiang LiuDale Schuurmans
2020-07-01
SE3M: A Model for Software Effort Estimation Using Pre-trained Embedding Models
Eliane M. De Bortoli FáveroDalcimar CasanovaAndrey Ricardo Pimentel
2020-06-30
Data Movement Is All You Need: A Case Study on Optimizing Transformers
Andrei IvanovNikoli DrydenTal Ben-NunShigang LiTorsten Hoefler
2020-06-30
Segmentation Approach for Coreference Resolution Task
Aref JafariAli Ghodsi
2020-06-30
Want to Identify, Extract and Normalize Adverse Drug Reactions in Tweets? Use RoBERTa
Katikapalli Subramanyam KalyanS. Sangeetha
2020-06-29
Improving Sequence Tagging for Vietnamese Text Using Transformer-based Neural Models
Viet Bui TheOanh Tran ThiPhuong Le-Hong
2020-06-29
Knowledge-Aware Language Model Pretraining
Corby RossetChenyan XiongMinh PhanXia SongPaul BennettSaurabh Tiwary
2020-06-29
Interpreting Hierarchical Linguistic Interactions in DNNs
Die ZhangHuilin ZhouXiaoyi BaoDa HuoRuizhao ChenXu ChengHao ZhangMengyue WuQuanshi Zhang
2020-06-29
Progressive Generation of Long Text
| Bowen TanZichao YangMaruan AI-ShedivatEric P. XingZhiting Hu
2020-06-28
Rethinking Positional Encoding in Language Pre-training
| Guolin KeDi HeTie-Yan Liu
2020-06-28
BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision
| Chen LiangYue YuHaoming JiangSiawpeng ErRuijia WangTuo ZhaoChao Zhang
2020-06-28
Video-Grounded Dialogues with Pretrained Generation Language Models
Hung LeSteven C. H. Hoi
2020-06-27
Normalizador Neural de Datas e Endereços
Gustavo PlensackPaulo Finardi
2020-06-27
FastSpec: Scalable Generation and Detection of Spectre Gadgets Using Neural Embeddings
| M. Caner TolKoray YurtsevenBerk GulmezogluBerk Sunar
2020-06-25
Normalizing Text using Language Modelling based on Phonetics and String Similarity
Fenil DoshiJimit GandhiDeep GosaliaSudhir Bagul
2020-06-25
LSBert: A Simple Framework for Lexical Simplification
| Jipeng QiangYun LiYi ZhuYunhao YuanXindong Wu
2020-06-25
Accelerated Large Batch Optimization of BERT Pretraining in 54 minutes
Shuai ZhengHaibin LinSheng ZhaMu Li
2020-06-24
Efficient Constituency Parsing by Pointing
Thanh-Tung NguyenXuan-Phi NguyenShafiq JotyXiaoli Li
2020-06-24
ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion
| BingningWangTing YaoQi ZhangJingfang XuXiaochuan Wang
2020-06-22
Students Need More Attention: BERT-based AttentionModel for Small Data with Application to AutomaticPatient Message Triage
Shijing SiRui WangJedrek WosikHao ZhangDavid DovGuoyin WangRicardo HenaoLawrence Carin
2020-06-22
Adaptive Learning Rates with Maximum Variation Averaging
| Chen ZhuYu ChengZhe GanFurong HuangJingjing LiuTom Goldstein
2020-06-21
Sarcasm Detection in Tweets with BERT and GloVe Embeddings
Akshay KhatriPranav PDr. Anand Kumar M
2020-06-20
New Vietnamese Corpus for Machine ReadingComprehension of Health News Articles
Kiet Van NguyenDuc-Vu NguyenAnh Gia-Tuan NguyenNgan Luu-Thuy Nguyen
2020-06-19
A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19
| David OnianiYanshan Wang
2020-06-19
SqueezeBERT: What can computer vision teach NLP about efficient neural networks?
Forrest N. IandolaAlbert E. ShawRavi KrishnaKurt W. Keutzer
2020-06-19
Automatically Ranked Russian Paraphrase Corpus for Text Generation
Vadim GudkovOlga MitrofanovaElizaveta Filippskikh
2020-06-17
Exploring the BERT Cross-Lingual Transferability: a Case Study in Reading Comprehension
Konovalov V. P.Gulyaev P. A.Sorokin A. A.Kuratov Y. M.Burtsev M. S.
2020-06-17
Tagging and parsing of multidomain collections
| Alexey SorokinIvan SmurovDenis Kirianov
2020-06-17
Improving accuracy and speeding up Document Image Classification through parallel systems
| Javier FerrandoJuan Luis DominguezJordi TorresRaul GarciaDavid GarciaDaniel GarridoJordi CortadaMateo Valero
2020-06-16
PERL: Pivot-based Domain Adaptation for Pre-trained Deep Contextualized Embedding Models
| Eyal Ben-DavidCarmel RabinovitzRoi Reichart
2020-06-16
The SPPD System for Schema Guided Dialogue State Tracking Challenge
Miao LiHaoqi XiongYunbo Cao
2020-06-16
Scalable Cross Lingual Pivots to Model Pronoun Gender for Translation
Kellie WebsterEmily Pitler
2020-06-16
End-to-End Code Switching Language Models for Automatic Speech Recognition
Ahan M. R.Shreyas Sunil Kulkarni
2020-06-16
Document Classification for COVID-19 Literature
Bernal Jiménez GutiérrezJuncheng ZengDongdong ZhangPing ZhangYu Su
2020-06-15
FinBERT: A Pretrained Language Model for Financial Communications
| Yi YangMark Christopher Siy UYAllen Huang
2020-06-15
Cooking Is All About People: Comment Classification On Cookery Channels Using BERT and Classification Models (Malayalam-English Mix-Code)
Subramaniam KazhuparambilAbhishek Kaushik
2020-06-15
FinEst BERT and CroSloEngual BERT: less is more in multilingual models
Matej UlčarMarko Robnik-Šikonja
2020-06-14
Transferring Monolingual Model to Low-Resource Language: The Case of Tigrinya
Abrhalei TelaAbraham WoubieVille Hautamaki
2020-06-13
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
Pedro Javier Ortiz SuárezLaurent RomaryBenoît Sagot
2020-06-11
MC-BERT: Efficient Language Pre-Training via a Meta Controller
| Zhenhui XuLinyuan GongGuolin KeDi HeShuxin ZhengLiwei WangJiang BianTie-Yan Liu
2020-06-10
Revisiting Few-sample BERT Fine-tuning
| Tianyi ZhangFelix WuArzoo KatiyarKilian Q. WeinbergerYoav Artzi
2020-06-10
Unsupervised Paraphrase Generation using Pre-trained Language Models
Chaitra HegdeShrikumar Patil
2020-06-09
Few-Shot Generative Conversational Query Rewriting
| Shi YuJiahua LiuJingqin YangChenyan XiongPaul BennettJianfeng GaoZhiyuan Liu
2020-06-09
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
| Marius MosbachMaksym AndriushchenkoDietrich Klakow
2020-06-08
Pre-training Polish Transformer-based Language Models at Scale
| Sławomir DadasMichał PerełkiewiczRafał Poświata
2020-06-07
Medical Concept Normalization in User Generated Texts by Learning Target Concept Embeddings
Katikapalli Subramanyam KalyanS. Sangeetha
2020-06-07
GMAT: Global Memory Augmentation for Transformers
| Ankit GuptaJonathan Berant
2020-06-05
Accelerating Natural Language Understanding in Task-Oriented Dialog
Ojas AhujaShrey Desai
2020-06-05
UDPipe at EvaLatin 2020: Contextualized Embeddings and Treebank Embeddings
Milan StrakaJana Straková
2020-06-05
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
| Pengcheng HeXiaodong LiuJianfeng GaoWeizhu Chen
2020-06-05
The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain
Annemarie FriedrichHeike AdelFederico TomazicJohannes HingerlRenou BenteauAnika MaruscykLukas Lange
2020-06-04
Automatic Text Summarization of COVID-19 Medical Research Articles using BERT and GPT-2
| Virapat KieuvongngamBowen TanYiming Niu
2020-06-03
WikiBERT models: deep transfer learning for many languages
Sampo PyysaloJenna KanervaAntti VirtanenFilip Ginter
2020-06-02
Question Answering on Scholarly Knowledge Graphs
Mohamad Yaser JaradehMarkus StockerSören Auer
2020-06-02
A Pairwise Probe for Understanding BERT Fine-Tuning on Machine Reading Comprehension
Jie CaiZhengzhou ZhuPing NieQian Liu
2020-06-02
BERT Based Multilingual Machine Comprehension in English and Hindi
| Somil GuptaNilesh Khade
2020-06-02
Exploring Cross-sentence Contexts for Named Entity Recognition with BERT
Jouni LuomaSampo Pyysalo
2020-06-02
Position Masking for Language Models
Andy WagnerTiyasa MitraMrinal IyerGodfrey Da CostaMarc Tremblay
2020-06-02
R\'e-entra\^\iner ou entra\^\iner soi-m\^eme ? Strat\'egies de pr\'e-entra\^\inement de BERT en domaine m\'edical (Re-train or train from scratch ? Pre-training strategies for BERT in the medical domain )
Hicham El Boukkouri
2020-06-01
\'Etude des variations s\'emantiques \`a travers plusieurs dimensions (Studying semantic variations through several dimensions )
Syrielle MontariolAlex Allauzenre
2020-06-01
Qu'apporte BERT \`a l'analyse syntaxique en constituants discontinus ? Une suite de tests pour \'evaluer les pr\'edictions de structures syntaxiques discontinues en anglais (What does BERT contribute to discontinuous constituency parsing ? A test suite to evaluate discontinuous constituency structure predictions in English)
Maximin Coavoux
2020-06-01
Les mod\`eles de langue contextuels Camembert pour le fran\ccais : impact de la taille et de l'h\'et\'erog\'en\'eit\'e des donn\'ees d'entrainement (C AMEM BERT Contextual Language Models for French: Impact of Training Data Size and Heterogeneity )
Louis MartinBenjamin MullerPedro Javier Ortiz Su{\'a}rezYoann DupontLaurent Romary{\'E}ric Villemonte de la ClergerieBeno{\^\i}t SagotDjam{\'e} Seddah
2020-06-01
Introduction d'informations s\'emantiques dans un syst\`eme de reconnaissance de la parole (Despite spectacular advances in recent years, the Automatic Speech Recognition (ASR) systems still make mistakes, especially in noisy environments)
St{\'e}phane LevelIrina IllinaDominique Fohr
2020-06-01
Emergence of Separable Manifolds in Deep Language Representations
Jonathan MamouHang LeMiguel Del RioCory StephensonHanlin TangYoon KimSueYeon Chung
2020-06-01
Conversational Machine Comprehension: a Literature Review
Somil GuptaBhanu Pratap Singh Rawat
2020-06-01
When Bert Forgets How To POS: Amnesic Probing of Linguistic Properties and MLM Predictions
Yanai ElazarShauli RavfogelAlon JacoviYoav Goldberg
2020-06-01
An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features
Shi-Yan WengTien-Hong LoBerlin Chen
2020-06-01
BERT-based Ensembles for Modeling Disclosure and Support in Conversational Social Media Text
Tanvi DaduKartikey PantRadhika Mamidi
2020-06-01
Neural Entity Linking: A Survey of Models based on Deep Learning
| Ozge SevgiliArtem ShelmanovMikhail ArkhipovAlexander PanchenkoChris Biemann
2020-05-31
"Judge me by my size (noun), do you?'' YodaLib: A Demographic-Aware Humor Generation Framework
Aparna GarimellaCarmen BaneaNabil HossainRada Mihalcea
2020-05-31
BPGC at SemEval-2020 Task 11: Propaganda Detection in News Articles with Multi-Granularity Knowledge Sharing and Linguistic Features based Ensemble Learning
Rajaswa PatilSomesh SinghSwati Agarwal
2020-05-31
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits based Humor Grading
Siddhant MahurkarRajaswa Patil
2020-05-31
Detecting Problem Statements in Peer Assessments
Yunkai XiaoGabriel ZingleQinjin JiaHarsh R. ShahYi ZhangTianyi LiMohsin KarovaliyaWeixiang ZhaoYang SongJie JiAshwin BalasubramaniamHarshit PatelPriyankha BhalasubbramanianVikram PatelEdward F. Gehringer
2020-05-30
First Neural Conjecturing Datasets and Experiments
Josef UrbanJan Jakubův
2020-05-29
Using Large Pretrained Language Models for Answering User Queries from Product Specifications
Kalyani RoySmit ShahNithish PaiJaidam RamtejPrajit Prashant NadkarnJyotirmoy BanerjeePawan GoyalSurender Kumar
2020-05-29
SAFER: A Structure-free Approach for Certified Robustness to Adversarial Word Substitutions
Mao YeChengyue GongQiang Liu
2020-05-29
A Comparative Study of Lexical Substitution Approaches based on Neural Language Models
Nikolay ArefyevBoris SheludkoAlexander PodolskiyAlexander Panchenko
2020-05-29
Stance Prediction for Contemporary Issues: Data and Experiments
| Marjan HosseiniaEduard DragutArjun Mukherjee
2020-05-29
On Incorporating Structural Information to improve Dialogue Response Generation
| Nikita MoghePriyesh VijayanBalaraman RavindranMitesh M. Khapra
2020-05-28
Language Models are Few-Shot Learners
| Tom B. BrownBenjamin MannNick RyderMelanie SubbiahJared KaplanPrafulla DhariwalArvind NeelakantanPranav ShyamGirish SastryAmanda AskellSandhini AgarwalAriel Herbert-VossGretchen KruegerTom HenighanRewon ChildAditya RameshDaniel M. ZieglerJeffrey WuClemens WinterChristopher HesseMark ChenEric SiglerMateusz LitwinScott GrayBenjamin ChessJack ClarkChristopher BernerSam McCandlishAlec RadfordIlya SutskeverDario Amodei
2020-05-28
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Adhiguna KuncoroLingpeng KongDaniel FriedDani YogatamaLaura RimellChris DyerPhil Blunsom
2020-05-27
CausaLM: Causal Model Explanation Through Counterfactual Language Models
Amir FederNadav OvedUri ShalitRoi Reichart
2020-05-27
Transition-based Semantic Dependency Parsing with Pointer Networks
Daniel Fernández-GonzálezCarlos Gómez-Rodríguez
2020-05-27
Language Representation Models for Fine-Grained Sentiment Classification
Brian CheangBailey WeiDavid KoganHowey QiuMasud Ahmed
2020-05-27
Network Fusion for Content Creation with Conditional INNs
Robin RombachPatrick EsserBjörn Ommer
2020-05-27
A Data-driven Approach for Noise Reduction in Distantly Supervised Biomedical Relation Extraction
Saadullah AminKatherine Ann DunfieldAnna VechkaevaGünter Neumann
2020-05-26
What Are People Asking About COVID-19? A Question Classification Dataset
| Jerry WeiChengyu HuangSoroush VosoughiJason Wei
2020-05-26
ParsBERT: Transformer-based Model for Persian Language Understanding
| Mehrdad FarahaniMohammad GharachorlooMarzieh FarahaniMohammad Manthouri
2020-05-26
BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection
| Jihyung MoonWon Ik ChoJunbum Lee
2020-05-26
Comparing BERT against traditional machine learning text classification
Santiago González-CarvajalEduardo C. Garrido-Merchán
2020-05-26
BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining
Zachariah ZhangJingshu LiuNarges Razavian
2020-05-26
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering
Chia-Chih KuoShang-Bao LuoKuan-Yu Chen
2020-05-25
Køpsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
| Daniel HershcovichMiryam de LhoneuxArtur KulmizevElham PejhanJoakim Nivre
2020-05-25
Pointwise Paraphrase Appraisal is Potentially Problematic
Hannah ChenYangfeng JiDavid Evans
2020-05-25
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding
| Chen LiuSu ZhuZijian ZhaoRuisheng CaoLu ChenKai Yu
2020-05-24
Comparative Study of Machine Learning Models and BERT on SQuAD
Devshree PatelParam RavalRatnam ParikhYesha Shastri
2020-05-22
L2R2: Leveraging Ranking for Abductive Reasoning
| Yunchang ZhuLiang PangYanyan LanXueqi Cheng
2020-05-22
Living Machines: A study of atypical animacy
Mariona Coll ArdanuyFederico NanniKaspar BeelenKasra HosseiniRuth AhnertJon LawrenceKatherine McDonoughGiorgia TolfoDaniel CS WilsonBarbara McGillivray
2020-05-22
Robust Layout-aware IE for Visually Rich Documents with Pre-trained Language Models
Mengxi WeiYifan HeQiong Zhang
2020-05-22
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
Laila RasmyYang XiangZiqian XieCui TaoDegui Zhi
2020-05-22
Text-to-Text Pre-Training for Data-to-Text Tasks
| Mihir Kale
2020-05-21
BERTweet: A pre-trained language model for English Tweets
| Dat Quoc NguyenThanh VuAnh Tuan Nguyen
2020-05-20
Creative Artificial Intelligence -- Algorithms vs. humans in an incentivized writing competition
Nils KöbisLuca Mossink
2020-05-20
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
Dehong GaoLinbo JinBen ChenMinghui QiuPeng LiYi WeiYi HuHao Wang
2020-05-20
Cross-lingual Transfer Learning for Dialogue Act Recognition
Jiří MartínekChristophe CerisaraPavel KrálLadislav Lenc
2020-05-19
Table Search Using a Deep Contextualized Language Model
| Zhiyu ChenMohamed TrabelsiJeff HeflinYinan XuBrian D. Davison
2020-05-19
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt
Hangyu LinYanwei FuYu-Gang JiangXiangyang Xue
2020-05-19
Are All Languages Created Equal in Multilingual BERT?
Shijie WuMark Dredze
2020-05-18
Context-Based Quotation Recommendation
Ansel MacLaughlinTao ChenBurcu Karagol AyanDan Roth
2020-05-17
Support-BERT: Predicting Quality of Question-Answer Pairs in MSDN using Deep Bidirectional Transformer
Bhaskar SenNikhil GopalXinwei Xue
2020-05-17
Building a Hebrew Semantic Role Labeling Lexical Resource from Parallel Movie Subtitles
Ben EyalMichael Elhadad
2020-05-17
Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce
| Juntao LiChang LiuJian WangLidong BingHongsong LiXiaozhong LiuDongyan ZhaoRui Yan
2020-05-17
Adversarial Training for Commonsense Inference
Lis PereiraXiaodong LiuFei ChengMasayuki AsaharaIchiro Kobayashi
2020-05-17
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
| Pengcheng YinGraham NeubigWen-tau YihSebastian Riedel
2020-05-17
CERT: Contrastive Self-supervised Learning for Language Understanding
Hongchao FangSicheng WangMeng ZhouJiayuan DingPengtao Xie
2020-05-16
Leveraging Affective Bidirectional Transformers for Offensive Language Detection
AbdelRahim ElmadanyChiyu ZhangMuhammad Abdul-MageedAzadeh Hashemi
2020-05-16
Spelling Error Correction with Soft-Masked BERT
| Shaohua ZhangHaoran HuangJicong LiuHang Li
2020-05-15
Neural Entity Linking on Technical Service Tickets
Nadja KurzFelix HamannAdrian Ulges
2020-05-15
Challenges in Emotion Style Transfer: An Exploration with a Lexical Substitution Pipeline
David HelbigEnrica TroianoRoman Klinger
2020-05-15
[email protected] at SemEval-2020 Task 12: Identifying Multilingual Offensive Tweets Using Weighted Ensemble and Fine-Tuned BERT
Saja Khaled TawalbehMahmoud HammadMohammad AL-Smadi
2020-05-15
NIT-Agartala-NLP-Team at SemEval-2020 Task 8: Building Multimodal Classifiers to tackle Internet Humor
Steve Durairaj SwamyShubham LaddhaBasil AbdussalamDebayan DattaAnupam Jamatia
2020-05-14
A pre-training technique to localize medical BERT and enhance BioBERT
| Shoya WadaToshihiro TakedaShiro ManabeShozo KonishiJun KamoharaYasushi Matsumura
2020-05-14
Parallel Corpus Filtering via Pre-trained Language Models
Boliang ZhangAjay NageshKevin Knight
2020-05-13
Large Scale Multi-Actor Generative Dialog Modeling
Alex BoydRaul PuriMohammad ShoeybiMostofa PatwaryBryan Catanzaro
2020-05-13
Entity-Enriched Neural Models for Clinical Question Answering
| Bhanu Pratap Singh RawatWei-Hung WengPreethi RaghavanPeter Szolovits
2020-05-13
On the Robustness of Language Encoders against Grammatical Errors
Fan YinQuanyu LongTao MengKai-Wei Chang
2020-05-12
On the Generation of Medical Dialogues for COVID-19
| Wenmian YangGuangtao ZengBowen TanZeqian JuSubrato ChakravortyXuehai HeShu ChenXingyi YangQingyang WuZhou YuEric XingPengtao Xie
2020-05-11
Detecting Adverse Drug Reactions from Twitter through Domain-Specific Preprocessing and BERT Ensembling
Amy BredenLee Moore
2020-05-11
How Context Affects Language Models' Factual Predictions
Fabio PetroniPatrick LewisAleksandra PiktusTim RocktäschelYuxiang WuAlexander H. MillerSebastian Riedel
2020-05-10
Transformer Based Language Models for Similar Text Retrieval and Ranking
Javed Qadrud-DinAshraf Bah RabiouRyan WalkerRavi SoniMartin GajekGabriel PackAkhil Rangaraj
2020-05-10
Finding Universal Grammatical Relations in Multilingual BERT
Ethan A. ChiJohn HewittChristopher D. Manning
2020-05-09
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
| Samson TanShafiq JotyMin-Yen KanRichard Socher
2020-05-09
LinCE: A Centralized Benchmark for Linguistic Code-switching Evaluation
Gustavo AguilarSudipta KarThamar Solorio
2020-05-09
schuBERT: Optimizing Elements of BERT
Ashish KhetanZohar Karnin
2020-05-09
SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics
| Da YinTao MengKai-Wei Chang
2020-05-08
Distilling Knowledge from Pre-trained Language Models via Text Smoothing
Xing WuYibing LiuXiangyang ZhouDianhai Yu
2020-05-08
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference
Ali Hadi ZadehAndreas Moshovos
2020-05-08
Temporal Common Sense Acquisition with Minimal Supervision
Ben ZhouQiang NingDaniel KhashabiDan Roth
2020-05-08
Comparative Analysis of Text Classification Approaches in Electronic Health Records
Aurelie MascioZeljko KraljevicDaniel BeanRichard DobsonRobert StewartRebecca BendayanAngus Roberts
2020-05-08
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
| Marco Tulio RibeiroTongshuang WuCarlos GuestrinSameer Singh
2020-05-08
LIIR at SemEval-2020 Task 12: A Cross-Lingual Augmentation Approach for Multilingual Offensive Language Identification
Erfan GhaderyMarie-Francine Moens
2020-05-07
Harvesting and Refining Question-Answer Pairs for Unsupervised QA
| Zhongli LiWenhui WangLi DongFuru WeiKe Xu
2020-05-06
An Empirical Study of Multi-Task Learning on BERT for Biomedical Text Mining
Yifan PengQingyu ChenZhiyong Lu
2020-05-06
Autoencoding Pixies: Amortised Variational Inference with Graph Convolutions for Functional Distributional Semantics
Guy Emerson
2020-05-06
Categorical Vector Space Semantics for Lambek Calculus with a Relevant Modality
Lachlan McPheatMehrnoosh SadrzadehHadi WazniGijs Wijnholds
2020-05-06
MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models
| Mandy GuoYinfei YangDaniel CerQinlan ShenNoah Constant
2020-05-05
Contextualizing Hate Speech Classifiers with Post-hoc Explanation
Brendan KennedyXisen JinAida Mostafazadeh DavaniMorteza DehghaniXiang Ren
2020-05-05
Establishing Baselines for Text Classification in Low-Resource Languages
| Jan Christian Blaise CruzCharibeth Cheng
2020-05-05
ExpBERT: Representation Engineering with Natural Language Explanations
| Shikhar MurtyPang Wei KohPercy Liang
2020-05-05
ImpactCite: An XLNet-based method for Citation Impact Analysis
Dominique MercierSyed Tahseen Raza RizviVikas RajashekarAndreas DengelSheraz Ahmed
2020-05-05
Distributional Discrepancy: A Metric for Unconditional Text Generation
| Ping CaiXingyuan ChenPeng JinHongjun WangTianrui Li
2020-05-04
Robust Encodings: A Framework for Combating Adversarial Typos
Erik JonesRobin JiaAditi RaghunathanPercy Liang
2020-05-04
Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop Question Answering
| Vikas YadavSteven BethardMihai Surdeanu
2020-05-04
Spying on your neighbors: Fine-grained probing of contextual embeddings for information about surrounding words
Josef KlafkaAllyson Ettinger
2020-05-04
Code and Named Entity Recognition in StackOverflow
| Jeniya TabassumMounica MaddelaWei XuAlan Ritter
2020-05-04
Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction
| Masahiro KanekoMasato MitaShun KiyonoJun SuzukiKentaro Inui
2020-05-03
Transformer-based End-to-End Question Generation
| Luis Enrico LopezDiane Kathryn CruzJan Christian Blaise CruzCharibeth Cheng
2020-05-03
BERT-kNN: Adding a kNN Search Component to Pretrained Language Models for Better QA
Nora KassnerHinrich Schütze
2020-05-02
DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
| Qingqing CaoHarsh TrivediAruna BalasubramanianNiranjan Balasubramanian
2020-05-02
Birds have four legs?! NumerSense: Probing Numerical Commonsense Knowledge of Pre-trained Language Models
Bill Yuchen LinSeyeon LeeRahul KhannaXiang Ren
2020-05-02
Generating Derivational Morphology with BERT
Valentin HofmannJanet B. Pierrehumbert