Weight Decay

Weight Decay, or $L_{2}$ Regularization, is a regularization technique applied to the weights of a neural network. We minimize a loss function compromising both the primary loss function and a penalty on the $L_{2}$ Norm of the weights:

$$L_{new}\left(w\right) = L_{original}\left(w\right) + \lambda{w^{T}w}$$

where $\lambda$ is a value determining the strength of the penalty (encouraging smaller weights).

Weight decay can be incorporated directly into the weight update rule, rather than just implicitly by defining it through to objective function. Often weight decay refers to the implementation where we specify it directly in the weight update rule (whereas L2 regularization is usually the implementation which is specified in the objective function).

Image Source: Deep Learning, Goodfellow et al

Latest Papers

PAPER DATE
The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation Classification
| Abdullatif KöksalArzucan Özgür
2020-10-19
Delaying Interaction Layers in Transformer-based Encoders for Efficient Open Domain Question Answering
Wissam SibliniMohamed ChallalCharlotte Pasqual
2020-10-16
It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT
Hila GonenShauli RavfogelYanai ElazarYoav Goldberg
2020-10-16
Coarse-to-Fine Pre-training for Named Entity Recognition
Mengge XueBowen YuZhenyu ZhangTingwen LiuYue ZhangBin Wang
2020-10-16
Neural Deepfake Detection with Factual Structure of Text
Wanjun ZhongDuyu TangZenan XuRuize WangNan DuanMing ZhouJiahai WangJian Yin
2020-10-15
Context-Guided BERT for Targeted Aspect-Based Sentiment Analysis
Zhengxuan WuDesmond C. Ong
2020-10-15
Does Chinese BERT Encode Word Structure?
| Yile WangLeyang CuiYue Zhang
2020-10-15
Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings
Phillip KeungJulian SalazarYichao LuNoah A. Smith
2020-10-15
Response Selection for Multi-Party Conversations withDynamic Topic Tracking
Weishi Wang§Shafiq Joty§Steven C. H. Hoi
2020-10-15
DA-Transformer: Distance-aware Transformer
Chuhan WuFangzhao WuYongfeng Huang
2020-10-14
An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models
Zihan ZhaoYuncong LiuLu ChenQi LiuRao MaKai Yu
2020-10-14
Geometry matters: Exploring language examples at the decision boundary
Debajyoti DattaShashwat KumarLaura BarnesTom Fletcher
2020-10-14
Decoding Methods for Neural Narrative Generation
| Alexandra DeLuciaAaron MuellerXiang Lisa LiJoão Sedoc
2020-10-14
No Rumours Please! A Multi-Indic-Lingual Approach for COVID Fake-Tweet Detection
| Debanjana KarMohit BhardwajSuranjana SamantaAmar Prakash Azad
2020-10-14
Probing for Multilingual Numerical Understanding in Transformer-Based Language Models
| Devin JohnsonDenise MakDrew BarkerLexi Loessberg-Zahl
2020-10-13
BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's Distance
| Jianquan LiXiaokang LiuHonghong ZhaoRuifeng XuMin YangYaohong Jin
2020-10-13
Incorporating BERT into Parallel Sequence Decoding with Adapters
| Junliang GuoZhirui ZhangLinli XuHao-Ran WeiBoxing ChenEnhong Chen
2020-10-13
Improving Text Generation Evaluation with Batch Centering and Tempered Word Mover Distance
Xi ChenNan DingTomer LevinboimRadu Soricut
2020-10-13
The workweek is the best time to start a family -- A Study of GPT-2 Based Claim Generation
Shai GretzYonatan BiluEdo Cohen-KarlikNoam Slonim
2020-10-13
CAPT: Contrastive Pre-Training for LearningDenoised Sequence Representations
Fuli LuoPengcheng YangShicheng LiXuancheng RenXu sun
2020-10-13
Aspect-based Document Similarity for Research Papers
| Malte OstendorffTerry RuasTill BlumeBela GippGeorg Rehm
2020-10-13
Multilingual Argument Mining: Datasets and Analysis
Orith Toledo-RonenMatan OrbachYonatan BiluArtem SpectorNoam Slonim
2020-10-13
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy LinRodrigo NogueiraAndrew Yates
2020-10-13
COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs
Jena D. HwangChandra BhagavatulaRonan Le BrasJeff DaKeisuke SakaguchiAntoine BosselutYejin Choi
2020-10-12
Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification
Jordan J. BirdAnikó EkártDiego R. Faria
2020-10-12
Zero-shot Entity Linking with Efficient Long Range Sequence Modeling
| Zonghai YaoLiangliang CaoHuapu Pan
2020-10-12
Meta-Context Transformers for Domain-Specific Response Generation
Debanjana KarSuranjana SamantaAmar Prakash Azad
2020-10-12
Counterfactual Variable Control for Robust and Interpretable Question Answering
| Sicheng YuYulei NiuShuohang WangJing JiangQianru Sun
2020-10-12
Improving Compositional Generalization in Semantic Parsing
| Inbar OrenJonathan HerzigNitish GuptaMatt GardnerJonathan Berant
2020-10-12
HUJI-KU at MRP~2020: Two Transition-based Neural Parsers
Ofir ArvivRuixiang CuiDaniel Hershcovich
2020-10-12
Probing Pretrained Language Models for Lexical Semantics
Ivan VulićEdoardo Maria PontiRobert LitschkoGoran GlavašAnna Korhonen
2020-10-12
EFSG: Evolutionary Fooling Sentences Generator
Marco Di GiovanniMarco Brambilla
2020-10-12
Layer-wise Guided Training for BERT: Learning Incrementally Refined Document Representations
Nikolaos ManginasIlias ChalkidisProdromos Malakasiotis
2020-10-12
From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks
| Steffen EgerYannik Benz
2020-10-12
Load What You Need: Smaller Versions of Multilingual BERT
| Amine AbdaouiCamille PradelGrégoire Sigel
2020-10-12
Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually)
| Alex WarstadtYian ZhangHaau-Sing LiHaokun LiuSamuel R. Bowman
2020-10-11
Detecting Foodborne Illness Complaints in Multiple Languages Using English Annotations Only
Ziyi LiuGiannis KaramanolakisDaniel HsuLuis Gravano
2020-10-11
Connecting the Dots Between Fact Verification and Fake News Detection
Qifei LiWangchunshu Zhou
2020-10-11
Unsupervised Distillation of Syntactic Information from Contextualized Word Representations
Shauli RavfogelYanai ElazarJacob GoldbergerYoav Goldberg
2020-10-11
Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU
| Brielen MadureiraDavid Schlangen
2020-10-11
Data Agnostic RoBERTa-based Natural Language to SQL Query Generation
| Debaditya PalHarsh SharmaKaustubh Chaudhari
2020-10-11
SMYRF: Efficient Attention using Asymmetric Clustering
| Giannis DarasNikita KitaevAugustus OdenaAlexandros G. Dimakis
2020-10-11
Information Extraction from Swedish Medical Prescriptions with Sig-Transformer Encoder
John Pougue BiyongBo wangTerry LyonsAlejo J Nevado-Holgado
2020-10-10
Tag Recommendation for Online Q&A Communities based on BERT Pre-Training Technique
Navid KhezrianJafar HabibiIssa Annamoradnejad
2020-10-10
Compressing Transformer-Based Semantic Parsing Models using Compositional Code Embeddings
Prafull PrakashSaurabh Kumar ShashidharWenlong ZhaoSubendhu RongaliHaidar KhanMichael Kayser
2020-10-10
Automated Concatenation of Embeddings for Structured Prediction
| Xinyu WangYong JiangNguyen BachTao WangZhongqiang HuangFei HuangKewei Tu
2020-10-10
Second-Order Neural Dependency Parsing with Message Passing and End-to-End Training
| Xinyu WangKewei Tu
2020-10-10
Hindsight Experience Replay with Kronecker Product Approximate Curvature
Dhuruva Priyan G MAbhik SinglaShalabh Bhatnagar
2020-10-09
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding
Jin CaoJun WangWael HamzaKelly VaneeShang-Wen Li
2020-10-09
Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis
| João A. LeiteDiego F. SilvaKalina BontchevaCarolina Scarton
2020-10-09
Grid Tagging Scheme for Aspect-oriented Fine-grained Opinion Extraction
Zhen WuChengcan YingFei ZhaoZhifang FanXinyu DaiRui Xia
2020-10-09
NutCracker at WNUT-2020 Task 2: Robustly Identifying Informative COVID-19 Tweets using Ensembling and Adversarial Training
| Priyanshu KumarAadarsh Singh
2020-10-09
Deep Learning Meets Projective Clustering
Alaa MaaloufHarry LangDaniela RusDan Feldman
2020-10-08
Masked ELMo: An evolution of ELMo towards fully contextual RNN language models
Gregory SenayEmmanuelle Salin
2020-10-08
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Yinghui HuangHong-Kwang KuoSamuel ThomasZvi KonsKartik AudhkhasiBrian KingsburyRon HooryMichael Picheny
2020-10-08
PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge
| Yun HeZhuoer WangYin ZhangRuihong HuangJames Caverlee
2020-10-08
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
| Yun HeZiwei ZhuYin ZhangQin ChenJames Caverlee
2020-10-08
Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference
| Xiaoan DingTianyu LiuBaobao ChangZhifang SuiKevin Gimpel
2020-10-08
Improving Attention Mechanism with Query-Value Interaction
Chuhan WuFangzhao WuTao QiYongfeng Huang
2020-10-08
Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding
Dechuang TengLibo QinWanxiang CheSendong ZhaoTing Liu
2020-10-08
Automatic generation of reviews of scientific papers
| Anna NikiforovskayaNikolai KapralovAnna VlasovaOleg ShpynovAleksei Shpilman
2020-10-08
Optimizing Transformers with Approximate Computing for Faster, Smaller and more Accurate NLP Models
Amrit NagarajanSanchari SenJacob R. StevensAnand Raghunathan
2020-10-07
Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets
Mihaela GamanRadu Tudor Ionescu
2020-10-07
Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank
| Eleftheria BriakouMarine Carpuat
2020-10-07
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling
Jiecao ChenLiu YangKarthik RamanMichael BenderskyJung-Jung YehYun ZhouMarc NajorkDanyang CaiEhsan Emadzadeh
2020-10-07
Why do you think that? Exploring Faithful Sentence-Level Rationales Without Supervision
Max GlocknerIvan HabernalIryna Gurevych
2020-10-07
ELMo and BERT in semantic change detection for Russian
Julia RodinaYuliya TrofimovaAndrey KutuzovEkaterina Artemova
2020-10-07
Investigating African-American Vernacular English in Transformer-Based Text Generation
Sophie GroenwoldLily OuAesha ParekhSamhita HonnavalliSharon LevyDiba MirzaWilliam Yang Wang
2020-10-06
Do Explicit Alignments Robustly Improve Multilingual Encoders?
Shijie WuMark Dredze
2020-10-06
LEGAL-BERT: The Muppets straight out of Law School
Ilias ChalkidisManos FergadiotisProdromos MalakasiotisNikolaos AletrasIon Androutsopoulos
2020-10-06
Cross-Lingual Text Classification with Minimal Resources by Transferring a Sparse Teacher
| Giannis KaramanolakisDaniel HsuLuis Gravano
2020-10-06
The Multilingual Amazon Reviews Corpus
Phillip KeungYichao LuGyörgy SzarvasNoah A. Smith
2020-10-06
Scene Graph Modification Based on Natural Language Commands
| Xuanli HeQuan Hung TranGholamreza HaffariWalter ChangTrung BuiZhe LinFranck DernoncourtNhan Dam
2020-10-06
On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained Transformers
| Marius MosbachAnna KhokhlovaMichael A. HedderichDietrich Klakow
2020-10-06
Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation
| Sebastian HofstätterSophia AlthammerMichael SchröderMete SertkanAllan Hanbury
2020-10-06
Incorporating Behavioral Hypotheses for Query Generation
Ruey-Cheng ChenChia-Jung Lee
2020-10-06
Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder
Alvin ChanYi TayYew-Soon OngAston Zhang
2020-10-06
BERT Knows Punta Cana is not just beautiful, it's gorgeous: Ranking Scalar Adjectives with Contextualised Representations
| Aina Garí SolerMarianna Apidianaki
2020-10-06
Analyzing Individual Neurons in Pre-trained Language Models
Nadir DurraniHassan SajjadFahim DalviYonatan Belinkov
2020-10-06
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
| Minki KangMoonsu HanSung Ju Hwang
2020-10-06
Intrinsic Probing through Dimension Selection
Lucas Torroba HennigenAdina WilliamsRyan Cotterell
2020-10-06
Reconciling Modern Deep Learning with Traditional Optimization Analyses: The Intrinsic Learning Rate
Zhiyuan LiKaifeng LyuSanjeev Arora
2020-10-06
Exploring BERT's Sensitivity to Lexical Cues using Tests from Semantic Priming
Kanishka MisraAllyson EttingerJulia Taylor Rayz
2020-10-06
PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation
Xinyu HuaLu Wang
2020-10-05
InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Boxin WangShuohang WangYu ChengZhe GanRuoxi JiaBo LiJingjing Liu
2020-10-05
Mixup-Transfomer: Dynamic Data Augmentation for NLP Tasks
Lichao SunCongying XiaWenpeng YinTingTing LiangPhilip S. YuLifang He
2020-10-05
Self-training Improves Pre-training for Natural Language Understanding
Jingfei DuEdouard GraveBeliz GunelVishrav ChaudharyOnur CelebiMichael AuliVes StoyanovAlexis Conneau
2020-10-05
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?
Shayne LongpreYu WangChristopher DuBois
2020-10-05
Improving AMR Parsing with Sequence-to-Sequence Pre-training
| Dongqin XuJunhui LiMuhua ZhuMin ZhangGuodong Zhou
2020-10-05
Unsupervised Reference-Free Summary Quality Evaluation via Contrastive Learning
Hanlu WuTengfei MaLingfei WuTariro ManyumwaShouling Ji
2020-10-05
Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior
Zi LinJeremiah Zhe LiuZi YangNan HuaDan Roth
2020-10-05
GenAug: Data Augmentation for Finetuning Text Generators
Steven Y. FengVarun GangalDongyeop KangTeruko MitamuraEduard Hovy
2020-10-05
PMI-Masking: Principled masking of correlated spans
Yoav LevineBarak LenzOpher LieberOmri AbendKevin Leyton-BrownMoshe TennenholtzYoav Shoham
2020-10-05
Linguistic Profiling of a Neural Language Model
Alessio MiaschiDominique BrunatoFelice Dell'OrlettaGiulia Venturi
2020-10-05
PUM at SemEval-2020 Task 12: Aggregation of Transformer-based models' features for offensive language recognition
Piotr JaniszewskiMateusz SkibaUrszula Walińska
2020-10-05
X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset
Angel DazaAnette Frank
2020-10-05
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels
Ilias ChalkidisManos FergadiotisSotiris KotitsasProdromos MalakasiotisNikolaos AletrasIon Androutsopoulos
2020-10-04
Inquisitive Question Generation for High Level Text Comprehension
Wei-Jen KoTe-Yuan ChenYiyan HuangGreg DurrettJunyi Jessy Li
2020-10-04
On Losses for Modern Language Models
Stephane Aroca-OuelletteFrank Rudzicz
2020-10-04
Mining Knowledge for Natural Language Inference from Wikipedia Categories
Mingda ChenZewei ChuKarl StratosKevin Gimpel
2020-10-03
Personality Trait Detection Using Bagged SVM over BERT Word Embedding Ensembles
Amirmohammad KazameiniSamin FatehiYash MehtaSauleh EetemadiErik Cambria
2020-10-03
Cost-effective Selection of Pretraining Data: A Case Study of Pretraining BERT on Social Media
Xiang DaiSarvnaz KarimiBen HacheyCecile Paris
2020-10-02
STIL -- Simultaneous Slot Filling, Translation, Intent Classification, and Language Identification: Initial Results using mBART on MultiATIS++
Jack G. M. FitzGerald
2020-10-02
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
| Andreas RückléJonas PfeifferIryna Gurevych
2020-10-02
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
Ikuya YamadaAkari AsaiHiroyuki ShindoHideaki TakedaYuji Matsumoto
2020-10-02
Beyond The Text: Analysis of Privacy Statements through Syntactic and Semantic Role Labeling
Yan ShvartzshnaiderAnanth BalashankarVikas PatidarThomas WiesLakshminarayanan Subramanian
2020-10-01
Examining the rhetorical capacities of neural language models
Zining ZhuChuer PanMohamed AbdallaFrank Rudzicz
2020-10-01
Evaluating Multilingual BERT for Estonian
Claudia KittaskKirill MilintsevichKairit Sirts
2020-10-01
RRF102: Meeting the TREC-COVID Challenge with a 100+ Runs Ensemble
Michael BenderskyHonglei ZhuangJi MaShuguang HanKeith HallRyan Mcdonald
2020-10-01
Understanding tables with intermediate pre-training
| Julian Martin EisenschlosSyrine KrichineThomas Müller
2020-10-01
Detecting White Supremacist Hate Speech using Domain Specific Word Embedding with Deep Learning and BERT
Hind Saleh AlatawiAreej Maatog AlhothaliKawthar Mustafa Moria
2020-10-01
CoLAKE: Contextualized Language and Knowledge Embedding
| Tianxiang SunYunfan ShaoXipeng QiuQipeng GuoYaru HuXuanjing HuangZheng Zhang
2020-10-01
Bag of Tricks for Adversarial Training
| Tianyu PangXiao YangYinpeng DongHang SuJun Zhu
2020-10-01
AUBER: Automated BERT Regularization
Hyun Dong LeeSeongmin LeeU Kang
2020-09-30
BERT for Monolingual and Cross-Lingual Reverse Dictionary
| Hang YanXiaonan LiXipeng Qiu
2020-09-30
A Tale of Two Linkings: Dynamically Gating between Schema Linking and Structural Linking for Text-to-SQL Parsing
| Sanxing ChenAidan SanXiaodong LiuYangfeng Ji
2020-09-30
Gender prediction using limited Twitter Data
Maaike BurghoornMaaike H. T. de BoerStephan Raaijmakers
2020-09-29
Visually-Grounded Planning without Vision: Language Models Infer Detailed Plans from High-level Instructions
| Peter A. Jansen
2020-09-29
TEST_POSITIVE at W-NUT 2020 Shared Task-3: Joint Event Multi-task Learning for Slot Filling in Noisy Text
Chacha ChenChieh-Yang HuangYaqi HouYang ShiEnyan DaiJiaqi Wang
2020-09-29
Cross-lingual Alignment Methods for Multilingual BERT: A Comparative Study
Saurabh KulshreshthaJosé Luis Redondo-GarcíaChing-Yun Chang
2020-09-29
MaP: A Matrix-based Prediction Approach to Improve Span Extraction in Machine Reading Comprehension
Huaishao LuoYu ShiMing GongLinjun ShouTianrui Li
2020-09-29
The design and implementation of Language Learning Chatbot with XAI using Ontology and Transfer Learning
Nuobei ShiQin ZengRaymond Lee
2020-09-29
Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation
Yinfei YangNing JinKuo LinMandy GuoDaniel Cer
2020-09-29
HINT3: Raising the bar for Intent Detection in the Wild
Gaurav AroraChirag JainManas ChaturvediKrupal Modi
2020-09-29
Contrastive Distillation on Intermediate Representations for Language Model Compression
Siqi SunZhe GanYu ChengYuwei FangShuohang WangJingjing Liu
2020-09-29
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
Shikib MehriMihail EricDilek Hakkani-Tur
2020-09-28
Fancy Man Lauches Zippo at WNUT 2020 Shared Task-1: A Bert Case Model for Wet Lab Entity Extraction
Haoding MengQingcheng ZengXiaoyang FangZhexin Liang
2020-09-28
A Simple and Efficient Ensemble Classifier Combining Multiple Neural Network Models on Social Media Datasets in Vietnamese
Huy Duc HuynhHang Thi-Thuy DoKiet Van NguyenNgan Luu-Thuy Nguyen
2020-09-28
Accelerating Multi-Model Inference by Merging DNNs of Different Weights
Joo Seong JeongSoojeong KimGyeong-In YuYunseong LeeByung-Gon Chun
2020-09-28
Knowledge-Aware Procedural Text Understanding with Multi-Stage Training
Zhihan ZhangXiubo GengTao QinYunfang WuDaxin Jiang
2020-09-28
PIN: A Novel Parallel Interactive Network for Spoken Language Understanding
Peilin ZhouZhiqi HuangFenglin LiuYuexian Zou
2020-09-28
TernaryBERT: Distillation-aware Ultra-low Bit BERT
Wei ZhangLu HouYichun YinLifeng ShangXiao ChenXin JiangQun Liu
2020-09-27
Metaphor Detection using Deep Contextualized Word Embeddings
Shashwat AggarwalRamesh Singh
2020-09-26
Metaphor Detection using Deep Contextualized Word Embeddings
Shashwat AggarwalRamesh Singh
2020-09-26
Techniques to Improve Q&A Accuracy with Transformer-based models on Large Complex Documents
Chejui LiaoTabish ManiarSravanajyothi NAnantha Sharma
2020-09-26
HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
| Yifan DingNicholas BotzerTim Weninger
2020-09-25
BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification Context
Jean-Philippe CorbeilHadi Abdi Ghadivel
2020-09-25
An Unsupervised Sentence Embedding Method byMutual Information Maximization
Yan ZhangRuidan HeZuozhu LiuKwan Hui LimLidong Bing
2020-09-25
A little goes a long way: Improving toxic language classification despite data scarcity
Mika JuutiTommi GröndahlAdrian FlanaganN. Asokan
2020-09-25
A Comparative Study of Feature Types for Age-Based Text Classification
| Anna GlazkovaYury EgorovMaksim Glazkov
2020-09-24
Toward a Thermodynamics of Meaning
Jonathan Scott Enderle
2020-09-24
AnchiBERT: A Pre-Trained Model for Ancient ChineseLanguage Understanding and Generation
Huishuang TianKexin YangDayiheng LiuJiancheng Lv
2020-09-24
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences
| Boon Peng YapAndrew Koh Jin JieEng Siong Chng
2020-09-24
Pruning Convolutional Filters using Batch Bridgeout
Najeeb KhanIan Stavness
2020-09-23
A Token-wise CNN-based Method for Sentence Compression
Weiwei HouHanna SuominenPiotr KoniuszSabrina CaldwellTom Gedeon
2020-09-23
On Data Augmentation for Extreme Multi-label Classification
Danqing ZhangTao LiHaiyang ZhangBing Yin
2020-09-22
AutoRC: Improving BERT Based Relation Classification Models via Architecture Search
Wei ZhuXiaoling WangXipeng QiuYuan NiGuotong Xie
2020-09-22
GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis
| Huaishao LuoLei JiTianrui LiNan DuanDaxin Jiang
2020-09-22
Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
Chris J. KennedyGeoff BaconAlexander SahnClaudia von Vacano
2020-09-22
"When they say weed causes depression, but it's your fav antidepressant": Knowledge-aware Attention Framework for Relationship Extraction
Shweta YadavUsha LokalaRaminta DaniulaityteKrishnaprasad ThirunarayanFrancois LamyAmit Sheth
2020-09-21
Open-set Short Utterance Forensic Speaker Verification using Teacher-Student Network with Explicit Inductive Bias
Mufan SangWei XiaJohn H. L. Hansen
2020-09-21
Profile Consistency Identification for Open-domain Dialogue Agents
Haoyu SongYan WangWei-Nan ZhangZhengyu ZhaoTing LiuXiaojiang Liu
2020-09-21
Latin BERT: A Contextual Language Model for Classical Philology
David BammanPatrick J. Burns
2020-09-21
Dual-path CNN with Max Gated block for Text-Based Person Re-identification
Tinghuai MaMingming YangHuan RongYurong QianYurong QianYuan TianNajlaAl-Nabhan
2020-09-20
Longformer for MS MARCO Document Re-ranking Task
| Ivan SekulićAmir SoleimaniMohammad AliannejadiFabio Crestani
2020-09-20
Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging
| Ehsan DoostmohammadiMinoo NassajianAdel Rahimi
2020-09-20
VirtualFlow: Decoupling Deep Learning Model Execution from Underlying Hardware
Andrew OrHaoyu ZhangMichael J. Freedman
2020-09-20
Prior Art Search and Reranking for Generated Patent Text
Jieh-Sheng LeeJieh Hsiang
2020-09-19
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
Jonathan PilaultAmine ElhattamiChristopher Pal
2020-09-19
Nominal Compound Chain Extraction: A New Task for Semantic-enriched Lexical Chain
Bobo LiHao FeiYafeng RenDonghong Ji
2020-09-19
Will it Unblend?
Yuval PinterCassandra L. JacobsJacob Eisenstein
2020-09-18
The birth of Romanian BERT
Stefan Daniel DumitrescuAndrei-Marius AvramSampo Pyysalo
2020-09-18
Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models
Jihyeon RohHuiseong GimSoo-Young Lee
2020-09-18
fastHan: A BERT-based Joint Many-Task Toolkit for Chinese NLP
Zhichao GengHang YanXipeng QiuXuanjing Huang
2020-09-18
NEU at WNUT-2020 Task 2: Data Augmentation To Tell BERT That Death Is Not Necessarily Informative
Kumud Chauhan
2020-09-18
Cross-Modal Alignment with Mixture Experts Neural Network for Intral-City Retail Recommendation
Po LiLei LiYan FuJun RongYu Zhang
2020-09-17
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning
Bingbing LiZhenglun KongTianyun ZhangJi LiZhengang LiHang LiuCaiwen Ding
2020-09-17
Knowledge-Assisted Deep Reinforcement Learning in 5G Scheduler Design: From Theoretical Framework to Implementation
Zhouyou GuChangyang SheWibowo HardjawanaSimon LumbDavid McKechnieTodd EsseryBranka Vucetic
2020-09-17
Multi^2OIE: Multilingual Open Information Extraction based on Multi-Head Attention with BERT
Youngbin RoYukyung LeePilsung Kang
2020-09-17
DSC IIT-ISM at SemEval-2020 Task 6: Boosting BERT with Dependencies for Definition Extraction
| Aadarsh SinghPriyanshu KumarAman Sinha
2020-09-17
Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA
Ieva StaliūnaitėIgnacio Iacobacci
2020-09-17
A Multimodal Memes Classification: A Survey and Open Research Issues
Tariq Habib AfridiAftab AlamMuhammad Numan KhanJawad KhanYoung-Koo Lee
2020-09-17
Solomon at SemEval-2020 Task 11: Ensemble Architecture for Fine-Tuned Propaganda Detection in News Articles
Mayank RajAjay JaiswalRohit R. RAnkita GuptaSudeep Kumar SahooVertika SrivastavaYeon Hyang Kim
2020-09-16
Simplified TinyBERT: Knowledge Distillation for Document Retrieval
Xuanang ChenBen HeKai HuiLe SunYingfei Sun
2020-09-16
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
| Jian GuanMinlie Huang
2020-09-16
Deep Learning Approaches for Extracting Adverse Events and Indications of Dietary Supplements from Clinical Text
Yadan FanSicheng ZhouYifan LiRui Zhang
2020-09-16
DeNERT-KG: Named Entity and Relation Extraction Model Using DQN, Knowledge Graph, and BERT
SungMin YangSoYeop YooOkRan Jeong
2020-09-15
Augmented Natural Language for Generative Sequence Labeling
Ben AthiwaratkunCicero Nogueira dos SantosJason KroneBing Xiang
2020-09-15
Dialogue Response Ranking Training with Large-Scale Human Feedback Data
| Xiang GaoYizhe ZhangMichel GalleyChris BrockettBill Dolan
2020-09-15
Critical Thinking for Language Models
Gregor Betz
2020-09-15
The Radicalization Risks of GPT-3 and Advanced Neural Language Models
Kris McGuffieAlex Newhouse
2020-09-15
Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Wei NiuZhenglun KongGeng YuanWeiwen JiangJiexiong GuanCaiwen DingPu ZhaoSijia LiuBin RenYanzhi Wang
2020-09-15
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis ClouatrePhilippe TrempeAmal ZouaqSarath Chandar
2020-09-15
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
| Timo SchickHinrich Schütze
2020-09-15
Event Presence Prediction Helps Trigger Detection Across Languages
Parul AwasthyTahira NaseemJian NiTaesun MoonRadu Florian
2020-09-15
Lessons Learned from Applying off-the-shelf BERT: There is no SilverBullet
Victor MakarenkovLior Rokach
2020-09-15
BERT-QE: Contextualized Query Expansion for Document Re-ranking
Zhi ZhengKai HuiBen HeXianpei HanLe SunAndrew Yates
2020-09-15
Efficient Transformers: A Survey
Yi TayMostafa DehghaniDara BahriDonald Metzler
2020-09-14
Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue
Longxiang LiuZhuosheng ZhangHai ZhaoXi ZhouXiang Zhou
2020-09-14
GeDi: Generative Discriminator Guided Sequence Generation
Ben KrauseAkhilesh Deepak GotmareBryan McCannNitish Shirish KeskarShafiq JotyRichard SocherNazneen Fatema Rajani
2020-09-14
Can Fine-tuning Pre-trained Models Lead to Perfect NLP? A Study of the Generalizability of Relation Extraction
Ningyu ZhangLuoqiu LiShumin DengHaiyang YuXu ChengWei ZhangHuajun Chen
2020-09-14
Beyond Accuracy: ROI-driven Data Analytics of Empirical Data
Gouri DeshpandeGuenther Ruhe
2020-09-14
Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Shuohang WangLuowei ZhouZhe GanYen-Chun ChenYuwei FangSiqi SunYu ChengJingjing Liu
2020-09-13
BoostingBERT:Integrating Multi-Class Boosting into BERT for NLP Tasks
Tongwen HuangQingyun SheJunlin Zhang
2020-09-13
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using Pre-trained Language Models
Yandrapati Prakash BabuRajagopal Eswari
2020-09-12
Country Image in COVID-19 Pandemic: A Case Study of China
Huimin ChenZeyu ZhuFanchao QiYining YeZhiyuan LiuMaosong SunJianbin Jin
2020-09-12
Fine-tuning Pre-trained Contextual Embeddings for Citation Content Analysis in Scholarly Publication
Haihua ChenHuyen Nguyen
2020-09-12
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank Approximation
Murad TukanAlaa MaaloufMatan WekslerDan Feldman
2020-09-11
Unit Test Case Generation with Transformers
Michele TufanoDawn DrainAlexey SvyatkovskiyShao Kun DengNeel Sundaresan
2020-09-11
UPB at SemEval-2020 Task 6: Pretrained Language Models for DefinitionExtraction
Andrei-Marius AvramDumitru-Clementin CercelCostin-Gabriel Chiru
2020-09-11
UPB at SemEval-2020 Task 11: Propaganda Detection with Domain-Specific Trained BERT
Andrei ParaschivDumitru-Clementin CercelMihai Dascalu
2020-09-11
A Comparison of LSTM and BERT for Small Corpus
Aysu Ezen-Can
2020-09-11
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas AffolterBeni EgressyDamian PascualRoger Wattenhofer
2020-09-10
Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection
| Taesun WhangDongyub LeeDongsuk OhChanhee LeeKijong HanDong-hun LeeSaebyeok Lee
2020-09-10
Modern Methods for Text Generation
| Dimas Munoz Montesinos
2020-09-10
Investigating Gender Bias in BERT
Rishabh BhardwajNavonil MajumderSoujanya Poria
2020-09-10
Pay Attention when Required
Swetha MandavaSzymon MigaczAlex Fit Florea
2020-09-09
Comparative Study of Language Models on Cross-Domain Data with Model Agnostic Explainability
Mayank ChhipaHrushikesh Mahesh VazurkarAbhijeet KumarMridul Mishra
2020-09-09
ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by Pre-trained Language Model
Zhengjie HuangShikun FengWeiyue SuXuyi ChenShuohuan WangJiaxiang LiuXuan OuyangYu Sun
2020-09-08
Improving Language Generation with Sentence Coherence Objective
Ruixiao SunJie YangMehrdad Yousefzadeh
2020-09-07
Black Box to White Box: Discover Model Characteristics Based on Strategic Probing
Josh KalinMatthew CiolinoDavid NoeverGerry Dozier
2020-09-07
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for E-commerce
Denghui ZhangZixuan YuanYanchi LiuFuzhen ZhuangHui Xiong
2020-09-07
Measuring Massive Multitask Language Understanding
Dan HendrycksCollin BurnsSteven BasartAndy ZouMantas MazeikaDawn SongJacob Steinhardt
2020-09-07
EdinburghNLP at WNUT-2020 Task 2: Leveraging Transformers with Generalized Augmentation for Identifying Informativeness in COVID-19 Tweets
Nickil Maveli
2020-09-06
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation system based on ensemble of language model
Pai Liu
2020-09-06
Accenture at CheckThat! 2020: If you say so: Post-hoc fact-checking of claims using transformer-based models
Evan WilliamsPaul RodriguesValerie Novak
2020-09-05
Comparative Evaluation of Pretrained Transfer Learning Models on Automatic Short Answer Grading
Sasi Kiran GaddipatiDeebul NairPaul G. Plöger
2020-09-02
Sentimental LIAR: Extended Corpus and Deep Learning Models for Fake Claim Classification
Bibek UpadhayayVahid Behzadan
2020-09-01
Automatic Assignment of Radiology Examination Protocols Using Pre-trained Language Models with Knowledge Distillation
Wilson LauLaura AaltonenMartin GunnMeliha Yetisgen
2020-09-01
A Bidirectional Tree Tagging Scheme for Jointly Extracting Overlapping Entities and Relations
Xukun LuoWeijie LiuMeng MaPing Wang
2020-08-31
SocCogCom at SemEval-2020 Task 11: Characterizing and Detecting Propaganda using Sentence-Level Emotional Salience Features
Gangeshwar KrishnamurthyRaj Kumar GuptaYinping Yang
2020-08-29
Rethinking the objectives of extractive question answering
Martin FajcikJosef JonSantosh KesirajuPavel Smrz
2020-08-28
Knowledge Efficient Deep Learning for Natural Language Processing
Hai Wang
2020-08-28
DAVE: Deriving Automatically Verilog from English
Hammond PearceBenjamin TanRamesh Karri
2020-08-27
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
Xinsong ZhangHang Li
2020-08-27
MultiGBS: A multi-layer graph approach to biomedical summarization
Ensieh DavoodijamNasser GhadiriMaryam Lotfi ShahrezaFabio Rinaldi
2020-08-27
Query Focused Multi-document Summarisation of Biomedical Texts
Diego MollaChristopher JonesVincent Nguyen
2020-08-27
GREEK-BERT: The Greeks visiting Sesame Street
John KoutsikakisIlias ChalkidisProdromos MalakasiotisIon Androutsopoulos
2020-08-27
Entity and Evidence Guided Relation Extraction for DocRED
Kevin HuangGuangtao WangTengyu MaJing Huang
2020-08-27
APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm
Hanlin TangShaoduo GanSamyam RajbhandariXiangru LianCe ZhangJi LiuYuxiong He
2020-08-26
Language Models and Word Sense Disambiguation: An Overview and Analysis
| Daniel LoureiroKiamehr RezaeeMohammad Taher PilehvarJose Camacho-Collados
2020-08-26
Discrete Word Embedding for Logical Natural Language Understanding
Masataro AsaiZilu Tang
2020-08-26
Conceptualized Representation Learning for Chinese Biomedical Text Mining
| Ningyu ZhangQianghuai JiaKangping YinLiang DongFeng GaoNengwei Hua
2020-08-25
syrapropa at SemEval-2020 Task 11: BERT-based Models Design For Propagandistic Technique and Span Detection
Jinfen LiLu Xiao
2020-08-24
Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and Resources
Taolin ZhangChengyu WangMinghui QiuBite YangXiaofeng HeJun Huang
2020-08-24
Two Stages Approach for Tweet Engagement Prediction
Amine DadounIsmail HarrandoPasquale LisenaAlison ReboudRaphael Troncy
2020-08-24
Prediction of ICD Codes with Clinical BERT Embeddings and Text Augmentation with Label Balancing using MIMIC-III
Brent BisedaGaurav DesaiHaifeng LinAnish Philip
2020-08-24
YNU-HPCC at SemEval-2020 Task 11: LSTM Network for Detection of Propaganda Techniques in News Articles
Jiaxu DaoJin WangXuejie Zhang
2020-08-24
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERT
| Omar MossadAmgad AhmedAnandharaju RajuHari KarthikeyanZayed Ahmed
2020-08-22
Applications of BERT Based Sequence Tagging Models on Chinese Medical Text Attributes Extraction
Gang ZhaoTeng ZhangChenxiao WangPing LvJi Wu
2020-08-22
HinglishNLP: Fine-tuned Language Models for Hinglish Sentiment Detection
Meghana BhangeNirant Kasliwal
2020-08-22
CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection
| Verena BlaschkeMaxim KorniyenkoSam Tureski
2020-08-22
DUTH at SemEval-2020 Task 11: BERT with Entity Mapping for Propaganda Classification
Anastasios BairaktarisSymeon SymeonidisAvi Arampatzis
2020-08-22
Adapting Event Extractors to Medical Data: Bridging the Covariate Shift
Aakanksha NaikJill LehmanCarolyn Rose
2020-08-21
Abstractive Summarization of Spoken andWritten Instructions with BERT
Alexandra SavelievaBryan Au-YeungVasanth Ramani
2020-08-21
An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension
Son T. LuuKiet Van NguyenAnh Gia-Tuan NguyenNgan Luu-Thuy Nguyen
2020-08-20
UoB at SemEval-2020 Task 12: Boosting BERT with Corpus Level Information
Wah Meng LimHarish Tayyar Madabushi
2020-08-19
Ranking Clarification Questions via Natural Language Inference
Vaibhav KumarVikas RaunakJamie Callan
2020-08-18
Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Dara BahriYi TayChe ZhengDonald MetzlerCliff BrunkAndrew Tomkins
2020-08-17
Narrative Interpolation for Generating and Understanding Stories
Su WangGreg DurrettKatrin Erk
2020-08-17
Stock Index Prediction with Multi-task Learning and Word Polarity Over Time
Yue ZhouKerstin Voigt
2020-08-17
Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size
Davis YoshidaAllyson EttingerKevin Gimpel
2020-08-16
DeVLBert: Learning Deconfounded Visio-Linguistic Representations
| Shengyu ZhangTan JiangTan WangKun KuangZhou ZhaoJianke ZhuJin YuHongxia YangFei Wu
2020-08-16
Jointly Fine-Tuning “BERT-like” Self Supervised Models to Improve Multimodal Speech Emotion Recognition
| Shamane SiriwardhanaAndrew ReisRivindu WeerasekeraSuranga Nanayakkara
2020-08-15
Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Henry TsaiJayden OoiChun-Sung FerngHyung Won ChungJason Riesa
2020-08-15
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Shamane SiriwardhanaAndrew ReisRivindu WeerasekeraSuranga Nanayakkara
2020-08-15
Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model
Marzieh MozafariReza FarahbakhshNoel Crespi
2020-08-14
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Andrea Madotto
2020-08-14
MICE: Mining Idioms with Contextual Embeddings
Tadej ŠkvorcPolona GantarMarko Robnik-Šikonja
2020-08-13
ANDES at SemEval-2020 Task 12: A jointly-trained BERT multilingual model for offensive language detection
| Juan Manuel PérezAymé ArangoFranco Luque
2020-08-13
Variance-reduced Language Pretraining via a Mask Proposal Network
Liang Chen
2020-08-12
FireBERT: Hardening BERT-based classifiers against adversarial attack
Gunnar MeinKevin HartmanAndrew Morris
2020-08-10
Navigating Language Models with Synthetic Agents
Philip Feldman
2020-08-10
KR-BERT: A Small-Scale Korean-Specific Language Model
| Sangah LeeHansol JangYunmee BaikSuzi ParkHyopil Shin
2020-08-10
Does BERT Solve Commonsense Task via Commonsense Knowledge?
Leyang CuiSijie ChengYu WuYue Zhang
2020-08-10
Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine
Kuan FangLong ZhaoZhan ShenRuiXing WangRiKang ZhourLiWen Fan
2020-08-10
GANBERT: Generative Adversarial Networks with Bidirectional Encoder Representations from Transformers for MRI to PET synthesis
Hoo-Chang ShinAlvin IhsaniSwetha MandavaSharath Turuvekere SreenivasChristopher ForsterJiook ChaAlzheimer's Disease Neuroimaging Initiative
2020-08-10
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
| Hayato FutamiHirofumi InagumaSei UenoMasato MimuraShinsuke SakaiTatsuya Kawahara
2020-08-09
Fast and Accurate Neural CRF Constituency Parsing
| Yu ZhangHouquan ZhouZhenghua Li
2020-08-09
Improve Generalization and Robustness of Neural Networks via Weight Scale Shifting Invariant Regularizations
Ziquan LiuYufei CuiAntoni B. Chan
2020-08-07
SemEval-2020 Task 10: Emphasis Selection for Written Text in Visual Media
Amirreza ShiraniFranck DernoncourtNedim LipkaPaul AsenteJose EchevarriaThamar Solorio
2020-08-07
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Zihang JiangWeihao YuDaquan ZhouYunpeng ChenJiashi FengShuicheng Yan
2020-08-06
DeText: A Deep Text Ranking Framework with BERT
| Weiwei GuoXiaowei LiuSida WangHuiji GaoAnanth SankarZimeng YangQi GuoLiang ZhangBo LongBee-Chung ChenDeepak Agarwal
2020-08-06
aschern at SemEval-2020 Task 11: It Takes Three to Tango: RoBERTa, CRF, and Transfer Learning
| Anton ChernyavskiyDmitry IlvovskyPreslav Nakov
2020-08-06
I-AID: Identifying Actionable Information from Disaster-related Tweets
Hamada M. ZaheraRricha JalotaMohamed A. SherifAxel N. Ngomo
2020-08-04
Taking Notes on the Fly Helps BERT Pre-training
Qiyu WuChen XingYatao LiGuolin KeDi HeTie-Yan Liu
2020-08-04
NLPDove at SemEval-2020 Task 12: Improving Offensive Language Detection with Cross-lingual Transfer
Hwijeen AhnJimin SunChan Young ParkJungyun Seo
2020-08-04
Improving One-stage Visual Grounding by Recursive Sub-query Construction
| Zhengyuan YangTianlang ChenLiwei WangJiebo Luo
2020-08-03
[email protected] at SemEval-2020 Task 12: Multilingual or language-specific BERT?
Marc PàmiesEmily ÖhmanKaisla KajavaJörg Tiedemann
2020-08-03
Trojaning Language Models for Fun and Profit
Xinyang ZhangZheng ZhangTing Wang
2020-08-01
Multi-node Bert-pretraining: Cost-efficient Approach
Jiahuang LinXin LiGennady Pekhimenko
2020-08-01
Finite Versus Infinite Neural Networks: an Empirical Study
Jaehoon LeeSamuel S. SchoenholzJeffrey PenningtonBen AdlamLechao XiaoRoman NovakJascha Sohl-Dickstein
2020-07-31
On Learning Universal Representations Across Languages
Xiangpeng WeiYue HuRongxiang WengLuxi XingHeng YuWeihua Luo
2020-07-31
Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
Yu GuRobert TinnHao ChengMichael LucasNaoto UsuyamaXiaodong LiuTristan NaumannJianfeng GaoHoifung Poon
2020-07-31
TweepFake: about Detecting Deepfake Tweets
Tiziano FagniFabrizio FalchiMargherita GambiniAntonio MartellaMaurizio Tesconi
2020-07-31
Model Reduction of Shallow CNN Model for Reliable Deployment of Information Extraction from Medical Reports
Abhishek K DubeyAlina PelusoJacob HinkleDevanshu AgarawalZilong Tan
2020-07-31
What does BERT know about books, movies and music? Probing BERT for Conversational Recommendation
| Gustavo PenhaClaudia Hauff
2020-07-30
Depressive, Drug Abusive, or Informative: Knowledge-aware Study of News Exposure during COVID-19 Outbreak
Amanuel AlamboManas GaurKrishnaprasad Thirunarayan
2020-07-30
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
| Shayne LongpreYi LuJoachim Daiber
2020-07-30
Composer Style Classification of Piano Sheet Music Images Using Language Model Pretraining
TJ TsaiKevin Ji
2020-07-29
Improving Results on Russian Sentiment Datasets
| Anton GolubevNatalia Loukachevitch
2020-07-28
BUT-FIT at SemEval-2020 Task 5: Automatic detection of counterfactual statements with deep pre-trained language representation models
Martin FajcikJosef JonMartin DocekalPavel Smrz
2020-07-28
Variants of BERT, Random Forests and SVM approach for Multimodal Emotion-Target Sub-challenge
Hoang Manh HungHyung-Jeong YangSoo-Hyung KimGuee-Sang Lee
2020-07-28
GUIR at SemEval-2020 Task 12: Domain-Tuned Contextualized Models for Offensive Language Detection
Sajad SotudehTong XiangHao-Ren YaoSean MacAvaneyEugene YangNazli GoharianOphir Frieder
2020-07-28
Deep Learning Brasil -- NLP at SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets
Manoel Veríssimo dos Santos NetoAyrton Denner da Silva AmaralNádia Félix Felipe da SilvaAnderson da Silva Soares
2020-07-28
KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media
| Ali SafayaMoutasem AbdullatifDeniz Yuret
2020-07-26
Reed at SemEval-2020 Task 9: Fine-Tuning and Bag-of-Words Approaches to Code-Mixed Sentiment Analysis
Vinay GopalanMark Hopkins
2020-07-26
To BERT or Not To BERT: Comparing Speech and Language-based Approaches for Alzheimer's Disease Detection
Aparna BalagopalanBenjamin EyreFrank RudziczJekaterina Novikova
2020-07-26
MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning
Aina Garí SolerMarianna Apidianaki
2020-07-24
Product Title Generation for Conversational Systems using BERT
Mansi Ranjit ManeShashank KediaAditya ManthaStephen GuoKannan Achan
2020-07-23
The Lottery Ticket Hypothesis for Pre-trained BERT Networks
| Tianlong ChenJonathan FrankleShiyu ChangSijia LiuYang ZhangZhangyang WangMichael Carbin
2020-07-23
IITK at the FinSim Task: Hypernym Detection in Financial Domain via Context-Free and Contextualized Word Embeddings
Vishal KeswaniSakshi SinghAshutosh Modi
2020-07-22
Multi-task learning for natural language processing in the 2020s: where are we going?
Joseph WorshamJugal Kalita
2020-07-22
problemConquero at SemEval-2020 Task 12: Transformer and Soft label-based approaches
Karishma LaudJagriti SinghRandeep Kumar SahuAshutosh Modi
2020-07-21
newsSweeper at SemEval-2020 Task 11: Context-Aware Rich Feature Representations For Propaganda Classification
| Paramansh SinghSiraj SandhuSubham KumarAshutosh Modi
2020-07-21
Word Representation for Rhythms
Tongyu LuLyucheng YanGus Xia
2020-07-21
Understanding BERT Rankers Under Distillation
Luyu GaoZhuyun DaiJamie Callan
2020-07-21
A Comparison of Supervised Learning to Match Methods for Product Search
| Fatemeh SarviNikos VoskaridesLois MooimanSebastian SchelterMaarten de Rijke
2020-07-20
Mono vs Multilingual Transformer-based Models: a Comparison across Several Language Tasks
Diego de Vargas FeijoViviane Pereira Moreira
2020-07-19
On regularization of gradient descent, layer imbalance and flat minima
Boris Ginsburg
2020-07-18
Generative Pretraining from Pixels
| Mark ChenAlec RadfordRewon ChildJeff WuHeewoo JunPrafulla DhariwalDavid LuanIlya Sutskever
2020-07-17
Multi-Perspective Semantic Information Retrieval in the Biomedical Domain
Samarth Rawal
2020-07-17
Towards Debiasing Sentence Representations
Paul Pu LiangIrene Mengze LiEmily ZhengYao Chong LimRuslan SalakhutdinovLouis-Philippe Morency
2020-07-16
Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT
Yang Jiao
2020-07-16
Human-like Energy Management Based on Deep Reinforcement Learning and Historical Driving Experiences
Teng LiuXiaolin TangXiaosong HuWenhao TanJinwei Zhang
2020-07-16
Hopfield Networks is All You Need
| Hubert RamsauerBernhard SchäflJohannes LehnerPhilipp SeidlMichael WidrichLukas GruberMarkus HolzleitnerMilena PavlovićGeir Kjetil SandveVictor GreiffDavid KreilMichael KoppGünter KlambauerJohannes BrandstetterSepp Hochreiter
2020-07-16
Fine-Tune Longformer for Jointly Predicting Rumor Stance and Veracity
Anant Khandelwal
2020-07-15
Non-greedy Gradient-based Hyperparameter Optimization Over Long Horizons
| Paul MicaelliAmos Storkey
2020-07-15
AdapterHub: A Framework for Adapting Transformers
| Jonas PfeifferAndreas RückléClifton PothAishwarya KamathIvan VulićSebastian RuderKyunghyun ChoIryna Gurevych
2020-07-15
Multimodal Word Sense Disambiguation in Creative Practice
Manuel Ladron de GuevaraChristopher GeorgeAkshat GuptaDaragh ByrneRamesh Krishnamurti
2020-07-15
Logic Constrained Pointer Networks for Interpretable Textual Similarity
| Subhadeep MajiRohan KumarManish BansalKalyani RoyPawan Goyal
2020-07-15
Predicting Clinical Diagnosis from Patients Electronic Health Records Using BERT-based Neural Networks
Pavel BlinovManvel AvetisianVladimir KokhDmitry UmerenkovAlexander Tuzhilin
2020-07-15
Overview of CheckThat! 2020: Automatic Identification and Verification of Claims in Social Media
| Alberto Barron-CedenoTamer ElsayedPreslav NakovGiovanni Da San MartinoMaram HasanainReem SuwailehFatima HaouariNikolay BabulkovBayan HamdanAlex NikolovShaden ShaarZien Sheikh Ali
2020-07-15
Deep Reinforced Query Reformulation for Information Retrieval
Xiao WangCraig MacdonaldIadh Ounis
2020-07-15
Fast and Accurate Neural CRF Constituency Parsing
| Yu ZhangHouquan ZhouZhenghua Li
2020-07-14
Deep Transformer based Data Augmentation with Subword Units for Morphologically Rich Online ASR
Balázs TarjánGyörgy SzaszákTibor FegyóPéter Mihajlik
2020-07-14
What's in a Name? Are BERT Named Entity Representations just as Good for any other Name?
Sriram BalasubramanianNaman JainGaurav JindalAbhijeet AwasthiSunita Sarawagi
2020-07-14
An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models
Lifu TuGarima LalwaniSpandana GellaHe He
2020-07-14
Can neural networks acquire a structural bias from raw linguistic data?
Alex WarstadtSamuel R. Bowman
2020-07-14
Emoji Prediction: Extensions and Benchmarking
Weicheng MaRuibo LiuLili WangSoroush Vosoughi
2020-07-14
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Shauharda KhadkaEstelle AflaloMattias MarderAvrech Ben-DavidSantiago MiretHanlin TangShie MannorTamir HazanSomdeb Majumdar
2020-07-14
Add a SideNet to your MainNet
Adrien Morisot
2020-07-14
An Enhanced Text Classification to Explore Health based Indian Government Policy Tweets
Aarzoo DhimanDurga Toshniwal
2020-07-13
Generative Graph Perturbations for Scene Graph Prediction
Boris KnyazevHarm de VriesCătălina CangeaGraham W. TaylorAaron CourvilleEugene Belilovsky
2020-07-11
BERT Learns (and Teaches) Chemistry
Josh PayneMario SroujiDian Ang YapVineet Kosaraju
2020-07-11
To BAN or not to BAN: Bayesian Attention Networks for Reliable Hate Speech Detection
Kristian MiokBlaz SkrljDaniela ZaharieMarko Robnik-Sikonja
2020-07-10
BISON:BM25-weighted Self-Attention Framework for Multi-Fields Document Search
| Xuan ShanChuanjie LiuYiqian XiaQi ChenYusi ZhangAngen LuoYuxiang Luo
2020-07-10
Multi-Dialect Arabic BERT for Country-Level Dialect Identification
| Bashar TalafhaMohammad AliMuhy Eddin Za'terHaitham SeelawiIbraheem TuffahaMostafa SamirWael FarhanHussein T. Al-Natsheh
2020-07-10
Contrastive Code Representation Learning
| Paras JainAjay JainTianjun ZhangPieter AbbeelJoseph E. GonzalezIon Stoica
2020-07-09
Fast Transformers with Clustered Attention
| Apoorv VyasAngelos KatharopoulosFrançois Fleuret
2020-07-09
The Go Transformer: Natural Language Modeling for Game Play
Matthew CiolinoDavid NoeverJosh Kalin
2020-07-07
Continual BERT: Continual Learning for Adaptive Extractive Summarization of COVID-19 Literature
Jong Won Park
2020-07-07
Exploring Heterogeneous Information Networks via Pre-Training
Yang FangXiang ZhaoWeidong Xiao
2020-07-07
Deep Contextual Embeddings for Address Classification in E-commerce
Shreyas MangalgiLakshya KumarRavindra Babu Tallamraju
2020-07-06
You Autocomplete Me: Poisoning Vulnerabilities in Neural Code Completion
Roei SchusterCongzheng SongEran TromerVitaly Shmatikov
2020-07-05
Text Data Augmentation: Towards better detection of spear-phishing emails
Mehdi ReginaMaxime MeyerSébastien Goutal
2020-07-04
Robust Prediction of Punctuation and Truecasing for Medical ASR
Monica SunkaraSrikanth RonankiKalpit DixitSravan BodapatiKatrin Kirchhoff
2020-07-04
Language-agnostic BERT Sentence Embedding
| Fangxiaoyu FengYinfei YangDaniel CerNaveen ArivazhaganWei Wang
2020-07-03
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Pavel DenisovNgoc Thang Vu
2020-07-03
Reading Comprehension in Czech via Machine Translation and Cross-lingual Transfer
Kateřina MackováMilan Straka
2020-07-03
Playing with Words at the National Library of Sweden -- Making a Swedish BERT
| Martin MalmstenLove BörjesonChris Haffenden
2020-07-03
On-The-Fly Information Retrieval Augmentation for Language Models
Hai WangDavid McAllester
2020-07-03
MIRA: Leveraging Multi-Intention Co-click Information in Web-scale Document Retrieval using Deep Neural Networks
Yusi ZhangChuanjie LiuAngen LuoHui XueXuan ShanYuxiang LuoYiqian XiaYuanchi YanHaidong Wang
2020-07-03
Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey
Shivaji AlaparthiManit Mishra
2020-07-02
The Impact of Explanations on AI Competency Prediction in VQA
Kamran AlipourArijit RayXiao LinJurgen P. SchulzeYi YaoGiedrius T. Burachas
2020-07-02
On Dropout, Overfitting, and Interaction Effects in Deep Neural Networks
Benjamin LengerichEric P. XingRich Caruana
2020-07-02
Improving Event Detection using Contextual Word and Sentence Embeddings
Mariano MaisonnaveFernando DelbiancoFernando TohméAna MaguitmanEvangelos Milios
2020-07-02
Transformers on Sarcasm Detection with Context
2020-07-01
On-The-Fly Information Retrieval Augmentation for Language Models
Hai WangDavid McAllester
2020-07-01
Unsupervised FAQ Retrieval with Question Generation and BERT
Yosi MassBoaz CarmeliHaggai RoitmanDavid Konopnicki
2020-07-01
GAN-BERT: Generative Adversarial Learning for Robust Text Classification with a Bunch of Labeled Examples
| Danilo CroceGiuseppe CastellucciRoberto Basili
2020-07-01
Integrating Multimodal Information in Large Pretrained Transformers
Wasifur RahmanMd Kamrul HasanSangwu LeeAmirAli Bagher ZadehChengfeng MaoLouis-Philippe MorencyEhsan Hoque
2020-07-01
Modelling Context and Syntactical Features for Aspect-based Sentiment Analysis
Minh Hieu PhanPhilip O. Ogunbona
2020-07-01
Roles and Utilization of Attention Heads in Transformer-based Neural Language Models
Jae-young JoSung-Hyon Myaeng
2020-07-01
Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation
Bo PangErik NijkampWenjuan HanLinqi ZhouYixian LiuKewei Tu
2020-07-01
Adversarial and Domain-Aware BERT for Cross-Domain Sentiment Analysis
Chunning DuHaifeng SunJingyu WangQi QiJianxin Liao
2020-07-01
How does BERT's attention change when you fine-tune? An analysis methodology and a case study in negation scope
Yiyun ZhaoSteven Bethard
2020-07-01
Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work?
Yada PruksachatkunJason PhangHaokun LiuPhu Mon HtutXiaoyi ZhangRichard Yuanzhe PangClara VaniaKatharina KannSamuel R. Bowman
2020-07-01
Would you Rather? A New Benchmark for Learning Machine Alignment with Cultural Values and Social Preferences
Yi TayDonovan OngJie FuAlvin ChanNancy ChenAnh Tuan LuuChris Pal
2020-07-01
Towards Debiasing Sentence Representations
Paul Pu LiangIrene Mengze LiEmily ZhengYao Chong LimRuslan SalakhutdinovLouis-Philippe Morency
2020-07-01
Automatic Generation of Citation Texts in Scholarly Papers: A Pilot Study
Xinyu XingXiaosheng FanXiaojun Wan
2020-07-01
Transition-based Semantic Dependency Parsing with Pointer Networks
Daniel Fern{\'a}ndez-Gonz{\'a}lezCarlos G{\'o}mez-Rodr{\'\i}guez
2020-07-01
tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection
Nicole PeineltDong NguyenMaria Liakata
2020-07-01
Understanding Advertisements with BERT
Kanika KalraBhargav KurmaSilpa Vadakkeeveetil SreelathaManasi PatwardhanKarShirish e
2020-07-01
Feature Projection for Improved Text Classification
Qi QinWenpeng HuBing Liu
2020-07-01
A Generate-and-Rank Framework with Semantic Type Regularization for Biomedical Concept Normalization
Dongfang XuZeyu ZhangSteven Bethard
2020-07-01
Revisiting Higher-Order Dependency Parsers
Erick FonsecaAndr{\'e} F. T. Martins
2020-07-01
SUPP.AI: finding evidence for supplement-drug interactions
Lucy WangOyvind TafjordArman CohanSarthak JainSam SkjonsbergCarissa SchoenickNick BotnerWaleed Ammar
2020-07-01
Why is penguin more similar to polar bear than to sea gull? Analyzing conceptual knowledge in distributional models
Pia Sommerauer
2020-07-01
A Simple and Effective Dependency Parser for Telugu
Sneha NallaniManish ShrivastavaDipti Sharma
2020-07-01
Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold Mixup
Jishnu Ray ChowdhuryCornelia CarageaDoina Caragea
2020-07-01
Should You Fine-Tune BERT for Automated Essay Scoring?
Elijah MayfieldAlan W Black
2020-07-01
A BERT-based One-Pass Multi-Task Model for Clinical Temporal Relation Extraction
Chen LinTimothy MillerDmitriy DligachFarig SadequeSteven BethardGuergana Savova
2020-07-01
Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity
Yuxia WangFei LiuKarin VerspoorTimothy Baldwin
2020-07-01
Item-based Collaborative Filtering with BERT
Tian WangYuyangzi Fu
2020-07-01
Sarcasm Identification and Detection in Conversion Context using BERT
Kalaivani A.Thenmozhi D.
2020-07-01
Neural Sarcasm Detection using Conversation Context
Nikhil Jaiswal
2020-07-01
Context-Aware Sarcasm Detection Using BERT
Arup BaruahKaushik DasFerdous BarbhuiyaKuntal Dey
2020-07-01
Character aware models with similarity learning for metaphor detection
Tarun KumarYashvardhan Sharma
2020-07-01
IlliniMet: Illinois System for Metaphor Detection with Contextual and Linguistic Information
Hongyu GongKshitij GuptaAkriti JainSuma Bhat
2020-07-01
Go Figure! Multi-task transformer-based architecture for metaphor detection using idioms: ETS team in 2020 metaphor shared task
Xianyang ChenChee Wee (Ben) LeongMichael FlorBeata Beigman Klebanov
2020-07-01
Metaphor Detection Using Contextual Word Embeddings From Transformers
Jerry LiuNathan O{'}HaraAlex RubinerRachel DraelosCynthia Rudin
2020-07-01
A Transformer Approach to Contextual Sarcasm Detection in Twitter
Hunter GregorySteven LiPouya MohammadiNatalie TarnRachel DraelosCynthia Rudin
2020-07-01
Turku Enhanced Parser Pipeline: From Raw Text to Enhanced Graphs in the IWPT 2020 Shared Task
Jenna KanervaFilip GinterSampo Pyysalo
2020-07-01
K\opsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
Daniel HershcovichMiryam de LhoneuxArtur KulmizevElham PejhanJoakim Nivre
2020-07-01
RobertNLP at the IWPT 2020 Shared Task: Surprisingly Simple Enhanced UD Parsing for English
Stefan Gr{\"u}newaldAnnemarie Friedrich
2020-07-01
The HW-TSC Video Speech Translation System at IWSLT 2020
Minghan WangHao YangYao DengYing QinLizhi LeiDaimeng WeiHengchao ShangNing XieXiaochun LiJiaxian Guo
2020-07-01
CopyBERT: A Unified Approach to Question Generation with Self-Attention
Stalin VaranasiSaadullah AminGuenter Neumann
2020-07-01
Robust Prediction of Punctuation and Truecasing for Medical ASR
Monica SunkaraSrikanth RonankiKalpit DixitSravan BodapatiKatrin Kirchhoff
2020-07-01
Exploring the Limits of Simple Learners in Knowledge Distillation for Document Classification with DocBERT
Ashutosh AdhikariAchyudh RamRaphael TangWilliam L. HamiltonJimmy Lin
2020-07-01
Joint Training with Semantic Role Labeling for Better Generalization in Natural Language Inference
Cemil CengizDeniz Yuret
2020-07-01
A Metric Learning Approach to Misogyny Categorization
Juan Manuel CoriaSahar GhannaySophie RossetHerv{\'e} Bredin
2020-07-01
Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation
Alessio MiaschiFelice Dell{'}Orletta
2020-07-01
What's in a Name? Are BERT Named Entity Representations just as Good for any other Name?
Sriram BalasubramanianNaman JainGaurav JindalAbhijeet AwasthiSunita Sarawagi
2020-07-01
Getting the \#\#life out of living: How Adequate Are Word-Pieces for Modelling Complex Morphology?
Stav KleinReut Tsarfaty
2020-07-01
SentiTel: TABSA for Twitter reviews on Uganda Telecoms
David KabiitoJoyce Nakatumba Nabende
2020-07-01
Adversarial Evaluation of BERT for Biomedical Named Entity Recognition
Vladimir AraujoAndr{\'e}s CarvalloDenis Parra
2020-07-01
Improving Multimodal Named Entity Recognition via Entity Span Detection with Unified Multimodal Transformer
Jianfei YuJing JiangLi YangRui Xia
2020-07-01
Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions
Hannah CraigheadAndrew CainesPaula ButteryHelen Yannakoudakis
2020-07-01
Regularly Updated Deterministic Policy Gradient Algorithm
Shuai HanWenbo ZhouShuai LüJiayu Yu
2020-07-01
LSTM and GPT-2 Synthetic Speech Transfer Learning for Speaker Recognition to Overcome Data Scarcity
Jordan J. BirdDiego R. FariaAnikó EkártCristiano PremebidaPedro P. S. Ayrosa
2020-07-01
The Summary Loop: Learning to Write Abstractive Summaries Without Examples
| Philippe LabanAndrew HsiJohn CannyMarti A. Hearst
2020-07-01
Go Wide, Then Narrow: Efficient Training of Deep Thin Networks
Denny ZhouMao YeChen ChenTianjian MengMingxing TanXiaodan SongQuoc LeQiang LiuDale Schuurmans
2020-07-01
SE3M: A Model for Software Effort Estimation Using Pre-trained Embedding Models
Eliane M. De Bortoli FáveroDalcimar CasanovaAndrey Ricardo Pimentel
2020-06-30
Data Movement Is All You Need: A Case Study on Optimizing Transformers
Andrei IvanovNikoli DrydenTal Ben-NunShigang LiTorsten Hoefler
2020-06-30
Segmentation Approach for Coreference Resolution Task
Aref JafariAli Ghodsi
2020-06-30
Want to Identify, Extract and Normalize Adverse Drug Reactions in Tweets? Use RoBERTa
Katikapalli Subramanyam KalyanS. Sangeetha
2020-06-29
Improving Sequence Tagging for Vietnamese Text Using Transformer-based Neural Models
Viet Bui TheOanh Tran ThiPhuong Le-Hong
2020-06-29
Knowledge-Aware Language Model Pretraining
Corby RossetChenyan XiongMinh PhanXia SongPaul BennettSaurabh Tiwary
2020-06-29
Interpreting Hierarchical Linguistic Interactions in DNNs
Die ZhangHuilin ZhouXiaoyi BaoDa HuoRuizhao ChenXu ChengHao ZhangMengyue WuQuanshi Zhang
2020-06-29
Progressive Generation of Long Text
| Bowen TanZichao YangMaruan AI-ShedivatEric P. XingZhiting Hu
2020-06-28
Rethinking Positional Encoding in Language Pre-training
| Guolin KeDi HeTie-Yan Liu
2020-06-28
BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision
| Chen LiangYue YuHaoming JiangSiawpeng ErRuijia WangTuo ZhaoChao Zhang
2020-06-28
Video-Grounded Dialogues with Pretrained Generation Language Models
Hung LeSteven C. H. Hoi
2020-06-27
Distributed Uplink Beamforming in Cell-Free Networks Using Deep Reinforcement Learning
Firas FredjYasser Al-EryaniSetareh MaghsudiMohamed AkroutEkram Hossain
2020-06-26
Noise, overestimation and exploration in Deep Reinforcement Learning
Rafael Stekolshchik
2020-06-25
FastSpec: Scalable Generation and Detection of Spectre Gadgets Using Neural Embeddings
| M. Caner TolKoray YurtsevenBerk GulmezogluBerk Sunar
2020-06-25
Normalizing Text using Language Modelling based on Phonetics and String Similarity
Fenil DoshiJimit GandhiDeep GosaliaSudhir Bagul
2020-06-25
LSBert: A Simple Framework for Lexical Simplification
| Jipeng QiangYun LiYi ZhuYunhao YuanXindong Wu
2020-06-25
Accelerated Large Batch Optimization of BERT Pretraining in 54 minutes
Shuai ZhengHaibin LinSheng ZhaMu Li
2020-06-24
Efficient Constituency Parsing by Pointing
Thanh-Tung NguyenXuan-Phi NguyenShafiq JotyXiaoli Li
2020-06-24
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng MengRob GorbetDana Kulić
2020-06-23
ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion
| BingningWangTing YaoQi ZhangJingfang XuXiaochuan Wang
2020-06-22
Students Need More Attention: BERT-based AttentionModel for Small Data with Application to AutomaticPatient Message Triage
Shijing SiRui WangJedrek WosikHao ZhangDavid DovGuoyin WangRicardo HenaoLawrence Carin
2020-06-22
Sarcasm Detection in Tweets with BERT and GloVe Embeddings
Akshay KhatriPranav PDr. Anand Kumar M
2020-06-20
New Vietnamese Corpus for Machine ReadingComprehension of Health News Articles
Kiet Van NguyenDuc-Vu NguyenAnh Gia-Tuan NguyenNgan Luu-Thuy Nguyen
2020-06-19
A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19
| David OnianiYanshan Wang
2020-06-19
SqueezeBERT: What can computer vision teach NLP about efficient neural networks?
Forrest N. IandolaAlbert E. ShawRavi KrishnaKurt W. Keutzer
2020-06-19
Reducing Estimation Bias via Weighted Delayed Deep Deterministic Policy Gradient
Qiang HeXinwen Hou
2020-06-18
Exploring the BERT Cross-Lingual Transferability: a Case Study in Reading Comprehension
Konovalov V. P.Gulyaev P. A.Sorokin A. A.Kuratov Y. M.Burtsev M. S.
2020-06-17
Tagging and parsing of multidomain collections
| Alexey SorokinIvan SmurovDenis Kirianov
2020-06-17
Improving accuracy and speeding up Document Image Classification through parallel systems
| Javier FerrandoJuan Luis DominguezJordi TorresRaul GarciaDavid GarciaDaniel GarridoJordi CortadaMateo Valero
2020-06-16
PERL: Pivot-based Domain Adaptation for Pre-trained Deep Contextualized Embedding Models
| Eyal Ben-DavidCarmel RabinovitzRoi Reichart
2020-06-16
The SPPD System for Schema Guided Dialogue State Tracking Challenge
Miao LiHaoqi XiongYunbo Cao
2020-06-16
Scalable Cross Lingual Pivots to Model Pronoun Gender for Translation
Kellie WebsterEmily Pitler
2020-06-16
End-to-End Code Switching Language Models for Automatic Speech Recognition
Ahan M. R.Shreyas Sunil Kulkarni
2020-06-16
Spherical Motion Dynamics of Deep Neural Networks with Batch Normalization and Weight Decay
Ruosi WanZhanxing ZhuXiangyu ZhangJian Sun
2020-06-15
An online evolving framework for advancing reinforcement-learning based automated vehicle control
Teawon HanSubramanya NageshraoDimitar P. FilevUmit Ozguner
2020-06-15
Document Classification for COVID-19 Literature
Bernal Jiménez GutiérrezJuncheng ZengDongdong ZhangPing ZhangYu Su
2020-06-15
FinBERT: A Pretrained Language Model for Financial Communications
| Yi YangMark Christopher Siy UYAllen Huang
2020-06-15
Cooking Is All About People: Comment Classification On Cookery Channels Using BERT and Classification Models (Malayalam-English Mix-Code)
Subramaniam KazhuparambilAbhishek Kaushik
2020-06-15
FinEst BERT and CroSloEngual BERT: less is more in multilingual models
Matej UlčarMarko Robnik-Šikonja
2020-06-14
Transferring Monolingual Model to Low-Resource Language: The Case of Tigrinya
Abrhalei TelaAbraham WoubieVille Hautamaki
2020-06-13
Human and Multi-Agent collaboration in a human-MARL teaming framework
Neda NavidiFrancois ChabotSagar KurandwadIrv LustigmanVincent RobertGregory SzriftgiserAndrea Schuch
2020-06-12
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
Pedro Javier Ortiz SuárezLaurent RomaryBenoît Sagot
2020-06-11
All Local Minima are Global for Two-Layer ReLU Neural Networks: The Hidden Convex Optimization Landscape
Jonathan LacotteMert Pilanci
2020-06-10
MC-BERT: Efficient Language Pre-Training via a Meta Controller
| Zhenhui XuLinyuan GongGuolin KeDi HeShuxin ZhengLiwei WangJiang BianTie-Yan Liu
2020-06-10
Neural Networks, Ridge Splines, and TV Regularization in the Radon Domain
Rahul ParhiRobert D. Nowak
2020-06-10
Revisiting Few-sample BERT Fine-tuning
| Tianyi ZhangFelix WuArzoo KatiyarKilian Q. WeinbergerYoav Artzi
2020-06-10
Unsupervised Paraphrase Generation using Pre-trained Language Models
Chaitra HegdeShrikumar Patil
2020-06-09
Few-Shot Generative Conversational Query Rewriting
| Shi YuJiahua LiuJingqin YangChenyan XiongPaul BennettJianfeng GaoZhiyuan Liu
2020-06-09
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
| Marius MosbachMaksym AndriushchenkoDietrich Klakow
2020-06-08
Pre-training Polish Transformer-based Language Models at Scale
| Sławomir DadasMichał PerełkiewiczRafał Poświata
2020-06-07
Medical Concept Normalization in User Generated Texts by Learning Target Concept Embeddings
Katikapalli Subramanyam KalyanS. Sangeetha
2020-06-07
GMAT: Global Memory Augmentation for Transformers
| Ankit GuptaJonathan Berant
2020-06-05
Accelerating Natural Language Understanding in Task-Oriented Dialog
Ojas AhujaShrey Desai
2020-06-05
UDPipe at EvaLatin 2020: Contextualized Embeddings and Treebank Embeddings
Milan StrakaJana Straková
2020-06-05
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
| Pengcheng HeXiaodong LiuJianfeng GaoWeizhu Chen
2020-06-05
The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain
Annemarie FriedrichHeike AdelFederico TomazicJohannes HingerlRenou BenteauAnika MaruscykLukas Lange
2020-06-04
Automatic Text Summarization of COVID-19 Medical Research Articles using BERT and GPT-2
| Virapat KieuvongngamBowen TanYiming Niu
2020-06-03
WikiBERT models: deep transfer learning for many languages
Sampo PyysaloJenna KanervaAntti VirtanenFilip Ginter
2020-06-02
Question Answering on Scholarly Knowledge Graphs
Mohamad Yaser JaradehMarkus StockerSören Auer
2020-06-02
A Pairwise Probe for Understanding BERT Fine-Tuning on Machine Reading Comprehension
Jie CaiZhengzhou ZhuPing NieQian Liu
2020-06-02
BERT Based Multilingual Machine Comprehension in English and Hindi
| Somil GuptaNilesh Khade
2020-06-02
Exploring Cross-sentence Contexts for Named Entity Recognition with BERT
Jouni LuomaSampo Pyysalo
2020-06-02
Position Masking for Language Models
Andy WagnerTiyasa MitraMrinal IyerGodfrey Da CostaMarc Tremblay
2020-06-02
R\'e-entra\^\iner ou entra\^\iner soi-m\^eme ? Strat\'egies de pr\'e-entra\^\inement de BERT en domaine m\'edical (Re-train or train from scratch ? Pre-training strategies for BERT in the medical domain )
Hicham El Boukkouri
2020-06-01
\'Etude des variations s\'emantiques \`a travers plusieurs dimensions (Studying semantic variations through several dimensions )
Syrielle MontariolAlex Allauzenre
2020-06-01
Qu'apporte BERT \`a l'analyse syntaxique en constituants discontinus ? Une suite de tests pour \'evaluer les pr\'edictions de structures syntaxiques discontinues en anglais (What does BERT contribute to discontinuous constituency parsing ? A test suite to evaluate discontinuous constituency structure predictions in English)
Maximin Coavoux
2020-06-01
Les mod\`eles de langue contextuels Camembert pour le fran\ccais : impact de la taille et de l'h\'et\'erog\'en\'eit\'e des donn\'ees d'entrainement (C AMEM BERT Contextual Language Models for French: Impact of Training Data Size and Heterogeneity )
Louis MartinBenjamin MullerPedro Javier Ortiz Su{\'a}rezYoann DupontLaurent Romary{\'E}ric Villemonte de la ClergerieBeno{\^\i}t SagotDjam{\'e} Seddah
2020-06-01
Introduction d'informations s\'emantiques dans un syst\`eme de reconnaissance de la parole (Despite spectacular advances in recent years, the Automatic Speech Recognition (ASR) systems still make mistakes, especially in noisy environments)
St{\'e}phane LevelIrina IllinaDominique Fohr
2020-06-01
Emergence of Separable Manifolds in Deep Language Representations
Jonathan MamouHang LeMiguel Del RioCory StephensonHanlin TangYoon KimSueYeon Chung
2020-06-01
Conversational Machine Comprehension: a Literature Review
Somil GuptaBhanu Pratap Singh Rawat
2020-06-01
When Bert Forgets How To POS: Amnesic Probing of Linguistic Properties and MLM Predictions
Yanai ElazarShauli RavfogelAlon JacoviYoav Goldberg
2020-06-01
An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features
Shi-Yan WengTien-Hong LoBerlin Chen
2020-06-01
BERT-based Ensembles for Modeling Disclosure and Support in Conversational Social Media Text
Tanvi DaduKartikey PantRadhika Mamidi
2020-06-01
Neural Entity Linking: A Survey of Models based on Deep Learning
| Ozge SevgiliArtem ShelmanovMikhail ArkhipovAlexander PanchenkoChris Biemann
2020-05-31
"Judge me by my size (noun), do you?'' YodaLib: A Demographic-Aware Humor Generation Framework
Aparna GarimellaCarmen BaneaNabil HossainRada Mihalcea
2020-05-31
BPGC at SemEval-2020 Task 11: Propaganda Detection in News Articles with Multi-Granularity Knowledge Sharing and Linguistic Features based Ensemble Learning
Rajaswa PatilSomesh SinghSwati Agarwal
2020-05-31
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits based Humor Grading
Siddhant MahurkarRajaswa Patil
2020-05-31
Detecting Problem Statements in Peer Assessments
Yunkai XiaoGabriel ZingleQinjin JiaHarsh R. ShahYi ZhangTianyi LiMohsin KarovaliyaWeixiang ZhaoYang SongJie JiAshwin BalasubramaniamHarshit PatelPriyankha BhalasubbramanianVikram PatelEdward F. Gehringer
2020-05-30
First Neural Conjecturing Datasets and Experiments
Josef UrbanJan Jakubův
2020-05-29
Using Large Pretrained Language Models for Answering User Queries from Product Specifications
Kalyani RoySmit ShahNithish PaiJaidam RamtejPrajit Prashant NadkarnJyotirmoy BanerjeePawan GoyalSurender Kumar
2020-05-29
SAFER: A Structure-free Approach for Certified Robustness to Adversarial Word Substitutions
Mao YeChengyue GongQiang Liu
2020-05-29
A Comparative Study of Lexical Substitution Approaches based on Neural Language Models
Nikolay ArefyevBoris SheludkoAlexander PodolskiyAlexander Panchenko
2020-05-29
Stance Prediction for Contemporary Issues: Data and Experiments
| Marjan HosseiniaEduard DragutArjun Mukherjee
2020-05-29
On Incorporating Structural Information to improve Dialogue Response Generation
| Nikita MoghePriyesh VijayanBalaraman RavindranMitesh M. Khapra
2020-05-28
Language Models are Few-Shot Learners
| Tom B. BrownBenjamin MannNick RyderMelanie SubbiahJared KaplanPrafulla DhariwalArvind NeelakantanPranav ShyamGirish SastryAmanda AskellSandhini AgarwalAriel Herbert-VossGretchen KruegerTom HenighanRewon ChildAditya RameshDaniel M. ZieglerJeffrey WuClemens WinterChristopher HesseMark ChenEric SiglerMateusz LitwinScott GrayBenjamin ChessJack ClarkChristopher BernerSam McCandlishAlec RadfordIlya SutskeverDario Amodei
2020-05-28
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Adhiguna KuncoroLingpeng KongDaniel FriedDani YogatamaLaura RimellChris DyerPhil Blunsom
2020-05-27
CausaLM: Causal Model Explanation Through Counterfactual Language Models
Amir FederNadav OvedUri ShalitRoi Reichart
2020-05-27
Transition-based Semantic Dependency Parsing with Pointer Networks
Daniel Fernández-GonzálezCarlos Gómez-Rodríguez
2020-05-27
Language Representation Models for Fine-Grained Sentiment Classification
Brian CheangBailey WeiDavid KoganHowey QiuMasud Ahmed
2020-05-27
Network Fusion for Content Creation with Conditional INNs
Robin RombachPatrick EsserBjörn Ommer
2020-05-27
A Data-driven Approach for Noise Reduction in Distantly Supervised Biomedical Relation Extraction
Saadullah AminKatherine Ann DunfieldAnna VechkaevaGünter Neumann
2020-05-26
What Are People Asking About COVID-19? A Question Classification Dataset
| Jerry WeiChengyu HuangSoroush VosoughiJason Wei
2020-05-26
ParsBERT: Transformer-based Model for Persian Language Understanding
| Mehrdad FarahaniMohammad GharachorlooMarzieh FarahaniMohammad Manthouri
2020-05-26
BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection
| Jihyung MoonWon Ik ChoJunbum Lee
2020-05-26
Comparing BERT against traditional machine learning text classification
Santiago González-CarvajalEduardo C. Garrido-Merchán
2020-05-26
BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining
Zachariah ZhangJingshu LiuNarges Razavian
2020-05-26
Optimization-driven Deep Reinforcement Learning for Robust Beamforming in IRS-assisted Wireless Communications
Jiaye LinYuze ZouXiaoru DongShimin GongDinh Thai HoangDusit Niyato
2020-05-25
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering
Chia-Chih KuoShang-Bao LuoKuan-Yu Chen
2020-05-25
Køpsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
| Daniel HershcovichMiryam de LhoneuxArtur KulmizevElham PejhanJoakim Nivre
2020-05-25
Pointwise Paraphrase Appraisal is Potentially Problematic
Hannah ChenYangfeng JiDavid Evans
2020-05-25
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding
| Chen LiuSu ZhuZijian ZhaoRuisheng CaoLu ChenKai Yu
2020-05-24
Comparative Study of Machine Learning Models and BERT on SQuAD
Devshree PatelParam RavalRatnam ParikhYesha Shastri
2020-05-22
L2R2: Leveraging Ranking for Abductive Reasoning
| Yunchang ZhuLiang PangYanyan LanXueqi Cheng
2020-05-22
Living Machines: A study of atypical animacy
Mariona Coll ArdanuyFederico NanniKaspar BeelenKasra HosseiniRuth AhnertJon LawrenceKatherine McDonoughGiorgia TolfoDaniel CS WilsonBarbara McGillivray
2020-05-22
Robust Layout-aware IE for Visually Rich Documents with Pre-trained Language Models
Mengxi WeiYifan HeQiong Zhang
2020-05-22
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
Laila RasmyYang XiangZiqian XieCui TaoDegui Zhi
2020-05-22
Supervised Learning in the Presence of Concept Drift: A modelling framework
Michiel StraatFthi AbadiZhuoyun KanChristina GöpfertBarbara HammerMichael Biehl
2020-05-21
BERTweet: A pre-trained language model for English Tweets
| Dat Quoc NguyenThanh VuAnh Tuan Nguyen
2020-05-20
Creative Artificial Intelligence -- Algorithms vs. humans in an incentivized writing competition
Nils KöbisLuca Mossink
2020-05-20
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
Dehong GaoLinbo JinBen ChenMinghui QiuPeng LiYi WeiYi HuHao Wang
2020-05-20
Cross-lingual Transfer Learning for Dialogue Act Recognition
Jiří MartínekChristophe CerisaraPavel KrálLadislav Lenc
2020-05-19
Table Search Using a Deep Contextualized Language Model
| Zhiyu ChenMohamed TrabelsiJeff HeflinYinan XuBrian D. Davison
2020-05-19
Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning
Zhenhui YeYining ChenGuanghua SongBowei YangShen Fan
2020-05-19
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt
Hangyu LinYanwei FuYu-Gang JiangXiangyang Xue
2020-05-19
Are All Languages Created Equal in Multilingual BERT?
Shijie WuMark Dredze
2020-05-18
Context-Based Quotation Recommendation
Ansel MacLaughlinTao ChenBurcu Karagol AyanDan Roth
2020-05-17
Support-BERT: Predicting Quality of Question-Answer Pairs in MSDN using Deep Bidirectional Transformer
Bhaskar SenNikhil GopalXinwei Xue
2020-05-17
Building a Hebrew Semantic Role Labeling Lexical Resource from Parallel Movie Subtitles
Ben EyalMichael Elhadad
2020-05-17
Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce
| Juntao LiChang LiuJian WangLidong BingHongsong LiXiaozhong LiuDongyan ZhaoRui Yan
2020-05-17
Adversarial Training for Commonsense Inference
Lis PereiraXiaodong LiuFei ChengMasayuki AsaharaIchiro Kobayashi
2020-05-17
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
| Pengcheng YinGraham NeubigWen-tau YihSebastian Riedel
2020-05-17
CERT: Contrastive Self-supervised Learning for Language Understanding
Hongchao FangSicheng WangMeng ZhouJiayuan DingPengtao Xie
2020-05-16
Leveraging Affective Bidirectional Transformers for Offensive Language Detection
AbdelRahim ElmadanyChiyu ZhangMuhammad Abdul-MageedAzadeh Hashemi
2020-05-16
Spelling Error Correction with Soft-Masked BERT
| Shaohua ZhangHaoran HuangJicong LiuHang Li
2020-05-15
Neural Entity Linking on Technical Service Tickets
Nadja KurzFelix HamannAdrian Ulges
2020-05-15
Challenges in Emotion Style Transfer: An Exploration with a Lexical Substitution Pipeline
David HelbigEnrica TroianoRoman Klinger
2020-05-15
[email protected] at SemEval-2020 Task 12: Identifying Multilingual Offensive Tweets Using Weighted Ensemble and Fine-Tuned BERT
Saja Khaled TawalbehMahmoud HammadMohammad AL-Smadi
2020-05-15
NIT-Agartala-NLP-Team at SemEval-2020 Task 8: Building Multimodal Classifiers to tackle Internet Humor
Steve Durairaj SwamyShubham LaddhaBasil AbdussalamDebayan DattaAnupam Jamatia
2020-05-14
A pre-training technique to localize medical BERT and enhance BioBERT
| Shoya WadaToshihiro TakedaShiro ManabeShozo KonishiJun KamoharaYasushi Matsumura
2020-05-14
Parallel Corpus Filtering via Pre-trained Language Models
Boliang ZhangAjay NageshKevin Knight
2020-05-13
Large Scale Multi-Actor Generative Dialog Modeling
Alex BoydRaul PuriMohammad ShoeybiMostofa PatwaryBryan Catanzaro
2020-05-13
Entity-Enriched Neural Models for Clinical Question Answering
| Bhanu Pratap Singh RawatWei-Hung WengPreethi RaghavanPeter Szolovits
2020-05-13
On the Robustness of Language Encoders against Grammatical Errors
Fan YinQuanyu LongTao MengKai-Wei Chang
2020-05-12
On the Generation of Medical Dialogues for COVID-19
| Wenmian YangGuangtao ZengBowen TanZeqian JuSubrato ChakravortyXuehai HeShu ChenXingyi YangQingyang WuZhou YuEric XingPengtao Xie
2020-05-11
Detecting Adverse Drug Reactions from Twitter through Domain-Specific Preprocessing and BERT Ensembling
Amy BredenLee Moore
2020-05-11
How Context Affects Language Models' Factual Predictions
Fabio PetroniPatrick LewisAleksandra PiktusTim RocktäschelYuxiang WuAlexander H. MillerSebastian Riedel
2020-05-10
Transformer Based Language Models for Similar Text Retrieval and Ranking
Javed Qadrud-DinAshraf Bah RabiouRyan WalkerRavi SoniMartin GajekGabriel PackAkhil Rangaraj
2020-05-10
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning
Hirohisa WatanabeMineto TsukadaHiroki Matsutani
2020-05-10
Finding Universal Grammatical Relations in Multilingual BERT
Ethan A. ChiJohn HewittChristopher D. Manning
2020-05-09
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
| Samson TanShafiq JotyMin-Yen KanRichard Socher
2020-05-09
LinCE: A Centralized Benchmark for Linguistic Code-switching Evaluation
Gustavo AguilarSudipta KarThamar Solorio
2020-05-09
schuBERT: Optimizing Elements of BERT
Ashish KhetanZohar Karnin
2020-05-09
SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics
| Da YinTao MengKai-Wei Chang
2020-05-08
Distilling Knowledge from Pre-trained Language Models via Text Smoothing
Xing WuYibing LiuXiangyang ZhouDianhai Yu
2020-05-08
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference
Ali Hadi ZadehAndreas Moshovos
2020-05-08
Temporal Common Sense Acquisition with Minimal Supervision
Ben ZhouQiang NingDaniel KhashabiDan Roth
2020-05-08
Comparative Analysis of Text Classification Approaches in Electronic Health Records
Aurelie MascioZeljko KraljevicDaniel BeanRichard DobsonRobert StewartRebecca BendayanAngus Roberts
2020-05-08
LIIR at SemEval-2020 Task 12: A Cross-Lingual Augmentation Approach for Multilingual Offensive Language Identification
Erfan GhaderyMarie-Francine Moens
2020-05-07
Harvesting and Refining Question-Answer Pairs for Unsupervised QA
| Zhongli LiWenhui WangLi DongFuru WeiKe Xu
2020-05-06
An Empirical Study of Multi-Task Learning on BERT for Biomedical Text Mining
Yifan PengQingyu ChenZhiyong Lu
2020-05-06
Autoencoding Pixies: Amortised Variational Inference with Graph Convolutions for Functional Distributional Semantics
Guy Emerson
2020-05-06
Categorical Vector Space Semantics for Lambek Calculus with a Relevant Modality
Lachlan McPheatMehrnoosh SadrzadehHadi WazniGijs Wijnholds
2020-05-06
MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models
| Mandy GuoYinfei YangDaniel CerQinlan ShenNoah Constant
2020-05-05
Contextualizing Hate Speech Classifiers with Post-hoc Explanation
Brendan KennedyXisen JinAida Mostafazadeh DavaniMorteza DehghaniXiang Ren
2020-05-05
Establishing Baselines for Text Classification in Low-Resource Languages
| Jan Christian Blaise CruzCharibeth Cheng
2020-05-05
ExpBERT: Representation Engineering with Natural Language Explanations
| Shikhar MurtyPang Wei KohPercy Liang
2020-05-05
ImpactCite: An XLNet-based method for Citation Impact Analysis
Dominique MercierSyed Tahseen Raza RizviVikas RajashekarAndreas DengelSheraz Ahmed
2020-05-05
Distributional Discrepancy: A Metric for Unconditional Text Generation
| Ping CaiXingyuan ChenPeng JinHongjun WangTianrui Li
2020-05-04
Spying on your neighbors: Fine-grained probing of contextual embeddings for information about surrounding words
Josef KlafkaAllyson Ettinger
2020-05-04
Robust Encodings: A Framework for Combating Adversarial Typos
Erik JonesRobin JiaAditi RaghunathanPercy Liang
2020-05-04
Unsupervised Alignment-based Iterative Evidence Retrieval for Multi-hop Question Answering
| Vikas YadavSteven BethardMihai Surdeanu
2020-05-04
Code and Named Entity Recognition in StackOverflow
| Jeniya TabassumMounica MaddelaWei XuAlan Ritter
2020-05-04
Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction
| Masahiro KanekoMasato MitaShun KiyonoJun SuzukiKentaro Inui
2020-05-03
Transformer-based End-to-End Question Generation
| Luis Enrico LopezDiane Kathryn CruzJan Christian Blaise CruzCharibeth Cheng
2020-05-03
BERT-kNN: Adding a kNN Search Component to Pretrained Language Models for Better QA
Nora KassnerHinrich Schütze
2020-05-02
DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
| Qingqing CaoHarsh TrivediAruna BalasubramanianNiranjan Balasubramanian
2020-05-02
Birds have four legs?! NumerSense: Probing Numerical Commonsense Knowledge of Pre-trained Language Models
Bill Yuchen LinSeyeon LeeRahul KhannaXiang Ren
2020-05-02
Generating Derivational Morphology with BERT
Valentin HofmannJanet B. PierrehumbertHinrich Schütze
2020-05-02
IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization
| Wenxuan ZhouBill Yuchen LinXiang Ren
2020-05-02
A Simple Language Model for Task-Oriented Dialogue
| Ehsan Hosseini-AslBryan McCannChien-Sheng WuSemih YavuzRichard Socher
2020-05-02
Contrastive Self-Supervised Learning for Commonsense Reasoning
| Tassilo KleinMoin Nabi
2020-05-02
Understanding Generalization in Recurrent Neural Networks
Zhuozhuo TuFengxiang HeDacheng Tao
2020-05-01
Improving Neural Language Generation with Spectrum Control
Lingxiao WangJing HuangKevin HuangZiniu HuGuangtao WangQuanquan Gu
2020-05-01
Neural Symbolic Reader: Scalable Integration of Distributed and Symbolic Representations for Reading Comprehension
Xinyun ChenChen LiangAdams Wei YuDenny ZhouDawn SongQuoc V. Le
2020-05-01
A Controllable Model of Grounded Response Generation
Zeqiu WuMichel GalleyChris BrockettYizhe ZhangXiang GaoChris QuirkRik Koncel-KedziorskiJianfeng GaoHannaneh HajishirziMari OstendorfBill Dolan
2020-05-01
HipoRank: Incorporating Hierarchical and Positional Information into Graph-based Unsupervised Long Document Extractive Summarization
Yue DongAndrei RomascanuJackie C. K. Cheung
2020-05-01
Identifying Necessary Elements for BERT's Multilinguality
| Philipp DufterHinrich Schütze
2020-05-01
Hitachi at SemEval-2020 Task 12: Offensive Language Identification with Noisy Labels using Statistical Sampling and Post-Processing
Manikandan RavikiranAmin Ekant MuljibhaiToshinori MiyoshiHiroaki OzakiYuta KoreedaSakata Masayuki
2020-05-01
Cross-Linguistic Syntactic Evaluation of Word Prediction Models
| Aaron MuellerGarrett NicolaiPanayiota Petrou-ZeniouNatalia TalminaTal Linzen
2020-05-01
Intermediate-Task Transfer Learning with Pretrained Models for Natural Language Understanding: When and Why Does It Work?
Yada PruksachatkunJason PhangHaokun LiuPhu Mon HtutXiaoyi ZhangRichard Yuanzhe PangClara VaniaKatharina KannSamuel R. Bowman
2020-05-01
Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset
Xiang YueBernal Jimenez GutierrezHuan Sun
2020-05-01
When BERT Plays the Lottery, All Tickets Are Winning
Sai PrasannaAnna RogersAnna Rumshisky
2020-05-01
POINTER: Constrained Text Generation via Insertion-based Generative Pre-training
| Yizhe ZhangGuoyin WangChunyuan LiZhe GanChris BrockettBill Dolan
2020-05-01
Probing Text Models for Common Ground with Visual Representations
Gabriel IlharcoRowan ZellersAli FarhadiHannaneh Hajishirzi
2020-05-01
Multilingual Corpus Creation for Multilingual Semantic Similarity Task
Mahtab AhmedChahna DixitRobert E. MercerAtif KhanMuhammad Rifayat SameeFelipe Urra
2020-05-01
Text Categorization for Conflict Event Annotation
Fredrik OlssonMagnus SahlgrenFehmi ben AbdesslemAriel EkgrenKristine Eck
2020-05-01
TF-IDF Character N-grams versus Word Embedding-based Models for Fine-grained Event Classification: A Preliminary Study
Jakub PiskorskiGuillaume Jacquet
2020-05-01
TermEval 2020: TALN-LS2N System for Automatic Term Extraction
Amir HazemBouhM{\'e}rieme iFlorian BoudinBeatrice Daille
2020-05-01
FrameNet Annotations Alignment using Attention-based Machine Translation
Gabriel Marzinotto
2020-05-01
Implementation of Supervised Training Approaches for Monolingual Word Sense Alignment: ACDH-CH System Description for the MWSA Shared Task at GlobaLex 2020
Lenka BajceticSeung-bin Yim
2020-05-01
Transfer learning applied to text classification in Spanish radiological reports
Pilar L{\'o}pez {\'U}bedaManuel Carlos D{\'\i}az-GalianoL. Alfonso Urena LopezMaite MartinTeodoro Mart{\'\i}n-NoguerolAntonio Luna
2020-05-01
Aggression Identification in Social Media: a Transfer Learning Based Approach
RamiFaneva risoaJosiane Mothe
2020-05-01
IRIT at TRAC 2020
RamiFaneva risoaJosiane Mothe
2020-05-01
Bagging BERT Models for Robust Aggression Identification
Julian RischRalf Krestel
2020-05-01
Scmhl5 at TRAC-2 Shared Task on Aggression Identification: Bert Based Ensemble Learning Approach
Han LiuPete BurnapWafa AlorainyMatthew Williams
2020-05-01
Aggression Identification in English, Hindi and Bangla Text using BERT, RoBERTa and SVM
| Arup BaruahKaushik DasFerdous BarbhuiyaKuntal Dey
2020-05-01
Aggression and Misogyny Detection using BERT: A Multi-Task Approach
| Niloofar Safi SamghabadiParth PatwaSrinivas PYKLPrerana MukherjeeAmitava DasThamar Solorio
2020-05-01
From Web Crawl to Clean Register-Annotated Corpora
Veronika LaippalaSamuel R{\"o}nnqvistSaara Hellstr{\"o}mJuhani LuotolahtiLiina RepoAnna SalmelaValtteri SkantsiSampo Pyysalo
2020-05-01
Cross-lingual Zero Pronoun Resolution
Abdulrahman AlorainiMassimo Poesio
2020-05-01
Understanding User Utterances in a Dialog System for Caregiving
Yoshihiko AsaoJulien KloetzerJunta MizunoDai SaikiKazuma KadowakiKentaro Torisawa
2020-05-01
Joint Learning of Syntactic Features Helps Discourse Segmentation
Takshak DesaiParag Pravin DakleDan Moldovan
2020-05-01
Adapting BERT to Implicit Discourse Relation Classification with a Focus on Discourse Connectives
Yudai KishimotoYugo MurawakiSadao Kurohashi
2020-05-01
Automated Essay Scoring System for Nonnative Japanese Learners
Reo HiraoMio AraiHiroki ShimanakaSatoru KatsumataMamoru Komachi
2020-05-01
Development and Validation of a Corpus for Machine Humor Comprehension
Yuen-Hsien TsengWun-Syuan WuChia-Yueh ChangHsueh-Chih ChenWei-Lun Hsu
2020-05-01