Dropout is a regularization technique for neural networks that drops a unit (along with connections) at training time with a specified probability $p$ (a common value is $p=0.5$). At test time, all units are present, but with weights scaled by $p$ (i.e. $w$ becomes $pw$).

The idea is to prevent co-adaptation, where the neural network becomes too reliant on particular connections, as this could be symptomatic of overfitting. Intuitively, dropout can be thought of as creating an implicit ensemble of neural networks.

Source: Dropout: A Simple Way to Prevent Neural Networks from Overfitting

Latest Papers

PAPER DATE
The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation Classification
| Abdullatif KöksalArzucan Özgür
2020-10-19
Capturing Longer Context for Document-level Neural Machine Translation: A Multi-resolutional Approach
| Zewei SunMingxuan WangHao ZhouChengqi ZhaoShuJian HuangJiajun ChenLei LI
2020-10-18
Delaying Interaction Layers in Transformer-based Encoders for Efficient Open Domain Question Answering
Wissam SibliniMohamed ChallalCharlotte Pasqual
2020-10-16
It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT
Hila GonenShauli RavfogelYanai ElazarYoav Goldberg
2020-10-16
Coarse-to-Fine Pre-training for Named Entity Recognition
Mengge XueBowen YuZhenyu ZhangTingwen LiuYue ZhangBin Wang
2020-10-16
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
Wei ChenWeiping WangLi LiuMichael S. Lew
2020-10-16
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion
Shengkui ZhaoTrung Hieu NguyenHao WangBin Ma
2020-10-16
Latent Vector Recovery of Audio GANs
Andrew KeyesNicky BayatVahid Reza KhazaieYalda Mohsenzadeh
2020-10-16
DiDi's Machine Translation System for WMT2020
Tanfang ChenWeiwei WangWenyang WeiXing ShiXiangang LiJieping YeKevin Knight
2020-10-16
Generating Diverse Translation from Model Distribution with Dropout
Xuanfu WuYang FengChenze Shao
2020-10-16
Modeling Token-level Uncertainty to Learn Unknown Concepts in SLU via Calibrated Dirichlet Prior RNN
Yilin ShenWenhu ChenHongxia Jin
2020-10-16
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks
| Nandan ThakurNils ReimersJohannes DaxenbergerIryna Gurevych
2020-10-16
Revisiting Optical Flow Estimation in 360 Videos
Keshav BhandariZiliang ZongYan Yan
2020-10-15
Empirical Study of Transformers for Source Code
Nadezhda ChirkovaSergey Troshin
2020-10-15
Neural Deepfake Detection with Factual Structure of Text
Wanjun ZhongDuyu TangZenan XuRuize WangNan DuanMing ZhouJiahai WangJian Yin
2020-10-15
AI-based BMI Inference from Facial Images: An Application to Weight Monitoring
Hera SiddiquiAjita RattaniDakshina Ranjan KiskuTanner Dean
2020-10-15
Multi-Task Learning for Cross-Lingual Abstractive Summarization
Sho TakaseNaoaki Okazaki
2020-10-15
Context-Guided BERT for Targeted Aspect-Based Sentiment Analysis
Zhengxuan WuDesmond C. Ong
2020-10-15
Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs
| Ana MarasovićChandra BhagavatulaJae Sung ParkRonan Le BrasNoah A. SmithYejin Choi
2020-10-15
GMH: A General Multi-hop Reasoning Model for KG Completion
Yao ZhangXu ZhangJun WangHongru LiangAdam JatowtWenqiang LeiZhenglu Yang
2020-10-15
DialogueTRM: Exploring the Intra- and Inter-Modal Emotional Behaviors in the Conversation
Yuzhao MaoQi SunGuang LiuXiaojie WangWeiguo GaoXuan LiJianping Shen
2020-10-15
Does Chinese BERT Encode Word Structure?
| Yile WangLeyang CuiYue Zhang
2020-10-15
Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings
Phillip KeungJulian SalazarYichao LuNoah A. Smith
2020-10-15
[email protected]: Sentiment Analysis of Code-Mixed Dravidian text using XLNet
Shubhanker BanerjeeArun JayapalSajeetha Thavareesan
2020-10-15
Response Selection for Multi-Party Conversations withDynamic Topic Tracking
Weishi Wang§Shafiq Joty§Steven C. H. Hoi
2020-10-15
Compressive Summarization with Plausibility and Salience Modeling
| Shrey DesaiJiacheng XuGreg Durrett
2020-10-15
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua ZhuYingce XiaLijun WuJiajun DengWengang ZhouTao QinHouqiang Li
2020-10-15
Understanding Neural Abstractive Summarization Models via Uncertainty
| Jiacheng XuShrey DesaiGreg Durrett
2020-10-15
Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout
| Zhao ChenJiquan NgiamYanping HuangThang LuongHenrik KretzschmarYuning ChaiDragomir Anguelov
2020-10-14
Semantic Segmentation for Partially Occluded Apple Trees Based on Deep Learning
Zijue ChenDavid TingRhys NewburyChao Chen
2020-10-14
Memformer: The Memory-Augmented Transformer
Qingyang WuZhenzhong LanJing GuZhou Yu
2020-10-14
DA-Transformer: Distance-aware Transformer
Chuhan WuFangzhao WuYongfeng Huang
2020-10-14
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search
| Gyuwan KimKyunghyun Cho
2020-10-14
An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models
Zihan ZhaoYuncong LiuLu ChenQi LiuRao MaKai Yu
2020-10-14
Geometry matters: Exploring language examples at the decision boundary
Debajyoti DattaShashwat KumarLaura BarnesTom Fletcher
2020-10-14
Decoding Methods for Neural Narrative Generation
| Alexandra DeLuciaAaron MuellerXiang Lisa LiJoão Sedoc
2020-10-14
No Rumours Please! A Multi-Indic-Lingual Approach for COVID Fake-Tweet Detection
| Debanjana KarMohit BhardwajSuranjana SamantaAmar Prakash Azad
2020-10-14
Probing for Multilingual Numerical Understanding in Transformer-Based Language Models
| Devin JohnsonDenise MakDrew BarkerLexi Loessberg-Zahl
2020-10-13
BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's Distance
| Jianquan LiXiaokang LiuHonghong ZhaoRuifeng XuMin YangYaohong Jin
2020-10-13
Incorporating BERT into Parallel Sequence Decoding with Adapters
| Junliang GuoZhirui ZhangLinli XuHao-Ran WeiBoxing ChenEnhong Chen
2020-10-13
Improving Text Generation Evaluation with Batch Centering and Tempered Word Mover Distance
Xi ChenNan DingTomer LevinboimRadu Soricut
2020-10-13
The workweek is the best time to start a family -- A Study of GPT-2 Based Claim Generation
Shai GretzYonatan BiluEdo Cohen-KarlikNoam Slonim
2020-10-13
Context-Aware Drive-thru Recommendation Service at Fast Food Restaurants
Luyang WangKai HuangJiao WangShengsheng HuangJason DaiYue Zhuang
2020-10-13
CAPT: Contrastive Pre-Training for LearningDenoised Sequence Representations
Fuli LuoPengcheng YangShicheng LiXuancheng RenXu sun
2020-10-13
Aspect-based Document Similarity for Research Papers
| Malte OstendorffTerry RuasTill BlumeBela GippGeorg Rehm
2020-10-13
Interpreting Attention Models with Human Visual Attention in Machine Reading Comprehension
Ekta SoodSimon TannertDiego FrassinelliAndreas BullingNgoc Thang Vu
2020-10-13
Multilingual Argument Mining: Datasets and Analysis
Orith Toledo-RonenMatan OrbachYonatan BiluArtem SpectorNoam Slonim
2020-10-13
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy LinRodrigo NogueiraAndrew Yates
2020-10-13
Pagsusuri ng RNN-based Transfer Learning Technique sa Low-Resource Language
| Dan John Velasco
2020-10-13
COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs
Jena D. HwangChandra BhagavatulaRonan Le BrasJeff DaKeisuke SakaguchiAntoine BosselutYejin Choi
2020-10-12
Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification
Jordan J. BirdAnikó EkártDiego R. Faria
2020-10-12
Zero-shot Entity Linking with Efficient Long Range Sequence Modeling
| Zonghai YaoLiangliang CaoHuapu Pan
2020-10-12
Meta-Context Transformers for Domain-Specific Response Generation
Debanjana KarSuranjana SamantaAmar Prakash Azad
2020-10-12
Counterfactual Variable Control for Robust and Interpretable Question Answering
| Sicheng YuYulei NiuShuohang WangJing JiangQianru Sun
2020-10-12
Improving Compositional Generalization in Semantic Parsing
| Inbar OrenJonathan HerzigNitish GuptaMatt GardnerJonathan Berant
2020-10-12
HUJI-KU at MRP~2020: Two Transition-based Neural Parsers
Ofir ArvivRuixiang CuiDaniel Hershcovich
2020-10-12
Probing Pretrained Language Models for Lexical Semantics
Ivan VulićEdoardo Maria PontiRobert LitschkoGoran GlavašAnna Korhonen
2020-10-12
EFSG: Evolutionary Fooling Sentences Generator
Marco Di GiovanniMarco Brambilla
2020-10-12
Dynamic Memory Enhanced Transformer for End-to-end Task-Oriented Dialogue System
Yanjie GouYinjie LeiLingqiao Liu
2020-10-12
Layer-wise Guided Training for BERT: Learning Incrementally Refined Document Representations
Nikolaos ManginasIlias ChalkidisProdromos Malakasiotis
2020-10-12
From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks
| Steffen EgerYannik Benz
2020-10-12
Load What You Need: Smaller Versions of Multilingual BERT
| Amine AbdaouiCamille PradelGrégoire Sigel
2020-10-12
Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually)
| Alex WarstadtYian ZhangHaau-Sing LiHaokun LiuSamuel R. Bowman
2020-10-11
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task
Zuchao LiHai ZhaoRui WangKehai ChenMasao UtiyamaEiichiro Sumita
2020-10-11
Detecting Foodborne Illness Complaints in Multiple Languages Using English Annotations Only
Ziyi LiuGiannis KaramanolakisDaniel HsuLuis Gravano
2020-10-11
Connecting the Dots Between Fact Verification and Fake News Detection
Qifei LiWangchunshu Zhou
2020-10-11
Machine Translation of Mathematical Text
Aditya OhriTanya Schmah
2020-10-11
Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization
Jiyang XieZhanyu MaGuoqiang ZhangJing-Hao XueZheng-Hua TanJun Guo
2020-10-11
Unsupervised Distillation of Syntactic Information from Contextualized Word Representations
Shauli RavfogelYanai ElazarJacob GoldbergerYoav Goldberg
2020-10-11
Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU
| Brielen MadureiraDavid Schlangen
2020-10-11
Data Agnostic RoBERTa-based Natural Language to SQL Query Generation
| Debaditya PalHarsh SharmaKaustubh Chaudhari
2020-10-11
SMYRF: Efficient Attention using Asymmetric Clustering
| Giannis DarasNikita KitaevAugustus OdenaAlexandros G. Dimakis
2020-10-11
Information Extraction from Swedish Medical Prescriptions with Sig-Transformer Encoder
John Pougue BiyongBo wangTerry LyonsAlejo J Nevado-Holgado
2020-10-10
Structured Self-Attention Weights Encode Semantics in Sentiment Analysis
| Zhengxuan WuThanh-Son NguyenDesmond C. Ong
2020-10-10
Tag Recommendation for Online Q&A Communities based on BERT Pre-Training Technique
Navid KhezrianJafar HabibiIssa Annamoradnejad
2020-10-10
Compressing Transformer-Based Semantic Parsing Models using Compositional Code Embeddings
Prafull PrakashSaurabh Kumar ShashidharWenlong ZhaoSubendhu RongaliHaidar KhanMichael Kayser
2020-10-10
Automated Concatenation of Embeddings for Structured Prediction
| Xinyu WangYong JiangNguyen BachTao WangZhongqiang HuangFei HuangKewei Tu
2020-10-10
Second-Order Neural Dependency Parsing with Message Passing and End-to-End Training
| Xinyu WangKewei Tu
2020-10-10
On Task-Level Dialogue Composition of Generative Transformer Model
| Prasanna ParthasarathiArvind NeelakantanSharan Narang
2020-10-09
Attaining Real-Time Super-Resolution for Microscopic Images Using GAN
| Vibhu BhatiaYatender Kumar
2020-10-09
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders
Wen-Chin HuangPatrick Lumban TobingYi-Chiao WuKazuhiro KobayashiTomoki Toda
2020-10-09
Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN
Patrick Lumban TobingYi-Chiao WuTomoki Toda
2020-10-09
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding
Jin CaoJun WangWael HamzaKelly VaneeShang-Wen Li
2020-10-09
Online Back-Parsing for AMR-to-Text Generation
Xuefeng BaiLinfeng SongYue Zhang
2020-10-09
What Have We Achieved on Text Summarization?
Dandan HuangLeyang CuiSen yangGuangsheng BaoKun WangJun XieYue Zhang
2020-10-09
Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis
| João A. LeiteDiego F. SilvaKalina BontchevaCarolina Scarton
2020-10-09
Neural Random Projection: From the Initial Task To the Input Similarity Problem
Alan SavushkinNikita BenkovichDmitry Golubev
2020-10-09
Grid Tagging Scheme for Aspect-oriented Fine-grained Opinion Extraction
Zhen WuChengcan YingFei ZhaoZhifang FanXinyu DaiRui Xia
2020-10-09
NutCracker at WNUT-2020 Task 2: Robustly Identifying Informative COVID-19 Tweets using Ensembling and Adversarial Training
| Priyanshu KumarAadarsh Singh
2020-10-09
Deep Learning Meets Projective Clustering
Alaa MaaloufHarry LangDaniela RusDan Feldman
2020-10-08
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Jonathan ShenYe JiaMike ChrzanowskiYu ZhangIsaac EliasHeiga ZenYonghui Wu
2020-10-08
Masked ELMo: An evolution of ELMo towards fully contextual RNN language models
Gregory SenayEmmanuelle Salin
2020-10-08
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Yinghui HuangHong-Kwang KuoSamuel ThomasZvi KonsKartik AudhkhasiBrian KingsburyRon HooryMichael Picheny
2020-10-08
Energy-based Out-of-distribution Detection
| Weitang LiuXiaoYun WangJohn D. OwensYixuan Li
2020-10-08
PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge
| Yun HeZhuoer WangYin ZhangRuihong HuangJames Caverlee
2020-10-08
Shallow-to-Deep Training for Neural Machine Translation
Bei LiZiyang WangHui LiuYufan JiangQuan DuTong XiaoHuizhen WangJingbo Zhu
2020-10-08
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
| Yun HeZiwei ZhuYin ZhangQin ChenJames Caverlee
2020-10-08
Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference
| Xiaoan DingTianyu LiuBaobao ChangZhifang SuiKevin Gimpel
2020-10-08
Improving Attention Mechanism with Query-Value Interaction
Chuhan WuFangzhao WuTao QiYongfeng Huang
2020-10-08
TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling
Parker RileyNoah ConstantMandy GuoGirish KumarDavid UthusZarana Parekh
2020-10-08
A Co-Interactive Transformer for Joint Slot Filling and Intent Detection
| Libo QinTailu LiuWanxiang CheBingbing KangSendong ZhaoTing Liu
2020-10-08
Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding
Dechuang TengLibo QinWanxiang CheSendong ZhaoTing Liu
2020-10-08
Prediction intervals for Deep Neural Networks
Tullio ManciniHector Calvo-PardoJose Olmo
2020-10-08
Interlocking Backpropagation: Improving depthwise model-parallelism
Aidan N. GomezOscar KeyStephen GouNick FrosstJeff DeanYarin Gal
2020-10-08
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou ZhuWeijie SuLewei LuBin LiXiaogang WangJifeng Dai
2020-10-08
Automatic generation of reviews of scientific papers
| Anna NikiforovskayaNikolai KapralovAnna VlasovaOleg ShpynovAleksei Shpilman
2020-10-08
Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets
Mihaela GamanRadu Tudor Ionescu
2020-10-07
Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank
| Eleftheria BriakouMarine Carpuat
2020-10-07
Optimizing Transformers with Approximate Computing for Faster, Smaller and more Accurate NLP Models
Amrit NagarajanSanchari SenJacob R. StevensAnand Raghunathan
2020-10-07
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling
Jiecao ChenLiu YangKarthik RamanMichael BenderskyJung-Jung YehYun ZhouMarc NajorkDanyang CaiEhsan Emadzadeh
2020-10-07
Transformer-GCRF: Recovering Chinese Dropped Pronouns with General Conditional Random Fields
Jingxuan YangKerui XuJun XuSi LiSheng GaoJun GuoJi-Rong WenNianwen Xue
2020-10-07
Don't Trigger Me! A Triggerless Backdoor Attack Against Deep Neural Networks
Ahmed SalemMichael BackesYang Zhang
2020-10-07
Why do you think that? Exploring Faithful Sentence-Level Rationales Without Supervision
Max GlocknerIvan HabernalIryna Gurevych
2020-10-07
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Thai-Son NguyenSebastian StuekerAlex Waibel
2020-10-07
ELMo and BERT in semantic change detection for Russian
Julia RodinaYuliya TrofimovaAndrey KutuzovEkaterina Artemova
2020-10-07
Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing
Xilun ChenAsish GhoshalYashar MehdadLuke ZettlemoyerSonal Gupta
2020-10-07
Vector-Vector-Matrix Architecture: A Novel Hardware-Aware Framework for Low-Latency Inference in NLP Applications
Matthew KhouryRumen DangovskiLongwu OuPreslav NakovYichen ShenLi Jing
2020-10-06
Parallax Motion Effect Generation Through Instance Segmentation And Depth Estimation
Allan PintoManuel A. CórdovaLuis G. L. DeckerJose L. Flores-CampanaMarcos R. SouzaAndreza A. dos SantosJhonatas S. ConceiçãoHenrique F. GagliardiDiogo C. LuvizonRicardo da S. TorresHelio Pedrini
2020-10-06
Adversarial Grammatical Error Correction
Vipul RahejaDimitrios Alikaniotis
2020-10-06
Investigating African-American Vernacular English in Transformer-Based Text Generation
Sophie GroenwoldLily OuAesha ParekhSamhita HonnavalliSharon LevyDiba MirzaWilliam Yang Wang
2020-10-06
Do Explicit Alignments Robustly Improve Multilingual Encoders?
Shijie WuMark Dredze
2020-10-06
LEGAL-BERT: The Muppets straight out of Law School
Ilias ChalkidisManos FergadiotisProdromos MalakasiotisNikolaos AletrasIon Androutsopoulos
2020-10-06
Cross-Lingual Text Classification with Minimal Resources by Transferring a Sparse Teacher
| Giannis KaramanolakisDaniel HsuLuis Gravano
2020-10-06
The Multilingual Amazon Reviews Corpus
Phillip KeungYichao LuGyörgy SzarvasNoah A. Smith
2020-10-06
Scene Graph Modification Based on Natural Language Commands
| Xuanli HeQuan Hung TranGholamreza HaffariWalter ChangTrung BuiZhe LinFranck DernoncourtNhan Dam
2020-10-06
Converting the Point of View of Messages Spoken to Virtual Assistants
| Isabelle G. LeeVera ZuSai Srujana BuddiDennis LiangJack G. M. FitzGerald
2020-10-06
On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained Transformers
| Marius MosbachAnna KhokhlovaMichael A. HedderichDietrich Klakow
2020-10-06
On the Sub-Layer Functionalities of Transformer Decoder
Yilin YangLongyue WangShuming ShiPrasad TadepalliStefan LeeZhaopeng Tu
2020-10-06
Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation
| Sebastian HofstätterSophia AlthammerMichael SchröderMete SertkanAllan Hanbury
2020-10-06
Incorporating Behavioral Hypotheses for Query Generation
Ruey-Cheng ChenChia-Jung Lee
2020-10-06
Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder
Alvin ChanYi TayYew-Soon OngAston Zhang
2020-10-06
BERT Knows Punta Cana is not just beautiful, it's gorgeous: Ranking Scalar Adjectives with Contextualised Representations
| Aina Garí SolerMarianna Apidianaki
2020-10-06
Analyzing Individual Neurons in Pre-trained Language Models
Nadir DurraniHassan SajjadFahim DalviYonatan Belinkov
2020-10-06
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
| Minki KangMoonsu HanSung Ju Hwang
2020-10-06
Intrinsic Probing through Dimension Selection
Lucas Torroba HennigenAdina WilliamsRyan Cotterell
2020-10-06
Exploring BERT's Sensitivity to Lexical Cues using Tests from Semantic Priming
Kanishka MisraAllyson EttingerJulia Taylor Rayz
2020-10-06
Resource-Enhanced Neural Model for Event Argument Extraction
Jie MaShuai WangRishita AnubhaiMiguel BallesterosYaser Al-Onaizan
2020-10-06
Beyond [CLS] through Ranking by Generation
Cicero Nogueira dos santosXiaofei MaRamesh NallapatiZhiheng HuangBing Xiang
2020-10-06
Efficient Inference For Neural Machine Translation
Yi-Te HsuSarthak GargYi-Hsiu LiaoIlya Chatsviorkin
2020-10-06
Using Bayesian deep learning approaches for uncertainty-aware building energy surrogate models
Paul WestermannRalph Evins
2020-10-05
PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation
Xinyu HuaLu Wang
2020-10-05
InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Boxin WangShuohang WangYu ChengZhe GanRuoxi JiaBo LiJingjing Liu
2020-10-05
Mixup-Transfomer: Dynamic Data Augmentation for NLP Tasks
Lichao SunCongying XiaWenpeng YinTingTing LiangPhilip S. YuLifang He
2020-10-05
[email protected]: Pre-training ULMFiT on Synthetically Generated Code-Mixed Data for Hate Speech Detection
Gaurav Arora
2020-10-05
Self-training Improves Pre-training for Natural Language Understanding
Jingfei DuEdouard GraveBeliz GunelVishrav ChaudharyOnur CelebiMichael AuliVes StoyanovAlexis Conneau
2020-10-05
D3Net: Densely connected multidilated DenseNet for music source separation
Naoya TakahashiYuki Mitsufuji
2020-10-05
Transformer-Based Neural Text Generation with Syntactic Guidance
Yinghao LiRui FengIsaac RehgChao Zhang
2020-10-05
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?
Shayne LongpreYu WangChristopher DuBois
2020-10-05
Improving AMR Parsing with Sequence-to-Sequence Pre-training
| Dongqin XuJunhui LiMuhua ZhuMin ZhangGuodong Zhou
2020-10-05
Unsupervised Reference-Free Summary Quality Evaluation via Contrastive Learning
Hanlu WuTengfei MaLingfei WuTariro ManyumwaShouling Ji
2020-10-05
Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior
Zi LinJeremiah Zhe LiuZi YangNan HuaDan Roth
2020-10-05
GenAug: Data Augmentation for Finetuning Text Generators
Steven Y. FengVarun GangalDongyeop KangTeruko MitamuraEduard Hovy
2020-10-05
DCT-SNN: Using DCT to Distribute Spatial Information over Time for Learning Low-Latency Spiking Neural Networks
Isha GargSayeed Shafayet ChowdhuryKaushik Roy
2020-10-05
PMI-Masking: Principled masking of correlated spans
Yoav LevineBarak LenzOpher LieberOmri AbendKevin Leyton-BrownMoshe TennenholtzYoav Shoham
2020-10-05
Linguistic Profiling of a Neural Language Model
Alessio MiaschiDominique BrunatoFelice Dell'OrlettaGiulia Venturi
2020-10-05
PUM at SemEval-2020 Task 12: Aggregation of Transformer-based models' features for offensive language recognition
Piotr JaniszewskiMateusz SkibaUrszula Walińska
2020-10-05
X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset
Angel DazaAnette Frank
2020-10-05
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space
Dayiheng LiuYeyun GongJie FuYu YanJiusheng ChenJiancheng LvNan DuanMing Zhou
2020-10-04
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels
Ilias ChalkidisManos FergadiotisSotiris KotitsasProdromos MalakasiotisNikolaos AletrasIon Androutsopoulos
2020-10-04
Inquisitive Question Generation for High Level Text Comprehension
Wei-Jen KoTe-Yuan ChenYiyan HuangGreg DurrettJunyi Jessy Li
2020-10-04
On Losses for Modern Language Models
Stephane Aroca-OuelletteFrank Rudzicz
2020-10-04
Mining Knowledge for Natural Language Inference from Wikipedia Categories
Mingda ChenZewei ChuKarl StratosKevin Gimpel
2020-10-03
Nonconvex Regularization for Network Slimming:Compressing CNNs Even More
Kevin BuiFredrick ParkShuai ZhangYingyong QiJack Xin
2020-10-03
Differentially Private Representation for NLP: Formal Guarantee and An Empirical Study on Privacy and Fairness
Lingjuan LyuXuanli HeYitong Li
2020-10-03
Personality Trait Detection Using Bagged SVM over BERT Word Embedding Ensembles
Amirmohammad KazameiniSamin FatehiYash MehtaSauleh EetemadiErik Cambria
2020-10-03
End-to-End Training of CNN Ensembles for Person Re-Identification
Ayse SerbetciYusuf Sinan Akgul
2020-10-03
Cost-effective Selection of Pretraining Data: A Case Study of Pretraining BERT on Social Media
Xiang DaiSarvnaz KarimiBen HacheyCecile Paris
2020-10-02
STIL -- Simultaneous Slot Filling, Translation, Intent Classification, and Language Identification: Initial Results using mBART on MultiATIS++
Jack G. M. FitzGerald
2020-10-02
Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis
katsuhiko IshiguroKazuya UjiharaRyohto SawadaHirotaka AkitaMasaaki Kotera
2020-10-02
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
| Andreas RückléJonas PfeifferIryna Gurevych
2020-10-02
Beyond Chemical 1D knowledge using Transformers
Ruud Van DeursenIgor V. TetkoGuillaume Godin
2020-10-02
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
Ikuya YamadaAkari AsaiHiroyuki ShindoHideaki TakedaYuji Matsumoto
2020-10-02
Beyond The Text: Analysis of Privacy Statements through Syntactic and Semantic Role Labeling
Yan ShvartzshnaiderAnanth BalashankarVikas PatidarThomas WiesLakshminarayanan Subramanian
2020-10-01
Evaluating Multilingual BERT for Estonian
Claudia KittaskKirill MilintsevichKairit Sirts
2020-10-01
Phonemer at WNUT-2020 Task 2: Sequence Classification Using COVID Twitter BERT and Bagging Ensemble Technique based on Plurality Voting
| Anshul Wadhawan
2020-10-01
A Compare Aggregate Transformer for Understanding Document-grounded Dialogue
Longxuan MaWei-Nan ZhangRunxin SunTing Liu
2020-10-01
Examining the rhetorical capacities of neural language models
Zining ZhuChuer PanMohamed AbdallaFrank Rudzicz
2020-10-01
RRF102: Meeting the TREC-COVID Challenge with a 100+ Runs Ensemble
Michael BenderskyHonglei ZhuangJi MaShuguang HanKeith HallRyan Mcdonald
2020-10-01
Understanding tables with intermediate pre-training
| Julian Martin EisenschlosSyrine KrichineThomas Müller
2020-10-01
Detecting White Supremacist Hate Speech using Domain Specific Word Embedding with Deep Learning and BERT
Hind Saleh AlatawiAreej Maatog AlhothaliKawthar Mustafa Moria
2020-10-01
CoLAKE: Contextualized Language and Knowledge Embedding
| Tianxiang SunYunfan ShaoXipeng QiuQipeng GuoYaru HuXuanjing HuangZheng Zhang
2020-10-01
WeChat Neural Machine Translation Systems for WMT20
Fandong MengJianhao YanYijin LiuYuan GaoXianfeng ZengQinsong ZengPeng LiMing ChenJie zhouSifan LiuHao Zhou
2020-10-01
AUBER: Automated BERT Regularization
Hyun Dong LeeSeongmin LeeU Kang
2020-09-30
Accurate and Robust Feature Importance Estimation under Distribution Shifts
Jayaraman J. ThiagarajanVivek NarayanaswamyRushil AnirudhPeer-Timo BremerAndreas Spanias
2020-09-30
Learning Hard Retrieval Cross Attention for Transformer
Hongfei XuQiuhui Liu
2020-09-30
Measuring Systematic Generalization in Neural Proof Generation with Transformers
Nicolas GontierKoustuv SinhaSiva ReddyChristopher Pal
2020-09-30
BERT for Monolingual and Cross-Lingual Reverse Dictionary
| Hang YanXiaonan LiXipeng Qiu
2020-09-30
Rethinking Attention with Performers
| Krzysztof ChoromanskiValerii LikhosherstovDavid DohanXingyou SongAndreea GaneTamas SarlosPeter HawkinsJared DavisAfroz MohiuddinLukasz KaiserDavid BelangerLucy ColwellAdrian Weller
2020-09-30
MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention
Carson EisenachYagna PatelDhruv Madeka
2020-09-30
A Tale of Two Linkings: Dynamically Gating between Schema Linking and Structural Linking for Text-to-SQL Parsing
| Sanxing ChenAidan SanXiaodong LiuYangfeng Ji
2020-09-30
Gender prediction using limited Twitter Data
Maaike BurghoornMaaike H. T. de BoerStephan Raaijmakers
2020-09-29
Visually-Grounded Planning without Vision: Language Models Infer Detailed Plans from High-level Instructions
| Peter A. Jansen
2020-09-29
Attention-Driven Body Pose Encoding for Human Activity Recognition
B DebnathM O'brienS. KumarA Behera
2020-09-29
TEST_POSITIVE at W-NUT 2020 Shared Task-3: Joint Event Multi-task Learning for Slot Filling in Noisy Text
Chacha ChenChieh-Yang HuangYaqi HouYang ShiEnyan DaiJiaqi Wang
2020-09-29
Cross-lingual Alignment Methods for Multilingual BERT: A Comparative Study
Saurabh KulshreshthaJosé Luis Redondo-GarcíaChing-Yun Chang
2020-09-29
Attention that does not Explain Away
Nan DingXinjie FanZhenzhong LanDale SchuurmansRadu Soricut
2020-09-29
MaP: A Matrix-based Prediction Approach to Improve Span Extraction in Machine Reading Comprehension
Huaishao LuoYu ShiMing GongLinjun ShouTianrui Li
2020-09-29
The design and implementation of Language Learning Chatbot with XAI using Ontology and Transfer Learning
Nuobei ShiQin ZengRaymond Lee
2020-09-29
Self-grouping Convolutional Neural Networks
Qingbei GuoXiao-Jun WuJosef KittlerZhiquan Feng
2020-09-29
Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation
Yinfei YangNing JinKuo LinMandy GuoDaniel Cer
2020-09-29
A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation
Dinghan ShenMingzhi ZhengYelong ShenYanru QuWeizhu Chen
2020-09-29
HINT3: Raising the bar for Intent Detection in the Wild
Gaurav AroraChirag JainManas ChaturvediKrupal Modi
2020-09-29
Sequence-to-Sequence Learning for Indonesian Automatic Question Generator
Ferdiant Joshua MuisAyu Purwarianti
2020-09-29
Contrastive Distillation on Intermediate Representations for Language Model Compression
Siqi SunZhe GanYu ChengYuwei FangShuohang WangJingjing Liu
2020-09-29
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
Shikib MehriMihail EricDilek Hakkani-Tur
2020-09-28
VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training
Xiaowei HuXi YinKevin LinLijuan WangLei ZhangJianfeng GaoZicheng Liu
2020-09-28
Detecting soccer balls with reduced neural networks: a comparison of multiple architectures under constrained hardware scenarios
Douglas De Rizzo MeneghettiThiago Pedro Donadon HomemJonas Henrique Renolfi de OliveiraIsaac Jesus da SilvaDanilo Hernani PericoReinaldo Augusto da Costa Bianchi
2020-09-28
Fancy Man Lauches Zippo at WNUT 2020 Shared Task-1: A Bert Case Model for Wet Lab Entity Extraction
Haoding MengQingcheng ZengXiaoyang FangZhexin Liang
2020-09-28
A Simple and Efficient Ensemble Classifier Combining Multiple Neural Network Models on Social Media Datasets in Vietnamese
Huy Duc HuynhHang Thi-Thuy DoKiet Van NguyenNgan Luu-Thuy Nguyen
2020-09-28
Accelerating Multi-Model Inference by Merging DNNs of Different Weights
Joo Seong JeongSoojeong KimGyeong-In YuYunseong LeeByung-Gon Chun
2020-09-28
Deep Transformers with Latent Depth
Xian LiAsa Cooper SticklandYuqing TangXiang Kong
2020-09-28
Quantal synaptic dilution enhances sparse encoding and dropout regularisation in deep networks
Gardave S Bhumbra
2020-09-28
Knowledge-Aware Procedural Text Understanding with Multi-Stage Training
Zhihan ZhangXiubo GengTao QinYunfang WuDaxin Jiang
2020-09-28
PIN: A Novel Parallel Interactive Network for Spoken Language Understanding
Peilin ZhouZhiqi HuangFenglin LiuYuexian Zou
2020-09-28
Arabic Handwritten Character Recognition based on Convolution Neural Networks and Support Vector Machine
Mahmoud ShamsAmira. A. ElsonbatyWael. Z. ElSawy
2020-09-28
What does it mean to be language-agnostic? Probing multilingual sentence encoders for typological properties
Rochelle ChoenniEkaterina Shutova
2020-09-27
Classification and understanding of cloud structures via satellite images with EfficientUNet
Tashin AhmedNoor Hossain Nuri Sabab
2020-09-27
TernaryBERT: Distillation-aware Ultra-low Bit BERT
Wei ZhangLu HouYichun YinLifeng ShangXiao ChenXin JiangQun Liu
2020-09-27
Metaphor Detection using Deep Contextualized Word Embeddings
Shashwat AggarwalRamesh Singh
2020-09-26
Metaphor Detection using Deep Contextualized Word Embeddings
Shashwat AggarwalRamesh Singh
2020-09-26
Techniques to Improve Q&A Accuracy with Transformer-based models on Large Complex Documents
Chejui LiaoTabish ManiarSravanajyothi NAnantha Sharma
2020-09-26
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning
| Ye LiuYao WanLifang HeHao PengPhilip S. Yu
2020-09-26
SEMI: Self-supervised Exploration via Multisensory Incongruity
Jianren WangZiwen ZhuangHang Zhao
2020-09-26
HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
| Yifan DingNicholas BotzerTim Weninger
2020-09-25
BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification Context
Jean-Philippe CorbeilHadi Abdi Ghadivel
2020-09-25
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Zhaojiang LinAndrea MadottoGenta Indra WinataPascale Fung
2020-09-25
An Unsupervised Sentence Embedding Method byMutual Information Maximization
Yan ZhangRuidan HeZuozhu LiuKwan Hui LimLidong Bing
2020-09-25
Weird AI Yankovic: Generating Parody Lyrics
Mark Riedl
2020-09-25
A little goes a long way: Improving toxic language classification despite data scarcity
Mika JuutiTommi GröndahlAdrian FlanaganN. Asokan
2020-09-25
A Comparative Study of Feature Types for Age-Based Text Classification
| Anna GlazkovaYury EgorovMaksim Glazkov
2020-09-24
Toward a Thermodynamics of Meaning
Jonathan Scott Enderle
2020-09-24
Brain Tumor Segmentation using 3D-CNNs with Uncertainty Estimation
Laura Mora BallestarVeronica Vilaplana
2020-09-24
ECOVNet: An Ensemble of Deep Convolutional Neural Networks Based on EfficientNet to Detect COVID-19 From Chest X-rays
Nihad Karim ChowdhuryMd. Muhtadir RahmanNoortaz RezoanaMuhammad Ashad Kabir
2020-09-24
AnchiBERT: A Pre-Trained Model for Ancient ChineseLanguage Understanding and Generation
Huishuang TianKexin YangDayiheng LiuJiancheng Lv
2020-09-24
Interpreting and Boosting Dropout from a Game-Theoretic View
Hao ZhangSen LiYinchao MaMingjie LiYichen XieQuanshi Zhang
2020-09-24
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences
| Boon Peng YapAndrew Koh Jin JieEng Siong Chng
2020-09-24
Schizophrenia-mimicking layers outperform conventional neural network layers
Ryuta MizutaniSenta NoguchiRino SaigaMitsuhiro MiyashitaMakoto AraiMasanari Itokawa
2020-09-23
Multi-Pass Transformer for Machine Translation
Peng GaoChiori HoriShijie GengTakaaki HoriJonathan Le Roux
2020-09-23
Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition
Bingcong LiXin TangXianbiao QiYihao ChenRong Xiao
2020-09-23
Pruning Convolutional Filters using Batch Bridgeout
Najeeb KhanIan Stavness
2020-09-23
Automatic Breast Lesion Classification by Joint Neural Analysis of Mammography and Ultrasound
Gavriel HabibNahum KiryatiMiri Sklair-LevyAnat ShalmonOsnat Halshtok NeimanRenata Faermann WeidenfeldYael YagilEli KonenArnaldo Mayer
2020-09-23
Robustification of Segmentation Models Against Adversarial Perturbations In Medical Imaging
Hanwool ParkAmirhossein BayatMohammad SabokrouJan S. KirschkeBjoern H. Menze
2020-09-23
FastSecAgg: Scalable Secure Aggregation for Privacy-Preserving Federated Learning
Swanand KadheNived RajaramanO. Ozan KoyluogluKannan Ramchandran
2020-09-23
A Token-wise CNN-based Method for Sentence Compression
Weiwei HouHanna SuominenPiotr KoniuszSabrina CaldwellTom Gedeon
2020-09-23
On Data Augmentation for Extreme Multi-label Classification
Danqing ZhangTao LiHaiyang ZhangBing Yin
2020-09-22
Design of Efficient Deep Learning models for Determining Road Surface Condition from Roadside Camera Images and Weather Data
Juan CarrilloMark CrowleyGuangyuan PanLiping Fu
2020-09-22
AutoRC: Improving BERT Based Relation Classification Models via Architecture Search
Wei ZhuXiaoling WangXipeng QiuYuan NiGuotong Xie
2020-09-22
GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis
| Huaishao LuoLei JiTianrui LiNan DuanDaxin Jiang
2020-09-22
Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
Chris J. KennedyGeoff BaconAlexander SahnClaudia von Vacano
2020-09-22
TSV Extrusion Morphology Classification Using Deep Convolutional Neural Networks
Brendan ReidyGolareh JalilvandTengfei JiangRamtin Zand
2020-09-22
CCBlock: An Effective Use of Deep Learning for Automatic Diagnosis of COVID-19 Using X-Ray Images
Ali Al-BawiKarrar Ali Al-KaabiMohammed JeryoAhmad Al-Fatlawi
2020-09-21
Impact of lung segmentation on the diagnosis and explanation of COVID-19 in chest X-ray images
Lucas O. TeixeiraRodolfo M. PereiraDiego BertoliniLuiz S. OliveiraLoris NanniYandre M. G. Costa
2020-09-21
Evolutionary Architecture Search for Graph Neural Networks
Min ShiDavid A. WilsonXingquan ZhuYu HuangYuan ZhuangJianxun LiuYufei Tang
2020-09-21
"When they say weed causes depression, but it's your fav antidepressant": Knowledge-aware Attention Framework for Relationship Extraction
Shweta YadavUsha LokalaRaminta DaniulaityteKrishnaprasad ThirunarayanFrancois LamyAmit Sheth
2020-09-21
Alleviating the Inequality of Attention Heads for Neural Machine Translation
Zewei SunShujian HuangXinyu DaiJiajun Chen
2020-09-21
Profile Consistency Identification for Open-domain Dialogue Agents
Haoyu SongYan WangWei-Nan ZhangZhengyu ZhaoTing LiuXiaojiang Liu
2020-09-21
Empathetic Dialogue Generation via Knowledge Enhancing and Emotion Dependency Modeling
Qintong LiPiji LiZhumin ChenZhaochun Ren
2020-09-21
UCD-CS at W-NUT 2020 Shared Task-3: A Text to Text Approach for COVID-19 Event Extraction on Social Media
Congcong WangDavid Lillis
2020-09-21
Latin BERT: A Contextual Language Model for Classical Philology
David BammanPatrick J. Burns
2020-09-21
Dual-path CNN with Max Gated block for Text-Based Person Re-identification
Tinghuai MaMingming YangHuan RongYurong QianYurong QianYuan TianNajlaAl-Nabhan
2020-09-20
Longformer for MS MARCO Document Re-ranking Task
| Ivan SekulićAmir SoleimaniMohammad AliannejadiFabio Crestani
2020-09-20
Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging
| Ehsan DoostmohammadiMinoo NassajianAdel Rahimi
2020-09-20
VirtualFlow: Decoupling Deep Learning Model Execution from Underlying Hardware
Andrew OrHaoyu ZhangMichael J. Freedman
2020-09-20
Prior Art Search and Reranking for Generated Patent Text
Jieh-Sheng LeeJieh Hsiang
2020-09-19
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
Jonathan PilaultAmine ElhattamiChristopher Pal
2020-09-19
Nominal Compound Chain Extraction: A New Task for Semantic-enriched Lexical Chain
Bobo LiHao FeiYafeng RenDonghong Ji
2020-09-19
Adversarial Exposure Attack on Diabetic Retinopathy Imagery
Yupeng ChengFelix Juefei-XuQing GuoHuazhu FuXiaofei XieShang-Wei LinWeisi LinYang Liu
2020-09-19
Towards Computational Linguistics in Minangkabau Language: Studies on Sentiment Analysis and Machine Translation
Fajri KotoIkhwan Koto
2020-09-19
Will it Unblend?
Yuval PinterCassandra L. JacobsJacob Eisenstein
2020-09-18
Densely Guided Knowledge Distillation using Multiple Teacher Assistants
Wonchul SonJaemin NaWonjun Hwang
2020-09-18
The birth of Romanian BERT
Stefan Daniel DumitrescuAndrei-Marius AvramSampo Pyysalo
2020-09-18
Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models
Jihyeon RohHuiseong GimSoo-Young Lee
2020-09-18
fastHan: A BERT-based Joint Many-Task Toolkit for Chinese NLP
Zhichao GengHang YanXipeng QiuXuanjing Huang
2020-09-18
NEU at WNUT-2020 Task 2: Data Augmentation To Tell BERT That Death Is Not Necessarily Informative
Kumud Chauhan
2020-09-18
Cross-Modal Alignment with Mixture Experts Neural Network for Intral-City Retail Recommendation
Po LiLei LiYan FuJun RongYu Zhang
2020-09-17
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning
Bingbing LiZhenglun KongTianyun ZhangJi LiZhengang LiHang LiuCaiwen Ding
2020-09-17
Label Smoothing and Adversarial Robustness
Chaohao FuHongbin ChenNa RuanWeijia Jia
2020-09-17
Towards Fully 8-bit Integer Inference for the Transformer Model
Ye LinYanyang LiTengbo LiuTong XiaoTongran LiuJingbo Zhu
2020-09-17
Multi^2OIE: Multilingual Open Information Extraction based on Multi-Head Attention with BERT
Youngbin RoYukyung LeePilsung Kang
2020-09-17
DSC IIT-ISM at SemEval-2020 Task 6: Boosting BERT with Dependencies for Definition Extraction
| Aadarsh SinghPriyanshu KumarAman Sinha
2020-09-17
Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA
Ieva StaliūnaitėIgnacio Iacobacci
2020-09-17
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya GuoShuo RenShuai LuZhangyin FengDuyu TangShujie LiuLong ZhouNan DuanJian YinDaxin JiangMing Zhou
2020-09-17
A Multimodal Memes Classification: A Survey and Open Research Issues
Tariq Habib AfridiAftab AlamMuhammad Numan KhanJawad KhanYoung-Koo Lee
2020-09-17
Document-level Neural Machine Translation with Document Embeddings
Shu JiangHai ZhaoZuchao LiBao-Liang Lu
2020-09-16
Solomon at SemEval-2020 Task 11: Ensemble Architecture for Fine-Tuned Propaganda Detection in News Articles
Mayank RajAjay JaiswalRohit R. RAnkita GuptaSudeep Kumar SahooVertika SrivastavaYeon Hyang Kim
2020-09-16
Geometric Uncertainty in Patient-Specific Cardiovascular Modeling with Convolutional Dropout Networks
Gabriel MaherCasey FleeterDaniele SchiavazziAlison Marsden
2020-09-16
Retrofitting Structure-aware Transformer Language Model for End Tasks
Hao FeiYafeng RenDonghong Ji
2020-09-16
EfficientNet-eLite: Extremely Lightweight and Efficient CNN Models for Edge Devices by Network Candidate Search
| Ching-Chen WangChing-Te ChiuJheng-Yi Chang
2020-09-16
Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation
Insoo ChungByeongwook KimYoonjung ChoiSe Jung KwonYongkweon JeonBaeseong ParkSangha KimDongsoo Lee
2020-09-16
Graph-to-Sequence Neural Machine Translation
Sufeng DuanHai ZhaoRui Wang
2020-09-16
Simplified TinyBERT: Knowledge Distillation for Document Retrieval
Xuanang ChenBen HeKai HuiLe SunYingfei Sun
2020-09-16
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
| Jian GuanMinlie Huang
2020-09-16
NABU -- Multilingual Graph-based Neural RDF Verbalizer
Diego MoussallemDwaraknath GnaneshwarThiago Castro FerreiraAxel-Cyrille Ngonga Ngomo
2020-09-16
Automated Source Code Generation and Auto-completion Using Deep Learning: Comparing and Discussing Current Language-Model-Related Approaches
Juan Cruz-BenitoSanjay VishwakarmaFrancisco Martin-FernandezIsmael Faro
2020-09-16
Deep Learning Approaches for Extracting Adverse Events and Indications of Dietary Supplements from Clinical Text
Yadan FanSicheng ZhouYifan LiRui Zhang
2020-09-16
Activation Functions: Do They Represent A Trade-Off Between Modular Nature of Neural Networks And Task Performance
Himanshu Pradeep AswaniAmit Sethi
2020-09-16
DeNERT-KG: Named Entity and Relation Extraction Model Using DQN, Knowledge Graph, and BERT
SungMin YangSoYeop YooOkRan Jeong
2020-09-15
Augmented Natural Language for Generative Sequence Labeling
Ben AthiwaratkunCicero Nogueira dos SantosJason KroneBing Xiang
2020-09-15
The Radicalization Risks of GPT-3 and Advanced Neural Language Models
Kris McGuffieAlex Newhouse
2020-09-15
Dialogue Response Ranking Training with Large-Scale Human Feedback Data
| Xiang GaoYizhe ZhangMichel GalleyChris BrockettBill Dolan
2020-09-15
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
| Timo SchickHinrich Schütze
2020-09-15
Critical Thinking for Language Models
Gregor Betz
2020-09-15
Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Wei NiuZhenglun KongGeng YuanWeiwen JiangJiexiong GuanCaiwen DingPu ZhaoSijia LiuBin RenYanzhi Wang
2020-09-15
Attention-Aware Inference for Neural Abstractive Summarization
Ye MaLu Zong
2020-09-15
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis ClouatrePhilippe TrempeAmal ZouaqSarath Chandar
2020-09-15
Event Presence Prediction Helps Trigger Detection Across Languages
Parul AwasthyTahira NaseemJian NiTaesun MoonRadu Florian
2020-09-15
Lessons Learned from Applying off-the-shelf BERT: There is no SilverBullet
Victor MakarenkovLior Rokach
2020-09-15
BERT-QE: Contextualized Query Expansion for Document Re-ranking
Zhi ZhengKai HuiBen HeXianpei HanLe SunAndrew Yates
2020-09-15
Controllable neural text-to-speech synthesis using intuitive prosodic features
Tuomo RaitioRamya RasipuramDan Castellani
2020-09-14
Efficient Transformers: A Survey
Yi TayMostafa DehghaniDara BahriDonald Metzler
2020-09-14
Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue
Longxiang LiuZhuosheng ZhangHai ZhaoXi ZhouXiang Zhou
2020-09-14
GeDi: Generative Discriminator Guided Sequence Generation
Ben KrauseAkhilesh Deepak GotmareBryan McCannNitish Shirish KeskarShafiq JotyRichard SocherNazneen Fatema Rajani
2020-09-14
Can Fine-tuning Pre-trained Models Lead to Perfect NLP? A Study of the Generalizability of Relation Extraction
Ningyu ZhangLuoqiu LiShumin DengHaiyang YuXu ChengWei ZhangHuajun Chen
2020-09-14
Unsupervised Domain Adaptation by Uncertain Feature Alignment
Tobias RingwaldRainer Stiefelhagen
2020-09-14
Beyond Accuracy: ROI-driven Data Analytics of Empirical Data
Gouri DeshpandeGuenther Ruhe
2020-09-14
Machine Learning's Dropout Training is Distributionally Robust Optimal
Jose BlanchetYang KangJose Luis Montiel OleaViet Anh NguyenXuhui Zhang
2020-09-13
Pairwise-GAN: Pose-based View Synthesis through Pair-Wise Training
Xuyang ShenJo PlestedYue YaoTom Gedeon
2020-09-13
Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Shuohang WangLuowei ZhouZhe GanYen-Chun ChenYuwei FangSiqi SunYu ChengJingjing Liu
2020-09-13
BoostingBERT:Integrating Multi-Class Boosting into BERT for NLP Tasks
Tongwen HuangQingyun SheJunlin Zhang
2020-09-13
Corrective feedback, emphatic speech synthesis, visual-speech exaggeration, pronunciation learning
Yaohua BuWeijun LiTianyi MaShengqi ChenJia JiaKun LiXiaobo Lu
2020-09-12
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using Pre-trained Language Models
Yandrapati Prakash BabuRajagopal Eswari
2020-09-12
Country Image in COVID-19 Pandemic: A Case Study of China
Huimin ChenZeyu ZhuFanchao QiYining YeZhiyuan LiuMaosong SunJianbin Jin
2020-09-12
Fine-tuning Pre-trained Contextual Embeddings for Citation Content Analysis in Scholarly Publication
Haihua ChenHuyen Nguyen
2020-09-12
Unit Test Case Generation with Transformers
Michele TufanoDawn DrainAlexey SvyatkovskiyShao Kun DengNeel Sundaresan
2020-09-11
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank Approximation
Murad TukanAlaa MaaloufMatan WekslerDan Feldman
2020-09-11
UPB at SemEval-2020 Task 6: Pretrained Language Models for DefinitionExtraction
Andrei-Marius AvramDumitru-Clementin CercelCostin-Gabriel Chiru
2020-09-11
SoFAr: Shortcut-based Fractal Architectures for Binary Convolutional Neural Networks
Zhu BaozhouPeter HofsteeJinho LeeZaid Al-Ars
2020-09-11
GTEA: Representation Learning for Temporal Interaction Graphs via Edge Aggregation
Yiming LiDa Sun Handason TamSiyue XieXiaxin LiuQiu Fang YingWing Cheong LauDah Ming ChiuShou Zhi Chen
2020-09-11
Attribute-conditioned Layout GAN for Automatic Graphic Design
Jianan LiJimei YangJianming ZhangChang LiuChristina WangTingfa Xu
2020-09-11
UPB at SemEval-2020 Task 11: Propaganda Detection with Domain-Specific Trained BERT
Andrei ParaschivDumitru-Clementin CercelMihai Dascalu
2020-09-11
A Comparison of LSTM and BERT for Small Corpus
Aysu Ezen-Can
2020-09-11
Comprehensive Comparison of Deep Learning Models for Lung and COVID-19 Lesion Segmentation in CT scans
Paschalis BizopoulosNicholas VretosPetros Daras
2020-09-10
Rank over Class: The Untapped Potential of Ranking in Natural Language Processing
Amir Atapour-AbarghoueiStephen BonnerAndrew Stephen McGough
2020-09-10
FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding
Yuwei FangShuohang WangZhe GanSiqi SunJingjing Liu
2020-09-10
Sparsifying Transformer Models with Differentiable Representation Pooling
Michał PietruszkaŁukasz BorchmannFilip Graliński
2020-09-10
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas AffolterBeni EgressyDamian PascualRoger Wattenhofer
2020-09-10
Learning Universal Representations from Word to Sentence
Yian LiHai Zhao
2020-09-10
Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection
| Taesun WhangDongyub LeeDongsuk OhChanhee LeeKijong HanDong-hun LeeSaebyeok Lee
2020-09-10
Modern Methods for Text Generation
| Dimas Munoz Montesinos
2020-09-10
Investigating Gender Bias in BERT
Rishabh BhardwajNavonil MajumderSoujanya Poria
2020-09-10
Pay Attention when Required
Swetha MandavaSzymon MigaczAlex Fit Florea
2020-09-09
Comparative Study of Language Models on Cross-Domain Data with Model Agnostic Explainability
Mayank ChhipaHrushikesh Mahesh VazurkarAbhijeet KumarMridul Mishra
2020-09-09
A Deep Neural Network Tool for Automatic Segmentation of Human Body Parts in Natural Scenes
Patrick McClureGabrielle ReimannMichal RamotFrancisco Pereira
2020-09-08
ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by Pre-trained Language Model
Zhengjie HuangShikun FengWeiyue SuXuyi ChenShuohuan WangJiaxiang LiuXuan OuyangYu Sun
2020-09-08
TanhSoft -- a family of activation functions combining Tanh and Softplus
Koushik BiswasSandeep KumarShilpak BanerjeeAshish Kumar Pandey
2020-09-08
Masked Label Prediction: Unified Massage Passing Model for Semi-Supervised Classification
Yunsheng ShiZhengjie HuangWenjin WangHui ZhongShikun FengYu Sun
2020-09-08
Improving Language Generation with Sentence Coherence Objective
Ruixiao SunJie YangMehrdad Yousefzadeh
2020-09-07
ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation
Sicheng ZhaoYezhen WangBo LiBichen WuYang GaoPengfei XuTrevor DarrellKurt Keutzer
2020-09-07
Robust Conversational AI with Grounded Text Generation
Jianfeng GaoBaolin PengChunyuan LiJinchao LiShahin ShayandehLars LidenHeung-Yeung Shum
2020-09-07
Black Box to White Box: Discover Model Characteristics Based on Strategic Probing
Josh KalinMatthew CiolinoDavid NoeverGerry Dozier
2020-09-07
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for E-commerce
Denghui ZhangZixuan YuanYanchi LiuFuzhen ZhuangHui Xiong
2020-09-07
TransModality: An End2End Fusion Method with Transformer for Multimodal Sentiment Analysis
Zilong WangZhaohong WanXiaojun Wan
2020-09-07
Stochastic-YOLO: Efficient Probabilistic Object Detection under Dataset Shifts
Tiago AzevedoRené de JongPartha Maji
2020-09-07
Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding
Sahar AbdelnabiMario Fritz
2020-09-07
Active deep learning method for the discovery of objects of interest in large spectroscopic surveys
Petr ŠkodaOndřej PodsztavekPavel Tvrdík
2020-09-07
Measuring Massive Multitask Language Understanding
Dan HendrycksCollin BurnsSteven BasartAndy ZouMantas MazeikaDawn SongJacob Steinhardt
2020-09-07
EdinburghNLP at WNUT-2020 Task 2: Leveraging Transformers with Generalized Augmentation for Identifying Informativeness in COVID-19 Tweets
Nickil Maveli
2020-09-06
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation system based on ensemble of language model
Pai Liu
2020-09-06
CalciumGAN: A Generative Adversarial Network Model for Synthesising Realistic Calcium Imaging Data of Neuronal Populations
Bryan M. Li.Theoklitos AmvrosiadisNathalie RochefortArno Onken
2020-09-06
Accenture at CheckThat! 2020: If you say so: Post-hoc fact-checking of claims using transformer-based models
Evan WilliamsPaul RodriguesValerie Novak
2020-09-05
HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis
Jiawei ChenXu TanJian LuanTao QinTie-Yan Liu
2020-09-03
Comparative Evaluation of Pretrained Transfer Learning Models on Automatic Short Answer Grading
Sasi Kiran GaddipatiDeebul NairPaul G. Plöger
2020-09-02
Transform Quantization for CNN Compression
Sean I. YoungWang ZheDavid TaubmanBernd Girod
2020-09-02
Sentimental LIAR: Extended Corpus and Deep Learning Models for Fake Claim Classification
Bibek UpadhayayVahid Behzadan
2020-09-01
LiftFormer: 3D Human Pose Estimation using attention models
Adrian Llopart
2020-09-01
Automatic Assignment of Radiology Examination Protocols Using Pre-trained Language Models with Knowledge Distillation
Wilson LauLaura AaltonenMartin GunnMeliha Yetisgen
2020-09-01
A Bidirectional Tree Tagging Scheme for Jointly Extracting Overlapping Entities and Relations
Xukun LuoWeijie LiuMeng MaPing Wang
2020-08-31
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Wei LiJames QinChung-Cheng ChiuRuoming PangYanzhang He
2020-08-30
SocCogCom at SemEval-2020 Task 11: Characterizing and Detecting Propaganda using Sentence-Level Emotional Salience Features
Gangeshwar KrishnamurthyRaj Kumar GuptaYinping Yang
2020-08-29
HittER: Hierarchical Transformers for Knowledge Graph Embeddings
Sanxing ChenXiaodong LiuJianfeng GaoJian JiaoRuofei ZhangYangfeng Ji
2020-08-28
TATL at W-NUT 2020 Task 2: A Transformer-based Baseline System for Identification of Informative COVID-19 English Tweets
Anh Tuan Nguyen
2020-08-28
Rethinking the objectives of extractive question answering
Martin FajcikJosef JonSantosh KesirajuPavel Smrz
2020-08-28
Knowledge Efficient Deep Learning for Natural Language Processing
Hai Wang
2020-08-28
A free web service for fast COVID-19 classification of chest X-Ray images
Jose David Bermudez CastroRicardo ReiJose E. RuizPedro Achanccaray DiazSmith Arauco CanchumuniCristian Muñoz VillalobosFelipe Borges CoelhoLeonardo Forero MendozaMarco Aurelio C. Pacheco
2020-08-27
DAVE: Deriving Automatically Verilog from English
Hammond PearceBenjamin TanRamesh Karri
2020-08-27
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
Xinsong ZhangHang Li
2020-08-27
MultiGBS: A multi-layer graph approach to biomedical summarization
Ensieh DavoodijamNasser GhadiriMaryam Lotfi ShahrezaFabio Rinaldi
2020-08-27
Improvement of a dedicated model for open domain persona-aware dialogue generation
Qiang Han
2020-08-27
Query Focused Multi-document Summarisation of Biomedical Texts
Diego MollaChristopher JonesVincent Nguyen
2020-08-27
GREEK-BERT: The Greeks visiting Sesame Street
John KoutsikakisIlias ChalkidisProdromos MalakasiotisIon Androutsopoulos
2020-08-27
Entity and Evidence Guided Relation Extraction for DocRED
Kevin HuangGuangtao WangTengyu MaJing Huang
2020-08-27
APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm
Hanlin TangShaoduo GanSamyam RajbhandariXiangru LianCe ZhangJi LiuYuxiong He
2020-08-26
Language Models and Word Sense Disambiguation: An Overview and Analysis
| Daniel LoureiroKiamehr RezaeeMohammad Taher PilehvarJose Camacho-Collados
2020-08-26
Discrete Word Embedding for Logical Natural Language Understanding
Masataro AsaiZilu Tang
2020-08-26
Uncertainty-Aware Surrogate Model For Oilfield Reservoir Simulation
Ajitabh Kumar
2020-08-26
A Multitask Deep Learning Approach for User Depression Detection on Sina Weibo
Yiding WangZhenyi WangChenghao LiYilin ZhangHaizhou Wang
2020-08-26
Conceptualized Representation Learning for Chinese Biomedical Text Mining
| Ningyu ZhangQianghuai JiaKangping YinLiang DongFeng GaoNengwei Hua
2020-08-25
PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization
Zhize LiHongyan BaoXiangliang ZhangPeter Richtárik
2020-08-25
FastSal: a Computationally Efficient Network for Visual Saliency Prediction
Feiyan HuKevin McGuinness
2020-08-25
syrapropa at SemEval-2020 Task 11: BERT-based Models Design For Propagandistic Technique and Span Detection
Jinfen LiLu Xiao
2020-08-24
Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and Resources
Taolin ZhangChengyu WangMinghui QiuBite YangXiaofeng HeJun Huang
2020-08-24
End to End Dialogue Transformer
Ondřej MěkotaMemduh GökırmakPetr Laitoch
2020-08-24
Two Stages Approach for Tweet Engagement Prediction
Amine DadounIsmail HarrandoPasquale LisenaAlison ReboudRaphael Troncy
2020-08-24
Prediction of ICD Codes with Clinical BERT Embeddings and Text Augmentation with Label Balancing using MIMIC-III
Brent BisedaGaurav DesaiHaifeng LinAnish Philip
2020-08-24
YNU-HPCC at SemEval-2020 Task 11: LSTM Network for Detection of Propaganda Techniques in News Articles
Jiaxu DaoJin WangXuejie Zhang
2020-08-24
Variational Inference-Based Dropout in Recurrent Neural Networks for Slot Filling in Spoken Language Understanding
Jun QiXu LiuJavier Tejedor
2020-08-23
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERT
| Omar MossadAmgad AhmedAnandharaju RajuHari KarthikeyanZayed Ahmed
2020-08-22
Applications of BERT Based Sequence Tagging Models on Chinese Medical Text Attributes Extraction
Gang ZhaoTeng ZhangChenxiao WangPing LvJi Wu
2020-08-22
Identity-Aware Multi-Sentence Video Description
| Jae Sung ParkTrevor DarrellAnna Rohrbach
2020-08-22
HinglishNLP: Fine-tuned Language Models for Hinglish Sentiment Detection
Meghana BhangeNirant Kasliwal
2020-08-22
CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection
| Verena BlaschkeMaxim KorniyenkoSam Tureski
2020-08-22
DUTH at SemEval-2020 Task 11: BERT with Entity Mapping for Propaganda Classification
Anastasios BairaktarisSymeon SymeonidisAvi Arampatzis
2020-08-22
Adapting Event Extractors to Medical Data: Bridging the Covariate Shift
Aakanksha NaikJill LehmanCarolyn Rose
2020-08-21
An Improved Person Re-identification Method by light-weight convolutional neural network
Sajad Amouei SheshkalKazim Fouladi-GhalehHossein Aghababa
2020-08-21
Abstractive Summarization of Spoken andWritten Instructions with BERT
Alexandra SavelievaBryan Au-YeungVasanth Ramani
2020-08-21
PTT5: Pretraining and validating the T5 model on Brazilian Portuguese data
| Diedre CarmoMarcos PiauIsrael CampiottiRodrigo NogueiraRoberto Lotufo
2020-08-20
Uncertainty Estimation in Medical Image Denoising with Bayesian Deep Image Prior
| Max-Heinrich LavesMalte TölleTobias Ortmaier
2020-08-20
Lite Training Strategies for Portuguese-English and English-Portuguese Translation
| Alexandre LopesRodrigo NogueiraRoberto LotufoHelio Pedrini
2020-08-20
An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension
Son T. LuuKiet Van NguyenAnh Gia-Tuan NguyenNgan Luu-Thuy Nguyen
2020-08-20
UoB at SemEval-2020 Task 12: Boosting BERT with Corpus Level Information
Wah Meng LimHarish Tayyar Madabushi
2020-08-19
Glancing Transformer for Non-Autoregressive Neural Machine Translation
Lihua QianHao ZhouYu BaoMingxuan WangLin QiuWeinan ZhangYong YuLei Li
2020-08-18
Very Deep Transformers for Neural Machine Translation
Xiaodong LiuKevin DuhLiyuan LiuJianfeng Gao
2020-08-18
Ranking Clarification Questions via Natural Language Inference
Vaibhav KumarVikas RaunakJamie Callan
2020-08-18
Estimation of causal effects of multiple treatments in healthcare database studies with rare outcomes
Liangyuan HuChenyang Gu
2020-08-18
Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical Study
Karthik GopalakrishnanBehnam HedayatniaLongshaokan WangYang LiuDilek Hakkani-Tur
2020-08-18
Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Dara BahriYi TayChe ZhengDonald MetzlerCliff BrunkAndrew Tomkins
2020-08-17
Narrative Interpolation for Generating and Understanding Stories
Su WangGreg DurrettKatrin Erk
2020-08-17
Spatial Temporal Transformer Network for Skeleton-based Action Recognition
| Chiara PlizzariMarco CanniciMatteo Matteucci
2020-08-17
Stock Index Prediction with Multi-task Learning and Word Polarity Over Time
Yue ZhouKerstin Voigt
2020-08-17
Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size
Davis YoshidaAllyson EttingerKevin Gimpel
2020-08-16
DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act Recognition and Sentiment Classification
Libo QinWanxiang CheYangming LiMinheng NiTing Liu
2020-08-16
DeVLBert: Learning Deconfounded Visio-Linguistic Representations
| Shengyu ZhangTan JiangTan WangKun KuangZhou ZhaoJianke ZhuJin YuHongxia YangFei Wu
2020-08-16
TopicBERT: A Transformer transfer learning based memory-graph approach for multimodal streaming social media topic detection
Meysam Asgari-ChenaghluMohammad-Reza Feizi-DerakhshiLeili farzinvashMohammad-Ali BalafarCina Motamed
2020-08-16
Jointly Fine-Tuning “BERT-like” Self Supervised Models to Improve Multimodal Speech Emotion Recognition
| Shamane SiriwardhanaAndrew ReisRivindu WeerasekeraSuranga Nanayakkara
2020-08-15
Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Henry TsaiJayden OoiChun-Sung FerngHyung Won ChungJason Riesa
2020-08-15
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
Shamane SiriwardhanaAndrew ReisRivindu WeerasekeraSuranga Nanayakkara
2020-08-15
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Andrea MadottoZihan LiuZhaojiang LinPascale Fung
2020-08-14
Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model
Marzieh MozafariReza FarahbakhshNoel Crespi
2020-08-14
Adaptable Multi-Domain Language Model for Transformer ASR
Taewoo LeeMin-Joong LeeTae Gyoon KangSeokyeoung JungMinseok KwonYeona HongJungin LeeKyoung-Gu WooHo-Gyeong KimJiseung JeongJihyun LeeHosik LeeYoung Sang Choi
2020-08-14
A Hybrid BERT and LightGBM based Model for Predicting Emotion GIF Categories on Twitter
Ye BiShuo WangZhongrui Fan
2020-08-14
End-to-end Contextual Perception and Prediction with Interaction Transformer
Lingyun Luke LiBin YangMing LiangWenyuan ZengMengye RenSean SegalRaquel Urtasun
2020-08-13
Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion
| Dipjyoti PaulMuhammed PV ShifasYannis PantazisYannis Stylianou
2020-08-13
MICE: Mining Idioms with Contextual Embeddings
Tadej ŠkvorcPolona GantarMarko Robnik-Šikonja
2020-08-13
Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition
Wenyong HuangWenchao HuYu Ting YeungXiao Chen
2020-08-13
Large-scale Transfer Learning for Low-resource Spoken Language Understanding
Xueli JiaJianzong WangZhiyong ZhangNing ChengJing Xiao
2020-08-13
ANDES at SemEval-2020 Task 12: A jointly-trained BERT multilingual model for offensive language detection
| Juan Manuel PérezAymé ArangoFranco Luque
2020-08-13
MMM : Exploring Conditional Multi-Track Music Generation with the Transformer
Jeff EnsPhilippe Pasquier
2020-08-13
Leveraging Automated Mixed-Low-Precision Quantization for tiny edge microcontrollers
Manuele RusciMarco FariselliAlessandro CapotondiLuca Benini
2020-08-12
Variance-reduced Language Pretraining via a Mask Proposal Network
Liang Chen
2020-08-12
Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders
| Nicola MessinaGiuseppe AmatoAndrea EsuliFabrizio FalchiClaudio GennaroStéphane Marchand-Maillet
2020-08-12
Compression of Deep Learning Models for Text: A Survey
Manish GuptaPuneet Agrawal
2020-08-12
Evaluating the Impact of Knowledge Graph Context on Entity Disambiguation Models
| Isaiah Onando Mulang'Kuldeep SinghChaitali PrabhuAbhishek NadgeriJohannes HoffartJens Lehmann
2020-08-12
Predicting MOOCs Dropout Using Only Two Easily Obtainable Features from the First Week's Activities
Ahmed AlamriMohammad AlshehriAlexandra I. CristeaFilipe D. PereiraElaine OliveiraLei ShiCraig Stewart
2020-08-12
Facial Expression Recognition Under Partial Occlusion from Virtual Reality Headsets based on Transfer Learning
Bita HoushmandNaimul Khan
2020-08-12
Multi-modal segmentation of 3D brain scans using neural networks
Jonathan ZopesMoritz PlatscherSilvio PaganucciChristian Federau
2020-08-11
Modeling Prosodic Phrasing with Multi-Task Learning in Tacotron-based TTS
Rui LiuBerrak SismanFeilong BaoGuanglai GaoHaizhou Li
2020-08-11
PneumoXttention: A CNN compensating for Human Fallibility when Detecting Pneumonia through CXR images with Attention
Sanskriti Singh
2020-08-11
Informative Dropout for Robust Representation Learning: A Shape-bias Perspective
| Baifeng ShiDinghuai ZhangQi DaiZhanxing ZhuYadong MuJingdong Wang
2020-08-10
KR-BERT: A Small-Scale Korean-Specific Language Model
| Sangah LeeHansol JangYunmee BaikSuzi ParkHyopil Shin
2020-08-10
Do ideas have shape? Plato's theory of forms as the continuous limit of artificial neural networks
Houman Owhadi
2020-08-10
FireBERT: Hardening BERT-based classifiers against adversarial attack
Gunnar MeinKevin HartmanAndrew Morris
2020-08-10
Navigating Language Models with Synthetic Agents
Philip Feldman
2020-08-10
Does BERT Solve Commonsense Task via Commonsense Knowledge?
Leyang CuiSijie ChengYu WuYue Zhang
2020-08-10
Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine
Kuan FangLong ZhaoZhan ShenRuiXing WangRiKang ZhourLiWen Fan
2020-08-10
GANBERT: Generative Adversarial Networks with Bidirectional Encoder Representations from Transformers for MRI to PET synthesis
Hoo-Chang ShinAlvin IhsaniSwetha MandavaSharath Turuvekere SreenivasChristopher ForsterJiook ChaAlzheimer's Disease Neuroimaging Initiative
2020-08-10
SpeedySpeech: Efficient Neural Speech Synthesis
| Jan VainerOndřej Dušek
2020-08-09
Fast and Accurate Neural CRF Constituency Parsing
| Yu ZhangHouquan ZhouZhenghua Li
2020-08-09
DIET-SNN: Direct Input Encoding With Leakage and Threshold Optimization in Deep Spiking Neural Networks
Nitin RathiKaushik Roy
2020-08-09
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
| Hayato FutamiHirofumi InagumaSei UenoMasato MimuraShinsuke SakaiTatsuya Kawahara
2020-08-09
Forming Local Intersections of Projections for Classifying and Searching Histopathology Images
Aditya SriramShivam KalraMorteza BabaieBrady KiefferWaddah Al DrobiShahryar RahnamayanHany KashaniHamid R. Tizhoosh
2020-08-08
Towards Lossless Binary Convolutional Neural Networks Using Piecewise Approximation
Baozhou Zhu Zaid Al-ArsWei Pan
2020-08-08
SemEval-2020 Task 10: Emphasis Selection for Written Text in Visual Media
Amirreza ShiraniFranck DernoncourtNedim LipkaPaul AsenteJose EchevarriaThamar Solorio
2020-08-07
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Wen-Chin HuangTomoki HayashiYi-Chiao WuHirokazu KameokaTomoki Toda
2020-08-07
The Ensemble Method for Thorax Diseases Classification
Bayu A. Nugroho
2020-08-07
Notes on the Behavior of MC Dropout
Francesco VerdojaVille Kyrki
2020-08-06
Noisy Student Training using Body Language Dataset Improves Facial Expression Recognition
Vikas KumarShivansh RaoLi Yu
2020-08-06
Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
| Patrick LewisPontus StenetorpSebastian Riedel
2020-08-06
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Zihang JiangWeihao YuDaquan ZhouYunpeng ChenJiashi FengShuicheng Yan
2020-08-06
DeText: A Deep Text Ranking Framework with BERT
| Weiwei GuoXiaowei LiuSida WangHuiji GaoAnanth SankarZimeng YangQi GuoLiang ZhangBo LongBee-Chung ChenDeepak Agarwal
2020-08-06
Structured Convolutions for Efficient Neural Network Design
Yash BhalgatYizhe ZhangJamie LinFatih Porikli
2020-08-06
aschern at SemEval-2020 Task 11: It Takes Three to Tango: RoBERTa, CRF, and Transfer Learning
| Anton ChernyavskiyDmitry IlvovskyPreslav Nakov
2020-08-06
6VecLM: Language Modeling in Vector Space for IPv6 Target Generation
Tianyu CuiGang XiongGaopeng GouJunzheng ShiWei Xia
2020-08-05
I-AID: Identifying Actionable Information from Disaster-related Tweets
Hamada M. ZaheraRricha JalotaMohamed A. SherifAxel N. Ngomo
2020-08-04
Land Use and Land Cover Classification using a Human Group based Particle Swarm Optimization Algorithm with a LSTM classifier on hybrid-pre-processing Remote Sensing Images
T. KowsalyaS. L. UlloC. ZarroK. L. HemalathaB. D. Parameshachari
2020-08-04
Peer-inspired Student Performance Prediction in Interactive Online Question Pools with Graph Neural Network
Haotian LiHuan WeiYong WangYangqiu SongHuamin Qu
2020-08-04
Taking Notes on the Fly Helps BERT Pre-training
Qiyu WuChen XingYatao LiGuolin KeDi HeTie-Yan Liu
2020-08-04
Learning from a Complementary-label Source Domain: Theory and Algorithms
Yiyang ZhangFeng LiuZhen FangBo YuanGuangquan ZhangJie Lu
2020-08-04
NLPDove at SemEval-2020 Task 12: Improving Offensive Language Detection with Cross-lingual Transfer
Hwijeen AhnJimin SunChan Young ParkJungyun Seo
2020-08-04
The Jazz Transformer on the Front Line: Exploring the Shortcomings of AI-composed Music through Quantitative Measures
| Shih-Lun WuYi-Hsuan Yang
2020-08-04
Automatic Composition of Guitar Tabs by Transformers and Groove Modeling
Yu-Hua ChenYu-Hsiang HuangWen-Yi HsiaoYi-Hsuan Yang
2020-08-04
One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech
| Tomáš NekvindaOndřej Dušek
2020-08-03
Deep Bayesian Bandits: Exploring in Online Personalized Recommendations
Dalin GuoSofia Ira KtenaFerenc HuszarPranay Kumar MyanaWenzhe ShiAlykhan Tejani
2020-08-03
Improving One-stage Visual Grounding by Recursive Sub-query Construction
| Zhengyuan YangTianlang ChenLiwei WangJiebo Luo
2020-08-03
[email protected] at SemEval-2020 Task 12: Multilingual or language-specific BERT?
Marc PàmiesEmily ÖhmanKaisla KajavaJörg Tiedemann
2020-08-03
Self-attention encoding and pooling for speaker recognition
Pooyan SafariMiquel IndiaJavier Hernando
2020-08-03
SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation Space
Liu YangFanqi MengMing-Kuang Daniel WuVicent YingXianchao Xu
2020-08-02
The Chess Transformer: Mastering Play using Generative Language Models
David NoeverMatt CiolinoJosh Kalin
2020-08-02
Trojaning Language Models for Fun and Profit
Xinyang ZhangZheng ZhangTing Wang
2020-08-01
Multi-node Bert-pretraining: Cost-efficient Approach
Jiahuang LinXin LiGennady Pekhimenko
2020-08-01
A Novel Global Spatial Attention Mechanism in Convolutional Neural Network for Medical Image Classification
Linchuan XuJun HuangAtsushi NitandaRyo AsaokaKenji Yamanishi
2020-07-31
Learning the Distribution: A Unified Distillation Paradigm for Fast Uncertainty Estimation in Computer Vision
Yichen ShenZhilu ZhangMert R. SabuncuLin Sun
2020-07-31
On Learning Universal Representations Across Languages
Xiangpeng WeiYue HuRongxiang WengLuxi XingHeng YuWeihua Luo
2020-07-31
Resist : Reconstruction of irises from templates
Sohaib AhmadBenjamin Fuller
2020-07-31
Language Modelling for Source Code with Transformer-XL
| Thomas DowdellHongyu Zhang
2020-07-31
Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
Yu GuRobert TinnHao ChengMichael LucasNaoto UsuyamaXiaodong LiuTristan NaumannJianfeng GaoHoifung Poon
2020-07-31
TweepFake: about Detecting Deepfake Tweets
Tiziano FagniFabrizio FalchiMargherita GambiniAntonio MartellaMaurizio Tesconi
2020-07-31
Model Reduction of Shallow CNN Model for Reliable Deployment of Information Extraction from Medical Reports
Abhishek K DubeyAlina PelusoJacob HinkleDevanshu AgarawalZilong Tan
2020-07-31
Generalization Comparison of Deep Neural Networks via Output Sensitivity
| Mahsa ForouzeshFarnood SalehiPatrick Thiran
2020-07-30
Deep Multi-View Spatiotemporal Virtual Graph Neural Network for Significant Citywide Ride-hailing Demand Prediction
Guangyin JinZhexu XiHengyu ShaYanghe FengJincai Huang
2020-07-30
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Jinhyeok YangJunmo LeeYoungik KimHoonyoung ChoInjung Kim
2020-07-30
What does BERT know about books, movies and music? Probing BERT for Conversational Recommendation
| Gustavo PenhaClaudia Hauff
2020-07-30
Interpretable Contextual Team-aware Item Recommendation: Application in Multiplayer Online Battle Arena Games
| Andrés VillaVladimir AraujoFrancisca CattanDenis Parra
2020-07-30
Depressive, Drug Abusive, or Informative: Knowledge-aware Study of News Exposure during COVID-19 Outbreak
Amanuel AlamboManas GaurKrishnaprasad Thirunarayan
2020-07-30
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
| Shayne LongpreYi LuJoachim Daiber
2020-07-30
Adversarial Robustness for Machine Learning Cyber Defenses Using Log Data
Kai SteversonJonathan MullinMetin Ahiskali
2020-07-29
Reliable Tuberculosis Detection using Chest X-ray with Deep Learning, Segmentation and Visualization
Tawsifur RahmanAmith KhandakarMuhammad Abdul KadirKhandaker R. IslamKhandaker F. IslamRashid MazharTahir HamidMohammad T. IslamZaid B. MahbubMohamed Arselene AyariMuhammad E. H. Chowdhury
2020-07-29
Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation
| Yiyang ZhangFeng LiuZhen FangBo YuanGuangquan ZhangJie Lu
2020-07-29
Composer Style Classification of Piano Sheet Music Images Using Language Model Pretraining
TJ TsaiKevin Ji
2020-07-29
Improving Results on Russian Sentiment Datasets
| Anton GolubevNatalia Loukachevitch
2020-07-28
BUT-FIT at SemEval-2020 Task 5: Automatic detection of counterfactual statements with deep pre-trained language representation models
Martin FajcikJosef JonMartin DocekalPavel Smrz
2020-07-28
Variants of BERT, Random Forests and SVM approach for Multimodal Emotion-Target Sub-challenge
Hoang Manh HungHyung-Jeong YangSoo-Hyung KimGuee-Sang Lee
2020-07-28
GUIR at SemEval-2020 Task 12: Domain-Tuned Contextualized Models for Offensive Language Detection
Sajad SotudehTong XiangHao-Ren YaoSean MacAvaneyEugene YangNazli GoharianOphir Frieder
2020-07-28
Deep Learning Brasil -- NLP at SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets
Manoel Veríssimo dos Santos NetoAyrton Denner da Silva AmaralNádia Félix Felipe da SilvaAnderson da Silva Soares
2020-07-28
TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling
Shuai ZhangPeng ZhangXindian MaJunqiu WeiNingning WangQun Liu
2020-07-28
MaxDropout: Deep Neural Network Regularization Based on Maximum Output Values
| Claudio Filipi Goncalves do SantosDanilo ColomboMateus RoderJoão Paulo Papa
2020-07-27
Self-Attentive Multi-Layer Aggregation with Feature Recalibration and Normalization for End-to-End Speaker Verification System
Soonshin SeoJi-Hwan Kim
2020-07-27
From Sound Representation to Model Robustness
Mohammad EsmaeilpourPatrick CardinalAlessandro Lameiras Koerich
2020-07-27
Receptive-Field Regularized CNNs for Music Classification and Tagging
Khaled KoutiniHamid Eghbal-ZadehVerena HaunschmidPaul PrimusShreyan ChowdhuryGerhard Widmer
2020-07-27
Semi-Supervised Learning with Data Augmentation for End-to-End ASR
Felix WeningerFranco ManaRoberto GemelloJesús Andrés-FerrerPuming Zhan
2020-07-27
KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media
| Ali SafayaMoutasem AbdullatifDeniz Yuret
2020-07-26
Reed at SemEval-2020 Task 9: Fine-Tuning and Bag-of-Words Approaches to Code-Mixed Sentiment Analysis
Vinay GopalanMark Hopkins
2020-07-26
To BERT or Not To BERT: Comparing Speech and Language-based Approaches for Alzheimer's Disease Detection
Aparna BalagopalanBenjamin EyreFrank RudziczJekaterina Novikova
2020-07-26
Quasi-Periodic Parallel WaveGAN: A Non-autoregressive Raw Waveform Generative Model with Pitch-dependent Dilated Convolution Neural Network
| Yi-Chiao WuTomoki HayashiTakuma OkamotoHisashi KawaiTomoki Toda
2020-07-25
Self-supervised Learning for Deep Models in Recommendations
Tiansheng YaoXinyang YiDerek Zhiyuan ChengFelix YuAditya MenonLichan HongEd H. ChiSteve TjoaJieqiKangEvan Ettinger
2020-07-25
FiSSA at SemEval-2020 Task 9: Fine-tuned For Feelings
| Bertelt BraaksmaRichard ScholtensStan van SuijlekomRemy WangAhmet Üstün
2020-07-24
MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning
Aina Garí SolerMarianna Apidianaki
2020-07-24
Product Title Generation for Conversational Systems using BERT
Mansi Ranjit ManeShashank KediaAditya ManthaStephen GuoKannan Achan
2020-07-23
PareCO: Pareto-aware Channel Optimization for Slimmable Neural Networks
Ting-Wu ChinAri S. MorcosDiana Marculescu
2020-07-23
The Lottery Ticket Hypothesis for Pre-trained BERT Networks
| Tianlong ChenJonathan FrankleShiyu ChangSijia LiuYang ZhangZhangyang WangMichael Carbin
2020-07-23
Exploring Swedish & English fastText Embeddings with the Transformer
| Tosin P. AdewumiFoteini LiwickiMarcus Liwicki
2020-07-23
CrossTransformers: spatially-aware few-shot transfer
Carl DoerschAnkush GuptaAndrew Zisserman
2020-07-22
IITK at the FinSim Task: Hypernym Detection in Financial Domain via Context-Free and Contextualized Word Embeddings
Vishal KeswaniSakshi SinghAshutosh Modi
2020-07-22
Rethinking CNN Models for Audio Classification
| Kamalesh PalanisamyDipika SinghaniaAngela Yao
2020-07-22
Analogical Reasoning for Visually Grounded Language Acquisition
Bo WuHaoyu QinAlireza ZareianCarl VondrickShih-Fu Chang
2020-07-22
Multi-task learning for natural language processing in the 2020s: where are we going?
Joseph WorshamJugal Kalita
2020-07-22
SliceOut: Training Transformers and CNNs faster while using less memory
Pascal NotinAidan N. GomezJoanna YooYarin Gal
2020-07-21
Neural Machine Translation with Error Correction
Kaitao SongXu TanJianfeng Lu
2020-07-21
problemConquero at SemEval-2020 Task 12: Transformer and Soft label-based approaches
Karishma LaudJagriti SinghRandeep Kumar SahuAshutosh Modi
2020-07-21
newsSweeper at SemEval-2020 Task 11: Context-Aware Rich Feature Representations For Propaganda Classification
| Paramansh SinghSiraj SandhuSubham KumarAshutosh Modi
2020-07-21
Word Representation for Rhythms
Tongyu LuLyucheng YanGus Xia
2020-07-21
Understanding BERT Rankers Under Distillation
Luyu GaoZhuyun DaiJamie Callan
2020-07-21
Learning Joint Spatial-Temporal Transformations for Video Inpainting
| Yanhong ZengJianlong FuHongyang Chao
2020-07-20
Monte Carlo Dropout Ensembles for Robust Illumination Estimation
Firas LaakomJenni RaitoharjuAlexandros IosifidisJarno NikkanenMoncef Gabbouj
2020-07-20
A Comparison of Supervised Learning to Match Methods for Product Search
| Fatemeh SarviNikos VoskaridesLois MooimanSebastian SchelterMaarten de Rijke
2020-07-20
Learning Sparse Filters in Deep Convolutional Neural Networks with a l1/l2 Pseudo-Norm
Anthony BerthelierYongzhe YanThierry ChateauChristophe BlancStefan DuffnerChristophe Garcia
2020-07-20
Effects of Approximate Multiplication on Convolutional Neural Networks
Min Soo KimAlberto A. Del BarrioHyunJin KimNader Bagherzadeh
2020-07-20
Conformer-Kernel with Query Term Independence for Document Retrieval
| Bhaskar MitraSebastian HofstatterHamed ZamaniNick Craswell
2020-07-20
Mono vs Multilingual Transformer-based Models: a Comparison across Several Language Tasks
Diego de Vargas FeijoViviane Pereira Moreira
2020-07-19
Temporal Pointwise Convolutional Networks for Length of Stay Prediction in the Intensive Care Unit
| Emma RocheteauPietro LiòStephanie Hyland
2020-07-18
Feature Pyramid Transformer
| Dong ZhangHanwang ZhangJinhui TangMeng WangXiansheng HuaQianru Sun
2020-07-18
Deep Learning Based Traffic Surveillance System For Missing and Suspicious Car Detection
K. V. KadambariVishnu Vardhan Nimmalapudi
2020-07-17
Hybrid Discriminative-Generative Training via Contrastive Learning
Hao LiuPieter Abbeel
2020-07-17
CTC-Segmentation of Large Corpora for German End-to-end Speech Recognition
Ludwig KürzingerDominik WinkelbauerLujun LiTobias WatzelGerhard Rigoll
2020-07-17
Multi-Perspective Semantic Information Retrieval in the Biomedical Domain
Samarth Rawal
2020-07-17
Investigating Pretrained Language Models for Graph-to-Text Generation
Leonardo F. R. RibeiroMartin SchmittHinrich SchützeIryna Gurevych
2020-07-16
Towards Debiasing Sentence Representations
Paul Pu LiangIrene Mengze LiEmily ZhengYao Chong LimRuslan SalakhutdinovLouis-Philippe Morency
2020-07-16
EfficientHRNet: Efficient Scaling for Lightweight High-Resolution Multi-Person Pose Estimation
Christopher NeffAneri ShethSteven FurgursonHamed Tabkhi
2020-07-16
Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT
Yang Jiao
2020-07-16
SqueezeFacePoseNet: Lightweight Face Verification Across Different Poses for Mobile Platforms
Fernando Alonso-FernandezJavier BarrachinaKevin Hernandez-DiazJosef Bigun
2020-07-16
Hopfield Networks is All You Need
| Hubert RamsauerBernhard SchäflJohannes LehnerPhilipp SeidlMichael WidrichLukas GruberMarkus HolzleitnerMilena PavlovićGeir Kjetil SandveVictor GreiffDavid KreilMichael KoppGünter KlambauerJohannes BrandstetterSepp Hochreiter
2020-07-16
Fine-Tune Longformer for Jointly Predicting Rumor Stance and Veracity
Anant Khandelwal
2020-07-15
AdapterHub: A Framework for Adapting Transformers
| Jonas PfeifferAndreas RückléClifton PothAishwarya KamathIvan VulićSebastian RuderKyunghyun ChoIryna Gurevych
2020-07-15
Multimodal Word Sense Disambiguation in Creative Practice
Manuel Ladron de GuevaraChristopher GeorgeAkshat GuptaDaragh ByrneRamesh Krishnamurti
2020-07-15
Finding Non-Uniform Quantization Schemes using Multi-Task Gaussian Processes
Marcelo Gennari do NascimentoTheo W. CostainVictor Adrian Prisacariu
2020-07-15
Logic Constrained Pointer Networks for Interpretable Textual Similarity
| Subhadeep MajiRohan KumarManish BansalKalyani RoyPawan Goyal
2020-07-15
Predicting Clinical Diagnosis from Patients Electronic Health Records Using BERT-based Neural Networks
Pavel BlinovManvel AvetisianVladimir KokhDmitry UmerenkovAlexander Tuzhilin
2020-07-15
Overview of CheckThat! 2020: Automatic Identification and Verification of Claims in Social Media
| Alberto Barron-CedenoTamer ElsayedPreslav NakovGiovanni Da San MartinoMaram HasanainReem SuwailehFatima HaouariNikolay BabulkovBayan HamdanAlex NikolovShaden ShaarZien Sheikh Ali
2020-07-15
Deep Reinforced Query Reformulation for Information Retrieval
Xiao WangCraig MacdonaldIadh Ounis
2020-07-15
The Monte Carlo Transformer: a stochastic self-attention model for sequence prediction
Alice MartinCharles OllionFlorian StrubSylvain Le CorffOlivier Pietquin
2020-07-15
InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training
| Zewen ChiLi DongFuru WeiNan YangSaksham SinghalWenhui WangXia SongXian-Ling MaoHeyan HuangMing Zhou
2020-07-15
Fast and Accurate Neural CRF Constituency Parsing
| Yu ZhangHouquan ZhouZhenghua Li
2020-07-14
An Uncertainty-based Human-in-the-loop System for Industrial Tool Wear Analysis
Alexander TreissJannis WalkNiklas Kühl
2020-07-14
Deep Transformer based Data Augmentation with Subword Units for Morphologically Rich Online ASR
Balázs TarjánGyörgy SzaszákTibor FegyóPéter Mihajlik
2020-07-14
Contextualized Code Representation Learning for Commit Message Generation
Lun Yiu NieCuiyun GaoZhicong ZhongWai LamYang LiuZenglin Xu
2020-07-14
What's in a Name? Are BERT Named Entity Representations just as Good for any other Name?
Sriram BalasubramanianNaman JainGaurav JindalAbhijeet AwasthiSunita Sarawagi
2020-07-14
An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models
Lifu TuGarima LalwaniSpandana GellaHe He
2020-07-14
Can neural networks acquire a structural bias from raw linguistic data?
Alex WarstadtSamuel R. Bowman
2020-07-14
Emoji Prediction: Extensions and Benchmarking
Weicheng MaRuibo LiuLili WangSoroush Vosoughi
2020-07-14
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Shauharda KhadkaEstelle AflaloMattias MarderAvrech Ben-DavidSantiago MiretHanlin TangShie MannorTamir HazanSomdeb Majumdar
2020-07-14
Add a SideNet to your MainNet
Adrien Morisot
2020-07-14
Uncertain-DeepSSM: From Images to Probabilistic Shape Models
Jadie AdamsRiddhish BhalodiaShireen Elhabian
2020-07-13
Paranoid Transformer: Reading Narrative of Madness as Computational Approach to Creativity
Yana AgafonovaAlexey TikhonovIvan P. Yamshchikov
2020-07-13
Transformer with Depth-Wise LSTM
Hongfei XuQiuhui LiuDeyi XiongJosef van Genabith
2020-07-13
An Enhanced Text Classification to Explore Health based Indian Government Policy Tweets
Aarzoo DhimanDurga Toshniwal
2020-07-13
Regularized linear autoencoders recover the principal components, eventually
| Xuchan BaoJames LucasSushant SachdevaRoger Grosse
2020-07-13
VINNAS: Variational Inference-based Neural Network Architecture Search
Martin FeriancHongxiang FanMiguel Rodrigues
2020-07-12
Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation
Aditya MogadalaMarius MosbachDietrich Klakow
2020-07-12
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
| Andy T. LiuShang-Wen LiHung-yi Lee
2020-07-12
Locality Guided Neural Networks for Explainable Artificial Intelligence
Randy TanNaimul KhanLing Guan
2020-07-12
HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections
Yi TayZhe ZhaoDara BahriDonald MetzlerDa-Cheng Juan
2020-07-12
To filter prune, or to layer prune, that is the question
Sara ElkerdawyMostafa ElhoushiAbhineet SinghHong ZhangNilanjan Ray
2020-07-11
Sequence Generation with Mixed Representations
| Lijun Wu Shufang Xie Yingce Xia Fan Yang Tao Qin Jianhuang Lai Tie-Yan Liu
2020-07-11
Generative Graph Perturbations for Scene Graph Prediction
Boris KnyazevHarm de VriesCătălina CangeaGraham W. TaylorAaron CourvilleEugene Belilovsky
2020-07-11
BERT Learns (and Teaches) Chemistry
Josh PayneMario SroujiDian Ang YapVineet Kosaraju
2020-07-11
Characteristics of Monte Carlo Dropout in Wide Neural Networks
Joachim SickingMaram AkilaTim WirtzSebastian HoubenAsja Fischer
2020-07-10
To BAN or not to BAN: Bayesian Attention Networks for Reliable Hate Speech Detection
Kristian MiokBlaz SkrljDaniela ZaharieMarko Robnik-Sikonja
2020-07-10
BISON:BM25-weighted Self-Attention Framework for Multi-Fields Document Search
| Xuan ShanChuanjie LiuYiqian XiaQi ChenYusi ZhangAngen LuoYuxiang Luo
2020-07-10
Multi-Dialect Arabic BERT for Country-Level Dialect Identification
| Bashar TalafhaMohammad AliMuhy Eddin Za'terHaitham SeelawiIbraheem TuffahaMostafa SamirWael FarhanHussein T. Al-Natsheh
2020-07-10
Blockchain-Federated-Learning and Deep Learning Models for COVID-19 detection using CT Imaging
| Rajesh KumarAbdullah Aman KhanSinmin ZhangWenYong WangYousif AbuidrisWaqas AminJay Kumar
2020-07-10
Uncertainty Quantification in Deep Residual Neural Networks
Lukasz WandzikRaul Vicente GarciaJörg Krüger
2020-07-09
DeepSinger: Singing Voice Synthesis with Data Mined From the Web
Yi RenXu TanTao QinJian LuanZhou ZhaoTie-Yan Liu
2020-07-09
Contrastive Code Representation Learning
| Paras JainAjay JainTianjun ZhangPieter AbbeelJoseph E. GonzalezIon Stoica
2020-07-09
Single architecture and multiple task deep neural network for altered fingerprint analysis
Oliver GiudiceMattia LitricoSebastiano Battiato
2020-07-09
Fast Transformers with Clustered Attention
| Apoorv VyasAngelos KatharopoulosFrançois Fleuret
2020-07-09
Advances of Transformer-Based Models for News Headline Generation
| Alexey BukhtiyarovIlya Gusev
2020-07-09
Few Is Enough: Task-Augmented Active Meta-Learning for Brain Cell Classification
Pengyu YuanAryan MobinyJahandar JahanipourXiaoyang LiPietro Antonio CicaleseBadrinath RoysamVishal PatelMaric DraganHien Van Nguyen
2020-07-09
MCU-Net: A framework towards uncertainty representations for decision support system patient referrals in healthcare contexts
Nabeel Seedat
2020-07-08
Journey Towards Tiny Perceptual Super-Resolution
Royson LeeŁukasz DudziakMohamed AbdelfattahStylianos I. VenierisHyeji KimHongkai WenNicholas D. Lane
2020-07-08
3D Topology Transformation with Generative Adversarial Networks
Luca StornaiuoloNima DehmamyAlbert-László BarabásiMauro Martino
2020-07-07
The Go Transformer: Natural Language Modeling for Game Play
Matthew CiolinoDavid NoeverJosh Kalin
2020-07-07
Continual BERT: Continual Learning for Adaptive Extractive Summarization of COVID-19 Literature
Jong Won Park
2020-07-07
Do Transformers Need Deep Long-Range Memory
Jack W. RaeAli Razavi
2020-07-07
RIFLE: Backpropagation in Depth for Deep Transfer Learning through Re-Initializing the Fully-connected LayEr
Xingjian LiHaoyi XiongHaozhe AnChengzhong XuDejing Dou
2020-07-07
Single Shot MC Dropout Approximation
Kai BrachBeate SickOliver Dürr
2020-07-07
Exploring Heterogeneous Information Networks via Pre-Training
Yang FangXiang ZhaoWeidong Xiao
2020-07-07
Relevance Transformer: Generating Concise Code Snippets with Relevance Feedback
Carlos GemmellFederico RossettoJeffrey Dalton
2020-07-06
Learning to Segment Anatomical Structures Accurately from One Exemplar
Yuhang LuWeijian LiKang ZhengYirui WangAdam P. HarrisonChihung LinSong WangJing XiaoLe LuChang-Fu KuoShun Miao