Residual Connection

Introduced by He et al. in Deep Residual Learning for Image Recognition

Residual Connections are a type of skip-connection that learn residual functions with reference to the layer inputs, instead of learning unreferenced functions.

Formally, denoting the desired underlying mapping as $\mathcal{H}({x})$, we let the stacked nonlinear layers fit another mapping of $\mathcal{F}({x}):=\mathcal{H}({x})-{x}$. The original mapping is recast into $\mathcal{F}({x})+{x}$.

The intuition is that it is easier to optimize the residual mapping than to optimize the original, unreferenced mapping. To the extreme, if an identity mapping were optimal, it would be easier to push the residual to zero than to fit an identity mapping by a stack of nonlinear layers.

Source: Deep Residual Learning for Image Recognition

Latest Papers

PAPER DATE
The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation Classification
| Abdullatif KöksalArzucan Özgür
2020-10-19
Capturing Longer Context for Document-level Neural Machine Translation: A Multi-resolutional Approach
| Zewei SunMingxuan WangHao ZhouChengqi ZhaoShuJian HuangJiajun ChenLei LI
2020-10-18
Delaying Interaction Layers in Transformer-based Encoders for Efficient Open Domain Question Answering
Wissam SibliniMohamed ChallalCharlotte Pasqual
2020-10-16
It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT
Hila GonenShauli RavfogelYanai ElazarYoav Goldberg
2020-10-16
Coarse-to-Fine Pre-training for Named Entity Recognition
Mengge XueBowen YuZhenyu ZhangTingwen LiuYue ZhangBin Wang
2020-10-16
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion
Shengkui ZhaoTrung Hieu NguyenHao WangBin Ma
2020-10-16
DiDi's Machine Translation System for WMT2020
Tanfang ChenWeiwei WangWenyang WeiXing ShiXiangang LiJieping YeKevin Knight
2020-10-16
Modeling Token-level Uncertainty to Learn Unknown Concepts in SLU via Calibrated Dirichlet Prior RNN
Yilin ShenWenhu ChenHongxia Jin
2020-10-16
Revisiting Optical Flow Estimation in 360 Videos
Keshav BhandariZiliang ZongYan Yan
2020-10-15
Empirical Study of Transformers for Source Code
Nadezhda ChirkovaSergey Troshin
2020-10-15
Neural Deepfake Detection with Factual Structure of Text
Wanjun ZhongDuyu TangZenan XuRuize WangNan DuanMing ZhouJiahai WangJian Yin
2020-10-15
Multi-Task Learning for Cross-Lingual Abstractive Summarization
Sho TakaseNaoaki Okazaki
2020-10-15
Context-Guided BERT for Targeted Aspect-Based Sentiment Analysis
Zhengxuan WuDesmond C. Ong
2020-10-15
Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs
| Ana MarasovićChandra BhagavatulaJae Sung ParkRonan Le BrasNoah A. SmithYejin Choi
2020-10-15
DialogueTRM: Exploring the Intra- and Inter-Modal Emotional Behaviors in the Conversation
Yuzhao MaoQi SunGuang LiuXiaojie WangWeiguo GaoXuan LiJianping Shen
2020-10-15
Does Chinese BERT Encode Word Structure?
| Yile WangLeyang CuiYue Zhang
2020-10-15
A Transformer Based Pitch Sequence Autoencoder with MIDI Augmentation
Mingshuo DingYinghao Ma
2020-10-15
Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings
Phillip KeungJulian SalazarYichao LuNoah A. Smith
2020-10-15
[email protected]: Sentiment Analysis of Code-Mixed Dravidian text using XLNet
Shubhanker BanerjeeArun JayapalSajeetha Thavareesan
2020-10-15
Response Selection for Multi-Party Conversations withDynamic Topic Tracking
Weishi Wang§Shafiq Joty§Steven C. H. Hoi
2020-10-15
Compressive Summarization with Plausibility and Salience Modeling
| Shrey DesaiJiacheng XuGreg Durrett
2020-10-15
Masked Contrastive Representation Learning for Reinforcement Learning
Jinhua ZhuYingce XiaLijun WuJiajun DengWengang ZhouTao QinHouqiang Li
2020-10-15
Understanding Neural Abstractive Summarization Models via Uncertainty
| Jiacheng XuShrey DesaiGreg Durrett
2020-10-15
Self-Supervised Ranking for Representation Learning
Ali VarameshAli DibaTinne TuytelaarsLuc van Gool
2020-10-14
Viewmaker Networks: Learning Views for Unsupervised Representation Learning
Alex TamkinMike WuNoah Goodman
2020-10-14
Do End-to-end Stereo Algorithms Under-utilize Information?
| Changjiang CaiPhilippos Mordohai
2020-10-14
Memformer: The Memory-Augmented Transformer
Qingyang WuZhenzhong LanJing GuZhou Yu
2020-10-14
DA-Transformer: Distance-aware Transformer
Chuhan WuFangzhao WuYongfeng Huang
2020-10-14
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search
| Gyuwan KimKyunghyun Cho
2020-10-14
An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models
Zihan ZhaoYuncong LiuLu ChenQi LiuRao MaKai Yu
2020-10-14
Geometry matters: Exploring language examples at the decision boundary
Debajyoti DattaShashwat KumarLaura BarnesTom Fletcher
2020-10-14
Decoding Methods for Neural Narrative Generation
| Alexandra DeLuciaAaron MuellerXiang Lisa LiJoão Sedoc
2020-10-14
No Rumours Please! A Multi-Indic-Lingual Approach for COVID Fake-Tweet Detection
| Debanjana KarMohit BhardwajSuranjana SamantaAmar Prakash Azad
2020-10-14
COVID-CT-Mask-Net: Prediction of COVID-19 from CT Scans Using Regional Features
| Aram Ter-Sarkisov
2020-10-14
Probing for Multilingual Numerical Understanding in Transformer-Based Language Models
| Devin JohnsonDenise MakDrew BarkerLexi Loessberg-Zahl
2020-10-13
BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's Distance
| Jianquan LiXiaokang LiuHonghong ZhaoRuifeng XuMin YangYaohong Jin
2020-10-13
Incorporating BERT into Parallel Sequence Decoding with Adapters
| Junliang GuoZhirui ZhangLinli XuHao-Ran WeiBoxing ChenEnhong Chen
2020-10-13
Improving Text Generation Evaluation with Batch Centering and Tempered Word Mover Distance
Xi ChenNan DingTomer LevinboimRadu Soricut
2020-10-13
The workweek is the best time to start a family -- A Study of GPT-2 Based Claim Generation
Shai GretzYonatan BiluEdo Cohen-KarlikNoam Slonim
2020-10-13
Context-Aware Drive-thru Recommendation Service at Fast Food Restaurants
Luyang WangKai HuangJiao WangShengsheng HuangJason DaiYue Zhuang
2020-10-13
CAPT: Contrastive Pre-Training for LearningDenoised Sequence Representations
Fuli LuoPengcheng YangShicheng LiXuancheng RenXu sun
2020-10-13
Aspect-based Document Similarity for Research Papers
| Malte OstendorffTerry RuasTill BlumeBela GippGeorg Rehm
2020-10-13
Interpreting Attention Models with Human Visual Attention in Machine Reading Comprehension
Ekta SoodSimon TannertDiego FrassinelliAndreas BullingNgoc Thang Vu
2020-10-13
Multilingual Argument Mining: Datasets and Analysis
Orith Toledo-RonenMatan OrbachYonatan BiluArtem SpectorNoam Slonim
2020-10-13
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy LinRodrigo NogueiraAndrew Yates
2020-10-13
COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs
Jena D. HwangChandra BhagavatulaRonan Le BrasJeff DaKeisuke SakaguchiAntoine BosselutYejin Choi
2020-10-12
Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification
Jordan J. BirdAnikó EkártDiego R. Faria
2020-10-12
Zero-shot Entity Linking with Efficient Long Range Sequence Modeling
| Zonghai YaoLiangliang CaoHuapu Pan
2020-10-12
Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers
Christoph KamannBurkhard GüssefeldRobin HutmacherJan Hendrik MetzenCarsten Rother
2020-10-12
Conditioning Trick for Training Stable GANs
Mohammad EsmaeilpourRaymel Alfonso SalloOlivier St-GeorgesPatrick CardinalAlessandro Lameiras Koerich
2020-10-12
Automatic Quantification of Settlement Damage using Deep Learning of Satellite Images
Lili LuWeisi Guo
2020-10-12
Meta-Context Transformers for Domain-Specific Response Generation
Debanjana KarSuranjana SamantaAmar Prakash Azad
2020-10-12
Counterfactual Variable Control for Robust and Interpretable Question Answering
| Sicheng YuYulei NiuShuohang WangJing JiangQianru Sun
2020-10-12
Improving Compositional Generalization in Semantic Parsing
| Inbar OrenJonathan HerzigNitish GuptaMatt GardnerJonathan Berant
2020-10-12
HUJI-KU at MRP~2020: Two Transition-based Neural Parsers
Ofir ArvivRuixiang CuiDaniel Hershcovich
2020-10-12
Probing Pretrained Language Models for Lexical Semantics
Ivan VulićEdoardo Maria PontiRobert LitschkoGoran GlavašAnna Korhonen
2020-10-12
EFSG: Evolutionary Fooling Sentences Generator
Marco Di GiovanniMarco Brambilla
2020-10-12
Dynamic Memory Enhanced Transformer for End-to-end Task-Oriented Dialogue System
Yanjie GouYinjie LeiLingqiao Liu
2020-10-12
Layer-wise Guided Training for BERT: Learning Incrementally Refined Document Representations
Nikolaos ManginasIlias ChalkidisProdromos Malakasiotis
2020-10-12
From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks
| Steffen EgerYannik Benz
2020-10-12
Load What You Need: Smaller Versions of Multilingual BERT
| Amine AbdaouiCamille PradelGrégoire Sigel
2020-10-12
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task
Zuchao LiHai ZhaoRui WangKehai ChenMasao UtiyamaEiichiro Sumita
2020-10-11
Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually)
| Alex WarstadtYian ZhangHaau-Sing LiHaokun LiuSamuel R. Bowman
2020-10-11
Detecting Foodborne Illness Complaints in Multiple Languages Using English Annotations Only
Ziyi LiuGiannis KaramanolakisDaniel HsuLuis Gravano
2020-10-11
Connecting the Dots Between Fact Verification and Fake News Detection
Qifei LiWangchunshu Zhou
2020-10-11
Machine Translation of Mathematical Text
Aditya OhriTanya Schmah
2020-10-11
Unsupervised Distillation of Syntactic Information from Contextualized Word Representations
Shauli RavfogelYanai ElazarJacob GoldbergerYoav Goldberg
2020-10-11
Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU
| Brielen MadureiraDavid Schlangen
2020-10-11
Data Agnostic RoBERTa-based Natural Language to SQL Query Generation
| Debaditya PalHarsh SharmaKaustubh Chaudhari
2020-10-11
SMYRF: Efficient Attention using Asymmetric Clustering
| Giannis DarasNikita KitaevAugustus OdenaAlexandros G. Dimakis
2020-10-11
An Empirical Study on Detecting COVID-19 in Chest X-ray Images Using Deep Learning Based Methods
Ramtin BabaeipourElham AziziHassan Khotanlou
2020-10-10
Information Extraction from Swedish Medical Prescriptions with Sig-Transformer Encoder
John Pougue BiyongBo wangTerry LyonsAlejo J Nevado-Holgado
2020-10-10
Structured Self-Attention Weights Encode Semantics in Sentiment Analysis
| Zhengxuan WuThanh-Son NguyenDesmond C. Ong
2020-10-10
Tag Recommendation for Online Q&A Communities based on BERT Pre-Training Technique
Navid KhezrianJafar HabibiIssa Annamoradnejad
2020-10-10
Compressing Transformer-Based Semantic Parsing Models using Compositional Code Embeddings
Prafull PrakashSaurabh Kumar ShashidharWenlong ZhaoSubendhu RongaliHaidar KhanMichael Kayser
2020-10-10
Meta-Aggregating Networks for Class-Incremental Learning
| Yaoyao LiuBernt SchieleQianru Sun
2020-10-10
Automated Concatenation of Embeddings for Structured Prediction
| Xinyu WangYong JiangNguyen BachTao WangZhongqiang HuangFei HuangKewei Tu
2020-10-10
Second-Order Neural Dependency Parsing with Message Passing and End-to-End Training
| Xinyu WangKewei Tu
2020-10-10
Explaining Clinical Decision Support Systems in Medical Imaging using Cycle-Consistent Activation Maximization
Alexander KatzmannOliver TaubmannStephen AhmadAlexander MühlbergMichael SühlingHorst-Michael Groß
2020-10-09
On Task-Level Dialogue Composition of Generative Transformer Model
| Prasanna ParthasarathiArvind NeelakantanSharan Narang
2020-10-09
Long-distance tiny face detection based on enhanced YOLOv3 for unmanned system
Jia-Yi ChangYan-Feng LuYa-Jun LiuBo ZhouHong Qiao
2020-10-09
Attaining Real-Time Super-Resolution for Microscopic Images Using GAN
| Vibhu BhatiaYatender Kumar
2020-10-09
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders
Wen-Chin HuangPatrick Lumban TobingYi-Chiao WuKazuhiro KobayashiTomoki Toda
2020-10-09
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding
Jin CaoJun WangWael HamzaKelly VaneeShang-Wen Li
2020-10-09
Online Back-Parsing for AMR-to-Text Generation
Xuefeng BaiLinfeng SongYue Zhang
2020-10-09
What Have We Achieved on Text Summarization?
Dandan HuangLeyang CuiSen yangGuangsheng BaoKun WangJun XieYue Zhang
2020-10-09
Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis
| João A. LeiteDiego F. SilvaKalina BontchevaCarolina Scarton
2020-10-09
Grid Tagging Scheme for Aspect-oriented Fine-grained Opinion Extraction
Zhen WuChengcan YingFei ZhaoZhifang FanXinyu DaiRui Xia
2020-10-09
NutCracker at WNUT-2020 Task 2: Robustly Identifying Informative COVID-19 Tweets using Ensembling and Adversarial Training
| Priyanshu KumarAadarsh Singh
2020-10-09
Deep Learning Meets Projective Clustering
Alaa MaaloufHarry LangDaniela RusDan Feldman
2020-10-08
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Jonathan ShenYe JiaMike ChrzanowskiYu ZhangIsaac EliasHeiga ZenYonghui Wu
2020-10-08
Masked ELMo: An evolution of ELMo towards fully contextual RNN language models
Gregory SenayEmmanuelle Salin
2020-10-08
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Yinghui HuangHong-Kwang KuoSamuel ThomasZvi KonsKartik AudhkhasiBrian KingsburyRon HooryMichael Picheny
2020-10-08
Energy-based Out-of-distribution Detection
| Weitang LiuXiaoYun WangJohn D. OwensYixuan Li
2020-10-08
TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling
Parker RileyNoah ConstantMandy GuoGirish KumarDavid UthusZarana Parekh
2020-10-08
IRX-1D: A Simple Deep Learning Architecture for Remote Sensing Classifications
Mahesh PalAkshayB. Charan Teja
2020-10-08
PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge
| Yun HeZhuoer WangYin ZhangRuihong HuangJames Caverlee
2020-10-08
Shallow-to-Deep Training for Neural Machine Translation
Bei LiZiyang WangHui LiuYufan JiangQuan DuTong XiaoHuizhen WangJingbo Zhu
2020-10-08
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
| Yun HeZiwei ZhuYin ZhangQin ChenJames Caverlee
2020-10-08
Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference
| Xiaoan DingTianyu LiuBaobao ChangZhifang SuiKevin Gimpel
2020-10-08
Improving Attention Mechanism with Query-Value Interaction
Chuhan WuFangzhao WuTao QiYongfeng Huang
2020-10-08
A Co-Interactive Transformer for Joint Slot Filling and Intent Detection
| Libo QinTailu LiuWanxiang CheBingbing KangSendong ZhaoTing Liu
2020-10-08
Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding
Dechuang TengLibo QinWanxiang CheSendong ZhaoTing Liu
2020-10-08
Interlocking Backpropagation: Improving depthwise model-parallelism
Aidan N. GomezOscar KeyStephen GouNick FrosstJeff DeanYarin Gal
2020-10-08
Deformable DETR: Deformable Transformers for End-to-End Object Detection
Xizhou ZhuWeijie SuLewei LuBin LiXiaogang WangJifeng Dai
2020-10-08
Automatic generation of reviews of scientific papers
| Anna NikiforovskayaNikolai KapralovAnna VlasovaOleg ShpynovAleksei Shpilman
2020-10-08
Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets
Mihaela GamanRadu Tudor Ionescu
2020-10-07
Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank
| Eleftheria BriakouMarine Carpuat
2020-10-07
Optimizing Transformers with Approximate Computing for Faster, Smaller and more Accurate NLP Models
Amrit NagarajanSanchari SenJacob R. StevensAnand Raghunathan
2020-10-07
YOdar: Uncertainty-based Sensor Fusion for Vehicle Detection with Camera and Radar Sensors
Kamil KowolMatthias RottmannStefan BrackeHanno Gottschalk
2020-10-07
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling
Jiecao ChenLiu YangKarthik RamanMichael BenderskyJung-Jung YehYun ZhouMarc NajorkDanyang CaiEhsan Emadzadeh
2020-10-07
Transformer-GCRF: Recovering Chinese Dropped Pronouns with General Conditional Random Fields
Jingxuan YangKerui XuJun XuSi LiSheng GaoJun GuoJi-Rong WenNianwen Xue
2020-10-07
Why do you think that? Exploring Faithful Sentence-Level Rationales Without Supervision
Max GlocknerIvan HabernalIryna Gurevych
2020-10-07
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Thai-Son NguyenSebastian StuekerAlex Waibel
2020-10-07
ELMo and BERT in semantic change detection for Russian
Julia RodinaYuliya TrofimovaAndrey KutuzovEkaterina Artemova
2020-10-07
Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing
Xilun ChenAsish GhoshalYashar MehdadLuke ZettlemoyerSonal Gupta
2020-10-07
Vector-Vector-Matrix Architecture: A Novel Hardware-Aware Framework for Low-Latency Inference in NLP Applications
Matthew KhouryRumen DangovskiLongwu OuPreslav NakovYichen ShenLi Jing
2020-10-06
MH-COVIDNet: Diagnosis of COVID-19 using Deep Neural Networks and Meta-heuristic-based Feature Selection on X-ray Images
| Murat CANAYAZ
2020-10-06
Parallax Motion Effect Generation Through Instance Segmentation And Depth Estimation
Allan PintoManuel A. CórdovaLuis G. L. DeckerJose L. Flores-CampanaMarcos R. SouzaAndreza A. dos SantosJhonatas S. ConceiçãoHenrique F. GagliardiDiogo C. LuvizonRicardo da S. TorresHelio Pedrini
2020-10-06
LOGAN: Local Group Bias Detection by Clustering
Jieyu ZhaoKai-Wei Chang
2020-10-06
Investigating African-American Vernacular English in Transformer-Based Text Generation
Sophie GroenwoldLily OuAesha ParekhSamhita HonnavalliSharon LevyDiba MirzaWilliam Yang Wang
2020-10-06
Converting the Point of View of Messages Spoken to Virtual Assistants
| Isabelle G. LeeVera ZuSai Srujana BuddiDennis LiangJack G. M. FitzGerald
2020-10-06
Adversarial Grammatical Error Correction
Vipul RahejaDimitrios Alikaniotis
2020-10-06
Pretrained Language Model Embryology: The Birth of ALBERT
| David C. ChiangSung-Feng HuangHung-Yi Lee
2020-10-06
Do Explicit Alignments Robustly Improve Multilingual Encoders?
Shijie WuMark Dredze
2020-10-06
LEGAL-BERT: The Muppets straight out of Law School
Ilias ChalkidisManos FergadiotisProdromos MalakasiotisNikolaos AletrasIon Androutsopoulos
2020-10-06
Cross-Lingual Text Classification with Minimal Resources by Transferring a Sparse Teacher
| Giannis KaramanolakisDaniel HsuLuis Gravano
2020-10-06
The Multilingual Amazon Reviews Corpus
Phillip KeungYichao LuGyörgy SzarvasNoah A. Smith
2020-10-06
Scene Graph Modification Based on Natural Language Commands
| Xuanli HeQuan Hung TranGholamreza HaffariWalter ChangTrung BuiZhe LinFranck DernoncourtNhan Dam
2020-10-06
On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained Transformers
| Marius MosbachAnna KhokhlovaMichael A. HedderichDietrich Klakow
2020-10-06
On the Sub-Layer Functionalities of Transformer Decoder
Yilin YangLongyue WangShuming ShiPrasad TadepalliStefan LeeZhaopeng Tu
2020-10-06
Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation
| Sebastian HofstätterSophia AlthammerMichael SchröderMete SertkanAllan Hanbury
2020-10-06
Incorporating Behavioral Hypotheses for Query Generation
Ruey-Cheng ChenChia-Jung Lee
2020-10-06
Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder
Alvin ChanYi TayYew-Soon OngAston Zhang
2020-10-06
BERT Knows Punta Cana is not just beautiful, it's gorgeous: Ranking Scalar Adjectives with Contextualised Representations
| Aina Garí SolerMarianna Apidianaki
2020-10-06
Analyzing Individual Neurons in Pre-trained Language Models
Nadir DurraniHassan SajjadFahim DalviYonatan Belinkov
2020-10-06
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
| Minki KangMoonsu HanSung Ju Hwang
2020-10-06
Intrinsic Probing through Dimension Selection
Lucas Torroba HennigenAdina WilliamsRyan Cotterell
2020-10-06
Exploring BERT's Sensitivity to Lexical Cues using Tests from Semantic Priming
Kanishka MisraAllyson EttingerJulia Taylor Rayz
2020-10-06
Resource-Enhanced Neural Model for Event Argument Extraction
Jie MaShuai WangRishita AnubhaiMiguel BallesterosYaser Al-Onaizan
2020-10-06
Beyond [CLS] through Ranking by Generation
Cicero Nogueira dos santosXiaofei MaRamesh NallapatiZhiheng HuangBing Xiang
2020-10-06
Efficient Inference For Neural Machine Translation
Yi-Te HsuSarthak GargYi-Hsiu LiaoIlya Chatsviorkin
2020-10-06
PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation
Xinyu HuaLu Wang
2020-10-05
InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Boxin WangShuohang WangYu ChengZhe GanRuoxi JiaBo LiJingjing Liu
2020-10-05
Mixup-Transfomer: Dynamic Data Augmentation for NLP Tasks
Lichao SunCongying XiaWenpeng YinTingTing LiangPhilip S. YuLifang He
2020-10-05
GenAug: Data Augmentation for Finetuning Text Generators
Steven Y. FengVarun GangalDongyeop KangTeruko MitamuraEduard Hovy
2020-10-05
Joint Pruning & Quantization for Extremely Sparse Neural Networks
Po-Hsiang YuSih-Sian WuJan P. KloppLiang-Gee ChenShao-Yi Chien
2020-10-05
Self-training Improves Pre-training for Natural Language Understanding
Jingfei DuEdouard GraveBeliz GunelVishrav ChaudharyOnur CelebiMichael AuliVes StoyanovAlexis Conneau
2020-10-05
Transformer-Based Neural Text Generation with Syntactic Guidance
Yinghao LiRui FengIsaac RehgChao Zhang
2020-10-05
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?
Shayne LongpreYu WangChristopher DuBois
2020-10-05
Improving AMR Parsing with Sequence-to-Sequence Pre-training
| Dongqin XuJunhui LiMuhua ZhuMin ZhangGuodong Zhou
2020-10-05
Unsupervised Reference-Free Summary Quality Evaluation via Contrastive Learning
Hanlu WuTengfei MaLingfei WuTariro ManyumwaShouling Ji
2020-10-05
Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior
Zi LinJeremiah Zhe LiuZi YangNan HuaDan Roth
2020-10-05
PMI-Masking: Principled masking of correlated spans
Yoav LevineBarak LenzOpher LieberOmri AbendKevin Leyton-BrownMoshe TennenholtzYoav Shoham
2020-10-05
Linguistic Profiling of a Neural Language Model
Alessio MiaschiDominique BrunatoFelice Dell'OrlettaGiulia Venturi
2020-10-05
PUM at SemEval-2020 Task 12: Aggregation of Transformer-based models' features for offensive language recognition
Piotr JaniszewskiMateusz SkibaUrszula Walińska
2020-10-05
X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset
Angel DazaAnette Frank
2020-10-05
LEGAN: Disentangled Manipulation of Directional Lighting and Facial Expressions by Leveraging Human Perceptual Judgements
Sandipan BanerjeeAjjen JoshiPrashant MahajanSneha BhattacharyaSurvi KyalTaniya Mishra
2020-10-04
Inquisitive Question Generation for High Level Text Comprehension
| Wei-Jen KoTe-Yuan ChenYiyan HuangGreg DurrettJunyi Jessy Li
2020-10-04
MetaDetect: Uncertainty Quantification and Prediction Quality Estimates for Object Detection
Marius SchubertKarsten KahlMatthias Rottmann
2020-10-04
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space
Dayiheng LiuYeyun GongJie FuYu YanJiusheng ChenJiancheng LvNan DuanMing Zhou
2020-10-04
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels
Ilias ChalkidisManos FergadiotisSotiris KotitsasProdromos MalakasiotisNikolaos AletrasIon Androutsopoulos
2020-10-04
On Losses for Modern Language Models
Stephane Aroca-OuelletteFrank Rudzicz
2020-10-04
UCP: Uniform Channel Pruning for Deep Convolutional Neural Networks Compression and Acceleration
Jingfei ChangYang LuPing XueXing WeiZhen Wei
2020-10-03
Mining Knowledge for Natural Language Inference from Wikipedia Categories
Mingda ChenZewei ChuKarl StratosKevin Gimpel
2020-10-03
Personality Trait Detection Using Bagged SVM over BERT Word Embedding Ensembles
Amirmohammad KazameiniSamin FatehiYash MehtaSauleh EetemadiErik Cambria
2020-10-03
Cost-effective Selection of Pretraining Data: A Case Study of Pretraining BERT on Social Media
Xiang DaiSarvnaz KarimiBen HacheyCecile Paris
2020-10-02
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
| Andreas RückléJonas PfeifferIryna Gurevych
2020-10-02
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
| Ikuya YamadaAkari AsaiHiroyuki ShindoHideaki TakedaYuji Matsumoto
2020-10-02
Dynamic Graph: Learning Instance-aware Connectivity for Neural Networks
Kun YuanQuanquan LiDapeng ChenAojun ZhouJunjie Yan
2020-10-02
STIL -- Simultaneous Slot Filling, Translation, Intent Classification, and Language Identification: Initial Results using mBART on MultiATIS++
Jack G. M. FitzGerald
2020-10-02
Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis
katsuhiko IshiguroKazuya UjiharaRyohto SawadaHirotaka AkitaMasaaki Kotera
2020-10-02
Beyond Chemical 1D knowledge using Transformers
Ruud Van DeursenIgor V. TetkoGuillaume Godin
2020-10-02
Pix2Prof: fast extraction of sequential information from galaxy imagery via deep learning
| Michael J. SmithNikhil AroraConnor StoneStéphane CourteauJames E. Geach
2020-10-01
Automatic Deep Learning System for COVID-19 Infection Quantification in chest CT
Omar Ibrahim Alirr
2020-10-01
Beyond The Text: Analysis of Privacy Statements through Syntactic and Semantic Role Labeling
Yan ShvartzshnaiderAnanth BalashankarVikas PatidarThomas WiesLakshminarayanan Subramanian
2020-10-01
Understanding Self-supervised Learning with Dual Deep Networks
Yuandong TianLantao YuXinlei ChenSurya Ganguli
2020-10-01
Evaluating Multilingual BERT for Estonian
Claudia KittaskKirill MilintsevichKairit Sirts
2020-10-01
Phonemer at WNUT-2020 Task 2: Sequence Classification Using COVID Twitter BERT and Bagging Ensemble Technique based on Plurality Voting
| Anshul Wadhawan
2020-10-01
A Compare Aggregate Transformer for Understanding Document-grounded Dialogue
Longxuan MaWei-Nan ZhangRunxin SunTing Liu
2020-10-01
Examining the rhetorical capacities of neural language models
Zining ZhuChuer PanMohamed AbdallaFrank Rudzicz
2020-10-01
RRF102: Meeting the TREC-COVID Challenge with a 100+ Runs Ensemble
Michael BenderskyHonglei ZhuangJi MaShuguang HanKeith HallRyan Mcdonald
2020-10-01
Understanding tables with intermediate pre-training
| Julian Martin EisenschlosSyrine KrichineThomas Müller
2020-10-01
Detecting White Supremacist Hate Speech using Domain Specific Word Embedding with Deep Learning and BERT
Hind Saleh AlatawiAreej Maatog AlhothaliKawthar Mustafa Moria
2020-10-01
CoLAKE: Contextualized Language and Knowledge Embedding
| Tianxiang SunYunfan ShaoXipeng QiuQipeng GuoYaru HuXuanjing HuangZheng Zhang
2020-10-01
WeChat Neural Machine Translation Systems for WMT20
Fandong MengJianhao YanYijin LiuYuan GaoXianfeng ZengQinsong ZengPeng LiMing ChenJie zhouSifan LiuHao Zhou
2020-10-01
Dissected 3D CNNs: Temporal Skip Connections for Efficient Online Video Processing
| Okan KöpüklüStefan HörmannFabian HerzogHakan CevikalpGerhard Rigoll
2020-09-30
AUBER: Automated BERT Regularization
Hyun Dong LeeSeongmin LeeU Kang
2020-09-30
Learning Hard Retrieval Cross Attention for Transformer
Hongfei XuQiuhui Liu
2020-09-30
Measuring Systematic Generalization in Neural Proof Generation with Transformers
Nicolas GontierKoustuv SinhaSiva ReddyChristopher Pal
2020-09-30
BERT for Monolingual and Cross-Lingual Reverse Dictionary
| Hang YanXiaonan LiXipeng Qiu
2020-09-30
Rethinking Attention with Performers
| Krzysztof ChoromanskiValerii LikhosherstovDavid DohanXingyou SongAndreea GaneTamas SarlosPeter HawkinsJared DavisAfroz MohiuddinLukasz KaiserDavid BelangerLucy ColwellAdrian Weller
2020-09-30
MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention
Carson EisenachYagna PatelDhruv Madeka
2020-09-30
A Tale of Two Linkings: Dynamically Gating between Schema Linking and Structural Linking for Text-to-SQL Parsing
| Sanxing ChenAidan SanXiaodong LiuYangfeng Ji
2020-09-30
Gender prediction using limited Twitter Data
Maaike BurghoornMaaike H. T. de BoerStephan Raaijmakers
2020-09-29
Attention-Driven Body Pose Encoding for Human Activity Recognition
B DebnathM O'brienS. KumarA Behera
2020-09-29
A Multi-term and Multi-task Analyzing Framework for Affective Analysis in-the-wild
Sachihiro YouokuYuushi ToyodaTakahisa YamamotoJunya SaitoRyosuke KawamuraXiaoyu MiKentaro Murase
2020-09-29
Visually-Grounded Planning without Vision: Language Models Infer Detailed Plans from High-level Instructions
| Peter A. Jansen
2020-09-29
TEST_POSITIVE at W-NUT 2020 Shared Task-3: Joint Event Multi-task Learning for Slot Filling in Noisy Text
Chacha ChenChieh-Yang HuangYaqi HouYang ShiEnyan DaiJiaqi Wang
2020-09-29
Cross-lingual Alignment Methods for Multilingual BERT: A Comparative Study
Saurabh KulshreshthaJosé Luis Redondo-GarcíaChing-Yun Chang
2020-09-29
Attention that does not Explain Away
Nan DingXinjie FanZhenzhong LanDale SchuurmansRadu Soricut
2020-09-29
MaP: A Matrix-based Prediction Approach to Improve Span Extraction in Machine Reading Comprehension
Huaishao LuoYu ShiMing GongLinjun ShouTianrui Li
2020-09-29
Self-grouping Convolutional Neural Networks
| Qingbei GuoXiao-Jun WuJosef KittlerZhiquan Feng
2020-09-29
TinyGAN: Distilling BigGAN for Conditional Image Generation
| Ting-Yun ChangChi-Jen Lu
2020-09-29
The design and implementation of Language Learning Chatbot with XAI using Ontology and Transfer Learning
Nuobei ShiQin ZengRaymond Lee
2020-09-29
MARA-Net: Single Image Deraining Network with Multi-level connection and Adaptive Regional Attention
Yeachan ParkMyeongho JeonJunho LeeMyungjoo Kan
2020-09-29
Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation
Yinfei YangNing JinKuo LinMandy GuoDaniel Cer
2020-09-29
A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation
Dinghan ShenMingzhi ZhengYelong ShenYanru QuWeizhu Chen
2020-09-29
HINT3: Raising the bar for Intent Detection in the Wild
Gaurav AroraChirag JainManas ChaturvediKrupal Modi
2020-09-29
Sequence-to-Sequence Learning for Indonesian Automatic Question Generator
Ferdiant Joshua MuisAyu Purwarianti
2020-09-29
Contrastive Distillation on Intermediate Representations for Language Model Compression
Siqi SunZhe GanYu ChengYuwei FangShuohang WangJingjing Liu
2020-09-29
Detecting soccer balls with reduced neural networks: a comparison of multiple architectures under constrained hardware scenarios
Douglas De Rizzo MeneghettiThiago Pedro Donadon HomemJonas Henrique Renolfi de OliveiraIsaac Jesus da SilvaDanilo Hernani PericoReinaldo Augusto da Costa Bianchi
2020-09-28
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
Shikib MehriMihail EricDilek Hakkani-Tur
2020-09-28
VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training
Xiaowei HuXi YinKevin LinLijuan WangLei ZhangJianfeng GaoZicheng Liu
2020-09-28
Adversarial Robustness of Stabilized NeuralODEs Might be from Obfuscated Gradients
Yifei HuangYaodong YuHongyang ZhangYi MaYuan Yao
2020-09-28
Group Whitening: Balancing Learning Efficiency and Representational Capacity
Lei HuangLi LiuFan ZhuLing Shao
2020-09-28
Fancy Man Lauches Zippo at WNUT 2020 Shared Task-1: A Bert Case Model for Wet Lab Entity Extraction
Haoding MengQingcheng ZengXiaoyang FangZhexin Liang
2020-09-28
A Simple and Efficient Ensemble Classifier Combining Multiple Neural Network Models on Social Media Datasets in Vietnamese
Huy Duc HuynhHang Thi-Thuy DoKiet Van NguyenNgan Luu-Thuy Nguyen
2020-09-28
Accelerating Multi-Model Inference by Merging DNNs of Different Weights
Joo Seong JeongSoojeong KimGyeong-In YuYunseong LeeByung-Gon Chun
2020-09-28
Deep Transformers with Latent Depth
Xian LiAsa Cooper SticklandYuqing TangXiang Kong
2020-09-28
Knowledge-Aware Procedural Text Understanding with Multi-Stage Training
Zhihan ZhangXiubo GengTao QinYunfang WuDaxin Jiang
2020-09-28
PIN: A Novel Parallel Interactive Network for Spoken Language Understanding
Peilin ZhouZhiqi HuangFenglin LiuYuexian Zou
2020-09-28
G-SimCLR: Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling
| Souradip ChakrabortyAritra Roy GosthipatySayak Paul
2020-09-28
What does it mean to be language-agnostic? Probing multilingual sentence encoders for typological properties
Rochelle ChoenniEkaterina Shutova
2020-09-27
TernaryBERT: Distillation-aware Ultra-low Bit BERT
Wei ZhangLu HouYichun YinLifeng ShangXiao ChenXin JiangQun Liu
2020-09-27
Metaphor Detection using Deep Contextualized Word Embeddings
Shashwat AggarwalRamesh Singh
2020-09-26
Metaphor Detection using Deep Contextualized Word Embeddings
Shashwat AggarwalRamesh Singh
2020-09-26
Techniques to Improve Q&A Accuracy with Transformer-based models on Large Complex Documents
Chejui LiaoTabish ManiarSravanajyothi NAnantha Sharma
2020-09-26
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning
| Ye LiuYao WanLifang HeHao PengPhilip S. Yu
2020-09-26
HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
| Yifan DingNicholas BotzerTim Weninger
2020-09-25
G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling
| Souradip ChakrabortyAritra Roy GosthipatySayak Paul
2020-09-25
BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification Context
Jean-Philippe CorbeilHadi Abdi Ghadivel
2020-09-25
DPN: Detail-Preserving Network with High Resolution Representation for Efficient Segmentation of Retinal Vessels
Song Guo
2020-09-25
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
Zhaojiang LinAndrea MadottoGenta Indra WinataPascale Fung
2020-09-25
An Unsupervised Sentence Embedding Method byMutual Information Maximization
Yan ZhangRuidan HeZuozhu LiuKwan Hui LimLidong Bing
2020-09-25
Weird AI Yankovic: Generating Parody Lyrics
Mark Riedl
2020-09-25
A little goes a long way: Improving toxic language classification despite data scarcity
Mika JuutiTommi GröndahlAdrian FlanaganN. Asokan
2020-09-25
A Comparative Study of Feature Types for Age-Based Text Classification
| Anna GlazkovaYury EgorovMaksim Glazkov
2020-09-24
Toward a Thermodynamics of Meaning
Jonathan Scott Enderle
2020-09-24
Automatic identification of fossils and abiotic grains during carbonate microfacies analysis using deep convolutional neural networks
Xiaokang LiuHaijun Song
2020-09-24
Residual Feature Distillation Network for Lightweight Image Super-Resolution
| Jie LiuJie TangGangshan Wu
2020-09-24
AnchiBERT: A Pre-Trained Model for Ancient ChineseLanguage Understanding and Generation
Huishuang TianKexin YangDayiheng LiuJiancheng Lv
2020-09-24
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences
| Boon Peng YapAndrew Koh Jin JieEng Siong Chng
2020-09-24
End-to-End Prediction of Parcel Delivery Time with Deep Learning for Smart-City Applications
Arthur Cruz de AraujoAli Etemad
2020-09-23
Multi-Pass Transformer for Machine Translation
Peng GaoChiori HoriShijie GengTakaaki HoriJonathan Le Roux
2020-09-23
Pruning Convolutional Filters using Batch Bridgeout
Najeeb KhanIan Stavness
2020-09-23
Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition
Bingcong LiXin TangXianbiao QiYihao ChenRong Xiao
2020-09-23
A Token-wise CNN-based Method for Sentence Compression
Weiwei HouHanna SuominenPiotr KoniuszSabrina CaldwellTom Gedeon
2020-09-23
On Data Augmentation for Extreme Multi-label Classification
Danqing ZhangTao LiHaiyang ZhangBing Yin
2020-09-22
Anomalous diffusion dynamics of learning in deep neural networks
Guozhang ChenCheng Kevin QuPulin Gong
2020-09-22
AutoRC: Improving BERT Based Relation Classification Models via Architecture Search
Wei ZhuXiaoling WangXipeng QiuYuan NiGuotong Xie
2020-09-22
GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis
| Huaishao LuoLei JiTianrui LiNan DuanDaxin Jiang
2020-09-22
Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
Chris J. KennedyGeoff BaconAlexander SahnClaudia von Vacano
2020-09-22
Impact of lung segmentation on the diagnosis and explanation of COVID-19 in chest X-ray images
Lucas O. TeixeiraRodolfo M. PereiraDiego BertoliniLuiz S. OliveiraLoris NanniYandre M. G. Costa
2020-09-21
Kernel-Based Smoothness Analysis of Residual Networks
Tom TirerJoan BrunaRaja Giryes
2020-09-21
UCD-CS at W-NUT 2020 Shared Task-3: A Text to Text Approach for COVID-19 Event Extraction on Social Media
Congcong WangDavid Lillis
2020-09-21
"When they say weed causes depression, but it's your fav antidepressant": Knowledge-aware Attention Framework for Relationship Extraction
Shweta YadavUsha LokalaRaminta DaniulaityteKrishnaprasad ThirunarayanFrancois LamyAmit Sheth
2020-09-21
Alleviating the Inequality of Attention Heads for Neural Machine Translation
Zewei SunShujian HuangXinyu DaiJiajun Chen
2020-09-21
Profile Consistency Identification for Open-domain Dialogue Agents
Haoyu SongYan WangWei-Nan ZhangZhengyu ZhaoTing LiuXiaojiang Liu
2020-09-21
Empathetic Dialogue Generation via Knowledge Enhancing and Emotion Dependency Modeling
Qintong LiPiji LiZhumin ChenZhaochun Ren
2020-09-21
Latin BERT: A Contextual Language Model for Classical Philology
David BammanPatrick J. Burns
2020-09-21
Dual-path CNN with Max Gated block for Text-Based Person Re-identification
Tinghuai MaMingming YangHuan RongYurong QianYurong QianYuan TianNajlaAl-Nabhan
2020-09-20
Longformer for MS MARCO Document Re-ranking Task
| Ivan SekulićAmir SoleimaniMohammad AliannejadiFabio Crestani
2020-09-20
Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging
| Ehsan DoostmohammadiMinoo NassajianAdel Rahimi
2020-09-20
VirtualFlow: Decoupling Deep Learning Model Execution from Underlying Hardware
Andrew OrHaoyu ZhangMichael J. Freedman
2020-09-20
Gated Res2Net for Multivariate Time Series Analysis
Chao YangMingxing JiangZhongwen GuoYuan Liu
2020-09-19
Prior Art Search and Reranking for Generated Patent Text
Jieh-Sheng LeeJieh Hsiang
2020-09-19
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
Jonathan PilaultAmine ElhattamiChristopher Pal
2020-09-19
Nominal Compound Chain Extraction: A New Task for Semantic-enriched Lexical Chain
Bobo LiHao FeiYafeng RenDonghong Ji
2020-09-19
BioALBERT: A Simple and Effective Pre-trained Language Model for Biomedical Named Entity Recognition
Usman NaseemMatloob KhushiVinay ReddySakthivel RajendranImran RazzakJinman Kim
2020-09-19
Towards Computational Linguistics in Minangkabau Language: Studies on Sentiment Analysis and Machine Translation
Fajri KotoIkhwan Koto
2020-09-19
Will it Unblend?
Yuval PinterCassandra L. JacobsJacob Eisenstein
2020-09-18
Residual Spatial Attention Network for Retinal Vessel Segmentation
| Changlu GuoMárton SzemenyeiYugen YiWei ZhouHaodong Bian
2020-09-18
Densely Guided Knowledge Distillation using Multiple Teacher Assistants
Wonchul SonJaemin NaWonjun Hwang
2020-09-18
Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models
Jihyeon RohHuiseong GimSoo-Young Lee
2020-09-18
The birth of Romanian BERT
Stefan Daniel DumitrescuAndrei-Marius AvramSampo Pyysalo
2020-09-18
fastHan: A BERT-based Joint Many-Task Toolkit for Chinese NLP
Zhichao GengHang YanXipeng QiuXuanjing Huang
2020-09-18
NEU at WNUT-2020 Task 2: Data Augmentation To Tell BERT That Death Is Not Necessarily Informative
Kumud Chauhan
2020-09-18
Cross-Modal Alignment with Mixture Experts Neural Network for Intral-City Retail Recommendation
Po LiLei LiYan FuJun RongYu Zhang
2020-09-17
AAG: Self-Supervised Representation Learning by Auxiliary Augmentation with GNT-Xent Loss
Yanlun TuJianxing FengYang Yang
2020-09-17
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning
Bingbing LiZhenglun KongTianyun ZhangJi LiZhengang LiHang LiuCaiwen Ding
2020-09-17
Distributional Generalization: A New Kind of Generalization
Preetum NakkiranYamini Bansal
2020-09-17
Label Smoothing and Adversarial Robustness
Chaohao FuHongbin ChenNa RuanWeijia Jia
2020-09-17
Towards Fully 8-bit Integer Inference for the Transformer Model
Ye LinYanyang LiTengbo LiuTong XiaoTongran LiuJingbo Zhu
2020-09-17
Multi^2OIE: Multilingual Open Information Extraction based on Multi-Head Attention with BERT
Youngbin RoYukyung LeePilsung Kang
2020-09-17
DSC IIT-ISM at SemEval-2020 Task 6: Boosting BERT with Dependencies for Definition Extraction
| Aadarsh SinghPriyanshu KumarAman Sinha
2020-09-17
Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA
Ieva StaliūnaitėIgnacio Iacobacci
2020-09-17
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya GuoShuo RenShuai LuZhangyin FengDuyu TangShujie LiuLong ZhouNan DuanJian YinDaxin JiangMing Zhou
2020-09-17
A Multimodal Memes Classification: A Survey and Open Research Issues
Tariq Habib AfridiAftab AlamMuhammad Numan KhanJawad KhanYoung-Koo Lee
2020-09-17
Document-level Neural Machine Translation with Document Embeddings
Shu JiangHai ZhaoZuchao LiBao-Liang Lu
2020-09-16
Solomon at SemEval-2020 Task 11: Ensemble Architecture for Fine-Tuned Propaganda Detection in News Articles
Mayank RajAjay JaiswalRohit R. RAnkita GuptaSudeep Kumar SahooVertika SrivastavaYeon Hyang Kim
2020-09-16
Retrofitting Structure-aware Transformer Language Model for End Tasks
Hao FeiYafeng RenDonghong Ji
2020-09-16
Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation
Insoo ChungByeongwook KimYoonjung ChoiSe Jung KwonYongkweon JeonBaeseong ParkSangha KimDongsoo Lee
2020-09-16
Graph-to-Sequence Neural Machine Translation
Sufeng DuanHai ZhaoRui Wang
2020-09-16
Simplified TinyBERT: Knowledge Distillation for Document Retrieval
Xuanang ChenBen HeKai HuiLe SunYingfei Sun
2020-09-16
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
| Jian GuanMinlie Huang
2020-09-16
NABU -- Multilingual Graph-based Neural RDF Verbalizer
Diego MoussallemDwaraknath GnaneshwarThiago Castro FerreiraAxel-Cyrille Ngonga Ngomo
2020-09-16
Automated Source Code Generation and Auto-completion Using Deep Learning: Comparing and Discussing Current Language-Model-Related Approaches
Juan Cruz-BenitoSanjay VishwakarmaFrancisco Martin-FernandezIsmael Faro
2020-09-16
Deep Learning Approaches for Extracting Adverse Events and Indications of Dietary Supplements from Clinical Text
Yadan FanSicheng ZhouYifan LiRui Zhang
2020-09-16
DeNERT-KG: Named Entity and Relation Extraction Model Using DQN, Knowledge Graph, and BERT
SungMin YangSoYeop YooOkRan Jeong
2020-09-15
Augmented Natural Language for Generative Sequence Labeling
Ben AthiwaratkunCicero Nogueira dos SantosJason KroneBing Xiang
2020-09-15
A Mobile App for Wound Localization using Deep Learning
D. M. AnisuzzamanYash PatelJeffrey NiezgodaSandeep GopalakrishnanZeyun Yu
2020-09-15
The Radicalization Risks of GPT-3 and Advanced Neural Language Models
Kris McGuffieAlex Newhouse
2020-09-15
Learning Functors using Gradient Descent
Bruno Gavranović
2020-09-15
Dialogue Response Ranking Training with Large-Scale Human Feedback Data
| Xiang GaoYizhe ZhangMichel GalleyChris BrockettBill Dolan
2020-09-15
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
| Timo SchickHinrich Schütze
2020-09-15
Critical Thinking for Language Models
Gregor Betz
2020-09-15
ResNet-like Architecture with Low Hardware Requirements
Elena LimonovaDaniil AlfonsoDmitry NikolaevVladimir V. Arlazarov
2020-09-15
Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Wei NiuZhenglun KongGeng YuanWeiwen JiangJiexiong GuanCaiwen DingPu ZhaoSijia LiuBin RenYanzhi Wang
2020-09-15
Attention-Aware Inference for Neural Abstractive Summarization
Ye MaLu Zong
2020-09-15
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis ClouatrePhilippe TrempeAmal ZouaqSarath Chandar
2020-09-15
Event Presence Prediction Helps Trigger Detection Across Languages
Parul AwasthyTahira NaseemJian NiTaesun MoonRadu Florian
2020-09-15
Lessons Learned from Applying off-the-shelf BERT: There is no SilverBullet
Victor MakarenkovLior Rokach
2020-09-15
BERT-QE: Contextualized Query Expansion for Document Re-ranking
Zhi ZhengKai HuiBen HeXianpei HanLe SunAndrew Yates
2020-09-15
Data Augmentation and Clustering for Vehicle Make/Model Classification
Mohamed NafziMichael BrauckmannTobias Glasmachers
2020-09-14
Controllable neural text-to-speech synthesis using intuitive prosodic features
Tuomo RaitioRamya RasipuramDan Castellani
2020-09-14
Efficient Transformers: A Survey
Yi TayMostafa DehghaniDara BahriDonald Metzler
2020-09-14
Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue
Longxiang LiuZhuosheng ZhangHai ZhaoXi ZhouXiang Zhou
2020-09-14
GeDi: Generative Discriminator Guided Sequence Generation
Ben KrauseAkhilesh Deepak GotmareBryan McCannNitish Shirish KeskarShafiq JotyRichard SocherNazneen Fatema Rajani
2020-09-14
Can Fine-tuning Pre-trained Models Lead to Perfect NLP? A Study of the Generalizability of Relation Extraction
Ningyu ZhangLuoqiu LiShumin DengHaiyang YuXu ChengWei ZhangHuajun Chen
2020-09-14
Beyond Accuracy: ROI-driven Data Analytics of Empirical Data
Gouri DeshpandeGuenther Ruhe
2020-09-14
Pairwise-GAN: Pose-based View Synthesis through Pair-Wise Training
Xuyang ShenJo PlestedYue YaoTom Gedeon
2020-09-13
Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Shuohang WangLuowei ZhouZhe GanYen-Chun ChenYuwei FangSiqi SunYu ChengJingjing Liu
2020-09-13
BoostingBERT:Integrating Multi-Class Boosting into BERT for NLP Tasks
Tongwen HuangQingyun SheJunlin Zhang
2020-09-13
Corrective feedback, emphatic speech synthesis, visual-speech exaggeration, pronunciation learning
Yaohua BuWeijun LiTianyi MaShengqi ChenJia JiaKun LiXiaobo Lu
2020-09-12
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Yuxuan CaiHongjia LiGeng YuanWei NiuYanyu LiXulong TangBin RenYanzhi Wang
2020-09-12
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using Pre-trained Language Models
Yandrapati Prakash BabuRajagopal Eswari
2020-09-12
Country Image in COVID-19 Pandemic: A Case Study of China
Huimin ChenZeyu ZhuFanchao QiYining YeZhiyuan LiuMaosong SunJianbin Jin
2020-09-12
Fine-tuning Pre-trained Contextual Embeddings for Citation Content Analysis in Scholarly Publication
Haihua ChenHuyen Nguyen
2020-09-12
Inverse mapping of face GANs
Nicky BayatVahid Reza KhazaieYalda Mohsenzadeh
2020-09-11
Unit Test Case Generation with Transformers
Michele TufanoDawn DrainAlexey SvyatkovskiyShao Kun DengNeel Sundaresan
2020-09-11
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank Approximation
Murad TukanAlaa MaaloufMatan WekslerDan Feldman
2020-09-11
UPB at SemEval-2020 Task 6: Pretrained Language Models for DefinitionExtraction
Andrei-Marius AvramDumitru-Clementin CercelCostin-Gabriel Chiru
2020-09-11
Optimizing Convolutional Neural Network Architecture via Information Field
Yuke WangBoyuan FengXueqiao PengYufei Ding
2020-09-11
Enabling Image Recognition on Constrained Devices Using Neural Network Pruning and a CycleGAN
August LidfeltDaniel IsakssonLudwig HedlundSimon ÅbergMarkus BorgErik Larsson
2020-09-11
SoFAr: Shortcut-based Fractal Architectures for Binary Convolutional Neural Networks
Zhu BaozhouPeter HofsteeJinho LeeZaid Al-Ars
2020-09-11
GTEA: Representation Learning for Temporal Interaction Graphs via Edge Aggregation
Yiming LiDa Sun Handason TamSiyue XieXiaxin LiuQiu Fang YingWing Cheong LauDah Ming ChiuShou Zhi Chen
2020-09-11
UPB at SemEval-2020 Task 11: Propaganda Detection with Domain-Specific Trained BERT
Andrei ParaschivDumitru-Clementin CercelMihai Dascalu
2020-09-11
A Comparison of LSTM and BERT for Small Corpus
Aysu Ezen-Can
2020-09-11
Comprehensive Comparison of Deep Learning Models for Lung and COVID-19 Lesion Segmentation in CT scans
Paschalis BizopoulosNicholas VretosPetros Daras
2020-09-10
FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding
Yuwei FangShuohang WangZhe GanSiqi SunJingjing Liu
2020-09-10
Rank over Class: The Untapped Potential of Ranking in Natural Language Processing
Amir Atapour-AbarghoueiStephen BonnerAndrew Stephen McGough
2020-09-10
Sparsifying Transformer Models with Differentiable Representation Pooling
Michał PietruszkaŁukasz BorchmannFilip Graliński
2020-09-10
Brain2Word: Decoding Brain Activity for Language Generation
Nicolas AffolterBeni EgressyDamian PascualRoger Wattenhofer
2020-09-10
Unsupervised Domain Adaptation via CycleGAN for White Matter Hyperintensity Segmentation in Multicenter MR Images
Julian Alberto PalladinoDiego Fernandez SlezakEnzo Ferrante
2020-09-10
Learning Universal Representations from Word to Sentence
Yian LiHai Zhao
2020-09-10
Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection
| Taesun WhangDongyub LeeDongsuk OhChanhee LeeKijong HanDong-hun LeeSaebyeok Lee
2020-09-10
Modern Methods for Text Generation
| Dimas Munoz Montesinos
2020-09-10
Investigating Gender Bias in BERT
Rishabh BhardwajNavonil MajumderSoujanya Poria
2020-09-10
Pay Attention when Required
Swetha MandavaSzymon MigaczAlex Fit Florea
2020-09-09
not-so-BigGAN: Generating High-Fidelity Images on a Small Compute Budget
Seungwook HanAkash SrivastavaCole HurwitzPrasanna SattigeriDavid D. Cox
2020-09-09
Comparative Study of Language Models on Cross-Domain Data with Model Agnostic Explainability
Mayank ChhipaHrushikesh Mahesh VazurkarAbhijeet KumarMridul Mishra
2020-09-09
ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by Pre-trained Language Model
Zhengjie HuangShikun FengWeiyue SuXuyi ChenShuohuan WangJiaxiang LiuXuan OuyangYu Sun
2020-09-08
Masked Label Prediction: Unified Massage Passing Model for Semi-Supervised Classification
Yunsheng ShiZhengjie HuangWenjin WangHui ZhongShikun FengYu Sun
2020-09-08
Improving Language Generation with Sentence Coherence Objective
Ruixiao SunJie YangMehrdad Yousefzadeh
2020-09-07
Stochastic-YOLO: Efficient Probabilistic Object Detection under Dataset Shifts
Tiago AzevedoRené de JongPartha Maji
2020-09-07
Robust Conversational AI with Grounded Text Generation
Jianfeng GaoBaolin PengChunyuan LiJinchao LiShahin ShayandehLars LidenHeung-Yeung Shum
2020-09-07
Deep Cyclic Generative Adversarial Residual Convolutional Networks for Real Image Super-Resolution
Rao Muhammad UmerChristian Micheloni
2020-09-07
Black Box to White Box: Discover Model Characteristics Based on Strategic Probing
Josh KalinMatthew CiolinoDavid NoeverGerry Dozier
2020-09-07
Deepfake detection: humans vs. machines
Pavel KorshunovSébastien Marcel
2020-09-07
Measuring Massive Multitask Language Understanding
Dan HendrycksCollin BurnsSteven BasartAndy ZouMantas MazeikaDawn SongJacob Steinhardt
2020-09-07
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for E-commerce
Denghui ZhangZixuan YuanYanchi LiuFuzhen ZhuangHui Xiong
2020-09-07
TransModality: An End2End Fusion Method with Transformer for Multimodal Sentiment Analysis
Zilong WangZhaohong WanXiaojun Wan
2020-09-07
Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding
Sahar AbdelnabiMario Fritz
2020-09-07
EdinburghNLP at WNUT-2020 Task 2: Leveraging Transformers with Generalized Augmentation for Identifying Informativeness in COVID-19 Tweets
Nickil Maveli
2020-09-06
UPB at SemEval-2020 Task 8: Joint Textual and Visual Modeling in a Multi-Task Learning Architecture for Memotion Analysis
George-Alexandru VladGeorge-Eduard ZahariaDumitru-Clementin CercelCostin-Gabriel ChiruStefan Trausan-Matu
2020-09-06
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation system based on ensemble of language model
Pai Liu
2020-09-06
Accenture at CheckThat! 2020: If you say so: Post-hoc fact-checking of claims using transformer-based models
Evan WilliamsPaul RodriguesValerie Novak
2020-09-05
Looking for change? Roll the Dice and demand Attention
| Foivos I. DiakogiannisFrançois WaldnerPeter Caccetta
2020-09-04
WaveGrad: Estimating Gradients for Waveform Generation
| Nanxin ChenYu ZhangHeiga ZenRon J. WeissMohammad NorouziWilliam Chan
2020-09-02
Comparative Evaluation of Pretrained Transfer Learning Models on Automatic Short Answer Grading
Sasi Kiran GaddipatiDeebul NairPaul G. Plöger
2020-09-02
Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations
Wei XiaJohn H. L. Hansen
2020-09-02
Neural Crossbreed: Neural Based Image Metamorphosis
Sanghun ParkKwanggyoon SeoJunyong Noh
2020-09-02
Transform Quantization for CNN Compression
Sean I. YoungWang ZheDavid TaubmanBernd Girod
2020-09-02
Sentimental LIAR: Extended Corpus and Deep Learning Models for Fake Claim Classification
Bibek UpadhayayVahid Behzadan
2020-09-01
RangeRCNN: Towards Fast and Accurate 3D Object Detection with Range Image Representation
Zhidong LiangMing ZhangZehan ZhangXian ZhaoShiliang Pu
2020-09-01
LiftFormer: 3D Human Pose Estimation using attention models
Adrian Llopart
2020-09-01
Automatic Assignment of Radiology Examination Protocols Using Pre-trained Language Models with Knowledge Distillation
Wilson LauLaura AaltonenMartin GunnMeliha Yetisgen
2020-09-01
A Framework For Contrastive Self-Supervised Learning And Designing A New Approach
William FalconKyunghyun Cho
2020-08-31
A Bidirectional Tree Tagging Scheme for Jointly Extracting Overlapping Entities and Relations
Xukun LuoWeijie LiuMeng MaPing Wang
2020-08-31
Langevin Cooling for Domain Translation
Vignesh SrinivasanKlaus-Robert MüllerWojciech SamekShinichi Nakajima
2020-08-31
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Wei LiJames QinChung-Cheng ChiuRuoming PangYanzhang He
2020-08-30
AKHCRNet: Bengali Handwritten Character Recognition Using Deep Learning
Akash Roy
2020-08-29
SocCogCom at SemEval-2020 Task 11: Characterizing and Detecting Propaganda using Sentence-Level Emotional Salience Features
Gangeshwar KrishnamurthyRaj Kumar GuptaYinping Yang
2020-08-29
Rethinking the objectives of extractive question answering
Martin FajcikJosef JonSantosh KesirajuPavel Smrz
2020-08-28
HittER: Hierarchical Transformers for Knowledge Graph Embeddings
Sanxing ChenXiaodong LiuJianfeng GaoJian JiaoRuofei ZhangYangfeng Ji
2020-08-28
TATL at W-NUT 2020 Task 2: A Transformer-based Baseline System for Identification of Informative COVID-19 English Tweets
Anh Tuan Nguyen
2020-08-28
Knowledge Efficient Deep Learning for Natural Language Processing
Hai Wang
2020-08-28
Predicting Training Time Without Training
Luca ZancatoAlessandro AchilleAvinash RavichandranRahul BhotikaStefano Soatto
2020-08-28
DAVE: Deriving Automatically Verilog from English
Hammond PearceBenjamin TanRamesh Karri
2020-08-27
Entity and Evidence Guided Relation Extraction for DocRED
Kevin HuangGuangtao WangTengyu MaJing Huang
2020-08-27
GREEK-BERT: The Greeks visiting Sesame Street
John KoutsikakisIlias ChalkidisProdromos MalakasiotisIon Androutsopoulos
2020-08-27
Query Focused Multi-document Summarisation of Biomedical Texts
Diego MollaChristopher JonesVincent Nguyen
2020-08-27
Improvement of a dedicated model for open domain persona-aware dialogue generation
Qiang Han
2020-08-27
MultiGBS: A multi-layer graph approach to biomedical summarization
Ensieh DavoodijamNasser GhadiriMaryam Lotfi ShahrezaFabio Rinaldi
2020-08-27
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
Xinsong ZhangHang Li
2020-08-27
A Multitask Deep Learning Approach for User Depression Detection on Sina Weibo
Yiding WangZhenyi WangChenghao LiYilin ZhangHaizhou Wang
2020-08-26
Discrete Word Embedding for Logical Natural Language Understanding
Masataro AsaiZilu Tang
2020-08-26
Language Models and Word Sense Disambiguation: An Overview and Analysis
| Daniel LoureiroKiamehr RezaeeMohammad Taher PilehvarJose Camacho-Collados
2020-08-26
APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm
Hanlin TangShaoduo GanSamyam RajbhandariXiangru LianJi LiuYuxiong HeCe Zhang
2020-08-26
On estimating gaze by self-attention augmented convolutions
Gabriel LefundesLuciano Oliveira
2020-08-25
Conceptualized Representation Learning for Chinese Biomedical Text Mining
| Ningyu ZhangQianghuai JiaKangping YinLiang DongFeng GaoNengwei Hua
2020-08-25
YNU-HPCC at SemEval-2020 Task 11: LSTM Network for Detection of Propaganda Techniques in News Articles
Jiaxu DaoJin WangXuejie Zhang
2020-08-24
TORNADO-Net: mulTiview tOtal vaRiatioN semAntic segmentation with Diamond inceptiOn module
Martin GerdzhevRyan RazaniEhsan TaghaviBingbing Liu
2020-08-24
Prediction of ICD Codes with Clinical BERT Embeddings and Text Augmentation with Label Balancing using MIMIC-III
Brent BisedaGaurav DesaiHaifeng LinAnish Philip
2020-08-24
Two Stages Approach for Tweet Engagement Prediction
Amine DadounIsmail HarrandoPasquale LisenaAlison ReboudRaphael Troncy
2020-08-24
End to End Dialogue Transformer
Ondřej MěkotaMemduh GökırmakPetr Laitoch
2020-08-24
Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and Resources
Taolin ZhangChengyu WangMinghui QiuBite YangXiaofeng HeJun Huang
2020-08-24
syrapropa at SemEval-2020 Task 11: BERT-based Models Design For Propagandistic Technique and Span Detection
Jinfen LiLu Xiao
2020-08-24
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERT
| Omar MossadAmgad AhmedAnandharaju RajuHari KarthikeyanZayed Ahmed
2020-08-22
DUTH at SemEval-2020 Task 11: BERT with Entity Mapping for Propaganda Classification
Anastasios BairaktarisSymeon SymeonidisAvi Arampatzis
2020-08-22
CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection
| Verena BlaschkeMaxim KorniyenkoSam Tureski
2020-08-22
HinglishNLP: Fine-tuned Language Models for Hinglish Sentiment Detection
Meghana BhangeNirant Kasliwal
2020-08-22
Identity-Aware Multi-Sentence Video Description
| Jae Sung ParkTrevor DarrellAnna Rohrbach
2020-08-22
PNEN: Pyramid Non-Local Enhanced Networks
Feida ZhuChaowei FangKai-Kuang Ma
2020-08-22
Applications of BERT Based Sequence Tagging Models on Chinese Medical Text Attributes Extraction
Gang ZhaoTeng ZhangChenxiao WangPing LvJi Wu
2020-08-22
Abstractive Summarization of Spoken andWritten Instructions with BERT
| Alexandra SavelievaBryan Au-YeungVasanth Ramani
2020-08-21
Adapting Event Extractors to Medical Data: Bridging the Covariate Shift
Aakanksha NaikJill LehmanCarolyn Rose
2020-08-21
Monocular Expressive Body Regression through Body-Driven Attention
| Vasileios ChoutasGeorgios PavlakosTimo BolkartDimitrios TzionasMichael J. Black
2020-08-20
Lite Training Strategies for Portuguese-English and English-Portuguese Translation
| Alexandre LopesRodrigo NogueiraRoberto LotufoHelio Pedrini
2020-08-20
An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension
Son T. LuuKiet Van NguyenAnh Gia-Tuan NguyenNgan Luu-Thuy Nguyen
2020-08-20
AWNet: Attentive Wavelet Network for Image ISP
Linhui DaiXiaohong LiuChengqi LiJun Chen
2020-08-20
PTT5: Pretraining and validating the T5 model on Brazilian Portuguese data
| Diedre CarmoMarcos PiauIsrael CampiottiRodrigo NogueiraRoberto Lotufo
2020-08-20
Blur-Attention: A boosting mechanism for non-uniform blurred image restoration
Xiaoguang LiFeifan YangKin Man LamLi ZhuoJiafeng Li
2020-08-19
"Name that manufacturer". Relating image acquisition bias with task complexity when training deep learning models: experiments on head CT
Giorgio Pietro BiondettiRomane GauriauChristopher P. BridgeCharles LuKatherine P. Andriole
2020-08-19
Slide-free MUSE Microscopy to H&E Histology Modality Conversion via Unpaired Image-to-Image Translation GAN Models
Tanishq AbrahamAndrew ShawDaniel O'ConnorAustin ToddRichard Levenson
2020-08-19
UoB at SemEval-2020 Task 12: Boosting BERT with Corpus Level Information
Wah Meng LimHarish Tayyar Madabushi
2020-08-19
Multilanguage Number Plate Detection using Convolutional Neural Networks
Jatin GuptaVandana SainiKamaldeep Garg
2020-08-18
Glancing Transformer for Non-Autoregressive Neural Machine Translation
Lihua QianHao ZhouYu BaoMingxuan WangLin QiuWeinan ZhangYong YuLei Li
2020-08-18
CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion
Maitreya PatelMirali PurohitJui ShahHemant A. Patil
2020-08-18
Very Deep Transformers for Neural Machine Translation
Xiaodong LiuKevin DuhLiyuan LiuJianfeng Gao
2020-08-18
Ranking Clarification Questions via Natural Language Inference
Vaibhav KumarVikas RaunakJamie Callan
2020-08-18
Estimation of causal effects of multiple treatments in healthcare database studies with rare outcomes
Liangyuan HuChenyang Gu
2020-08-18
Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical Study
Karthik GopalakrishnanBehnam HedayatniaLongshaokan WangYang LiuDilek Hakkani-Tur
2020-08-18
Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Dara BahriYi TayChe ZhengDonald MetzlerCliff BrunkAndrew Tomkins
2020-08-17
DeepGIN: Deep Generative Inpainting Network for Extreme Image Inpainting
| Chu-Tak LiWan-Chi SiuZhi-Song LiuLi-Wen WangDaniel Pak-Kong Lun
2020-08-17
Narrative Interpolation for Generating and Understanding Stories
Su WangGreg DurrettKatrin Erk
2020-08-17
Spatial Temporal Transformer Network for Skeleton-based Action Recognition
| Chiara PlizzariMarco CanniciMatteo Matteucci
2020-08-17
How to Train Your Robust Human Pose Estimator: Pay Attention to the Constraint Cue
Junjie HuangZheng ZhuGuan HuangDalong Du
2020-08-17
Stock Index Prediction with Multi-task Learning and Word Polarity Over Time
Yue ZhouKerstin Voigt
2020-08-17
Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size
Davis YoshidaAllyson EttingerKevin Gimpel
2020-08-16
DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act Recognition and Sentiment Classification
Libo QinWanxiang CheYangming LiMinheng NiTing Liu
2020-08-16
DeVLBert: Learning Deconfounded Visio-Linguistic Representations
| Shengyu ZhangTan JiangTan WangKun KuangZhou ZhaoJianke ZhuJin YuHongxia YangFei Wu
2020-08-16
TopicBERT: A Transformer transfer learning based memory-graph approach for multimodal streaming social media topic detection
Meysam Asgari-ChenaghluMohammad-Reza Feizi-DerakhshiLeili farzinvashMohammad-Ali BalafarCina Motamed
2020-08-16
GLOD: Gaussian Likelihood Out of Distribution Detector
| Guy AmitMoshe LevyIshai RosenbergAsaf ShabtaiYuval Elovici
2020-08-16
Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Henry TsaiJayden OoiChun-Sung FerngHyung Won ChungJason Riesa
2020-08-15
Model Patching: Closing the Subgroup Performance Gap with Data Augmentation
| Karan GoelAlbert GuYixuan LiChristopher Ré
2020-08-15
I-AID: Identifying Actionable Information from Disaster-related Tweets
Hamada M. ZaheraRricha JalotaMohamed A. SherifAxel N. Ngomo
2020-08-04
Physics-Informed Deep Neural Networks for Transient Electromagnetic Analysis
Oameed NoakoasteenShu WangZhen PengChristos Christodoulou
2020-08-04
Mining Inter-Video Proposal Relations for Video Object Detection
| Mingfei HanYali WangXiaojun ChangYu Qiao
2020-08-01
Transformers on Sarcasm Detection with Context
2020-07-01
A Chain Graph Interpretation of Real-World Neural Networks
| Yuesong ShenDaniel Cremers
2020-06-30
Semantic Segmentation With Multi Scale Spatial Attention For Self Driving Cars
| Abhinav SagarRajKumar Soundrapandiyan
2020-06-30
Streaming Transformer ASR with Blockwise Synchronous Inference
Emiru TsunooYosuke KashiwagiShinji Watanabe
2020-06-25
Deep Investing in Kyle's Single Period Model
Paul FriedrichJosef Teichmann
2020-06-24
Cross-lingual Retrieval for Iterative Self-Supervised Training
| Chau TranYuqing TangXi-An LiJiatao Gu
2020-06-16
Implicit Kernel Attention
Kyungwoo SongYohan JungDongjun KimIl-Chul Moon
2020-06-11
Deceiving computers in Reverse Turing Test through Deep Learning
| Jimut Bahan Pal
2020-06-01
Camouflaged Object Detection
| Deng-Ping Fan Ge-Peng Ji Guolei Sun Ming-Ming Cheng Jianbing Shen Ling Shao
2020-06-01
Approche de g\'en\'eration de r\'eponse \`a base de transformers (Transformer based approach for answer generation)
Imen AkermiJohannes HeineckeFr{\'e}d{\'e}ric Herledan
2020-06-01
Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps
| Tri DaoNimit SohoniAlbert GuMatthew EichhornAmit BlonderMegan LeszczynskiAtri RudraChristopher Ré
2020-05-01
Neural Machine Translation with Universal Visual Representation
| Zhuosheng ZhangKehai ChenRui WangMasao UtiyamaEiichiro SumitaZuchao LiHai Zhao
2020-05-01
Kernel of CycleGAN as a principal homogeneous space
Nikita MoriakovJonas AdlerJonas Teuwen
2020-05-01
Linear Symmetric Quantization of Neural Networks for Low-precision Integer Hardware
Xiandong ZhaoYing WangXuyi CaiCheng LiuLei Zhang
2020-05-01
Global Relational Models of Source Code
Vincent J. HellendoornCharles SuttonRishabh SinghPetros ManiatisDavid Bieber
2020-05-01
Logic and the 2-Simplicial Transformer
| James CliftDmitry DorynDaniel MurfetJames Wallbridge
2020-05-01
Improving Neural Language Generation with Spectrum Control
Lingxiao WangJing HuangKevin HuangZiniu HuGuangtao WangQuanquan Gu
2020-05-01
Neural Symbolic Reader: Scalable Integration of Distributed and Symbolic Representations for Reading Comprehension
Xinyun ChenChen LiangAdams Wei YuDenny ZhouDawn SongQuoc V. Le
2020-05-01
Few-Shot Learning for Opinion Summarization
Arthur BražinskasMirella LapataIvan Titov
2020-04-30
TAVAT: Token-Aware Virtual Adversarial Training for Language Understanding
Linyang LiXipeng Qiu
2020-04-30
GigaBERT: A Bilingual BERT for English and Arabic
| Wuwei LanYang ChenWei XuAlan Ritter
2020-04-30
$R^3$: Reverse, Retrieve, and Rank for Sarcasm Generation with Commonsense Knowledge
| Tuhin ChakrabartyDebanjan GhoshSmaranda MuresanNanyun Peng
2020-04-28
Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness
| Hyunwoo KimByeongchang KimGunhee Kim
2020-04-13
Low Latency End-to-End Streaming Speech Recognition with a Scout Network
Chengyi WangYu WuShujie LiuJinyu LiLiang LuGuoli YeMing Zhou
2020-03-23
State-of-the-Art Augmented NLP Transformer models for direct and single-step retrosynthesis
Igor V. TetkoPavel KarpovRuud Van DeursenGuillaume Godin
2020-03-05
Determination of the Semion Code Threshold using Neural Decoders
Santiago VaronaMiguel Angel Martin-Delgado
2020-02-20
Sparse Weight Activation Training
Md Aamir RaihanTor M. Aamodt
2020-01-07
Enhancing Relation Extraction Using Syntactic Indicators and Sentential Contexts
| Qiongxing TaoXiangfeng LuoHao Wang
2019-12-04
Shadow Removal via Shadow Image Decomposition
Hieu LeDimitris Samaras
2019-08-23
Online Normalization for Training Neural Networks
| Vitaliy ChileyIlya SharapovAtli KossonUrs KosterRyan ReeceSofia Samaniego de la FuenteVishal SubbiahMichael James
2019-05-15
Guidelines and Benchmarks for Deployment of Deep Learning Models on Smartphones as Real-Time Apps
| Abhishek SehgalNasser Kehtarnavaz
2019-01-08
Biologically-plausible learning algorithms can scale to large datasets
| Will XiaoHonglin ChenQianli LiaoTomaso Poggio
2018-11-08
Hello Edge: Keyword Spotting on Microcontrollers
| Yundong ZhangNaveen SudaLiangzhen LaiVikas Chandra
2017-11-20

Tasks

Components

COMPONENT TYPE
Batch Normalization
Normalization (optional)
ReLU
Activation Functions (optional)

Categories