Softmax

The Softmax output function transforms a previous layer's output into a vector of probabilities. It is commonly used for multiclass classification. Given an input vector $x$ and a weighting vector $w$ we have:

$$ P(y=j \mid{x}) = \frac{e^{x^{T}w_{j}}}{\sum^{K}_{k=1}e^{x^{T}wk}} $$

Latest Papers

PAPER DATE
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks
| Nandan ThakurNils ReimersJohannes DaxenbergerIryna Gurevych
2020-10-16
AI-based BMI Inference from Facial Images: An Application to Weight Monitoring
Hera SiddiquiAjita RattaniDakshina Ranjan KiskuTanner Dean
2020-10-15
Multi-Task Learning for Cross-Lingual Abstractive Summarization
Sho TakaseNaoaki Okazaki
2020-10-15
Context-Guided BERT for Targeted Aspect-Based Sentiment Analysis
Zhengxuan WuDesmond C. Ong
2020-10-15
Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs
| Ana MarasovićChandra BhagavatulaJae Sung ParkRonan Le BrasNoah A. SmithYejin Choi
2020-10-15
DialogueTRM: Exploring the Intra- and Inter-Modal Emotional Behaviors in the Conversation
Yuzhao MaoQi SunGuang LiuXiaojie WangWeiguo GaoXuan LiJianping Shen
2020-10-15
Does Chinese BERT Encode Word Structure?
| Yile WangLeyang CuiYue Zhang
2020-10-15
A Transformer Based Pitch Sequence Autoencoder with MIDI Augmentation
Mingshuo DingYinghao Ma
2020-10-15
Unsupervised Bitext Mining and Translation via Self-trained Contextual Embeddings
Phillip KeungJulian SalazarYichao LuNoah A. Smith
2020-10-15
[email protected]: Sentiment Analysis of Code-Mixed Dravidian text using XLNet
Shubhanker BanerjeeArun JayapalSajeetha Thavareesan
2020-10-15
Response Selection for Multi-Party Conversations withDynamic Topic Tracking
Weishi Wang§Shafiq Joty§Steven C. H. Hoi
2020-10-15
Compressive Summarization with Plausibility and Salience Modeling
| Shrey DesaiJiacheng XuGreg Durrett
2020-10-15
Understanding Neural Abstractive Summarization Models via Uncertainty
| Jiacheng XuShrey DesaiGreg Durrett
2020-10-15
Masked Contrastive Representation Learning for Reinforcement Learning
| Jinhua ZhuYingce XiaLijun WuJiajun DengWengang ZhouTao QinHouqiang Li
2020-10-15
Memformer: The Memory-Augmented Transformer
Qingyang WuZhenzhong LanJing GuZhou Yu
2020-10-14
DA-Transformer: Distance-aware Transformer
Chuhan WuFangzhao WuYongfeng Huang
2020-10-14
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search
| Gyuwan KimKyunghyun Cho
2020-10-14
An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models
Zihan ZhaoYuncong LiuLu ChenQi LiuRao MaKai Yu
2020-10-14
Geometry matters: Exploring language examples at the decision boundary
Debajyoti DattaShashwat KumarLaura BarnesTom Fletcher
2020-10-14
Do End-to-end Stereo Algorithms Under-utilize Information?
| Changjiang CaiPhilippos Mordohai
2020-10-14
Decoding Methods for Neural Narrative Generation
| Alexandra DeLuciaAaron MuellerXiang Lisa LiJoão Sedoc
2020-10-14
No Rumours Please! A Multi-Indic-Lingual Approach for COVID Fake-Tweet Detection
| Debanjana KarMohit BhardwajSuranjana SamantaAmar Prakash Azad
2020-10-14
Exploring the Uncertainty Properties of Neural Networks' Implicit Priors in the Infinite-Width Limit
Ben AdlamJaehoon LeeLechao XiaoJeffrey PenningtonJasper Snoek
2020-10-14
Temperature check: theory and practice for training models with softmax-cross-entropy losses
Atish AgarwalaJeffrey PenningtonYann DauphinSam Schoenholz
2020-10-14
Probing for Multilingual Numerical Understanding in Transformer-Based Language Models
| Devin JohnsonDenise MakDrew BarkerLexi Loessberg-Zahl
2020-10-13
Making Every Label Count: Handling Semantic Imprecision by Integrating Domain Knowledge
Clemens-Alexander BrustBjörn BarzJoachim Denzler
2020-10-13
A Generalized Zero-Shot Framework for Emotion Recognition from Body Gestures
Jinting WuYujia ZhangXiaoguang Zhao
2020-10-13
Pretrained Transformers for Text Ranking: BERT and Beyond
Jimmy LinRodrigo NogueiraAndrew Yates
2020-10-13
Multilingual Argument Mining: Datasets and Analysis
Orith Toledo-RonenMatan OrbachYonatan BiluArtem SpectorNoam Slonim
2020-10-13
Interpreting Attention Models with Human Visual Attention in Machine Reading Comprehension
Ekta SoodSimon TannertDiego FrassinelliAndreas BullingNgoc Thang Vu
2020-10-13
Aspect-based Document Similarity for Research Papers
| Malte OstendorffTerry RuasTill BlumeBela GippGeorg Rehm
2020-10-13
CAPT: Contrastive Pre-Training for LearningDenoised Sequence Representations
Fuli LuoPengcheng YangShicheng LiXuancheng RenXu sun
2020-10-13
Context-Aware Drive-thru Recommendation Service at Fast Food Restaurants
Luyang WangKai HuangJiao WangShengsheng HuangJason DaiYue Zhuang
2020-10-13
The workweek is the best time to start a family -- A Study of GPT-2 Based Claim Generation
Shai GretzYonatan BiluEdo Cohen-KarlikNoam Slonim
2020-10-13
Improving Text Generation Evaluation with Batch Centering and Tempered Word Mover Distance
Xi ChenNan DingTomer LevinboimRadu Soricut
2020-10-13
Incorporating BERT into Parallel Sequence Decoding with Adapters
| Junliang GuoZhirui ZhangLinli XuHao-Ran WeiBoxing ChenEnhong Chen
2020-10-13
BERT-EMD: Many-to-Many Layer Mapping for BERT Compression with Earth Mover's Distance
| Jianquan LiXiaokang LiuHonghong ZhaoRuifeng XuMin YangYaohong Jin
2020-10-13
Load What You Need: Smaller Versions of Multilingual BERT
| Amine AbdaouiCamille PradelGrégoire Sigel
2020-10-12
From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks
| Steffen EgerYannik Benz
2020-10-12
Layer-wise Guided Training for BERT: Learning Incrementally Refined Document Representations
Nikolaos ManginasIlias ChalkidisProdromos Malakasiotis
2020-10-12
Dynamic Memory Enhanced Transformer for End-to-end Task-Oriented Dialogue System
Yanjie GouYinjie LeiLingqiao Liu
2020-10-12
EFSG: Evolutionary Fooling Sentences Generator
Marco Di GiovanniMarco Brambilla
2020-10-12
Probing Pretrained Language Models for Lexical Semantics
Ivan VulićEdoardo Maria PontiRobert LitschkoGoran GlavašAnna Korhonen
2020-10-12
HUJI-KU at MRP~2020: Two Transition-based Neural Parsers
Ofir ArvivRuixiang CuiDaniel Hershcovich
2020-10-12
Improving Compositional Generalization in Semantic Parsing
| Inbar OrenJonathan HerzigNitish GuptaMatt GardnerJonathan Berant
2020-10-12
Counterfactual Variable Control for Robust and Interpretable Question Answering
| Sicheng YuYulei NiuShuohang WangJing JiangQianru Sun
2020-10-12
Meta-Context Transformers for Domain-Specific Response Generation
Debanjana KarSuranjana SamantaAmar Prakash Azad
2020-10-12
Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers
Christoph KamannBurkhard GüssefeldRobin HutmacherJan Hendrik MetzenCarsten Rother
2020-10-12
Conditioning Trick for Training Stable GANs
Mohammad EsmaeilpourRaymel Alfonso SalloOlivier St-GeorgesPatrick CardinalAlessandro Lameiras Koerich
2020-10-12
Zero-shot Entity Linking with Efficient Long Range Sequence Modeling
| Zonghai YaoLiangliang CaoHuapu Pan
2020-10-12
Chatbot Interaction with Artificial Intelligence: Human Data Augmentation with T5 and Language Transformer Ensemble for Text Classification
Jordan J. BirdAnikó EkártDiego R. Faria
2020-10-12
COMET-ATOMIC 2020: On Symbolic and Neural Commonsense Knowledge Graphs
Jena D. HwangChandra BhagavatulaRonan Le BrasJeff DaKeisuke SakaguchiAntoine BosselutYejin Choi
2020-10-12
Partial FC: Training 10 Million Identities on a Single Machine
| Xiang AnXuhan ZhuYang XiaoLan WuMing ZhangYuan GaoBin QinDebing ZhangYing Fu
2020-10-11
Weakly Supervised Medication Regimen Extraction from Medical Conversations
Dhruvesh PatelSandeep KonamSai P. Selvaraj
2020-10-11
SMYRF: Efficient Attention using Asymmetric Clustering
| Giannis DarasNikita KitaevAugustus OdenaAlexandros G. Dimakis
2020-10-11
Data Agnostic RoBERTa-based Natural Language to SQL Query Generation
| Debaditya PalHarsh SharmaKaustubh Chaudhari
2020-10-11
Incremental Processing in the Age of Non-Incremental Encoders: An Empirical Assessment of Bidirectional Models for Incremental NLU
| Brielen MadureiraDavid Schlangen
2020-10-11
Unsupervised Distillation of Syntactic Information from Contextualized Word Representations
| Shauli RavfogelYanai ElazarJacob GoldbergerYoav Goldberg
2020-10-11
Machine Translation of Mathematical Text
Aditya OhriTanya Schmah
2020-10-11
Connecting the Dots Between Fact Verification and Fake News Detection
Qifei LiWangchunshu Zhou
2020-10-11
Detecting Foodborne Illness Complaints in Multiple Languages Using English Annotations Only
Ziyi LiuGiannis KaramanolakisDaniel HsuLuis Gravano
2020-10-11
SJTU-NICT's Supervised and Unsupervised Neural Machine Translation Systems for the WMT20 News Translation Task
Zuchao LiHai ZhaoRui WangKehai ChenMasao UtiyamaEiichiro Sumita
2020-10-11
Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually)
| Alex WarstadtYian ZhangHaau-Sing LiHaokun LiuSamuel R. Bowman
2020-10-11
Second-Order Neural Dependency Parsing with Message Passing and End-to-End Training
Xinyu WangKewei Tu
2020-10-10
Automated Concatenation of Embeddings for Structured Prediction
| Xinyu WangYong JiangNguyen BachTao WangZhongqiang HuangFei HuangKewei Tu
2020-10-10
Compressing Transformer-Based Semantic Parsing Models using Compositional Code Embeddings
Prafull PrakashSaurabh Kumar ShashidharWenlong ZhaoSubendhu RongaliHaidar KhanMichael Kayser
2020-10-10
Tag Recommendation for Online Q&A Communities based on BERT Pre-Training Technique
Navid KhezrianJafar HabibiIssa Annamoradnejad
2020-10-10
An Empirical Study on Detecting COVID-19 in Chest X-ray Images Using Deep Learning Based Methods
Ramtin BabaeipourElham AziziHassan Khotanlou
2020-10-10
Structured Self-Attention Weights Encode Semantics in Sentiment Analysis
| Zhengxuan WuThanh-Son NguyenDesmond C. Ong
2020-10-10
Information Extraction from Swedish Medical Prescriptions with Sig-Transformer Encoder
John Pougue BiyongBo wangTerry LyonsAlejo J Nevado-Holgado
2020-10-10
NutCracker at WNUT-2020 Task 2: Robustly Identifying Informative COVID-19 Tweets using Ensembling and Adversarial Training
| Priyanshu KumarAadarsh Singh
2020-10-09
Grid Tagging Scheme for Aspect-oriented Fine-grained Opinion Extraction
Zhen WuChengcan YingFei ZhaoZhifang FanXinyu DaiRui Xia
2020-10-09
Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis
| João A. LeiteDiego F. SilvaKalina BontchevaCarolina Scarton
2020-10-09
What Have We Achieved on Text Summarization?
Dandan HuangLeyang CuiSen yangGuangsheng BaoKun WangJun XieYue Zhang
2020-10-09
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding
Jin CaoJun WangWael HamzaKelly VaneeShang-Wen Li
2020-10-09
Online Back-Parsing for AMR-to-Text Generation
Xuefeng BaiLinfeng SongYue Zhang
2020-10-09
Long-distance tiny face detection based on enhanced YOLOv3 for unmanned system
Jia-Yi ChangYan-Feng LuYa-Jun LiuBo ZhouHong Qiao
2020-10-09
The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders
Wen-Chin HuangPatrick Lumban TobingYi-Chiao WuKazuhiro KobayashiTomoki Toda
2020-10-09
TurboTransformers: An Efficient GPU Serving System For Transformer Models
Jiarui FangYang YuChengduo ZhaoJie zhou
2020-10-09
Attaining Real-Time Super-Resolution for Microscopic Images Using GAN
| Vibhu BhatiaYatender Kumar
2020-10-09
On Task-Level Dialogue Composition of Generative Transformer Model
| Prasanna ParthasarathiArvind NeelakantanSharan Narang
2020-10-09
Recurrent convolutional neural network for the surrogate modeling of subsurface flow simulation
Hyung Jun YangTimothy YeoJaewoo An
2020-10-08
Automatic generation of reviews of scientific papers
| Anna NikiforovskayaNikolai KapralovAnna VlasovaOleg ShpynovAleksei Shpilman
2020-10-08
Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding
Dechuang TengLibo QinWanxiang CheSendong ZhaoTing Liu
2020-10-08
Improving Attention Mechanism with Query-Value Interaction
Chuhan WuFangzhao WuTao QiYongfeng Huang
2020-10-08
Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference
| Xiaoan DingTianyu LiuBaobao ChangZhifang SuiKevin Gimpel
2020-10-08
Energy-based Out-of-distribution Detection
| Weitang LiuXiaoYun WangJohn D. OwensYixuan Li
2020-10-08
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition
Yun HeZiwei ZhuYin ZhangQin ChenJames Caverlee
2020-10-08
PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge
Yun HeZhuoer WangYin ZhangRuihong HuangJames Caverlee
2020-10-08
Deformable DETR: Deformable Transformers for End-to-End Object Detection
| Xizhou ZhuWeijie SuLewei LuBin LiXiaogang WangJifeng Dai
2020-10-08
Interlocking Backpropagation: Improving depthwise model-parallelism
Aidan N. GomezOscar KeyStephen GouNick FrosstJeff DeanYarin Gal
2020-10-08
IRX-1D: A Simple Deep Learning Architecture for Remote Sensing Classifications
Mahesh PalAkshayB. Charan Teja
2020-10-08
A Co-Interactive Transformer for Joint Slot Filling and Intent Detection
| Libo QinTailu LiuWanxiang CheBingbing KangSendong ZhaoTing Liu
2020-10-08
TextSETTR: Label-Free Text Style Extraction and Tunable Targeted Restyling
Parker RileyNoah ConstantMandy GuoGirish KumarDavid UthusZarana Parekh
2020-10-08
Shallow-to-Deep Training for Neural Machine Translation
| Bei LiZiyang WangHui LiuYufan JiangQuan DuTong XiaoHuizhen WangJingbo Zhu
2020-10-08
Query-Key Normalization for Transformers
| Alex HenryPrudhvi Raj DachapallyShubham PawarYuxuan Chen
2020-10-08
PoinT-5: Pointer Network and T-5 based Financial NarrativeSummarisation
Abhishek Singh
2020-10-08
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Yinghui HuangHong-Kwang KuoSamuel ThomasZvi KonsKartik AudhkhasiBrian KingsburyRon HooryMichael Picheny
2020-10-08
Masked ELMo: An evolution of ELMo towards fully contextual RNN language models
Gregory SenayEmmanuelle Salin
2020-10-08
Deep Learning Meets Projective Clustering
Alaa MaaloufHarry LangDaniela RusDan Feldman
2020-10-08
Low-Resource Domain Adaptation for Compositional Task-Oriented Semantic Parsing
Xilun ChenAsish GhoshalYashar MehdadLuke ZettlemoyerSonal Gupta
2020-10-07
ELMo and BERT in semantic change detection for Russian
Julia RodinaYuliya TrofimovaAndrey KutuzovEkaterina Artemova
2020-10-07
Super-Human Performance in Online Low-latency Recognition of Conversational Speech
Thai-Son NguyenSebastian StuekerAlex Waibel
2020-10-07
Why do you think that? Exploring Faithful Sentence-Level Rationales Without Supervision
Max GlocknerIvan HabernalIryna Gurevych
2020-10-07
Transformer-GCRF: Recovering Chinese Dropped Pronouns with General Conditional Random Fields
| Jingxuan YangKerui XuJun XuSi LiSheng GaoJun GuoJi-Rong WenNianwen Xue
2020-10-07
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling
Jiecao ChenLiu YangKarthik RamanMichael BenderskyJung-Jung YehYun ZhouMarc NajorkDanyang CaiEhsan Emadzadeh
2020-10-07
Deep Learning in Diabetic Foot Ulcers Detection: A Comprehensive Evaluation
Moi Hoon YapRyo HachiumaAzadeh AlaviRaphael BrungelManu GoyalHongtao ZhuBill CassidyJohannes RuckertMoshe OlshanskyXiao HuangHideo SaitoSaeed HassanpourChristoph M. FriedrichDavid AscherAnping SongHiroki KajitaDavid GillespieNeil D. ReevesJoseph PappachanClaire O'SheaEibe Frank
2020-10-07
YOdar: Uncertainty-based Sensor Fusion for Vehicle Detection with Camera and Radar Sensors
Kamil KowolMatthias RottmannStefan BrackeHanno Gottschalk
2020-10-07
Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank
| Eleftheria BriakouMarine Carpuat
2020-10-07
Combining Deep Learning and String Kernels for the Localization of Swiss German Tweets
Mihaela GamanRadu Tudor Ionescu
2020-10-07
Optimizing Transformers with Approximate Computing for Faster, Smaller and more Accurate NLP Models
Amrit NagarajanSanchari SenJacob R. StevensAnand Raghunathan
2020-10-07
Efficient Inference For Neural Machine Translation
Yi-Te HsuSarthak GargYi-Hsiu LiaoIlya Chatsviorkin
2020-10-06
Beyond [CLS] through Ranking by Generation
Cicero Nogueira dos santosXiaofei MaRamesh NallapatiZhiheng HuangBing Xiang
2020-10-06
Resource-Enhanced Neural Model for Event Argument Extraction
Jie MaShuai WangRishita AnubhaiMiguel BallesterosYaser Al-Onaizan
2020-10-06
Exploring BERT's Sensitivity to Lexical Cues using Tests from Semantic Priming
Kanishka MisraAllyson EttingerJulia Taylor Rayz
2020-10-06
Intrinsic Probing through Dimension Selection
| Lucas Torroba HennigenAdina WilliamsRyan Cotterell
2020-10-06
Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation
| Minki KangMoonsu HanSung Ju Hwang
2020-10-06
Analyzing Individual Neurons in Pre-trained Language Models
Nadir DurraniHassan SajjadFahim DalviYonatan Belinkov
2020-10-06
BERT Knows Punta Cana is not just beautiful, it's gorgeous: Ranking Scalar Adjectives with Contextualised Representations
| Aina Garí SolerMarianna Apidianaki
2020-10-06
Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder
| Alvin ChanYi TayYew-Soon OngAston Zhang
2020-10-06
Incorporating Behavioral Hypotheses for Query Generation
Ruey-Cheng ChenChia-Jung Lee
2020-10-06
Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation
| Sebastian HofstätterSophia AlthammerMichael SchröderMete SertkanAllan Hanbury
2020-10-06
On the Sub-Layer Functionalities of Transformer Decoder
Yilin YangLongyue WangShuming ShiPrasad TadepalliStefan LeeZhaopeng Tu
2020-10-06
On the Interplay Between Fine-tuning and Sentence-level Probing for Linguistic Knowledge in Pre-trained Transformers
Marius MosbachAnna KhokhlovaMichael A. HedderichDietrich Klakow
2020-10-06
Scene Graph Modification Based on Natural Language Commands
| Xuanli HeQuan Hung TranGholamreza HaffariWalter ChangTrung BuiZhe LinFranck DernoncourtNhan Dam
2020-10-06
The Multilingual Amazon Reviews Corpus
Phillip KeungYichao LuGyörgy SzarvasNoah A. Smith
2020-10-06
Cross-Lingual Text Classification with Minimal Resources by Transferring a Sparse Teacher
| Giannis KaramanolakisDaniel HsuLuis Gravano
2020-10-06
LEGAL-BERT: The Muppets straight out of Law School
Ilias ChalkidisManos FergadiotisProdromos MalakasiotisNikolaos AletrasIon Androutsopoulos
2020-10-06
Do Explicit Alignments Robustly Improve Multilingual Encoders?
| Shijie WuMark Dredze
2020-10-06
Pretrained Language Model Embryology: The Birth of ALBERT
| David C. ChiangSung-Feng HuangHung-Yi Lee
2020-10-06
Adversarial Grammatical Error Correction
Vipul RahejaDimitrios Alikaniotis
2020-10-06
Parallax Motion Effect Generation Through Instance Segmentation And Depth Estimation
Allan PintoManuel A. CórdovaLuis G. L. DeckerJose L. Flores-CampanaMarcos R. SouzaAndreza A. dos SantosJhonatas S. ConceiçãoHenrique F. GagliardiDiogo C. LuvizonRicardo da S. TorresHelio Pedrini
2020-10-06
Converting the Point of View of Messages Spoken to Virtual Assistants
| Isabelle G. LeeVera ZuSai Srujana BuddiDennis LiangJack G. M. FitzGerald
2020-10-06
Investigating African-American Vernacular English in Transformer-Based Text Generation
Sophie GroenwoldLily OuAesha ParekhSamhita HonnavalliSharon LevyDiba MirzaWilliam Yang Wang
2020-10-06
LOGAN: Local Group Bias Detection by Clustering
Jieyu ZhaoKai-Wei Chang
2020-10-06
Deep Reinforcement Learning for Electric Vehicle Routing Problem with Time Windows
Bo LinBissan GhaddarJatin Nathwani
2020-10-05
X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset
| Angel DazaAnette Frank
2020-10-05
PUM at SemEval-2020 Task 12: Aggregation of Transformer-based models' features for offensive language recognition
Piotr JaniszewskiMateusz SkibaUrszula Walińska
2020-10-05
Linguistic Profiling of a Neural Language Model
Alessio MiaschiDominique BrunatoFelice Dell'OrlettaGiulia Venturi
2020-10-05
PMI-Masking: Principled masking of correlated spans
Yoav LevineBarak LenzOpher LieberOmri AbendKevin Leyton-BrownMoshe TennenholtzYoav Shoham
2020-10-05
Pruning Redundant Mappings in Transformer Models via Spectral-Normalized Identity Prior
| Zi LinJeremiah Zhe LiuZi YangNan HuaDan Roth
2020-10-05
Unsupervised Reference-Free Summary Quality Evaluation via Contrastive Learning
| Hanlu WuTengfei MaLingfei WuTariro ManyumwaShouling Ji
2020-10-05
Improving AMR Parsing with Sequence-to-Sequence Pre-training
| Dongqin XuJunhui LiMuhua ZhuMin ZhangGuodong Zhou
2020-10-05
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?
Shayne LongpreYu WangChristopher DuBois
2020-10-05
Transformer-Based Neural Text Generation with Syntactic Guidance
| Yinghao LiRui FengIsaac RehgChao Zhang
2020-10-05
Self-training Improves Pre-training for Natural Language Understanding
| Jingfei DuEdouard GraveBeliz GunelVishrav ChaudharyOnur CelebiMichael AuliVes StoyanovAlexis Conneau
2020-10-05
DCT-SNN: Using DCT to Distribute Spatial Information over Time for Learning Low-Latency Spiking Neural Networks
| Isha GargSayeed Shafayet ChowdhuryKaushik Roy
2020-10-05
GenAug: Data Augmentation for Finetuning Text Generators
Steven Y. FengVarun GangalDongyeop KangTeruko MitamuraEduard Hovy
2020-10-05
D3Net: Densely connected multidilated DenseNet for music source separation
Naoya TakahashiYuki Mitsufuji
2020-10-05
Mixup-Transfomer: Dynamic Data Augmentation for NLP Tasks
Lichao SunCongying XiaWenpeng YinTingTing LiangPhilip S. YuLifang He
2020-10-05
InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
| Boxin WangShuohang WangYu ChengZhe GanRuoxi JiaBo LiJingjing Liu
2020-10-05
PAIR: Planning and Iterative Refinement in Pre-trained Transformers for Long Text Generation
Xinyu HuaLu Wang
2020-10-05
Adaptive Automotive Radar data Acquisition
Madhumitha SakthiAhmed Tewfik
2020-10-05
On Losses for Modern Language Models
| Stephane Aroca-OuelletteFrank Rudzicz
2020-10-04
An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels
Ilias ChalkidisManos FergadiotisSotiris KotitsasProdromos MalakasiotisNikolaos AletrasIon Androutsopoulos
2020-10-04
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space
| Dayiheng LiuYeyun GongJie FuYu YanJiusheng ChenJiancheng LvNan DuanMing Zhou
2020-10-04
MetaDetect: Uncertainty Quantification and Prediction Quality Estimates for Object Detection
Marius SchubertKarsten KahlMatthias Rottmann
2020-10-04
Inquisitive Question Generation for High Level Text Comprehension
| Wei-Jen KoTe-Yuan ChenYiyan HuangGreg DurrettJunyi Jessy Li
2020-10-04
A New Mask R-CNN Based Method for Improved Landslide Detection
Silvia Liberata UlloAmrita MohanAlessandro SebastianelliShaik Ejaz AhamedBasant KumarRamji DwivediG. R. Sinha
2020-10-04
Personality Trait Detection Using Bagged SVM over BERT Word Embedding Ensembles
Amirmohammad KazameiniSamin FatehiYash MehtaSauleh EetemadiErik Cambria
2020-10-03
Mining Knowledge for Natural Language Inference from Wikipedia Categories
| Mingda ChenZewei ChuKarl StratosKevin Gimpel
2020-10-03
End-to-End Training of CNN Ensembles for Person Re-Identification
Ayse SerbetciYusuf Sinan Akgul
2020-10-03
Nonconvex Regularization for Network Slimming:Compressing CNNs Even More
| Kevin BuiFredrick ParkShuai ZhangYingyong QiJack Xin
2020-10-03
Polyphonic Piano Transcription Using Autoregressive Multi-State Note Model
Taegyun KwonDasaem JeongJuhan Nam
2020-10-02
Beyond Chemical 1D knowledge using Transformers
Ruud Van DeursenIgor V. TetkoGuillaume Godin
2020-10-02
Autoregressive Entity Retrieval
Nicola De CaoGautier IzacardSebastian RiedelFabio Petroni
2020-10-02
Data Transfer Approaches to Improve Seq-to-Seq Retrosynthesis
katsuhiko IshiguroKazuya UjiharaRyohto SawadaHirotaka AkitaMasaaki Kotera
2020-10-02
STIL -- Simultaneous Slot Filling, Translation, Intent Classification, and Language Identification: Initial Results using mBART on MultiATIS++
Jack G. M. FitzGerald
2020-10-02
LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention
| Ikuya YamadaAkari AsaiHiroyuki ShindoHideaki TakedaYuji Matsumoto
2020-10-02
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
Andreas RückléJonas PfeifferIryna Gurevych
2020-10-02
Cost-effective Selection of Pretraining Data: A Case Study of Pretraining BERT on Social Media
Xiang DaiSarvnaz KarimiBen HacheyCecile Paris
2020-10-02
Background Adaptive Faster R-CNN for Semi-Supervised Convolutional Object Detection of Threats in X-Ray Images
John B. SigmanGregory P. SpellKevin J LiangLawrence Carin
2020-10-02
WeChat Neural Machine Translation Systems for WMT20
Fandong MengJianhao YanYijin LiuYuan GaoXianfeng ZengQinsong ZengPeng LiMing ChenJie zhouSifan LiuHao Zhou
2020-10-01
CoLAKE: Contextualized Language and Knowledge Embedding
| Tianxiang SunYunfan ShaoXipeng QiuQipeng GuoYaru HuXuanjing HuangZheng Zhang
2020-10-01
Examining the rhetorical capacities of neural language models
Zining ZhuChuer PanMohamed AbdallaFrank Rudzicz
2020-10-01
A Compare Aggregate Transformer for Understanding Document-grounded Dialogue
Longxuan MaWei-Nan ZhangRunxin SunTing Liu
2020-10-01
Phonemer at WNUT-2020 Task 2: Sequence Classification Using COVID Twitter BERT and Bagging Ensemble Technique based on Plurality Voting
| Anshul Wadhawan
2020-10-01
Detecting White Supremacist Hate Speech using Domain Specific Word Embedding with Deep Learning and BERT
Hind Saleh AlatawiAreej Maatog AlhothaliKawthar Mustafa Moria
2020-10-01
Understanding tables with intermediate pre-training
| Julian Martin EisenschlosSyrine KricheneThomas Müller
2020-10-01
RRF102: Meeting the TREC-COVID Challenge with a 100+ Runs Ensemble
Michael BenderskyHonglei ZhuangJi MaShuguang HanKeith HallRyan Mcdonald
2020-10-01
Evaluating Multilingual BERT for Estonian
Claudia KittaskKirill MilintsevichKairit Sirts
2020-10-01
Beyond The Text: Analysis of Privacy Statements through Syntactic and Semantic Role Labeling
Yan ShvartzshnaiderAnanth BalashankarVikas PatidarThomas WiesLakshminarayanan Subramanian
2020-10-01
MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention
Carson EisenachYagna PatelDhruv Madeka
2020-09-30
Rethinking Attention with Performers
| Krzysztof ChoromanskiValerii LikhosherstovDavid DohanXingyou SongAndreea GaneTamas SarlosPeter HawkinsJared DavisAfroz MohiuddinLukasz KaiserDavid BelangerLucy ColwellAdrian Weller
2020-09-30
Measuring Systematic Generalization in Neural Proof Generation with Transformers
| Nicolas GontierKoustuv SinhaSiva ReddyChristopher Pal
2020-09-30
Learning Hard Retrieval Cross Attention for Transformer
Hongfei XuQiuhui Liu
2020-09-30
A Tale of Two Linkings: Dynamically Gating between Schema Linking and Structural Linking for Text-to-SQL Parsing
Sanxing ChenAidan SanXiaodong LiuYangfeng Ji
2020-09-30
BERT for Monolingual and Cross-Lingual Reverse Dictionary
| Hang YanXiaonan LiXipeng Qiu
2020-09-30
AUBER: Automated BERT Regularization
Hyun Dong LeeSeongmin LeeU Kang
2020-09-30
Sequence-to-Sequence Learning for Indonesian Automatic Question Generator
| Ferdiant Joshua MuisAyu Purwarianti
2020-09-29
A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation
Dinghan ShenMingzhi ZhengYelong ShenYanru QuWeizhu Chen
2020-09-29
Contrastive Distillation on Intermediate Representations for Language Model Compression
| Siqi SunZhe GanYu ChengYuwei FangShuohang WangJingjing Liu
2020-09-29
The design and implementation of Language Learning Chatbot with XAI using Ontology and Transfer Learning
Nuobei ShiQin ZengRaymond Lee
2020-09-29
HINT3: Raising the bar for Intent Detection in the Wild
| Gaurav AroraChirag JainManas ChaturvediKrupal Modi
2020-09-29
Neural Retrieval for Question Answering with Cross-Attention Supervised Data Augmentation
Yinfei YangNing JinKuo LinMandy GuoDaniel Cer
2020-09-29
Self-grouping Convolutional Neural Networks
| Qingbei GuoXiao-Jun WuJosef KittlerZhiquan Feng
2020-09-29
Attention that does not Explain Away
Nan DingXinjie FanZhenzhong LanDale SchuurmansRadu Soricut
2020-09-29
TinyGAN: Distilling BigGAN for Conditional Image Generation
| Ting-Yun ChangChi-Jen Lu
2020-09-29
MaP: A Matrix-based Prediction Approach to Improve Span Extraction in Machine Reading Comprehension
Huaishao LuoYu ShiMing GongLinjun ShouTianrui Li
2020-09-29
Attention-Driven Body Pose Encoding for Human Activity Recognition
B DebnathM O'brienS. KumarA Behera
2020-09-29
Cross-lingual Alignment Methods for Multilingual BERT: A Comparative Study
Saurabh KulshreshthaJosé Luis Redondo-GarcíaChing-Yun Chang
2020-09-29
TEST_POSITIVE at W-NUT 2020 Shared Task-3: Joint Event Multi-task Learning for Slot Filling in Noisy Text
Chacha ChenChieh-Yang HuangYaqi HouYang ShiEnyan DaiJiaqi Wang
2020-09-29
Visually-Grounded Planning without Vision: Language Models Infer Detailed Plans from High-level Instructions
| Peter A. Jansen
2020-09-29
Gender prediction using limited Twitter Data
Maaike BurghoornMaaike H. T. de BoerStephan Raaijmakers
2020-09-29
Deep Transformers with Latent Depth
Xi-An LiAsa Cooper SticklandYuqing TangXiang Kong
2020-09-28
Accelerating Multi-Model Inference by Merging DNNs of Different Weights
Joo Seong JeongSoojeong KimGyeong-In YuYunseong LeeByung-Gon Chun
2020-09-28
EIS -- a family of activation functions combining Exponential, ISRU, and Softplus
Koushik BiswasSandeep KumarShilpak BanerjeeAshish Kumar Pandey
2020-09-28
PIN: A Novel Parallel Interactive Network for Spoken Language Understanding
Peilin ZhouZhiqi HuangFenglin LiuYuexian Zou
2020-09-28
Knowledge-Aware Procedural Text Understanding with Multi-Stage Training
Zhihan ZhangXiubo GengTao QinYunfang WuDaxin Jiang
2020-09-28
A Simple and Efficient Ensemble Classifier Combining Multiple Neural Network Models on Social Media Datasets in Vietnamese
Huy Duc HuynhHang Thi-Thuy DoKiet Van NguyenNgan Luu-Thuy Nguyen
2020-09-28
Fancy Man Lauches Zippo at WNUT 2020 Shared Task-1: A Bert Case Model for Wet Lab Entity Extraction
Haoding MengQingcheng ZengXiaoyang FangZhexin Liang
2020-09-28
VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training
Xiaowei HuXi YinKevin LinLijuan WangLei ZhangJianfeng GaoZicheng Liu
2020-09-28
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
| Shikib MehriMihail EricDilek Hakkani-Tur
2020-09-28
Detecting soccer balls with reduced neural networks: a comparison of multiple architectures under constrained hardware scenarios
| Douglas De Rizzo MeneghettiThiago Pedro Donadon HomemJonas Henrique Renolfi de OliveiraIsaac Jesus da SilvaDanilo Hernani PericoReinaldo Augusto da Costa Bianchi
2020-09-28
STAN: Synthetic Network Traffic Generation using Autoregressive Neural Models
| Shengzhe XuManish MarwahNaren Ramakrishnan
2020-09-27
TernaryBERT: Distillation-aware Ultra-low Bit BERT
Wei ZhangLu HouYichun YinLifeng ShangXiao ChenXin JiangQun Liu
2020-09-27
What does it mean to be language-agnostic? Probing multilingual sentence encoders for typological properties
Rochelle ChoenniEkaterina Shutova
2020-09-27
KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning
| Ye LiuYao WanLifang HeHao PengPhilip S. Yu
2020-09-26
Techniques to Improve Q&A Accuracy with Transformer-based models on Large Complex Documents
Chejui LiaoTabish ManiarSravanajyothi NAnantha Sharma
2020-09-26
Metaphor Detection using Deep Contextualized Word Embeddings
Shashwat AggarwalRamesh Singh
2020-09-26
A little goes a long way: Improving toxic language classification despite data scarcity
Mika JuutiTommi GröndahlAdrian FlanaganN. Asokan
2020-09-25
Weird AI Yankovic: Generating Parody Lyrics
Mark Riedl
2020-09-25
MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems
| Zhaojiang LinAndrea MadottoGenta Indra WinataPascale Fung
2020-09-25
An Unsupervised Sentence Embedding Method byMutual Information Maximization
Yan ZhangRuidan HeZuozhu LiuKwan Hui LimLidong Bing
2020-09-25
DPN: Detail-Preserving Network with High Resolution Representation for Efficient Segmentation of Retinal Vessels
Song Guo
2020-09-25
BET: A Backtranslation Approach for Easy Data Augmentation in Transformer-based Paraphrase Identification Context
| Jean-Philippe CorbeilHadi Abdi Ghadivel
2020-09-25
HetSeq: Distributed GPU Training on Heterogeneous Infrastructure
| Yifan DingNicholas BotzerTim Weninger
2020-09-25
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences
| Boon Peng YapAndrew KohEng Siong Chng
2020-09-24
AnchiBERT: A Pre-Trained Model for Ancient ChineseLanguage Understanding and Generation
Huishuang TianKexin YangDayiheng LiuJiancheng Lv
2020-09-24
Toward a Thermodynamics of Meaning
Jonathan Scott Enderle
2020-09-24
A Comparative Study of Feature Types for Age-Based Text Classification
| Anna GlazkovaYury EgorovMaksim Glazkov
2020-09-24
Probabilistic Label Trees for Extreme Multi-label Classification
Kalina Jasinska-KobusMarek WydmuchKrzysztof DembczynskiMikhail KuznetsovRobert Busa-Fekete
2020-09-23
Robustification of Segmentation Models Against Adversarial Perturbations In Medical Imaging
Hanwool ParkAmirhossein BayatMohammad SabokrouJan S. KirschkeBjoern H. Menze
2020-09-23
Revisiting Design Choices in Proximal Policy Optimization
| Chloe Ching-Yun HsuCelestine Mendler-DünnerMoritz Hardt
2020-09-23
Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition
| Bingcong LiXin TangXianbiao QiYihao ChenRong Xiao
2020-09-23
A Token-wise CNN-based Method for Sentence Compression
Weiwei HouHanna SuominenPiotr KoniuszSabrina CaldwellTom Gedeon
2020-09-23
Automatic Breast Lesion Classification by Joint Neural Analysis of Mammography and Ultrasound
Gavriel HabibNahum KiryatiMiri Sklair-LevyAnat ShalmonOsnat Halshtok NeimanRenata Faermann WeidenfeldYael YagilEli KonenArnaldo Mayer
2020-09-23
Multi-Pass Transformer for Machine Translation
Peng GaoChiori HoriShijie GengTakaaki HoriJonathan Le Roux
2020-09-23
Schizophrenia-mimicking layers outperform conventional neural network layers
| Ryuta MizutaniSenta NoguchiRino SaigaMitsuhiro MiyashitaMakoto AraiMasanari Itokawa
2020-09-23
Design of Efficient Deep Learning models for Determining Road Surface Condition from Roadside Camera Images and Weather Data
Juan CarrilloMark CrowleyGuangyuan PanLiping Fu
2020-09-22
Constructing interval variables via faceted Rasch measurement and multitask deep learning: a hate speech application
| Chris J. KennedyGeoff BaconAlexander SahnClaudia von Vacano
2020-09-22
GRACE: Gradient Harmonized and Cascaded Labeling for Aspect-based Sentiment Analysis
| Huaishao LuoLei JiTianrui LiNan DuanDaxin Jiang
2020-09-22
AutoRC: Improving BERT Based Relation Classification Models via Architecture Search
Wei ZhuXipeng QiuYuan NiGuotong Xie
2020-09-22
On Data Augmentation for Extreme Multi-label Classification
Danqing ZhangTao LiHaiyang ZhangBing Yin
2020-09-22
Empathetic Dialogue Generation via Knowledge Enhancing and Emotion Dependency Modeling
Qintong LiPiji LiZhumin ChenZhaochun Ren
2020-09-21
Alleviating the Inequality of Attention Heads for Neural Machine Translation
Zewei SunShu-Jian HuangXin-yu DaiJia-Jun Chen
2020-09-21
Latin BERT: A Contextual Language Model for Classical Philology
| David BammanPatrick J. Burns
2020-09-21
UCD-CS at W-NUT 2020 Shared Task-3: A Text to Text Approach for COVID-19 Event Extraction on Social Media
| Congcong WangDavid Lillis
2020-09-21
Impact of lung segmentation on the diagnosis and explanation of COVID-19 in chest X-ray images
Lucas O. TeixeiraRodolfo M. PereiraDiego BertoliniLuiz S. OliveiraLoris NanniYandre M. G. Costa
2020-09-21
Multitask Pointer Network for Multi-Representational Parsing
| Daniel Fernández-GonzálezCarlos Gómez-Rodríguez
2020-09-21
Profile Consistency Identification for Open-domain Dialogue Agents
| Haoyu SongYan WangWei-Nan ZhangZhengyu ZhaoTing LiuXiaojiang Liu
2020-09-21
CCBlock: An Effective Use of Deep Learning for Automatic Diagnosis of COVID-19 Using X-Ray Images
Ali Al-BawiKarrar Ali Al-KaabiMohammed JeryoAhmad Al-Fatlawi
2020-09-21
"When they say weed causes depression, but it's your fav antidepressant": Knowledge-aware Attention Framework for Relationship Extraction
Shweta YadavUsha LokalaRaminta DaniulaityteKrishnaprasad ThirunarayanFrancois LamyAmit Sheth
2020-09-21
Towards Fast, Accurate and Stable 3D Dense Face Alignment
| Jianzhu GuoXiangyu ZhuYang YangFan YangZhen LeiStan Z. Li
2020-09-21
Softmax Tempering for Training Neural Machine Translation Models
Raj DabreAtsushi Fujita
2020-09-20
VirtualFlow: Decoupling Deep Learning Model Execution from Underlying Hardware
Andrew OrHaoyu ZhangMichael J. Freedman
2020-09-20
Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging
| Ehsan DoostmohammadiMinoo NassajianAdel Rahimi
2020-09-20
Longformer for MS MARCO Document Re-ranking Task
| Ivan SekulićAmir SoleimaniMohammad AliannejadiFabio Crestani
2020-09-20
Dual-path CNN with Max Gated block for Text-Based Person Re-identification
Tinghuai MaMingming YangHuan RongYurong QianYuan TianNajlaAl-Nabhan
2020-09-20
Towards Computational Linguistics in Minangkabau Language: Studies on Sentiment Analysis and Machine Translation
| Fajri KotoIkhwan Koto
2020-09-19
BioALBERT: A Simple and Effective Pre-trained Language Model for Biomedical Named Entity Recognition
Usman NaseemMatloob KhushiVinay ReddySakthivel RajendranImran RazzakJinman Kim
2020-09-19
Nominal Compound Chain Extraction: A New Task for Semantic-enriched Lexical Chain
Bobo LiHao FeiYafeng RenDonghong Ji
2020-09-19
Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data
Jonathan PilaultAmine ElhattamiChristopher Pal
2020-09-19
Prior Art Search and Reranking for Generated Patent Text
Jieh-Sheng LeeJieh Hsiang
2020-09-19
NEU at WNUT-2020 Task 2: Data Augmentation To Tell BERT That Death Is Not Necessarily Informative
Kumud Chauhan
2020-09-18
fastHan: A BERT-based Joint Many-Task Toolkit for Chinese NLP
| Zhichao GengHang YanXipeng QiuXuanjing Huang
2020-09-18
Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models
Jihyeon RohHuiseong GimSoo-Young Lee
2020-09-18
The birth of Romanian BERT
| Stefan Daniel DumitrescuAndrei-Marius AvramSampo Pyysalo
2020-09-18
Will it Unblend?
| Yuval PinterCassandra L. JacobsJacob Eisenstein
2020-09-18
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks
| Zhiqiang ShenMarios Savvides
2020-09-17
GraphCodeBERT: Pre-training Code Representations with Data Flow
Daya GuoShuo RenShuai LuZhangyin FengDuyu TangShujie LiuLong ZhouNan DuanAlexey SvyatkovskiyShengyu FuMichele TufanoShao Kun DengColin ClementDawn DrainNeel SundaresanJian YinDaxin JiangMing Zhou
2020-09-17
Multi$^2$OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT
| Youngbin RoYukyung LeePilsung Kang
2020-09-17
Towards Fully 8-bit Integer Inference for the Transformer Model
Ye LinYanyang LiTengbo LiuTong XiaoTongran LiuJingbo Zhu
2020-09-17
A Multimodal Memes Classification: A Survey and Open Research Issues
Tariq Habib AfridiAftab AlamMuhammad Numan KhanJawad KhanYoung-Koo Lee
2020-09-17
Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA
Ieva StaliūnaitėIgnacio Iacobacci
2020-09-17
DSC IIT-ISM at SemEval-2020 Task 6: Boosting BERT with Dependencies for Definition Extraction
| Aadarsh SinghPriyanshu KumarAman Sinha
2020-09-17
Efficient Transformer-based Large Scale Language Representations using Hardware-friendly Block Structured Pruning
Bingbing LiZhenglun KongTianyun ZhangJi LiZhengang LiHang LiuCaiwen Ding
2020-09-17
Cross-Modal Alignment with Mixture Experts Neural Network for Intral-City Retail Recommendation
Po LiLei LIYan FuJun RongYu Zhang
2020-09-17
Automated Source Code Generation and Auto-completion Using Deep Learning: Comparing and Discussing Current Language-Model-Related Approaches
| Juan Cruz-BenitoSanjay VishwakarmaFrancisco Martin-FernandezIsmael Faro
2020-09-16
NABU $\mathrm{-}$ Multilingual Graph-based Neural RDF Verbalizer
Diego MoussallemDwaraknath GnaneshwarThiago castro FerreiraAxel-Cyrille Ngonga Ngomo
2020-09-16
Graph-to-Sequence Neural Machine Translation
Sufeng DuanHai ZhaoRui Wang
2020-09-16
Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation
Insoo ChungByeongwook KimYoonjung ChoiSe Jung KwonYongkweon JeonBaeseong ParkSangha KimDongsoo Lee
2020-09-16
Retrofitting Structure-aware Transformer Language Model for End Tasks
Hao FeiYafeng RenDonghong Ji
2020-09-16
Deep Learning Approaches for Extracting Adverse Events and Indications of Dietary Supplements from Clinical Text
Yadan FanSicheng ZhouYi-Fan LiRui Zhang
2020-09-16
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
| Jian GuanMinlie Huang
2020-09-16
Simplified TinyBERT: Knowledge Distillation for Document Retrieval
Xuanang ChenBen HeKai HuiLe SunYingfei Sun
2020-09-16
EfficientNet-eLite: Extremely Lightweight and Efficient CNN Models for Edge Devices by Network Candidate Search
| Ching-Chen WangChing-Te ChiuJheng-Yi Chang
2020-09-16
Solomon at SemEval-2020 Task 11: Ensemble Architecture for Fine-Tuned Propaganda Detection in News Articles
Mayank RajAjay JaiswalRohit R. RAnkita GuptaSudeep Kumar SahooVertika SrivastavaYeon Hyang Kim
2020-09-16
Document-level Neural Machine Translation with Document Embeddings
Shu JiangHai ZhaoZuchao LiBao-liang Lu
2020-09-16
Event Presence Prediction Helps Trigger Detection Across Languages
Parul AwasthyTahira NaseemJian NiTaesun MoonRadu Florian
2020-09-15
Attention-Aware Inference for Neural Abstractive Summarization
Ye MaLu Zong
2020-09-15
BERT-QE: Contextualized Query Expansion for Document Re-ranking
Zhi ZhengKai HuiBen HeXianpei HanLe SunAndrew Yates
2020-09-15
Lessons Learned from Applying off-the-shelf BERT: There is no Silver Bullet
Victor MakarenkovLior Rokach
2020-09-15
Critical Thinking for Language Models
| Gregor Betz
2020-09-15
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
| Timo SchickHinrich Schütze
2020-09-15
MLMLM: Link Prediction with Mean Likelihood Masked Language Model
Louis ClouatrePhilippe TrempeAmal ZouaqSarath Chandar
2020-09-15
Dialogue Response Ranking Training with Large-Scale Human Feedback Data
| Xiang GaoYizhe ZhangMichel GalleyChris BrockettBill Dolan
2020-09-15
Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization
Wei NiuZhenglun KongGeng YuanWeiwen JiangJiexiong GuanCaiwen DingPu ZhaoSijia LiuBin RenYanzhi Wang
2020-09-15
The Radicalization Risks of GPT-3 and Advanced Neural Language Models
Kris McGuffieAlex Newhouse
2020-09-15
A Mobile App for Wound Localization using Deep Learning
| D. M. AnisuzzamanYash PatelJeffrey NiezgodaSandeep GopalakrishnanZeyun Yu
2020-09-15
Augmented Natural Language for Generative Sequence Labeling
Ben AthiwaratkunCicero Nogueira dos santosJason KroneBing Xiang
2020-09-15
DeNERT-KG: Named Entity and Relation Extraction Model Using DQN, Knowledge Graph, and BERT
SungMin YangSoYeop YooOkRan Jeong
2020-09-15
GeDi: Generative Discriminator Guided Sequence Generation
| Ben KrauseAkhilesh Deepak GotmareBryan McCannNitish Shirish KeskarShafiq JotyRichard SocherNazneen Fatema Rajani
2020-09-14
Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue
Longxiang LiuZhuosheng ZhangHai ZhaoXi ZhouXiang Zhou
2020-09-14
Beyond Accuracy: ROI-driven Data Analytics of Empirical Data
Gouri DeshpandeGuenther Ruhe
2020-09-14
Can Fine-tuning Pre-trained Models Lead to Perfect NLP? A Study of the Generalizability of Relation Extraction
| Ningyu ZhangLuoqiu LiShumin DengHaiyang YuXu ChengWei zhangHuajun Chen
2020-09-14
RelativeNAS: Relative Neural Architecture Search via Slow-Fast Learning
| Hao TanRan ChengShihua HuangCheng HeChangxiao QiuFan YangPing Luo
2020-09-14
Efficient Transformers: A Survey
Yi TayMostafa DehghaniDara BahriDonald Metzler
2020-09-14
BoostingBERT:Integrating Multi-Class Boosting into BERT for NLP Tasks
Tongwen HuangQingyun SheJunlin Zhang
2020-09-13
Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding
Shuohang WangLuowei ZhouZhe GanYen-Chun ChenYuwei FangSiqi SunYu ChengJingjing Liu
2020-09-13
Fine-tuning Pre-trained Contextual Embeddings for Citation Content Analysis in Scholarly Publication
Haihua ChenHuyen Nguyen
2020-09-12
Country Image in COVID-19 Pandemic: A Case Study of China
| Huimin ChenZeyu ZhuFanchao QiYining YeZhiyuan LiuMaosong SunJianbin Jin
2020-09-12
CIA_NITT at WNUT-2020 Task 2: Classification of COVID-19 Tweets Using Pre-trained Language Models
Yandrapati Prakash BabuRajagopal Eswari
2020-09-12
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
| Yuxuan CaiHongjia LiGeng YuanWei NiuYanyu LiXulong TangBin RenYanzhi Wang
2020-09-12
GTEA: Representation Learning for Temporal Interaction Graphs via Edge Aggregation
Yiming LiDa Sun Handason TamSiyue XieXiaxin LiuQiu Fang YingWing Cheong LauDah Ming ChiuShou Zhi Chen
2020-09-11
A Comparison of LSTM and BERT for Small Corpus
Aysu Ezen-Can
2020-09-11
SoFAr: Shortcut-based Fractal Architectures for Binary Convolutional Neural Networks
Zhu BaozhouPeter HofsteeJinho LeeZaid Al-Ars
2020-09-11
UPB at SemEval-2020 Task 11: Propaganda Detection with Domain-Specific Trained BERT
Andrei ParaschivDumitru-Clementin CercelMihai Dascalu
2020-09-11
UPB at SemEval-2020 Task 6: Pretrained Language Models for Definition Extraction
| Andrei-Marius AvramDumitru-Clementin CercelCostin-Gabriel Chiru
2020-09-11
Investigating Bi-LSTM and CRF with POS Tag Embedding for Indonesian Named Entity Tagger
Devin HoesenAyu Purwarianti
2020-09-11
Compressed Deep Networks: Goodbye SVD, Hello Robust Low-Rank Approximation
Murad TukanAlaa MaaloufMatan WekslerDan Feldman
2020-09-11
Unit Test Case Generation with Transformers
| Michele TufanoDawn DrainAlexey SvyatkovskiyShao Kun DengNeel Sundaresan
2020-09-11
Investigating Gender Bias in BERT
Rishabh BhardwajNavonil MajumderSoujanya Poria
2020-09-10
Modern Methods for Text Generation
| Dimas Munoz Montesinos
2020-09-10
Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection
| Taesun WhangDongyub LeeDongsuk OhChanhee LeeKijong HanDong-hun LeeSaebyeok Lee
2020-09-10
Learning Universal Representations from Word to Sentence
Yian LiHai Zhao
2020-09-10
Multi-modal embeddings using multi-task learning for emotion recognition
Aparna KhareSrinivas ParthasarathyShiva Sundaram
2020-09-10
Globally-scalable Automated Target Recognition (GATR)
Gary ChernAusten GroenerMichael HarnerTyler KuhnsAndy LamStephen O'NeillMark Pritt
2020-09-10
Brain2Word: Decoding Brain Activity for Language Generation
| Nicolas AffolterBeni EgressyDamian PascualRoger Wattenhofer
2020-09-10
Sparsifying Transformer Models with Differentiable Representation Pooling
Michał PietruszkaŁukasz BorchmannFilip Graliński
2020-09-10
Rank over Class: The Untapped Potential of Ranking in Natural Language Processing
| Amir Atapour-AbarghoueiStephen BonnerAndrew Stephen McGough
2020-09-10
FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding
Yuwei FangShuohang WangZhe GanSiqi SunJingjing Liu
2020-09-10
Comprehensive Comparison of Deep Learning Models for Lung and COVID-19 Lesion Segmentation in CT scans
| Paschalis BizopoulosNicholas VretosPetros Daras
2020-09-10
Comparative Study of Language Models on Cross-Domain Data with Model Agnostic Explainability
Mayank ChhipaHrushikesh Mahesh VazurkarAbhijeet KumarMridul Mishra
2020-09-09
Deep learning for gravitational-wave data analysis: A resampling white-box approach
Manuel D. MoralesJavier M. AntelisClaudia MorenoAlexander I. Nesterov
2020-09-09
Pay Attention when Required
| Swetha MandavaSzymon MigaczAlex Fit Florea
2020-09-09
not-so-BigGAN: Generating High-Fidelity Images on a Small Compute Budget
Seungwook HanAkash SrivastavaCole HurwitzPrasanna SattigeriDavid D. Cox
2020-09-09
Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification
| Yunsheng ShiZhengjie HuangWenjin WangHui ZhongShikun FengYu Sun
2020-09-08
TanhSoft -- a family of activation functions combining Tanh and Softplus
Koushik BiswasSandeep KumarShilpak BanerjeeAshish Kumar Pandey
2020-09-08
ERNIE at SemEval-2020 Task 10: Learning Word Emphasis Selection by Pre-trained Language Model
Zhengjie HuangShikun FengWeiyue SuXuyi ChenShuohuan WangJiaxiang LiuXuan OuyangYu Sun
2020-09-08
A Deep Neural Network Tool for Automatic Segmentation of Human Body Parts in Natural Scenes
| Patrick McClureGabrielle ReimannMichal RamotFrancisco Pereira
2020-09-08
Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding
Sahar AbdelnabiMario Fritz
2020-09-07
TransModality: An End2End Fusion Method with Transformer for Multimodal Sentiment Analysis
Zilong WangZhaohong WanXiaojun Wan
2020-09-07
E-BERT: A Phrase and Product Knowledge Enhanced Language Model for E-commerce
Denghui ZhangZixuan YuanYanchi LiuZuohui FuFuzhen ZhuangPengyang WangHaifeng ChenHui Xiong
2020-09-07
Measuring Massive Multitask Language Understanding
| Dan HendrycksCollin BurnsSteven BasartAndy ZouMantas MazeikaDawn SongJacob Steinhardt
2020-09-07
Active deep learning method for the discovery of objects of interest in large spectroscopic surveys
Petr ŠkodaOndřej PodsztavekPavel Tvrdík
2020-09-07
Deepfake detection: humans vs. machines
Pavel KorshunovSébastien Marcel
2020-09-07
Black Box to White Box: Discover Model Characteristics Based on Strategic Probing
Josh KalinMatthew CiolinoDavid NoeverGerry Dozier
2020-09-07
Robust Conversational AI with Grounded Text Generation
Jianfeng GaoBaolin PengChunyuan LiJinchao LiShahin ShayandehLars LidenHeung-Yeung Shum
2020-09-07
Stochastic-YOLO: Efficient Probabilistic Object Detection under Dataset Shifts
Tiago AzevedoRené de JongPartha Maji
2020-09-07
Improving Language Generation with Sentence Coherence Objective
| Ruixiao SunJie YangMehrdad Yousefzadeh
2020-09-07
UPB at SemEval-2020 Task 8: Joint Textual and Visual Modeling in a Multi-Task Learning Architecture for Memotion Analysis
George-Alexandru VladGeorge-Eduard ZahariaDumitru-Clementin CercelCostin-Gabriel ChiruStefan Trausan-Matu
2020-09-06
QiaoNing at SemEval-2020 Task 4: Commonsense Validation and Explanation system based on ensemble of language model
Pai Liu
2020-09-06
EdinburghNLP at WNUT-2020 Task 2: Leveraging Transformers with Generalized Augmentation for Identifying Informativeness in COVID-19 Tweets
Nickil Maveli
2020-09-06
Accenture at CheckThat! 2020: If you say so: Post-hoc fact-checking of claims using transformer-based models
Evan WilliamsPaul RodriguesValerie Novak
2020-09-05
Imbalanced Image Classification with Complement Cross Entropy
| Yechan KimYounkwan LeeMoongu Jeon
2020-09-04
S3NAS: Fast NPU-aware Neural Architecture Search Methodology
| Jaeseong LeeDuseok KangSoonhoi Ha
2020-09-04
Tasks Integrated Networks: Joint Detection and Retrieval for Image Search
Lei ZhangZhenwei HeYi YangLiang WangXinbo Gao
2020-09-03
VddNet: Vine Disease Detection Network Based on Multispectral Images and Depth Map
Mohamed KerkechAdel HafianeRaphael Canals
2020-09-03
On the Structures of Representation for the Robustness of Semantic Segmentation to Input Corruption
| Charles LehmanDogancan TemelGhassan AlRegib
2020-09-02
Transform Quantization for CNN Compression
Sean I. YoungWang ZheDavid TaubmanBernd Girod
2020-09-02
Lifelong Object Detection
Wang ZhouShiyu ChangNorma SosaHendrik HamannDavid Cox
2020-09-02
Neural Crossbreed: Neural Based Image Metamorphosis
Sanghun ParkKwanggyoon SeoJunyong Noh
2020-09-02
Comparative Evaluation of Pretrained Transfer Learning Models on Automatic Short Answer Grading
Sasi Kiran GaddipatiDeebul NairPaul G. Plöger
2020-09-02
Multi-domain semantic segmentation with pyramidal fusion
Marin OršićPetra BevandićIvan GrubišićJosip ŠarićSiniša Šegvić
2020-09-02
WaveGrad: Estimating Gradients for Waveform Generation
| Nanxin ChenYu ZhangHeiga ZenRon J. WeissMohammad NorouziWilliam Chan
2020-09-02
Automatic Assignment of Radiology Examination Protocols Using Pre-trained Language Models with Knowledge Distillation
Wilson LauLaura AaltonenMartin GunnMeliha Yetisgen
2020-09-01
LiftFormer: 3D Human Pose Estimation using attention models
Adrian Llopart
2020-09-01
Sentimental LIAR: Extended Corpus and Deep Learning Models for Fake Claim Classification
Bibek UpadhayayVahid Behzadan
2020-09-01
Dynamic Scheduling for Stochastic Edge-Cloud Computing Environments using A3C learning and Residual Recurrent Neural Networks
| Shreshth TuliShashikant IlagerKotagiri RamamohanaraoRajkumar Buyya
2020-09-01
BiTT: Bidirectional Tree Tagging for Joint Extraction of Overlapping Entities and Relations
Xukun LuoWeijie LiuMeng MaPing Wang
2020-08-31
PNEL: Pointer Network based End-To-End Entity Linking over Knowledge Graphs
Debayan BanerjeeDebanjan ChaudhuriMohnish DubeyJens Lehmann
2020-08-31
A Topological Framework for Deep Learning
Mustafa HajijKyle Istvan
2020-08-31
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Wei LiJames QinChung-Cheng ChiuRuoming PangYanzhang He
2020-08-30
SocCogCom at SemEval-2020 Task 11: Characterizing and Detecting Propaganda using Sentence-Level Emotional Salience Features
| Gangeshwar KrishnamurthyRaj Kumar GuptaYinping Yang
2020-08-29
Knowledge Efficient Deep Learning for Natural Language Processing
Hai Wang
2020-08-28
TATL at W-NUT 2020 Task 2: A Transformer-based Baseline System for Identification of Informative COVID-19 English Tweets
Anh Tuan Nguyen
2020-08-28
HittER: Hierarchical Transformers for Knowledge Graph Embeddings
Sanxing ChenXiaodong LiuJianfeng GaoJian JiaoRuofei ZhangYangfeng Ji
2020-08-28
Rethinking the objectives of extractive question answering
Martin FajcikJosef JonSantosh KesirajuPavel Smrz
2020-08-28
Entity and Evidence Guided Relation Extraction for DocRED
Kevin HuangGuangtao WangTengyu MaJing Huang
2020-08-27
GREEK-BERT: The Greeks visiting Sesame Street
| John KoutsikakisIlias ChalkidisProdromos MalakasiotisIon Androutsopoulos
2020-08-27
Query Focused Multi-document Summarisation of Biomedical Texts
| Diego MollaChristopher JonesVincent Nguyen
2020-08-27
Improvement of a dedicated model for open domain persona-aware dialogue generation
| Qiang Han
2020-08-27
MultiGBS: A multi-layer graph approach to biomedical summarization
Ensieh DavoodijamNasser GhadiriMaryam Lotfi ShahrezaFabio Rinaldi
2020-08-27
AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization
Xinsong ZhangHang Li
2020-08-27
DAVE: Deriving Automatically Verilog from English
Hammond PearceBenjamin TanRamesh Karri
2020-08-27
A free web service for fast COVID-19 classification of chest X-Ray images
| Jose David Bermudez CastroRicardo ReiJose E. RuizPedro Achanccaray DiazSmith Arauco CanchumuniCristian Muñoz VillalobosFelipe Borges CoelhoLeonardo Forero MendozaMarco Aurelio C. Pacheco
2020-08-27
A Multitask Deep Learning Approach for User Depression Detection on Sina Weibo
Yiding WangZhenyi WangChenghao LiYilin ZhangHaizhou Wang
2020-08-26
5G Utility Pole Planner Using Google Street View and Mask R-CNN
Yanyu ZhangOsama Alshaykh
2020-08-26
Discrete Word Embedding for Logical Natural Language Understanding
Masataro AsaiZilu Tang
2020-08-26
Language Models and Word Sense Disambiguation: An Overview and Analysis
| Daniel LoureiroKiamehr RezaeeMohammad Taher PilehvarJose Camacho-Collados
2020-08-26
APMSqueeze: A Communication Efficient Adam-Preconditioned Momentum SGD Algorithm
Hanlin TangShaoduo GanSamyam RajbhandariXiangru LianJi LiuYuxiong HeCe Zhang
2020-08-26
Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing
Wei ShenXiaonan HeChuheng ZhangQiang NiWanchun DouYan Wang
2020-08-25
PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization
Zhize LiHongyan BaoXiangliang ZhangPeter Richtárik
2020-08-25
CDeC-Net: Composite Deformable Cascade Network for Table Detection in Document Images
| Madhav AgarwalAjoy MondalC. V. Jawahar
2020-08-25
Conceptualized Representation Learning for Chinese Biomedical Text Mining
| Ningyu ZhangQianghuai JiaKangping YinLiang DongFeng GaoNengwei Hua
2020-08-25
Balanced Activation for Long-tailed Visual Recognition
Jiawei RenCunjun YuZhongang CaiHaiyu Zhao
2020-08-24
YNU-HPCC at SemEval-2020 Task 11: LSTM Network for Detection of Propaganda Techniques in News Articles
| Jiaxu DaoJin WangXue-jie Zhang
2020-08-24
Prediction of ICD Codes with Clinical BERT Embeddings and Text Augmentation with Label Balancing using MIMIC-III
Brent BisedaGaurav DesaiHaifeng LinAnish Philip
2020-08-24
Two Stages Approach for Tweet Engagement Prediction
Amine DadounIsmail HarrandoPasquale LisenaAlison ReboudRaphael Troncy
2020-08-24
End to End Dialogue Transformer
Ondřej MěkotaMemduh GökırmakPetr Laitoch
2020-08-24
Knowledge-Empowered Representation Learning for Chinese Medical Reading Comprehension: Task, Model and Resources
Taolin ZhangChengyu WangMinghui QiuBite YangXiaofeng HeJun Huang
2020-08-24
syrapropa at SemEval-2020 Task 11: BERT-based Models Design For Propagandistic Technique and Span Detection
Jinfen LiLu Xiao
2020-08-24
Robust Vision Challenge 2020 -- 1st Place Report for Panoptic Segmentation
Rohit MohanAbhinav Valada
2020-08-23
m2caiSeg: Semantic Segmentation of Laparoscopic Images using Convolutional Neural Networks
| Salman MaqboolAqsa RiazHasan SajidOsman Hasan
2020-08-23
Towards Improved Human Action Recognition Using Convolutional Neural Networks and Multimodal Fusion of Depth and Inertial Sensor Data
Zeeshan AhmadNaimul Khan
2020-08-22
DUTH at SemEval-2020 Task 11: BERT with Entity Mapping for Propaganda Classification
Anastasios BairaktarisSymeon SymeonidisAvi Arampatzis
2020-08-22
CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection
| Verena BlaschkeMaxim KorniyenkoSam Tureski
2020-08-22
HinglishNLP: Fine-tuned Language Models for Hinglish Sentiment Detection
| Meghana BhangeNirant Kasliwal
2020-08-22
Identity-Aware Multi-Sentence Video Description
| Jae Sung ParkTrevor DarrellAnna Rohrbach
2020-08-22
Applications of BERT Based Sequence Tagging Models on Chinese Medical Text Attributes Extraction
Gang ZhaoTeng ZhangChenxiao WangPing LvJi Wu
2020-08-22
FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERT
| Omar MossadAmgad AhmedAnandharaju RajuHari KarthikeyanZayed Ahmed
2020-08-22
Abstractive Summarization of Spoken andWritten Instructions with BERT
| Alexandra SavelievaBryan Au-YeungVasanth Ramani
2020-08-21
Adapting Event Extractors to Medical Data: Bridging the Covariate Shift
Aakanksha NaikJill LehmanCarolyn Rose
2020-08-21
Lite Training Strategies for Portuguese-English and English-Portuguese Translation
| Alexandre LopesRodrigo NogueiraRoberto LotufoHelio Pedrini
2020-08-20
An Experimental Study of Deep Neural Network Models for Vietnamese Multiple-Choice Reading Comprehension
Son T. LuuKiet Van NguyenAnh Gia-Tuan NguyenNgan Luu-Thuy Nguyen
2020-08-20
AWNet: Attentive Wavelet Network for Image ISP
| Linhui DaiXiaohong LiuChengqi LiJun Chen
2020-08-20
PTT5: Pretraining and validating the T5 model on Brazilian Portuguese data
| Diedre CarmoMarcos PiauIsrael CampiottiRodrigo NogueiraRoberto Lotufo
2020-08-20
Query Twice: Dual Mixture Attention Meta Learning for Video Summarization
Junyan WangYang BaiYang LongBingZhang HuZhenhua ChaiYu GuanXiaolin Wei
2020-08-19
Anchor-free Small-scale Multispectral Pedestrian Detection
| Alexander WolpertMichael TeutschM. Saquib SarfrazRainer Stiefelhagen
2020-08-19
UoB at SemEval-2020 Task 12: Boosting BERT with Corpus Level Information
Wah Meng LimHarish Tayyar Madabushi
2020-08-19
Glancing Transformer for Non-Autoregressive Neural Machine Translation
Lihua QianHao ZhouYu BaoMingxuan WangLin QiuWei-Nan ZhangYong YuLei LI
2020-08-18
Very Deep Transformers for Neural Machine Translation
Xiaodong LiuKevin DuhLiyuan LiuJianfeng Gao
2020-08-18
Ranking Clarification Questions via Natural Language Inference
Vaibhav KumarVikas RaunakJamie Callan
2020-08-18
Estimation of causal effects of multiple treatments in healthcare database studies with rare outcomes
Liangyuan HuChenyang Gu
2020-08-18
Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical Study
Karthik GopalakrishnanBehnam HedayatniaLongshaokan WangYang LiuDilek Hakkani-Tur
2020-08-18
Multilanguage Number Plate Detection using Convolutional Neural Networks
Jatin GuptaVandana SainiKamaldeep Garg
2020-08-18
Deep Learning Based Open Set Acoustic Scene Classification
Zuzanna KwiatkowskaBeniamin KalinowskiMichał KośmiderKrzysztof Rykaczewski
2020-08-17
Narrative Interpolation for Generating and Understanding Stories
Su WangGreg DurrettKatrin Erk
2020-08-17
Spatial Temporal Transformer Network for Skeleton-based Action Recognition
| Chiara PlizzariMarco CanniciMatteo Matteucci
2020-08-17
Stock Index Prediction with Multi-task Learning and Word Polarity Over Time
Yue ZhouKerstin Voigt
2020-08-17
Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study
Dara BahriYi TayChe ZhengDonald MetzlerCliff BrunkAndrew Tomkins
2020-08-17
Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size
Davis YoshidaAllyson EttingerKevin Gimpel
2020-08-16
DCR-Net: A Deep Co-Interactive Relation Network for Joint Dialog Act Recognition and Sentiment Classification
Libo QinWanxiang CheYangming LiMinheng NiTing Liu
2020-08-16
DeVLBert: Learning Deconfounded Visio-Linguistic Representations
| Shengyu ZhangTan JiangTan WangKun KuangZhou ZhaoJianke ZhuJin YuHongxia YangFei Wu
2020-08-16
TopicBERT: A Transformer transfer learning based memory-graph approach for multimodal streaming social media topic detection
Meysam Asgari-ChenaghluMohammad-Reza Feizi-DerakhshiLeili farzinvashMohammad-Ali BalafarCina Motamed
2020-08-16
A Deep Convolutional Neural Network for the Detection of Polyps in Colonoscopy Images
Tariq RahimSyed Ali HassanSoo Young Shin
2020-08-15
Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Henry TsaiJayden OoiChun-Sung FerngHyung Won ChungJason Riesa
2020-08-15
Jointly Fine-Tuning "BERT-like" Self Supervised Models to Improve Multimodal Speech Emotion Recognition
| Shamane SiriwardhanaAndrew ReisRivindu WeerasekeraSuranga Nanayakkara
2020-08-15
Jointly Fine-Tuning “BERT-like” Self Supervised Models to Improve Multimodal Speech Emotion Recognition
| Shamane SiriwardhanaAndrew ReisRivindu WeerasekeraSuranga Nanayakkara
2020-08-15
Hate Speech Detection and Racial Bias Mitigation in Social Media based on BERT model
Marzieh MozafariReza FarahbakhshNoel Crespi
2020-08-14
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
Andrea MadottoZihan LiuZhaojiang LinPascale Fung
2020-08-14
Adaptable Multi-Domain Language Model for Transformer ASR
Taewoo LeeMin-Joong LeeTae Gyoon KangSeokyeoung JungMinseok KwonYeona HongJungin LeeKyoung-Gu WooHo-Gyeong KimJiseung JeongJi-Hyun LeeHosik LeeYoung Sang Choi
2020-08-14
A Hybrid BERT and LightGBM based Model for Predicting Emotion GIF Categories on Twitter
Ye BiShuo WangZhongrui Fan
2020-08-14
End-to-end Contextual Perception and Prediction with Interaction Transformer
Lingyun Luke LiBin YangMing LiangWenyuan ZengMengye RenSean SegalRaquel Urtasun
2020-08-13
Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion
| Dipjyoti PaulMuhammed PV ShifasYannis PantazisYannis Stylianou
2020-08-13
MICE: Mining Idioms with Contextual Embeddings
| Tadej ŠkvorcPolona GantarMarko Robnik-Šikonja
2020-08-13
Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition
Wenyong HuangWenchao HuYu Ting YeungXiao Chen
2020-08-13
What leads to generalization of object proposals?
Rui WangDhruv MahajanVignesh Ramanathan
2020-08-13
Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation
| Jialian WuLiangchen SongTiancai WangQian ZhangJunsong Yuan
2020-08-13
Large-scale Transfer Learning for Low-resource Spoken Language Understanding
Xueli JiaJianzong WangZhiyong ZhangNing ChengJing Xiao
2020-08-13
ANDES at SemEval-2020 Task 12: A jointly-trained BERT multilingual model for offensive language detection
| Juan Manuel PérezAymé ArangoFranco Luque
2020-08-13
MMM : Exploring Conditional Multi-Track Music Generation with the Transformer
Jeff EnsPhilippe Pasquier
2020-08-13
Variance-reduced Language Pretraining via a Mask Proposal Network
Liang Chen
2020-08-12
Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders
| Nicola MessinaGiuseppe AmatoAndrea EsuliFabrizio FalchiClaudio GennaroStéphane Marchand-Maillet
2020-08-12
Compression of Deep Learning Models for Text: A Survey
Manish GuptaPuneet Agrawal
2020-08-12
Evaluating the Impact of Knowledge Graph Context on Entity Disambiguation Models
| Isaiah Onando Mulang'Kuldeep SinghChaitali PrabhuAbhishek NadgeriJohannes HoffartJens Lehmann
2020-08-12
Leveraging Automated Mixed-Low-Precision Quantization for tiny edge microcontrollers
Manuele RusciMarco FariselliAlessandro CapotondiLuca Benini
2020-08-12
LogoDet-3K: A Large-Scale Image Dataset for Logo Detection
| Jing WangWeiqing MinSujuan HouShengnan MaYuanjie ZhengShuqiang Jiang
2020-08-12
Facial Expression Recognition Under Partial Occlusion from Virtual Reality Headsets based on Transfer Learning
Bita HoushmandNaimul Khan
2020-08-12
Reinforced Wasserstein Training for Severity-Aware Semantic Segmentation in Autonomous Driving
Xiaofeng LiuYimeng ZhangXiongchang LiuSong BaiSite LiJane You
2020-08-11
PneumoXttention: A CNN compensating for Human Fallibility when Detecting Pneumonia through CXR images with Attention
Sanskriti Singh
2020-08-11
KR-BERT: A Small-Scale Korean-Specific Language Model
| Sangah LeeHansol JangYunmee BaikSuzi ParkHyopil Shin
2020-08-10
FireBERT: Hardening BERT-based classifiers against adversarial attack
| Gunnar MeinKevin HartmanAndrew Morris
2020-08-10
Navigating Human Language Models with Synthetic Agents
Philip FeldmanAntonio Bucchiarone
2020-08-10
Bilevel Learning Model Towards Industrial Scheduling
Longkang LiHui-Ling ZhenMingxuan YuanJiawen LuXialiangTongJia ZengJun WangDirk Schnieders
2020-08-10
Does BERT Solve Commonsense Task via Commonsense Knowledge?
Leyang CuiSijie ChengYu WuYue Zhang
2020-08-10
Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine
Kuan FangLong ZhaoZhan ShenRuiXing WangRiKang ZhourLiWen Fan
2020-08-10
GANBERT: Generative Adversarial Networks with Bidirectional Encoder Representations from Transformers for MRI to PET synthesis
Hoo-chang ShinAlvin IhsaniSwetha MandavaSharath Turuvekere SreenivasChristopher ForsterJiook ChaAlzheimer's Disease Neuroimaging Initiative
2020-08-10
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
| Hayato FutamiHirofumi InagumaSei UenoMasato MimuraShinsuke SakaiTatsuya Kawahara
2020-08-09
Fast and Accurate Neural CRF Constituency Parsing
| Yu ZhangHouquan ZhouZhenghua Li
2020-08-09
DIET-SNN: Direct Input Encoding With Leakage and Threshold Optimization in Deep Spiking Neural Networks
Nitin RathiKaushik Roy
2020-08-09
Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen Speaker and Recording Conditions
| Dipjyoti PaulYannis PantazisYannis Stylianou
2020-08-09
HASeparator: Hyperplane-Assisted Softmax
Ioannis KansizoglouNicholas SantavasLoukas BampisAntonios Gasteratos
2020-08-08
Forming Local Intersections of Projections for Classifying and Searching Histopathology Images
Aditya SriramShivam KalraMorteza BabaieBrady KiefferWaddah Al DrobiShahryar RahnamayanHany KashaniHamid. R. Tizhoosh
2020-08-08
Towards Lossless Binary Convolutional Neural Networks Using Piecewise Approximation
Baozhou ZhuZaid Al-ArsWei Pan
2020-08-08
Assessing Demographic Bias in Named Entity Recognition
Shubhanshu MishraSijun HeLuca Belli
2020-08-08
Deep Robust Clustering by Contrastive Learning
Huasong ZhongChong ChenZhongming JinXian-Sheng Hua
2020-08-07
SemEval-2020 Task 10: Emphasis Selection for Written Text in Visual Media
Amirreza ShiraniFranck DernoncourtNedim LipkaPaul AsenteJose EchevarriaThamar Solorio
2020-08-07
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Wen-Chin HuangTomoki HayashiYi-Chiao WuHirokazu KameokaTomoki Toda
2020-08-07
Classifying sleep-wake stages through recurrent neural networks using pulse oximetry signals
Ramiro CasalLeandro E. Di PersiaGastón Schlotthauer
2020-08-07
Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
| Patrick LewisPontus StenetorpSebastian Riedel
2020-08-06
IIIT-AR-13K: A New Dataset for Graphical Object Detection in Documents
Ajoy MondalPeter LippsC. V. Jawahar
2020-08-06
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Zi-Hang JiangWeihao YuDaquan ZhouYunpeng ChenJiashi FengShuicheng Yan
2020-08-06
DeText: A Deep Text Ranking Framework with BERT
| Weiwei GuoXiao-Wei LiuSida WangHuiji GaoAnanth SankarZimeng YangQi GuoLiang ZhangBo LongBee-Chung ChenDeepak Agarwal
2020-08-06
aschern at SemEval-2020 Task 11: It Takes Three to Tango: RoBERTa, CRF, and Transfer Learning
| Anton ChernyavskiyDmitry IlvovskyPreslav Nakov
2020-08-06
6VecLM: Language Modeling in Vector Space for IPv6 Target Generation
Tianyu CuiGang XiongGaopeng GouJunzheng ShiWei Xia
2020-08-05
Land Use and Land Cover Classification using a Human Group based Particle Swarm Optimization Algorithm with a LSTM classifier on hybrid-pre-processing Remote Sensing Images
T. KowsalyaS. L. UlloC. ZarroK. L. HemalathaB. D. Parameshachari
2020-08-04
Taking Notes on the Fly Helps BERT Pre-training
Qiyu WuChen XingYatao LiGuolin KeDi HeTie-Yan Liu
2020-08-04
Learning from a Complementary-label Source Domain: Theory and Algorithms
| Yiyang ZhangFeng LiuZhen FangBo YuanGuangquan ZhangJie Lu
2020-08-04
NLPDove at SemEval-2020 Task 12: Improving Offensive Language Detection with Cross-lingual Transfer
| Hwijeen AhnJimin SunChan Young ParkJungyun Seo
2020-08-04
Hierarchical Context Embedding for Region-based Object Detection
Zhao-Min ChenXin JinBorui ZhaoXiu-Shen WeiYanwen Guo
2020-08-04
The Jazz Transformer on the Front Line: Exploring the Shortcomings of AI-composed Music through Quantitative Measures
| Shih-Lun WuYi-Hsuan Yang
2020-08-04
Automatic Composition of Guitar Tabs by Transformers and Groove Modeling
Yu-Hua ChenYu-Hsiang HuangWen-Yi HsiaoYi-Hsuan Yang
2020-08-04
I-AID: Identifying Actionable Information from Disaster-related Tweets
Hamada M. ZaheraRricha JalotaMohamed A. SherifAxel N. Ngomo
2020-08-04
Improving One-stage Visual Grounding by Recursive Sub-query Construction
| Zhengyuan YangTianlang ChenLi-Wei WangJiebo Luo
2020-08-03
Rethinking Image Deraining via Rain Streaks and Vapors
Yinglong WangYibing SongChao MaBing Zeng
2020-08-03
[email protected] at SemEval-2020 Task 12: Multilingual or language-specific BERT?
Marc PàmiesEmily ÖhmanKaisla KajavaJörg Tiedemann
2020-08-03
Audiovisual Speech Synthesis using Tacotron2
Ahmed Hussen AbdelazizAnushree Prasanna KumarChloe SeivwrightGabriele FanelliJustin BinderYannis StylianouSachin Kajarekar
2020-08-03
Self-attention encoding and pooling for speaker recognition
Pooyan SafariMiquel IndiaJavier Hernando
2020-08-03
SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation Space
Liu YangFanqi MengMing-Kuang Daniel WuVicent YingXianchao Xu
2020-08-02
The Chess Transformer: Mastering Play using Generative Language Models
David NoeverMatt CiolinoJosh Kalin
2020-08-02
Trojaning Language Models for Fun and Profit
Xinyang ZhangZheng ZhangTing Wang
2020-08-01
Multi-node Bert-pretraining: Cost-efficient Approach
Jiahuang LinXin LiGennady Pekhimenko
2020-08-01
On Learning Universal Representations Across Languages
Xiangpeng WeiYue HuRongxiang WengLuxi XingHeng YuWeihua Luo
2020-07-31
A Novel Global Spatial Attention Mechanism in Convolutional Neural Network for Medical Image Classification
Linchuan XuJun HuangAtsushi NitandaRyo AsaokaKenji Yamanishi
2020-07-31
Resist : Reconstruction of irises from templates
Sohaib AhmadBenjamin Fuller
2020-07-31
Language Modelling for Source Code with Transformer-XL
| Thomas DowdellHongyu Zhang
2020-07-31
Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
Yu GuRobert TinnHao ChengMichael LucasNaoto UsuyamaXiaodong LiuTristan NaumannJianfeng GaoHoifung Poon
2020-07-31
TweepFake: about Detecting Deepfake Tweets
Tiziano FagniFabrizio FalchiMargherita GambiniAntonio MartellaMaurizio Tesconi
2020-07-31
Object Detection and Tracking Algorithms for Vehicle Counting: A Comparative Analysis
Vishal MandalYaw Adu-Gyamfi
2020-07-31
Model Reduction of Shallow CNN Model for Reliable Deployment of Information Extraction from Medical Reports
Abhishek K DubeyAlina PelusoJacob HinkleDevanshu AgarawalZilong Tan
2020-07-31
LevelSet R-CNN: A Deep Variational Method for Instance Segmentation
Namdar HomayounfarYuwen XiongJustin LiangWei-Chiu MaRaquel Urtasun
2020-07-30
Rethinking Recurrent Neural Networks and other Improvements for Image Classification
| Nguyen Huu PhongBernardete Ribeiro
2020-07-30
Deep Multi-View Spatiotemporal Virtual Graph Neural Network for Significant Citywide Ride-hailing Demand Prediction
Guangyin JinZhexu XiHengyu ShaYanghe FengJincai Huang
2020-07-30
Improving Sample Efficiency with Normalized RBF Kernels
| Sebastian Pineda-ArangoDavid Obando-PaniaguaAlperen DedeogluPhilip KurzendörferFriedemann SchestagRandolf Scholz
2020-07-30
Instance Selection for GANs
Terrance DeVriesMichal DrozdzalGraham W. Taylor
2020-07-30
What does BERT know about books, movies and music? Probing BERT for Conversational Recommendation
| Gustavo PenhaClaudia Hauff
2020-07-30
Interpretable Contextual Team-aware Item Recommendation: Application in Multiplayer Online Battle Arena Games
| Andrés VillaVladimir AraujoFrancisca CattanDenis Parra
2020-07-30
Depressive, Drug Abusive, or Informative: Knowledge-aware Study of News Exposure during COVID-19 Outbreak
Amanuel AlamboManas GaurKrishnaprasad Thirunarayan
2020-07-30
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
| Shayne LongpreYi LuJoachim Daiber
2020-07-30
Reliable Tuberculosis Detection using Chest X-ray with Deep Learning, Segmentation and Visualization
Tawsifur RahmanAmith KhandakarMuhammad Abdul KadirKhandaker R. IslamKhandaker F. IslamRashid MazharTahir HamidMohammad T. IslamZaid B. MahbubMohamed Arselene AyariMuhammad E. H. Chowdhury
2020-07-29
Composer Style Classification of Piano Sheet Music Images Using Language Model Pretraining
| TJ TsaiKevin Ji
2020-07-29
Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation
| Yiyang ZhangFeng LiuZhen FangBo YuanGuangquan ZhangJie Lu
2020-07-29
Improving Results on Russian Sentiment Datasets
| Anton GolubevNatalia Loukachevitch
2020-07-28
BUT-FIT at SemEval-2020 Task 5: Automatic detection of counterfactual statements with deep pre-trained language representation models
| Martin FajcikJosef JonMartin DocekalPavel Smrz
2020-07-28
Variants of BERT, Random Forests and SVM approach for Multimodal Emotion-Target Sub-challenge
Hoang Manh HungHyung-Jeong YangSoo-Hyung KimGuee-Sang Lee
2020-07-28
GUIR at SemEval-2020 Task 12: Domain-Tuned Contextualized Models for Offensive Language Detection
Sajad SotudehTong XiangHao-Ren YaoSean MacAvaneyEugene YangNazli GoharianOphir Frieder
2020-07-28
Deep Learning Brasil -- NLP at SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets
Manoel Veríssimo dos Santos NetoAyrton Denner da Silva AmaralNádia Félix Felipe da SilvaAnderson da Silva Soares
2020-07-28
TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling
Shuai ZhangPeng ZhangXindian MaJunqiu WeiNingning WangQun Liu
2020-07-28
From Sound Representation to Model Robustness
Mohammad EsmaeilpourPatrick CardinalAlessandro Lameiras Koerich
2020-07-27
Two-Level Residual Distillation based Triple Network for Incremental Object Detection
Dongbao YangYu ZhouDayan WuCan MaFei YangWeiping Wang
2020-07-27
Contraction Mapping of Feature Norms for Classifier Learning on the Data with Different Quality
Weihua LiuXiabi LiuMurong WangLing Ma
2020-07-27
Receptive-Field Regularized CNNs for Music Classification and Tagging
| Khaled KoutiniHamid Eghbal-zadehVerena HaunschmidPaul PrimusShreyan ChowdhuryGerhard Widmer
2020-07-27
KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media
| Ali SafayaMoutasem AbdullatifDeniz Yuret
2020-07-26
Detection and Annotation of Plant Organs from Digitized Herbarium Scans using Deep Learning
Sohaib YounisMarco SchmidtClaus WeilandStefan DresslerBernhard SeegerThomas Hickler
2020-07-26
MACU-Net Semantic Segmentation from High-Resolution Remote Sensing Images
| Rui LiChenxi DuanShunyi Zheng
2020-07-26
Reed at SemEval-2020 Task 9: Fine-Tuning and Bag-of-Words Approaches to Code-Mixed Sentiment Analysis
Vinay GopalanMark Hopkins
2020-07-26
To BERT or Not To BERT: Comparing Speech and Language-based Approaches for Alzheimer's Disease Detection
Aparna BalagopalanBenjamin EyreFrank RudziczJekaterina Novikova
2020-07-26
FiSSA at SemEval-2020 Task 9: Fine-tuned For Feelings
| Bertelt BraaksmaRichard ScholtensStan van SuijlekomRemy WangAhmet Üstün
2020-07-24
MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning
| Aina Garí SolerMarianna Apidianaki
2020-07-24
Counting Fish and Dolphins in Sonar Images Using Deep Learning
Stefan SchneiderAlex Zhuang
2020-07-24
A Study on Evaluation Standard for Automatic Crack Detection Regard the Random Fractal
Hongyu LiJihe WangYu ZhangZi-Rui WangTiejun Wang
2020-07-23
The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation
| Tao WangYu LiBingyi KangJunnan LiJunhao LiewSheng TangSteven HoiJiashi Feng
2020-07-23
Regularization of Building Boundaries in Satellite Images using Adversarial and Regularized Losses
Stefano ZorziFriedrich Fraundorfer
2020-07-23
WeightNet: Revisiting the Design Space of Weight Networks
| Ningning MaXiangyu ZhangJiawei HuangJian Sun
2020-07-23
Product Title Generation for Conversational Systems using BERT
Mansi Ranjit ManeShashank KediaAditya ManthaStephen GuoKannan Achan
2020-07-23
PP-YOLO: An Effective and Efficient Implementation of Object Detector
| Xiang LongKaipeng DengGuanzhong WangYang ZhangQingqing DangYuan GaoHui ShenJianguo RenShumin HanErrui DingShilei Wen
2020-07-23
The Lottery Ticket Hypothesis for Pre-trained BERT Networks
| Tianlong ChenJonathan FrankleShiyu ChangSijia LiuYang ZhangZhangyang WangMichael Carbin
2020-07-23
Exploring Swedish & English fastText Embeddings with the Transformer
| Tosin P. AdewumiFoteini LiwickiMarcus Liwicki
2020-07-23
Deep Variational Instance Segmentation
Jialin YuanChao ChenLi Fuxin
2020-07-22
CrossTransformers: spatially-aware few-shot transfer
Carl DoerschAnkush GuptaAndrew Zisserman
2020-07-22
DEAL: Deep Evidential Active Learning for Image Classification
| Patrick HemmerNiklas KühlJakob Schöffer
2020-07-22
IITK at the FinSim Task: Hypernym Detection in Financial Domain via Context-Free and Contextualized Word Embeddings
Vishal KeswaniSakshi SinghAshutosh Modi
2020-07-22
Rethinking CNN Models for Audio Classification
| Kamalesh PalanisamyDipika SinghaniaAngela Yao
2020-07-22
Analogical Reasoning for Visually Grounded Language Acquisition
Bo WuHaoyu QinAlireza ZareianCarl VondrickShih-Fu Chang
2020-07-22
Multi-task learning for natural language processing in the 2020s: where are we going?
Joseph WorshamJugal Kalita
2020-07-22
SliceOut: Training Transformers and CNNs faster while using less memory
Pascal NotinAidan N. GomezJoanna YooYarin Gal
2020-07-21
problemConquero at SemEval-2020 Task 12: Transformer and Soft label-based approaches
| Karishma LaudJagriti SinghRandeep Kumar SahuAshutosh Modi
2020-07-21
newsSweeper at SemEval-2020 Task 11: Context-Aware Rich Feature Representations For Propaganda Classification
| Paramansh SinghSiraj SandhuSubham KumarAshutosh Modi
2020-07-21
Balanced Meta-Softmax for Long-Tailed Visual Recognition
Jiawei RenCunjun YuShunan ShengXiao MaHaiyu ZhaoShuai YiHongsheng Li
2020-07-21
Neural Machine Translation with Error Correction
| Kaitao SongXu TanJianfeng Lu
2020-07-21
Word Representation for Rhythms
| Tongyu LuLyucheng YanGus Xia
2020-07-21
Understanding BERT Rankers Under Distillation
Luyu GaoZhuyun DaiJamie Callan
2020-07-21
Self-supervised Feature Learning via Exploiting Multi-modal Data for Retinal Disease Diagnosis
| Xiaomeng LiMengyu JiaMd Tauhidul IslamLequan YuLei Xing
2020-07-21
Interpolating GANs to Scaffold Autotelic Creativity
Ziv EpsteinOcéane BoulaisSkylar GordonMatt Groh
2020-07-21
A Short Note on Soft-max and Policy Gradients in Bandits Problems
Neil Walton
2020-07-20
A Comparison of Supervised Learning to Match Methods for Product Search
| Fatemeh SarviNikos VoskaridesLois MooimanSebastian SchelterMaarten de Rijke
2020-07-20
Learning Joint Spatial-Temporal Transformations for Video Inpainting
| Yanhong ZengJianlong FuHongyang Chao
2020-07-20
NPCFace: A Negative-Positive Cooperation Supervision for Training Large-scale Face Recognition
Dan ZengHailin ShiHang DuJun WangZhen LeiTao Mei
2020-07-20
Learning Sparse Filters in Deep Convolutional Neural Networks with a l1/l2 Pseudo-Norm
Anthony BerthelierYongzhe YanThierry ChateauChristophe BlancStefan DuffnerChristophe Garcia
2020-07-20
Lagrangian Duality in Reinforcement Learning
Pranay Pasula
2020-07-20
Effects of Approximate Multiplication on Convolutional Neural Networks
Min Soo KimAlberto A. Del BarrioHyunJin KimNader Bagherzadeh
2020-07-20
Conformer-Kernel with Query Term Independence for Document Retrieval
| Bhaskar MitraSebastian HofstatterHamed ZamaniNick Craswell
2020-07-20
Bayesian Few-Shot Classification with One-vs-Each Pólya-Gamma Augmented Gaussian Processes
| Jake SnellRichard Zemel
2020-07-20
DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural Networks
Hassan DboukHetul SanghviMahesh MehendaleNaresh Shanbhag
2020-07-19
Mono vs Multilingual Transformer-based Models: a Comparison across Several Language Tasks
| Diego de Vargas FeijoViviane Pereira Moreira
2020-07-19
Temporal Pointwise Convolutional Networks for Length of Stay Prediction in the Intensive Care Unit
| Emma RocheteauPietro LiòStephanie Hyland
2020-07-18
Feature Pyramid Transformer
| Dong ZhangHanwang ZhangJinhui TangMeng WangXiansheng HuaQianru Sun
2020-07-18
Multi-Scale Positive Sample Refinement for Few-Shot Object Detection
| Jiaxi WuSongtao LiuDi HuangYunhong Wang
2020-07-18
Generative Pretraining from Pixels
| Mark ChenAlec RadfordRewon ChildJeff WuHeewoo JunPrafulla DhariwalDavid LuanIlya Sutskever
2020-07-17
Boundary-preserving Mask R-CNN
| Tianheng ChengXinggang WangLichao HuangWenyu Liu
2020-07-17
Deep Learning Based Traffic Surveillance System For Missing and Suspicious Car Detection
K. V. KadambariVishnu Vardhan Nimmalapudi
2020-07-17
CTC-Segmentation of Large Corpora for German End-to-end Speech Recognition
Ludwig KürzingerDominik WinkelbauerLujun LiTobias WatzelGerhard Rigoll
2020-07-17
Multi-Perspective Semantic Information Retrieval in the Biomedical Domain
Samarth Rawal