Dense Connections

Dense Connections, or Fully Connected Connections, are a type of layer in a deep neural network that use a linear operation where every input is connected to every output by a weight. This means there are $n_{\text{inputs}}*n_{\text{outputs}}$ parameters, which can lead to a lot of parameters for a sizeable network.

$$h_{l} = g\left(\textbf{W}^{T}h_{l-1}\right)$$

where $g$ is an activation function.

Image Source: Deep Learning by Goodfellow, Bengio and Courville

Latest Papers

PAPER DATE
Structured Convolutions for Efficient Neural Network Design
Yash BhalgatYizhe ZhangJamie LinFatih Porikli
2020-08-06
Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
| Patrick LewisPontus StenetorpSebastian Riedel
2020-08-06
6VecLM: Language Modeling in Vector Space for IPv6 Target Generation
Tianyu CuiGang XiongGaopeng GouJunzheng ShiWei Xia
2020-08-05
Stabilizing Deep Tomographic Reconstruction Networks
Weiwen WuDianlin HuShaoyu WangHengyong YuVarut VardhanabhutiGe Wang
2020-08-04
NLPDove at SemEval-2020 Task 12: Improving Offensive Language Detection with Cross-lingual Transfer
Hwijeen AhnJimin SunChan Young ParkJungyun Seo
2020-08-04
Taking Notes on the Fly Helps BERT Pre-training
Qiyu WuChen XingYatao LiGuolin KeDi HeTie-Yan Liu
2020-08-04
Automatic Composition of Guitar Tabs by Transformers and Groove Modeling
Yu-Hua ChenYu-Hsiang HuangWen-Yi HsiaoYi-Hsuan Yang
2020-08-04
The Jazz Transformer on the Front Line: Exploring the Shortcomings of AI-composed Music through Quantitative Measures
| Shih-Lun WuYi-Hsuan Yang
2020-08-04
Learning from a Complementary-label Source Domain: Theory and Algorithms
Yiyang ZhangFeng LiuZhen FangBo YuanGuangquan ZhangJie Lu
2020-08-04
Land Use and Land Cover Classification using a Human Group based Particle Swarm Optimization Algorithm with a LSTM classifier on hybrid-pre-processing Remote Sensing Images
T. KowsalyaS. L. UlloC. ZarroK. L. HemalathaB. D. Parameshachari
2020-08-04
A Spectral Energy Distance for Parallel Speech Synthesis
Alexey A. GritsenkoTim SalimansRianne van den BergJasper SnoekNal Kalchbrenner
2020-08-03
Self-attention encoding and pooling for speaker recognition
Pooyan SafariMiquel IndiaJavier Hernando
2020-08-03
Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
Elad RichardsonYuval AlalufOr PatashnikYotam NitzanYaniv AzarStav ShapiroDaniel Cohen-Or
2020-08-03
Improving One-stage Visual Grounding by Recursive Sub-query Construction
| Zhengyuan YangTianlang ChenLiwei WangJiebo Luo
2020-08-03
QPLEX: Duplex Dueling Multi-Agent Q-Learning
Jianhao WangZhizhou RenTerry LiuYang YuChongjie Zhang
2020-08-03
One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech
Tomáš NekvindaOndřej Dušek
2020-08-03
[email protected] at SemEval-2020 Task 12: Multilingual or language-specific BERT?
Marc PàmiesEmily ÖhmanKaisla KajavaJörg Tiedemann
2020-08-03
Rethinking Image Deraining via Rain Streaks and Vapors
Yinglong WangYibing SongChao MaBing Zeng
2020-08-03
SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation Space
Liu YangFanqi MengMing-Kuang Daniel WuVicent YingXianchao Xu
2020-08-02
Multi-node Bert-pretraining: Cost-efficient Approach
Jiahuang LinXin LiGennady Pekhimenko
2020-08-01
Trojaning Language Models for Fun and Profit
Xinyang ZhangZheng ZhangTing Wang
2020-08-01
Model Reduction of Shallow CNN Model for Reliable Deployment of Information Extraction from Medical Reports
Abhishek K DubeyAlina PelusoJacob HinkleDevanshu AgarawalZilong Tan
2020-07-31
TweepFake: about Detecting Deepfake Tweets
Tiziano FagniFabrizio FalchiMargherita GambiniAntonio MartellaMaurizio Tesconi
2020-07-31
Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing
Yu GuRobert TinnHao ChengMichael LucasNaoto UsuyamaXiaodong LiuTristan NaumannJianfeng GaoHoifung Poon
2020-07-31
Resist : Reconstruction of irises from templates
Sohaib AhmadBenjamin Fuller
2020-07-31
Language Modelling for Source Code with Transformer-XL
| Thomas DowdellHongyu Zhang
2020-07-31
A Novel Global Spatial Attention Mechanism in Convolutional Neural Network for Medical Image Classification
Linchuan XuJun HuangAtsushi NitandaRyo AsaokaKenji Yamanishi
2020-07-31
On Learning Universal Representations Across Languages
Xiangpeng WeiYue HuRongxiang WengLuxi XingHeng YuWeihua Luo
2020-07-31
Deep Multi-View Spatiotemporal Virtual Graph Neural Network for Significant Citywide Ride-hailing Demand Prediction
Guangyin JinZhexu XiHengyu ShaYanghe FengJincai Huang
2020-07-30
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
Jinhyeok YangJunmo LeeYoungik KimHoonyoung ChoInjung Kim
2020-07-30
Interpretable Contextual Team-aware Item Recommendation: Application in Multiplayer Online Battle Arena Games
Andrés VillaVladimir AraujoFrancisca CattanDenis Parra
2020-07-30
Instance Selection for GANs
Terrance DeVriesMichal DrozdzalGraham W. Taylor
2020-07-30
What does BERT know about books, movies and music? Probing BERT for Conversational Recommendation
| Gustavo PenhaClaudia Hauff
2020-07-30
Depressive, Drug Abusive, or Informative: Knowledge-aware Study of News Exposure during COVID-19 Outbreak
Amanuel AlamboManas GaurKrishnaprasad Thirunarayan
2020-07-30
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
Shayne LongpreYi LuJoachim Daiber
2020-07-30
Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation
| Yiyang ZhangFeng LiuZhen FangBo YuanGuangquan ZhangJie Lu
2020-07-29
Composer Style Classification of Piano Sheet Music Images Using Language Model Pretraining
TJ TsaiKevin Ji
2020-07-29
TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling
Shuai ZhangPeng ZhangXindian MaJunqiu Weiningning WangQun Liu
2020-07-28
Deep Learning Brasil -- NLP at SemEval-2020 Task 9: Overview of Sentiment Analysis of Code-Mixed Tweets
Manoel Veríssimo dos Santos NetoAyrton Denner da Silva AmaralNádia Félix Felipe da SilvaAnderson da Silva Soares
2020-07-28
Variants of BERT, Random Forests and SVM approach for Multimodal Emotion-Target Sub-challenge
Hoang Manh HungHyung-Jeong YangSoo-Hyung KimGuee-Sang Lee
2020-07-28
Improving Results on Russian Sentiment Datasets
| Anton GolubevNatalia Loukachevitch
2020-07-28
BUT-FIT at SemEval-2020 Task 5: Automatic detection of counterfactual statements with deep pre-trained language representation models
Martin FajcikJosef JonMartin DocekalPavel Smrz
2020-07-28
GUIR at SemEval-2020 Task 12: Domain-Tuned Contextualized Models for Offensive Language Detection
Sajad SotudehTong XiangHao-Ren YaoSean MacAvaneyEugene YangNazli GoharianOphir Frieder
2020-07-28
From Sound Representation to Model Robustness
Mohammad EsmaeilpourPatrick CardinalAlessandro Lameiras Koerich
2020-07-27
YOLOpeds: Efficient Real-Time Single-Shot Pedestrian Detection for Smart Camera Applications
Christos Kyrkou
2020-07-27
Receptive-Field Regularized CNNs for Music Classification and Tagging
Khaled KoutiniHamid Eghbal-ZadehVerena HaunschmidPaul PrimusShreyan ChowdhuryGerhard Widmer
2020-07-27
To BERT or Not To BERT: Comparing Speech and Language-based Approaches for Alzheimer's Disease Detection
Aparna BalagopalanBenjamin EyreFrank RudziczJekaterina Novikova
2020-07-26
KUISAIL at SemEval-2020 Task 12: BERT-CNN for Offensive Speech Identification in Social Media
| Ali SafayaMoutasem AbdullatifDeniz Yuret
2020-07-26
MACU-Net Semantic Segmentation from High-Resolution Remote Sensing Images
Rui LiChenxi DuanShunyi Zheng
2020-07-26
Reed at SemEval-2020 Task 9: Sentiment Analysis on Code-Mixed Tweets
Vinay GopalanMark Hopkins
2020-07-26
FiSSA at SemEval-2020 Task 9: Fine-tuned For Feelings
| Bertelt BraaksmaRichard ScholtensStan van SuijlekomRemy WangAhmet Üstün
2020-07-24
MULTISEM at SemEval-2020 Task 3: Fine-tuning BERT for Lexical Meaning
Aina Garí SolerMarianna Apidianaki
2020-07-24
Counting Fish and Dolphins in Sonar Images Using Deep Learning
Stefan SchneiderAlex Zhuang
2020-07-24
Exploring Swedish & English fastText Embeddings with the Transformer
| Tosin P. AdewumiFoteini LiwickiMarcus Liwicki
2020-07-23
WeightNet: Revisiting the Design Space of Weight Networks
| Ningning MaXiangyu ZhangJiawei HuangJian Sun
2020-07-23
Product Title Generation for Conversational Systems using BERT
Mansi Ranjit ManeShashank KediaAditya ManthaStephen GuoKannan Achan
2020-07-23
PareCO: Pareto-aware Channel Optimization for Slimmable Neural Networks
Ting-Wu ChinAri S. MorcosDiana Marculescu
2020-07-23
The Lottery Ticket Hypothesis for Pre-trained BERT Networks
| Tianlong ChenJonathan FrankleShiyu ChangSijia LiuYang ZhangZhangyang WangMichael Carbin
2020-07-23
Multi-task learning for natural language processing in the 2020s: where are we going?
Joseph WorshamJugal Kalita
2020-07-22
CrossTransformers: spatially-aware few-shot transfer
Carl DoerschAnkush GuptaAndrew Zisserman
2020-07-22
IITK at the FinSim Task: Hypernym Detection in Financial Domain via Context-Free and Contextualized Word Embeddings
Vishal KeswaniSakshi SinghAshutosh Modi
2020-07-22
Rethinking CNN Models for Audio Classification
Kamalesh PalanisamyDipika SinghaniaAngela Yao
2020-07-22
Analogical Reasoning for Visually Grounded Language Acquisition
Bo WuHaoyu QinAlireza ZareianCarl VondrickShih-Fu Chang
2020-07-22
SliceOut: Training Transformers and CNNs faster while using less memory
Pascal NotinAidan N. GomezJoanna YooYarin Gal
2020-07-21
Neural Machine Translation with Error Correction
Kaitao SongXu TanJianfeng Lu
2020-07-21
problemConquero at SemEval-2020 Task 12: Transformer and Soft label-based approaches
Karishma LaudJagriti SinghRandeep Kumar SahuAshutosh Modi
2020-07-21
newsSweeper at SemEval-2020 Task 11: Context-Aware Rich Feature Representations For Propaganda Classification
| Paramansh SinghSiraj SandhuSubham KumarAshutosh Modi
2020-07-21
Word Representation for Rhythms
Tongyu LuLyucheng YanGus Xia
2020-07-21
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed GhasemipourDale SchuurmansShixiang Shane Gu
2020-07-21
Understanding BERT Rankers Under Distillation
Luyu GaoZhuyun DaiJamie Callan
2020-07-21
Interpolating GANs to Scaffold Autotelic Creativity
Ziv EpsteinOcéane BoulaisSkylar GordonMatt Groh
2020-07-21
Learning Joint Spatial-Temporal Transformations for Video Inpainting
| Yanhong ZengJianlong FuHongyang Chao
2020-07-20
A Comparison of Supervised Learning to Match Methods for Product Search
| Fatemeh SarviNikos VoskaridesLois MooimanSebastian SchelterMaarten de Rijke
2020-07-20
Learning Sparse Filters in Deep Convolutional Neural Networks with a l1/l2 Pseudo-Norm
Anthony BerthelierYongzhe YanThierry ChateauChristophe BlancStefan DuffnerChristophe Garcia
2020-07-20
Lagrangian Duality in Reinforcement Learning
Pranay Pasula
2020-07-20
Conformer-Kernel with Query Term Independence for Document Retrieval
| Bhaskar MitraSebastian HofstatterHamed ZamaniNick Craswell
2020-07-20
Detection, Attribution and Localization of GAN Generated Images
Michael GoebelLakshmanan NatarajTejaswi NanjundaswamyTajuddin Manhar MohammedShivkumar ChandrasekaranB. S. Manjunath
2020-07-20
Generative Hierarchical Features from Synthesizing Images
| Yinghao XuYujun ShenJiapeng ZhuCeyuan YangBolei Zhou
2020-07-20
Mono vs Multilingual Transformer-based Models: a Comparison across Several Language Tasks
Diego de Vargas FeijoViviane Pereira Moreira
2020-07-19
DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural Networks
Hassan DboukHetul SanghviMahesh MehendaleNaresh Shanbhag
2020-07-19
Temporal Pointwise Convolutional Networks for Length of Stay Prediction in the Intensive Care Unit
| Emma RocheteauPietro LiòStephanie Hyland
2020-07-18
Feature Pyramid Transformer
Dong ZhangHanwang ZhangJinhui TangMeng WangXiansheng HuaQianru Sun
2020-07-18
Multi-Perspective Semantic Information Retrieval in the Biomedical Domain
Samarth Rawal
2020-07-17
Generative Pretraining from Pixels
| Mark ChenAlec RadfordRewon ChildJeff WuHeewoo JunPrafulla DhariwalDavid LuanIlya Sutskever
2020-07-17
Region-based Non-local Operation for Video Classification
Guoxi HuangAdrian G. Bors
2020-07-17
Deep Learning Based Traffic Surveillance System For Missing and Suspicious Car Detection
K. V. KadambariVishnu Vardhan Nimmalapudi
2020-07-17
Wavelet Channel Attention Module with a Fusion Network for Single Image Deraining
Hao-Hsiang YangChao-Han Huck YangYu-Chiang Frank Wang
2020-07-17
CTC-Segmentation of Large Corpora for German End-to-end Speech Recognition
Ludwig KürzingerDominik WinkelbauerLujun LiTobias WatzelGerhard Rigoll
2020-07-17
Hopfield Networks is All You Need
| Hubert RamsauerBernhard SchäflJohannes LehnerPhilipp SeidlMichael WidrichLukas GruberMarkus HolzleitnerMilena PavlovićGeir Kjetil SandveVictor GreiffDavid KreilMichael KoppGünter KlambauerJohannes BrandstetterSepp Hochreiter
2020-07-16
Mixture of Step Returns in Bootstrapped DQN
Po-Han ChiangHsuan-Kung YangZhang-Wei HongChun-Yi Lee
2020-07-16
Investigating Pretrained Language Models for Graph-to-Text Generation
Leonardo F. R. RibeiroMartin SchmittHinrich SchützeIryna Gurevych
2020-07-16
Towards Debiasing Sentence Representations
Paul Pu LiangIrene Mengze LiEmily ZhengYao Chong LimRuslan SalakhutdinovLouis-Philippe Morency
2020-07-16
EfficientHRNet: Efficient Scaling for Lightweight High-Resolution Multi-Person Pose Estimation
Christopher NeffAneri ShethSteven FurgursonHamed Tabkhi
2020-07-16
Translate Reverberated Speech to Anechoic Ones: Speech Dereverberation with BERT
Yang Jiao
2020-07-16
Real-time Surface Deformation Recovery from Stereo Videos
Haoyin ZhouJagadeesan Jayender
2020-07-16
Collision Avoidance Robotics Via Meta-Learning (CARML)
Abhiram IyerAravind Mahadevan
2020-07-16
Human-like Energy Management Based on Deep Reinforcement Learning and Historical Driving Experiences
Teng LiuXiaolin TangXiaosong HuWenhao TanJinwei Zhang
2020-07-16
Attention as Activation
| Yimian DaiStefan OehmckeYiquan WuKobus Barnard
2020-07-15
Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent
Bowen WengHuaqing XiongYingbin LiangWei Zhang
2020-07-15
AdapterHub: A Framework for Adapting Transformers
| Jonas PfeifferAndreas RückléClifton PothAishwarya KamathIvan VulićSebastian RuderKyunghyun ChoIryna Gurevych
2020-07-15
Multimodal Word Sense Disambiguation in Creative Practice
Manuel Ladron de GuevaraChristopher GeorgeAkshat GuptaDaragh ByrneRamesh Krishnamurti
2020-07-15
Finding Non-Uniform Quantization Schemes using Multi-Task Gaussian Processes
Marcelo Gennari do NascimentoTheo W. CostainVictor Adrian Prisacariu
2020-07-15
Logic Constrained Pointer Networks for Interpretable Textual Similarity
| Subhadeep MajiRohan KumarManish BansalKalyani RoyPawan Goyal
2020-07-15
Predicting Clinical Diagnosis from Patients Electronic Health Records Using BERT-based Neural Networks
Pavel BlinovManvel AvetisianVladimir KokhDmitry UmerenkovAlexander Tuzhilin
2020-07-15
Overview of CheckThat! 2020: Automatic Identification and Verification of Claims in Social Media
Alberto Barron-CedenoTamer ElsayedPreslav NakovGiovanni Da San MartinoMaram HasanainReem SuwailehFatima HaouariNikolay BabulkovBayan HamdanAlex NikolovShaden ShaarZien Sheikh Ali
2020-07-15
Deep Reinforced Query Reformulation for Information Retrieval
Xiao WangCraig MacdonaldIadh Ounis
2020-07-15
The Monte Carlo Transformer: a stochastic self-attention model for sequence prediction
Alice MartinCharles OllionFlorian StrubSylvain Le CorffOlivier Pietquin
2020-07-15
Fast and Accurate Neural CRF Constituency Parsing
| Yu ZhangHouquan ZhouZhenghua Li
2020-07-14
MFRNet: A New CNN Architecture for Post-Processing and In-loop Filtering
Di MaFan ZhangDavid R. Bull
2020-07-14
Deep Transformer based Data Augmentation with Subword Units for Morphologically Rich Online ASR
Balázs TarjánGyörgy SzaszákTibor FegyóPéter Mihajlik
2020-07-14
Contextualized Code Representation Learning for Commit Message Generation
Lun Yiu NieCuiyun GaoZhicong ZhongWai LamYang LiuZenglin Xu
2020-07-14
What's in a Name? Are BERT Named Entity Representations just as Good for any other Name?
Sriram BalasubramanianNaman JainGaurav JindalAbhijeet AwasthiSunita Sarawagi
2020-07-14
An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models
Lifu TuGarima LalwaniSpandana GellaHe He
2020-07-14
Can neural networks acquire a structural bias from raw linguistic data?
Alex WarstadtSamuel R. Bowman
2020-07-14
Emoji Prediction: Extensions and Benchmarking
Weicheng MaRuibo LiuLili WangSoroush Vosoughi
2020-07-14
Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Shauharda KhadkaEstelle AflaloMattias MarderAvrech Ben-DavidSantiago MiretHanlin TangShie MannorTamir HazanSomdeb Majumdar
2020-07-14
Add a SideNet to your MainNet
Adrien Morisot
2020-07-14
Paranoid Transformer: Reading Narrative of Madness as Computational Approach to Creativity
Yana AgafonovaAlexey TikhonovIvan P. Yamshchikov
2020-07-13
Transformer with Depth-Wise LSTM
Hongfei XuQiuhui LiuDeyi XiongJosef van Genabith
2020-07-13
An Enhanced Text Classification to Explore Health based Indian Government Policy Tweets
Aarzoo DhimanDurga Toshniwal
2020-07-13
Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation
Aditya MogadalaMarius MosbachDietrich Klakow
2020-07-12
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech
| Andy T. LiuShang-Wen LiHung-yi Lee
2020-07-12
Locality Guided Neural Networks for Explainable Artificial Intelligence
Randy TanNaimul KhanLing Guan
2020-07-12
HyperGrid: Efficient Multi-Task Transformers with Grid-wise Decomposable Hyper Projections
Yi TayZhe ZhaoDara BahriDonald MetzlerDa-Cheng Juan
2020-07-12
BERT Learns (and Teaches) Chemistry
Josh PayneMario SroujiDian Ang YapVineet Kosaraju
2020-07-11
Simulating multi-exit evacuation using deep reinforcement learning
Dong XuXiao HuangJoseph MangoXiang LiZhenlong Li
2020-07-11
To filter prune, or to layer prune, that is the question
Sara ElkerdawyMostafa ElhoushiAbhineet SinghHong ZhangNilanjan Ray
2020-07-11
Sequence Generation with Mixed Representations
Lijun Wu Shufang Xie Yingce Xia Fan Yang Tao Qin Jianhuang Lai Tie-Yan Liu
2020-07-11
Generative Graph Perturbations for Scene Graph Prediction
Boris KnyazevHarm de VriesCătălina CangeaGraham W. TaylorAaron CourvilleEugene Belilovsky
2020-07-11
BISON:BM25-weighted Self-Attention Framework for Multi-Fields Document Search
Xuan ShanChuanjie LiuYiqian XiaQi ChenYusi ZhangAngen LuoYuxiang Luo
2020-07-10
To BAN or not to BAN: Bayesian Attention Networks for Reliable Hate Speech Detection
Kristian MiokBlaz SkrljDaniela ZaharieMarko Robnik-Sikonja
2020-07-10
Biological credit assignment through dynamic inversion of feedforward networks
William F. PodlaskiChristian K. Machens
2020-07-10
Multi-Dialect Arabic BERT for Country-Level Dialect Identification
| Bashar TalafhaMohammad AliMuhy Eddin Za'terHaitham SeelawiIbraheem TuffahaMostafa SamirWael FarhanHussein T. Al-Natsheh
2020-07-10
Blockchain-Federated-Learning and Deep Learning Models for COVID-19 detection using CT Imaging
Rajesh KumarAbdullah Aman KhanSinmin ZhangWenYong WangYousif AbuidrisWaqas AminJay Kumar
2020-07-10
Contrastive Code Representation Learning
| Paras JainAjay JainTianjun ZhangPieter AbbeelJoseph E. GonzalezIon Stoica
2020-07-09
Fast Transformers with Clustered Attention
| Apoorv VyasAngelos KatharopoulosFrançois Fleuret
2020-07-09
DeepSinger: Singing Voice Synthesis with Data Mined From the Web
Yi RenXu TanTao QinJian LuanZhou ZhaoTie-Yan Liu
2020-07-09
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
| Kimin LeeMichael LaskinAravind SrinivasPieter Abbeel
2020-07-09
Single architecture and multiple task deep neural network for altered fingerprint analysis
Oliver GiudiceMattia LitricoSebastiano Battiato
2020-07-09
Advances of Transformer-Based Models for News Headline Generation
| Alexey BukhtiyarovIlya Gusev
2020-07-09
Self-Supervised Policy Adaptation during Deployment
| Nicklas HansenYu SunPieter AbbeelAlexei A. EfrosLerrel PintoXiaolong Wang
2020-07-08
Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads
Siyu WangYi RongShiqing FanZhen ZhengLanSong DiaoGuoping LongJun YangXiaoyong LiuWei Lin
2020-07-08
NVAE: A Deep Hierarchical Variational Autoencoder
Arash VahdatJan Kautz
2020-07-08
Journey Towards Tiny Perceptual Super-Resolution
Royson LeeŁukasz DudziakMohamed AbdelfattahStylianos I. VenierisHyeji KimHongkai WenNicholas D. Lane
2020-07-08
The Go Transformer: Natural Language Modeling for Game Play
Matthew CiolinoDavid NoeverJosh Kalin
2020-07-07
Continual BERT: Continual Learning for Adaptive Extractive Summarization of COVID-19 Literature
Jong Won Park
2020-07-07
Do Transformers Need Deep Long-Range Memory
Jack W. RaeAli Razavi
2020-07-07
Exploring Heterogeneous Information Networks via Pre-Training
Yang FangXiang ZhaoWeidong Xiao
2020-07-07
Can GAN Generated Morphs Threaten Face Recognition Systems Equally as Landmark Based Morphs? -- Vulnerability and Detection
Sushma VenkateshHaoyu ZhangRaghavendra RamachandraKiran RajaNaser DamerChristoph Busch
2020-07-07
Cognitive Radio Network Throughput Maximization with Deep Reinforcement Learning
Kevin Shen Hoong OngYang ZhangDusit Niyato
2020-07-07
A Free Viewpoint Portrait Generator with Dynamic Styling
| Anpei ChenRuiyang LiuLing XieJingyi Yu
2020-07-07
Relevance Transformer: Generating Concise Code Snippets with Relevance Feedback
Carlos GemmellFederico RossettoJeffrey Dalton
2020-07-06
LMVE at SemEval-2020 Task 4: Commonsense Validation and Explanation using Pretraining Language Model
Shilei LiuYu GuoBochao LiFeiliang Ren
2020-07-06
Learning to Segment Anatomical Structures Accurately from One Exemplar
Yuhang LuWeijian LiKang ZhengYirui WangAdam P. HarrisonChihung LinSong WangJing XiaoLe LuChang-Fu KuoShun Miao
2020-07-06
Deep Contextual Embeddings for Address Classification in E-commerce
Shreyas MangalgiLakshya KumarRavindra Babu Tallamraju
2020-07-06
EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning
| Bailin LiBowen WuJiang SuGuangrun WangLiang Lin
2020-07-06
You Autocomplete Me: Poisoning Vulnerabilities in Neural Code Completion
Roei SchusterCongzheng SongEran TromerVitaly Shmatikov
2020-07-05
Text Data Augmentation: Towards better detection of spear-phishing emails
Mehdi ReginaMaxime MeyerSébastien Goutal
2020-07-04
Robust Prediction of Punctuation and Truecasing for Medical ASR
Monica SunkaraSrikanth RonankiKalpit DixitSravan BodapatiKatrin Kirchhoff
2020-07-04
Language-agnostic BERT Sentence Embedding
| Fangxiaoyu FengYinfei YangDaniel CerNaveen ArivazhaganWei Wang
2020-07-03
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Pavel DenisovNgoc Thang Vu
2020-07-03
Reading Comprehension in Czech via Machine Translation and Cross-lingual Transfer
Kateřina MackováMilan Straka
2020-07-03
Playing with Words at the National Library of Sweden -- Making a Swedish BERT
| Martin MalmstenLove BörjesonChris Haffenden
2020-07-03
MIRA: Leveraging Multi-Intention Co-click Information in Web-scale Document Retrieval using Deep Neural Networks
Yusi ZhangChuanjie LiuAngen LuoHui XueXuan ShanYuxiang LuoYiqian XiaYuanchi YanHaidong Wang
2020-07-03
Abstractive and mixed summarization for long-single documents
Roger BarrullJugal Kalita
2020-07-03
Collaborative Learning for Faster StyleGAN Embedding
Shanyan GuanYing TaiBingbing NiFeida ZhuFeiyue HuangXiaokang Yang
2020-07-03
On-The-Fly Information Retrieval Augmentation for Language Models
Hai WangDavid McAllester
2020-07-03
Bidirectional Encoder Representations from Transformers (BERT): A sentiment analysis odyssey
Shivaji AlaparthiManit Mishra
2020-07-02
Are there any 'object detectors' in the hidden layers of CNNs trained to identify objects or scenes?
| Ella M. GaleNicholas MartinRyan BlythingAnh NguyenJeffrey S. Bowers
2020-07-02
The Impact of Explanations on AI Competency Prediction in VQA
Kamran AlipourArijit RayXiao LinJurgen P. SchulzeYi YaoGiedrius T. Burachas
2020-07-02
D-NetPAD: An Explainable and Interpretable Iris Presentation Attack Detector
| Renu SharmaArun Ross
2020-07-02
Improving Event Detection using Contextual Word and Sentence Embeddings
Mariano MaisonnaveFernando DelbiancoFernando TohméAna MaguitmanEvangelos Milios
2020-07-02
Decentralized Deep Reinforcement Learning for Network Level Traffic Signal Control
Jin Guo
2020-07-02
Self-Attention Guided Copy Mechanism for Abstractive Summarization
Song XuHaoran LiPeng YuanYouzheng WuXiaodong HeBowen Zhou
2020-07-01
Integrating Multimodal Information in Large Pretrained Transformers
Wasifur RahmanMd Kamrul HasanSangwu LeeAmirAli Bagher ZadehChengfeng MaoLouis-Philippe MorencyEhsan Hoque
2020-07-01
Roles and Utilization of Attention Heads in Transformer-based Neural Language Models
Jae-young JoSung-Hyon Myaeng
2020-07-01
Multimodal Transformer for Multimodal Machine Translation
Shaowei YaoXiaojun Wan
2020-07-01
Paraphrase Generation by Learning How to Edit from Samples
Amirhossein KazemnejadMohammadreza SalehiMahdieh Soleymani Baghshah
2020-07-01
Dependency Graph Enhanced Dual-transformer Structure for Aspect-based Sentiment Classification
Hao TangDonghong JiChenliang LiQiji Zhou
2020-07-01
Do Transformers Need Deep Long-Range Memory?
Jack RaeAli Razavi
2020-07-01
In Neural Machine Translation, What Does Transfer Learning Transfer?
Alham Fikri AjiNikolay BogoychevKenneth HeafieldRico Sennrich
2020-07-01
Feature Projection for Improved Text Classification
Qi QinWenpeng HuBing Liu
2020-07-01
Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation
Arya D. McCarthyXian LiJiatao GuNing Dong
2020-07-01
DIALOGPT : Large-Scale Generative Pre-training for Conversational Response Generation
| Yizhe ZhangSiqi SunMichel GalleyYen-Chun ChenChris BrockettXiang GaoJianfeng GaoJingjing LiuBill Dolan
2020-07-01
Combining Subword Representations into Word-level Representations in the Transformer Architecture
Noe CasasMarta R. Costa-juss{\`a}Jos{\'e} A. R. Fonollosa
2020-07-01
Robust Neural Machine Translation with ASR Errors
Haiyang XueYang FengShuhao GuWei Chen
2020-07-01
An empirical investigation of neural methods for content scoring of science explanations
Brian RiordanSarah BichlerAllison BradfordJennifer King ChenKorah WileyLibby GerardMarcia C. Linn
2020-07-01
Neural Transduction of Letter Position Dyslexia using an Anagram Matrix Representation
Avi Bleiweiss
2020-07-01
Detecting Sarcasm in Conversation Context Using Transformer-Based Models
Adithya AvvaruSanath VobilisettyRadhika Mamidi
2020-07-01
Character aware models with similarity learning for metaphor detection
Tarun KumarYashvardhan Sharma
2020-07-01
Metaphor Detection Using Contextual Word Embeddings From Transformers
Jerry LiuNathan O{'}HaraAlex RubinerRachel DraelosCynthia Rudin
2020-07-01
A Transformer Approach to Contextual Sarcasm Detection in Twitter
Hunter GregorySteven LiPouya MohammadiNatalie TarnRachel DraelosCynthia Rudin
2020-07-01
Adaptation of Multilingual Transformer Encoder for Robust Enhanced Universal Dependency Parsing
Han HeJinho D. Choi
2020-07-01
KIT's IWSLT 2020 SLT Translation System
Ngoc-Quan PhamFelix SchneiderTuan-Nam NguyenThanh-Le HaThai Son NguyenMaximilian AwiszusSebastian St{\"u}kerAlex Waibeler
2020-07-01
End-to-End Simultaneous Translation System for IWSLT2020 Using Modality Agnostic Meta-Learning
Hou Jeung HanMohd Abbas ZaidiSathish Reddy IndurthiNikhil Kumar LakumarapuBeomseok LeeSangha Kim
2020-07-01
End-to-End Offline Speech Translation System for IWSLT 2020 using Modality Agnostic Meta-Learning
Nikhil Kumar LakumarapuBeomseok LeeSathish Reddy IndurthiHou Jeung HanMohd Abbas ZaidiSangha Kim
2020-07-01
SRPOL's System for the IWSLT 2020 End-to-End Speech Translation Task
Tomasz PotapczykPawel Przybysz
2020-07-01
The AFRL IWSLT 2020 Systems: Work-From-Home Edition
Brian OreEric HansenTim AndersonJeremy Gwinnup
2020-07-01
OPPO's Machine Translation System for the IWSLT 2020 Open Domain Translation Task
Qian ZhangXiaopu LiDawei DangTingxun ShiDi AiZhengshan XueJie Hao
2020-07-01
CASIA's System for IWSLT 2020 Open Domain Translation
Qian WangYuchen LiuCong MaYu LuYining WangLong ZhouYang ZhaoJiajun ZhangChengqing Zong
2020-07-01
Deep Blue Sonics' Submission to IWSLT 2020 Open Domain Translation Task
Enmin SuYi Ren
2020-07-01
University of Tsukuba's Machine Translation System for IWSLT20 Open Domain Translation Task
Hongyi CuiYizhen WeiShohei IidaTakehito UtsuroMasaaki Nagata
2020-07-01
Xiaomi's Submissions for IWSLT 2020 Open Domain Translation Task
Yuhui SunMengxue GuoXiang LiJianwei CuiBin Wang
2020-07-01
The HW-TSC Video Speech Translation System at IWSLT 2020
Minghan WangHao YangYao DengYing QinLizhi LeiDaimeng WeiHengchao ShangNing XieXiaochun LiJiaxian Guo
2020-07-01
Towards Stream Translation: Adaptive Computation Time for Simultaneous Machine Translation
Felix SchneiderAlex Waibeler
2020-07-01
Compressing Neural Machine Translation Models with 4-bit Precision
Alham Fikri AjiKenneth Heafield
2020-07-01
Training and Inference Methods for High-Coverage Neural Machine Translation
Michael YangYixin LiuRahul Mayuranath
2020-07-01
Expand and Filter: CUNI and LMU Systems for the WNGT 2020 Duolingo Shared Task
Jind{\v{r}}ich Libovick{\'y}Zden{\v{e}}k KasnerJind{\v{r}}ich HelclOnd{\v{r}}ej Du{\v{s}}ek
2020-07-01
The NiuTrans System for WNGT 2020 Efficiency Task
Chi HuBei LiYinqiao LiYe LinYanyang LiChenglong WangTong XiaoJingbo Zhu
2020-07-01
Efficient and High-Quality Neural Machine Translation with OpenNMT
Guillaume KleinDakun ZhangCl{\'e}ment ChouteauJosep CregoJean Senellart
2020-07-01
Improving Document-Level Neural Machine Translation with Domain Adaptation
Sami Ul HaqSadaf Abdul RaufArslan ShoukatNoor-e- Hira
2020-07-01
CopyBERT: A Unified Approach to Question Generation with Self-Attention
Stalin VaranasiSaadullah AminGuenter Neumann
2020-07-01
How to Tame Your Data: Data Augmentation for Dialog State Tracking
Adam SummervilleJordan HashemiJames Ryanwilliam ferguson
2020-07-01
Methods for Extracting Information from Messages from Primary Care Providers to Specialists
Xiyu DingMichael BarnettAteev MehrotraTimothy Miller
2020-07-01
Generating Medical Reports from Patient-Doctor Conversations Using Sequence-to-Sequence Models
Seppo EnarviMarilisa AmoiaMiguel Del-Agua TebaBrian DelaneyFrank DiehlStefan HahnKristina HarrisLiam McGrathYue PanJoel PintoLuca RubiniMiguel RuizGag SingheepFabian StemmerWeiyi SunPaul VozilaThomas LinRanjani Ramamurthy
2020-07-01
Enhancing Transformer with Sememe Knowledge
Yuhui ZhangChenghao YangZhengping ZhouZhiyuan Liu
2020-07-01
Grapheme-to-Phoneme Conversion with a Multilingual Transformer Model
Omnia ElSaadanyBenjamin Suter
2020-07-01
On-The-Fly Information Retrieval Augmentation for Language Models
Hai WangDavid McAllester
2020-07-01
Investigating the effect of auxiliary objectives for the automated grading of learner English speech transcriptions
Hannah CraigheadAndrew CainesPaula ButteryHelen Yannakoudakis
2020-07-01
Frustratingly Easy Multilingual Grapheme-to-Phoneme Conversion
Nikhil PrabhuKatharina Kann
2020-07-01
Leveraging Principal Parts for Morphological Inflection
Ling LiuMans Hulden
2020-07-01
Data Augmentation for Transformer-based G2P
Zach RyanMans Hulden
2020-07-01
HausaMT v1.0: Towards English--Hausa Neural Machine Translation
Adewale Akinfaderin
2020-07-01
Unsupervised FAQ Retrieval with Question Generation and BERT
Yosi MassBoaz CarmeliHaggai RoitmanDavid Konopnicki
2020-07-01
GAN-BERT: Generative Adversarial Learning for Robust Text Classification with a Bunch of Labeled Examples
Danilo CroceGiuseppe CastellucciRoberto Basili
2020-07-01
Modelling Context and Syntactical Features for Aspect-based Sentiment Analysis
Minh Hieu PhanPhilip O. Ogunbona
2020-07-01
Towards Holistic and Automatic Evaluation of Open-Domain Dialogue Generation
Bo PangErik NijkampWenjuan HanLinqi ZhouYixian LiuKewei Tu
2020-07-01
Adversarial and Domain-Aware BERT for Cross-Domain Sentiment Analysis
Chunning DuHaifeng SunJingyu WangQi QiJianxin Liao
2020-07-01
How does BERT's attention change when you fine-tune? An analysis methodology and a case study in negation scope
Yiyun ZhaoSteven Bethard
2020-07-01
Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work?
Yada PruksachatkunJason PhangHaokun LiuPhu Mon HtutXiaoyi ZhangRichard Yuanzhe PangClara VaniaKatharina KannSamuel R. Bowman
2020-07-01
Would you Rather? A New Benchmark for Learning Machine Alignment with Cultural Values and Social Preferences
Yi TayDonovan OngJie FuAlvin ChanNancy ChenAnh Tuan LuuChris Pal
2020-07-01
Towards Debiasing Sentence Representations
Paul Pu LiangIrene Mengze LiEmily ZhengYao Chong LimRuslan SalakhutdinovLouis-Philippe Morency
2020-07-01
Automatic Generation of Citation Texts in Scholarly Papers: A Pilot Study
Xinyu XingXiaosheng FanXiaojun Wan
2020-07-01
Transition-based Semantic Dependency Parsing with Pointer Networks
Daniel Fern{\'a}ndez-Gonz{\'a}lezCarlos G{\'o}mez-Rodr{\'\i}guez
2020-07-01
tBERT: Topic Models and BERT Joining Forces for Semantic Similarity Detection
Nicole PeineltDong NguyenMaria Liakata
2020-07-01
Understanding Advertisements with BERT
Kanika KalraBhargav KurmaSilpa Vadakkeeveetil SreelathaManasi PatwardhanKarShirish e
2020-07-01
A Generate-and-Rank Framework with Semantic Type Regularization for Biomedical Concept Normalization
Dongfang XuZeyu ZhangSteven Bethard
2020-07-01
Revisiting Higher-Order Dependency Parsers
Erick FonsecaAndr{\'e} F. T. Martins
2020-07-01
SUPP.AI: finding evidence for supplement-drug interactions
Lucy WangOyvind TafjordArman CohanSarthak JainSam SkjonsbergCarissa SchoenickNick BotnerWaleed Ammar
2020-07-01
Why is penguin more similar to polar bear than to sea gull? Analyzing conceptual knowledge in distributional models
Pia Sommerauer
2020-07-01
A Simple and Effective Dependency Parser for Telugu
Sneha NallaniManish ShrivastavaDipti Sharma
2020-07-01
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining
Ivana Kvapil{\'\i}kov{\'a}Mikel ArtetxeGorka LabakaEneko AgirreOnd{\v{r}}ej Bojar
2020-07-01
Cross-Lingual Disaster-related Multi-label Tweet Classification with Manifold Mixup
Jishnu Ray ChowdhuryCornelia CarageaDoina Caragea
2020-07-01
Should You Fine-Tune BERT for Automated Essay Scoring?
Elijah MayfieldAlan W Black
2020-07-01
A BERT-based One-Pass Multi-Task Model for Clinical Temporal Relation Extraction
Chen LinTimothy MillerDmitriy DligachFarig SadequeSteven BethardGuergana Savova
2020-07-01
Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity
Yuxia WangFei LiuKarin VerspoorTimothy Baldwin
2020-07-01
Item-based Collaborative Filtering with BERT
Tian WangYuyangzi Fu
2020-07-01
Sarcasm Identification and Detection in Conversion Context using BERT
Kalaivani A.Thenmozhi D.
2020-07-01
Neural Sarcasm Detection using Conversation Context
Nikhil Jaiswal
2020-07-01
Context-Aware Sarcasm Detection Using BERT
Arup BaruahKaushik DasFerdous BarbhuiyaKuntal Dey
2020-07-01
IlliniMet: Illinois System for Metaphor Detection with Contextual and Linguistic Information
Hongyu GongKshitij GuptaAkriti JainSuma Bhat
2020-07-01
Go Figure! Multi-task transformer-based architecture for metaphor detection using idioms: ETS team in 2020 metaphor shared task
Xianyang ChenChee Wee (Ben) LeongMichael FlorBeata Beigman Klebanov
2020-07-01
Turku Enhanced Parser Pipeline: From Raw Text to Enhanced Graphs in the IWPT 2020 Shared Task
Jenna KanervaFilip GinterSampo Pyysalo
2020-07-01
K\opsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
Daniel HershcovichMiryam de LhoneuxArtur KulmizevElham PejhanJoakim Nivre
2020-07-01
RobertNLP at the IWPT 2020 Shared Task: Surprisingly Simple Enhanced UD Parsing for English
Stefan Gr{\"u}newaldAnnemarie Friedrich
2020-07-01
POSTECH Submission on Duolingo Shared Task
Junsu ParkHongseok KwonJong-Hyeok Lee
2020-07-01
Robust Prediction of Punctuation and Truecasing for Medical ASR
Monica SunkaraSrikanth RonankiKalpit DixitSravan BodapatiKatrin Kirchhoff
2020-07-01
Exploring the Limits of Simple Learners in Knowledge Distillation for Document Classification with DocBERT
Ashutosh AdhikariAchyudh RamRaphael TangWilliam L. HamiltonJimmy Lin
2020-07-01
Joint Training with Semantic Role Labeling for Better Generalization in Natural Language Inference
Cemil CengizDeniz Yuret
2020-07-01
A Metric Learning Approach to Misogyny Categorization
Juan Manuel CoriaSahar GhannaySophie RossetHerv{\'e} Bredin
2020-07-01
Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation
Alessio MiaschiFelice Dell{'}Orletta
2020-07-01
What's in a Name? Are BERT Named Entity Representations just as Good for any other Name?
Sriram BalasubramanianNaman JainGaurav JindalAbhijeet AwasthiSunita Sarawagi
2020-07-01
Getting the \#\#life out of living: How Adequate Are Word-Pieces for Modelling Complex Morphology?
Stav KleinReut Tsarfaty
2020-07-01
SentiTel: TABSA for Twitter reviews on Uganda Telecoms
David KabiitoJoyce Nakatumba Nabende
2020-07-01
Adversarial Evaluation of BERT for Biomedical Named Entity Recognition
Vladimir AraujoAndr{\'e}s CarvalloDenis Parra
2020-07-01
Improving Multimodal Named Entity Recognition via Entity Span Detection with Unified Multimodal Transformer
Jianfei YuJing JiangLi YangRui Xia
2020-07-01
Probing for Referential Information in Language Models
Ionut-Teodor SorodocKristina GulordavaGemma Boleda
2020-07-01
Regularly Updated Deterministic Policy Gradient Algorithm
Shuai HanWenbo ZhouShuai LüJiayu Yu
2020-07-01
LSTM and GPT-2 Synthetic Speech Transfer Learning for Speaker Recognition to Overcome Data Scarcity
Jordan J. BirdDiego R. FariaAnikó EkártCristiano PremebidaPedro P. S. Ayrosa
2020-07-01
Developing cooperative policies for multi-stage tasks
Jordan ErskineChris Lehnert
2020-07-01
The Summary Loop: Learning to Write Abstractive Summaries Without Examples
| Philippe LabanAndrew HsiJohn CannyMarti A. Hearst
2020-07-01
Image-level Harmonization of Multi-Site Data using Image-and-Spatial Transformer Networks
| R. RobinsonQ. DouD. C. CastroK. KamnitsasM. de GrootR. M. SummersD. RueckertB. Glocker
2020-06-30
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
| Dmitry LepikhinHyoukJoong LeeYuanzhong XuDehao ChenOrhan FiratYanping HuangMaxim KrikunNoam ShazeerZhifeng Chen
2020-06-30
Correction of Faulty Background Knowledge based on Condition Aware and Revise Transformer for Question Answering
Xinyan ZhaoXiao FengHaoming ZhongJun YaoHuanhuan Chen
2020-06-30
SE3M: A Model for Software Effort Estimation Using Pre-trained Embedding Models
Eliane M. De Bortoli FáveroDalcimar CasanovaAndrey Ricardo Pimentel
2020-06-30
PriorGAN: Real Data Prior for Generative Adversarial Nets
Shuyang GuJianmin BaoDong ChenFang Wen
2020-06-30
Data Movement Is All You Need: A Case Study on Optimizing Transformers
Andrei IvanovNikoli DrydenTal Ben-NunShigang LiTorsten Hoefler
2020-06-30
Segmentation Approach for Coreference Resolution Task
Aref JafariAli Ghodsi
2020-06-30
BERTERS: Multimodal Representation Learning for Expert Recommendation System with Transformer
N. Nikzad-KhasmakhiM. A. BalafarM. Reza Feizi-DerakhshiCina Motamed
2020-06-30
Simplifying Models with Unlabeled Output Data
Sang Michael XieTengyu MaPercy Liang
2020-06-29
Predicting Length of Stay in the Intensive Care Unit with Temporal Pointwise Convolutional Networks
| Emma RocheteauPietro LiòStephanie Hyland
2020-06-29
A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
| Jean-Benoit DelbrouckNoé TitsMathilde BrousmicheStéphane Dupont
2020-06-29
Multi-Head Attention: Collaborate Instead of Concatenate
| Jean-Baptiste CordonnierAndreas LoukasMartin Jaggi
2020-06-29
Want to Identify, Extract and Normalize Adverse Drug Reactions in Tweets? Use RoBERTa
Katikapalli Subramanyam KalyanS. Sangeetha
2020-06-29
Improving Sequence Tagging for Vietnamese Text Using Transformer-based Neural Models
Viet Bui TheOanh Tran ThiPhuong Le-Hong
2020-06-29
Knowledge-Aware Language Model Pretraining
Corby RossetChenyan XiongMinh PhanXia SongPaul BennettSaurabh Tiwary
2020-06-29
Interpreting Hierarchical Linguistic Interactions in DNNs
Die ZhangHuilin ZhouXiaoyi BaoDa HuoRuizhao ChenXu ChengHao ZhangMengyue WuQuanshi Zhang
2020-06-29
Rethinking Positional Encoding in Language Pre-training
| Guolin KeDi HeTie-Yan Liu
2020-06-28
Self-Attention Networks for Intent Detection
Sevinj YolchuyevaGéza NémethBálint Gyires-Tóth
2020-06-28
Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive Keypoint Estimates
| Ke SunZigang GengDepu MengBin XiaoDong LiuZhaoxiang ZhangJingdong Wang
2020-06-28
Causal Explanations of Image Misclassifications
Yan MinMiles Bennett
2020-06-28
Progressive Generation of Long Text
| Bowen TanZichao YangMaruan AI-ShedivatEric P. XingZhiting Hu
2020-06-28
BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision
| Chen LiangYue YuHaoming JiangSiawpeng ErRuijia WangTuo ZhaoChao Zhang
2020-06-28
Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization
Beliz GunelChenguang ZhuMichael ZengXuedong Huang
2020-06-27
Video-Grounded Dialogues with Pretrained Generation Language Models
Hung LeSteven C. H. Hoi
2020-06-27
Normalizador Neural de Datas e Endereços
Gustavo PlensackPaulo Finardi
2020-06-27
What they do when in doubt: a study of inductive biases in seq2seq learners
Eugene KharitonovRahma Chaabouni
2020-06-26
TURL: Table Understanding through Representation Learning
| Xiang DengHuan SunAlyssa LeesYou WuCong Yu
2020-06-26
BERTology Meets Biology: Interpreting Attention in Protein Language Models
| Jesse VigAli MadaniLav R. VarshneyCaiming XiongRichard SocherNazneen Fatema Rajani
2020-06-26
Conditional Set Generation with Transformers
Adam R KosiorekHyunjik KimDanilo J Rezende
2020-06-26
Distributed Uplink Beamforming in Cell-Free Networks Using Deep Reinforcement Learning
Firas FredjYasser Al-EryaniSetareh MaghsudiMohamed AkroutEkram Hossain
2020-06-26
DRACO: Co-Optimizing Hardware Utilization, and Performance of DNNs on Systolic Accelerator
Nandan Kumar JhaShreyas RavishankarSparsh MittalArvind KaushikDipan MandalMahesh Chandra
2020-06-26
COVID-19 detection using Residual Attention Network an Artificial Intelligence approach
Vishal SharmaCurtis Dyreson
2020-06-26
Learning Source Phrase Representations for Neural Machine Translation
Hongfei XuJosef van GenabithDeyi XiongQiuhui LiuJingyi Zhang
2020-06-25
Self-Segregating and Coordinated-Segregating Transformer for Focused Deep Multi-Modular Network for Visual Question Answering
Chiranjib Sur
2020-06-25
SACT: Self-Aware Multi-Space Feature Composition Transformer for Multinomial Attention for Video Captioning
Chiranjib Sur
2020-06-25
Noise, overestimation and exploration in Deep Reinforcement Learning
Rafael Stekolshchik
2020-06-25
FastSpec: Scalable Generation and Detection of Spectre Gadgets Using Neural Embeddings
| M. Caner TolKoray YurtsevenBerk GulmezogluBerk Sunar
2020-06-25
Normalizing Text using Language Modelling based on Phonetics and String Similarity
Fenil DoshiJimit GandhiDeep GosaliaSudhir Bagul
2020-06-25
LSBert: A Simple Framework for Lexical Simplification
| Jipeng QiangYun LiYi ZhuYunhao YuanXindong Wu
2020-06-25
Parametric Instance Classification for Unsupervised Visual Feature Learning
Yue CaoZhenda XieBin LiuYutong LinZheng ZhangHan Hu
2020-06-25
Differentiable Window for Dynamic Local Attention
Thanh-Tung NguyenXuan-Phi NguyenShafiq JotyXiaoli Li
2020-06-24
Accelerated Large Batch Optimization of BERT Pretraining in 54 minutes
Shuai ZhengHaibin LinSheng ZhaMu Li
2020-06-24
Reducing Overestimation Bias by Increasing Representation Dissimilarity in Ensemble Based Deep Q-Learning
Hassam Ullah SheikhLadislau Bölöni
2020-06-24
Efficient Constituency Parsing by Pointing
Thanh-Tung NguyenXuan-Phi NguyenShafiq JotyXiaoli Li
2020-06-24
On the Difficulty of Designing Processor Arrays for Deep Neural Networks
| Kevin StehleGünther SchindlerHolger Fröning
2020-06-24
Hyperparameter Ensembles for Robustness and Uncertainty Quantification
Florian WenzelJasper SnoekDustin TranRodolphe Jenatton
2020-06-24
A Novel and Reliable Deep Learning Web-Based Tool to Detect COVID-19 Infection from Chest CT-Scan
| Abdolkarim SaeediMaryam SaeediArash Maghsoudi
2020-06-24
Hybrid Spatio-Temporal Graph Convolutional Network: Improving Traffic Prediction with Navigation Data
| Rui DaiShenkun XuQian GuChenguang JiKaikui Liu
2020-06-23
Bach or Mock? A Grading Function for Chorales in the Style of J.S. Bach
| Alexander FangAlisa LiuPrem SeetharamanBryan Pardo
2020-06-23
The Effect of Multi-step Methods on Overestimation in Deep Reinforcement Learning
Lingheng MengRob GorbetDana Kulić
2020-06-23
Experience Replay with Likelihood-free Importance Weights
Samarth SinhaJiaming SongAnimesh GargStefano Ermon
2020-06-23
NeuralScale: Efficient Scaling of Neurons for Resource-Constrained Deep Neural Networks
| Eugene LeeChen-Yi Lee
2020-06-23
Self-supervised edge features for improved Graph Neural Network training
| Arijit SehanobishNeal G. RavindraDavid van Dijk
2020-06-23
A Self-Attention Network based Node Embedding Model
Dai Quoc NguyenTu Dinh NguyenDinh Phung
2020-06-22
Exploring Software Naturalness through Neural Language Models
Luca BurattiSaurabh PujarMihaela BorneaScott McCarleyYunhui ZhengGaetano RossielloAlessandro MorariJim LaredoVeronika ThostYufan ZhuangGiacomo Domeniconi
2020-06-22
ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion
| BingningWangTing YaoQi ZhangJingfang XuXiaochuan Wang
2020-06-22
Students Need More Attention: BERT-based AttentionModel for Small Data with Application to AutomaticPatient Message Triage
Shijing SiRui WangJedrek WosikHao ZhangDavid DovGuoyin WangRicardo HenaoLawrence Carin
2020-06-22
AdvAug: Robust Adversarial Augmentation for Neural Machine Translation
Yong ChengLu JiangWolfgang MachereyJacob Eisenstein
2020-06-21
The NYU-CUBoulder Systems for SIGMORPHON 2020 Task 0 and Task 2
Assaf SingerKatharina Kann
2020-06-21
Off-Policy Self-Critical Training for Transformer in Visual Paragraph Generation
Shiyang YanYang HuaNeil M. Robertson
2020-06-21
A Universal Representation Transformer Layer for Few-Shot Image Classification
| Lu LiuWilliam HamiltonGuodong LongJing JiangHugo Larochelle
2020-06-21
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
| Alexei BaevskiHenry ZhouAbdelrahman MohamedMichael Auli
2020-06-20
Memory Transformer
Mikhail S. BurtsevGrigory V. Sapunov
2020-06-20
Sarcasm Detection in Tweets with BERT and GloVe Embeddings
Akshay KhatriPranav PDr. Anand Kumar M
2020-06-20
End-to-end deep metamodeling to calibrate and optimize energy loads
Max CohenMaurice CharbitSylvain Le CorffMarius PredaGilles Nozière
2020-06-19
New Vietnamese Corpus for Machine ReadingComprehension of Health News Articles
Kiet Van NguyenDuc-Vu NguyenAnh Gia-Tuan NguyenNgan Luu-Thuy Nguyen
2020-06-19
A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19
| David OnianiYanshan Wang
2020-06-19
SqueezeBERT: What can computer vision teach NLP about efficient neural networks?
Forrest N. IandolaAlbert E. ShawRavi KrishnaKurt W. Keutzer
2020-06-19
Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing
Szu-Wei FuChien-Feng LiaoTsun-An HsiehKuo-Hsuan HungSyu-Siang WangCheng YuHeng-Cheng KuoRyandhimas E. ZezarioYou-Jin LiShang-Yi ChuangYen-Ju LuYu Tsao
2020-06-18
Multi-branch Attentive Transformer
| Yang FanShufang XieYingce XiaLijun WuTao QinXiang-Yang LiTie-Yan Liu
2020-06-18
I-BERT: Inductive Generalization of Transformer to Arbitrary Context Lengths
| Hyoungwook NamSeung Byum SeoVikram Sharma MailthodyNoor MichaelLan Li
2020-06-18
SEAL: Segment-wise Extractive-Abstractive Long-form Text Summarization
Yao ZhaoMohammad SalehPeter J. Liu
2020-06-18
Sparse GPU Kernels for Deep Learning
Trevor GaleMatei ZahariaCliff YoungErich Elsen
2020-06-18
SenWave: Monitoring the Global Sentiments under the COVID-19 Pandemic
Qiang YangHind AlamroSomayah AlbaradeiAdil SalhiXiaoting LvChangsheng MaManal AlshehriInji JaberFaroug TifrateneWei WangTakashi GojoboriCarlos M. DuarteXin GaoXiangliang Zhang
2020-06-18
Reducing Estimation Bias via Weighted Delayed Deep Deterministic Policy Gradient
Qiang HeXinwen Hou
2020-06-18
Efficient Ridesharing Dispatch Using Multi-Agent Reinforcement Learning
| Oscar de LimaHansal ShahTing-Sheng ChuBrian Fogelson
2020-06-18
Neural Graphics Pipeline for Controllable Image Generation
Xuelin ChenDaniel Cohen-OrBaoquan ChenNiloy J. Mitra
2020-06-18
Differentiable Augmentation for Data-Efficient GAN Training
| Shengyu ZhaoZhijian LiuJi LinJun-Yan ZhuSong Han
2020-06-18
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments
| Mathilde CaronIshan MisraJulien MairalPriya GoyalPiotr BojanowskiArmand Joulin
2020-06-17
Intelligent Protection & Classification of Transients in Two-Core Symmetric Phase Angle Regulating Transformers
Pallav Kumar BeraCan Isik
2020-06-17
Automatically Ranked Russian Paraphrase Corpus for Text Generation
Vadim GudkovOlga MitrofanovaElizaveta Filippskikh
2020-06-17
Learning Visual Commonsense for Robust Scene Graph Generation
Alireza ZareianZhecan WangHaoxuan YouShih-Fu Chang
2020-06-17
Dynamic Tensor Rematerialization
| Marisa KirisameSteven LyubomirskyAltan HaanJennifer BrennanMike HeJared RoeschTianqi ChenZachary Tatlock
2020-06-17
Fine-Grained Stochastic Architecture Search
Shraman Ray ChaudhuriElad EbanHanhan LiMax MorozYair Movshovitz-Attias
2020-06-17
Exploring the BERT Cross-Lingual Transferability: a Case Study in Reading Comprehension
Konovalov V. P.Gulyaev P. A.Sorokin A. A.Kuratov Y. M.Burtsev M. S.
2020-06-17
Tagging and parsing of multidomain collections
| Alexey SorokinIvan SmurovDenis Kirianov
2020-06-17
Modeling Graph Structure via Relative Position for Better Text Generation from Knowledge Graphs
Martin SchmittLeonardo F. R. RibeiroPhilipp DufterIryna GurevychHinrich Schütze
2020-06-16
Improving accuracy and speeding up Document Image Classification through parallel systems
Javier FerrandoJuan Luis DominguezJordi TorresRaul GarciaDavid GarciaDaniel GarridoJordi CortadaMateo Valero
2020-06-16
PERL: Pivot-based Domain Adaptation for Pre-trained Deep Contextualized Embedding Models
| Eyal Ben-DavidCarmel RabinovitzRoi Reichart
2020-06-16
Multi-Precision Policy Enforced Training (MuPPET): A precision-switching strategy for quantised fixed-point training of CNNs
Aditya RajagopalDiederik Adriaan VinkStylianos I. VenierisChristos-Savvas Bouganis
2020-06-16
The SPPD System for Schema Guided Dialogue State Tracking Challenge
Miao LiHaoqi XiongYunbo Cao
2020-06-16
Real-time Universal Style Transfer on High-resolution Images via Zero-channel Pruning
Jie AnTao LiHaozhi HuangLi ShenXuan WangYongyi TangJinwen MaWei LiuJiebo Luo
2020-06-16
Scalable Cross Lingual Pivots to Model Pronoun Gender for Translation
Kellie WebsterEmily Pitler
2020-06-16
End-to-End Code Switching Language Models for Automatic Speech Recognition
Ahan M. R.Shreyas Sunil Kulkarni
2020-06-16
Intriguing generalization and simplicity of adversarially trained neural networks
Chirag AgarwalPeijie ChenAnh Nguyen
2020-06-16
COVID-CXNet: Detecting COVID-19 in Frontal Chest X-ray Images using Deep Learning
| Arman HaghanifarMahdiyar Molahasani MajdabadiYounhee ChoiS. DeivalakshmiSeokbum Ko
2020-06-16
Fine-grained Human Evaluation of Transformer and Recurrent Approaches to Neural Machine Translation for English-to-Chinese
| Yuying YeAntonio Toral
2020-06-15
On the Multi-Property Extraction and Beyond
Tomasz DwojakMichał PietruszkaŁukasz BorchmannFilip GralińskiJakub Chłędowski
2020-06-15
Exploration of End-to-End ASR for OpenSTT -- Russian Open Speech-to-Text Dataset
Andrei AndrusenkoAleksandr LaptevIvan Medennikov
2020-06-15
Differentiable Neural Architecture Transformation for Reproducible Architecture Improvement
Do-Guk KimHeung-Chang Lee
2020-06-15
Multi-Image Summarization: Textual Summary from a Set of Cohesive Images
Nicholas TrieuSebastian GoodmanPradyumna NarayanaKazoo SoneRadu Soricut
2020-06-15
Interaction Networks: Using a Reinforcement Learner to train other Machine Learning algorithms
Florian Dietz
2020-06-15
FinBERT: A Pretrained Language Model for Financial Communications
| Yi YangMark Christopher Siy UYAllen Huang
2020-06-15
An online evolving framework for advancing reinforcement-learning based automated vehicle control
Teawon HanSubramanya NageshraoDimitar P. FilevUmit Ozguner
2020-06-15
Document Classification for COVID-19 Literature
Bernal Jiménez GutiérrezJuncheng ZengDongdong ZhangPing ZhangYu Su
2020-06-15
Generating Master Faces for Use in Performing Wolf Attacks on Face Recognition Systems
Huy H. NguyenJunichi YamagishiIsao EchizenSébastien Marcel
2020-06-15
Cooking Is All About People: Comment Classification On Cookery Channels Using BERT and Classification Models (Malayalam-English Mix-Code)
Subramaniam KazhuparambilAbhishek Kaushik
2020-06-15
FinEst BERT and CroSloEngual BERT: less is more in multilingual models
Matej UlčarMarko Robnik-Šikonja
2020-06-14
Transferring Monolingual Model to Low-Resource Language: The Case of Tigrinya
Abrhalei TelaAbraham WoubieVille Hautamaki
2020-06-13
Guided Transformer: Leveraging Multiple External Sources for Representation Learning in Conversational Search
Helia HashemiHamed ZamaniW. Bruce Croft
2020-06-13
Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)
Zhiwu QingXiang WangYongpeng SangChangxin GaoShiwei ZhangNong Sang
2020-06-13
Modelling High-Level Mathematical Reasoning in Mechanised Declarative Proofs
Wenda LiLei YuYuhuai WuLawrence C. Paulson
2020-06-13
Salienteye: Maximizing Engagement While Maintaining Artistic Style on Instagram Using Deep Neural Networks
Lili WangRuibo LiuSoroush Vosoughi
2020-06-13
Comparing Natural Language Processing Techniques for Alzheimer's Dementia Prediction in Spontaneous Speech
Thomas SearleZina IbrahimRichard Dobson
2020-06-12
Unmasking the Inductive Biases of Unsupervised Object Representations for Video Sequences
Marissa A. WeisKashyap ChittaYash SharmaWieland BrendelMatthias BethgeAndreas GeigerAlexander S. Ecker
2020-06-12
Human and Multi-Agent collaboration in a human-MARL teaming framework
Neda NavidiFrancois ChabotSagar KurandwadIrv LustigmanVincent RobertGregory SzriftgiserAndrea Schuch
2020-06-12
Online Sequential Extreme Learning Machines: Features Combined From Hundreds of Midlayers
Chandra Swarathesh Addanki
2020-06-12
Dance Revolution: Long Sequence Dance Generation with Music via Curriculum Learning
| Ruozi HuangHuang HuWei WuKei SawadaMi Zhang
2020-06-11
FastPitch: Parallel Text-to-speech with Pitch Prediction
Adrian Łańcucki
2020-06-11
Privacy-Aware Activity Classification from First Person Office Videos
Partho GhoshMd. Abrar IstiakNayeeb RashidAhsan Habib AkashRidwan AbrarAnkan Ghosh DastiderAsif Shahriyar SushmitTaufiq Hasan
2020-06-11
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
Pedro Javier Ortiz SuárezLaurent RomaryBenoît Sagot
2020-06-11
Training Generative Adversarial Networks with Limited Data
| Tero KarrasMiika AittalaJanne HellstenSamuli LaineJaakko LehtinenTimo Aila
2020-06-11
Extrapolation for Large-batch Training in Deep Learning
Tao LinLingjing KongSebastian U. StichMartin Jaggi
2020-06-10
Revisiting Few-sample BERT Fine-tuning
| Tianyi ZhangFelix WuArzoo KatiyarKilian Q. WeinbergerYoav Artzi
2020-06-10
MC-BERT: Efficient Language Pre-Training via a Meta Controller
| Zhenhui XuLinyuan GongGuolin KeDi HeShuxin ZhengLiwei WangJiang BianTie-Yan Liu
2020-06-10
Graph-Aware Transformer: Is Attention All Graphs Need?
Sanghyun YooYoung-Seok KimKang Hyun LeeKuhwan JeongJunhwi ChoiHoshik LeeYoung Sang Choi
2020-06-09
HausaMT v1.0: Towards English-Hausa Neural Machine Translation
Adewale Akinfaderin
2020-06-09
Unsupervised Paraphrase Generation using Pre-trained Language Models
Chaitra HegdeShrikumar Patil
2020-06-09
Few-Shot Generative Conversational Query Rewriting
| Shi YuJiahua LiuJingqin YangChenyan XiongPaul BennettJianfeng GaoZhiyuan Liu
2020-06-09
Bombus Species Image Classification
Venkat MargapuriGeorge LavezziRobert StewartDan Wagner
2020-06-09
Linformer: Self-Attention with Linear Complexity
| Sinong WangBelinda Z. LiMadian KhabsaHan FangHao Ma
2020-06-08
Modeling Discourse Structure for Document-level Neural Machine Translation
Junxuan ChenXiang LiJiarui ZhangChulun ZhouJianwei CuiBin WangJinsong Su
2020-06-08
MultiSpeech: Multi-Speaker Text to Speech with Transformer
Mingjian ChenXu TanYi RenJin XuHao SunSheng ZhaoTao Qin
2020-06-08
Learning to Count Words in Fluent Speech enables Online Speech Recognition
| George SterpuChristian SaamNaomi Harte
2020-06-08
Wat zei je? Detecting Out-of-Distribution Translations with Variational Transformers
Tim Z. XiaoAidan N. GomezYarin Gal
2020-06-08
Passive Batch Injection Training Technique: Boosting Network Performance by Injecting Mini-Batches from a different Data Distribution
Pravendra SinghPratik MazumderVinay P. Namboodiri
2020-06-08
Balancing a CartPole System with Reinforcement Learning -- A Tutorial
Swagat Kumar
2020-06-08
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
| Marius MosbachMaksym AndriushchenkoDietrich Klakow
2020-06-08
Learning disconnected manifolds: a no GANs land
Ugo TanielianThibaut IssenhuthElvis DohmatobJeremie Mary
2020-06-08
Big GANs Are Watching You: Towards Unsupervised Object Segmentation with Off-the-Shelf Generative Models
| Andrey VoynovStanislav MorozovArtem Babenko
2020-06-08
Conservative Q-Learning for Offline Reinforcement Learning
Aviral KumarAurick ZhouGeorge TuckerSergey Levine
2020-06-08
Learning Texture Transformer Network for Image Super-Resolution
| Fuzhi YangHuan YangJianlong FuHongtao LuBaining Guo
2020-06-07
Pre-training Polish Transformer-based Language Models at Scale
| Sławomir DadasMichał PerełkiewiczRafał Poświata
2020-06-07
BERT Loses Patience: Fast and Robust Inference with Early Exit
| Wangchunshu ZhouCanwen XuTao GeJulian McAuleyKe XuFuru Wei
2020-06-07
Medical Concept Normalization in User Generated Texts by Learning Target Concept Embeddings
Katikapalli Subramanyam KalyanS. Sangeetha
2020-06-07
EDropout: Energy-Based Dropout and Pruning of Deep Neural Networks
| Hojjat SalehinejadShahrokh Valaee
2020-06-07
A Comparative Study on Early Detection of COVID-19 from Chest X-Ray Images
Mete AhishaliAysen DegerliMehmet YamacSerkan KiranyazMuhammad E. H. ChowdhuryKhalid HameedTahir HamidRashid MazharMoncef Gabbouj
2020-06-07
Challenges and Thrills of Legal Arguments
Anurag PallaproluRadha VaidyaAditya Swaroop Attawar
2020-06-06
A Robust Attentional Framework for License Plate Recognition in the Wild
Linjiang ZhangPeng WangHui LiZhen LiChunhua ShenYanning Zhang
2020-06-06
Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers
Krzysztof ChoromanskiValerii LikhosherstovDavid DohanXingyou SongJared DavisTamas SarlosDavid BelangerLucy ColwellAdrian Weller
2020-06-05
GMAT: Global Memory Augmentation for Transformers
| Ankit GuptaJonathan Berant
2020-06-05
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
| Zihang DaiGuokun LaiYiming YangQuoc V. Le
2020-06-05
An Overview of Neural Network Compression
James O' Neill
2020-06-05
Accelerating Natural Language Understanding in Task-Oriented Dialog
Ojas AhujaShrey Desai
2020-06-05
UDPipe at EvaLatin 2020: Contextualized Embeddings and Treebank Embeddings
Milan StrakaJana Straková
2020-06-05
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
| Pengcheng HeXiaodong LiuJianfeng GaoWeizhu Chen
2020-06-05
End-to-End Speech-Translation with Knowledge Distillation: [email protected]
Marco GaidoMattia Antonino Di GangiMatteo NegriMarco Turchi
2020-06-04
The SOFC-Exp Corpus and Neural Approaches to Information Extraction in the Materials Science Domain
Annemarie FriedrichHeike AdelFederico TomazicJohannes HingerlRenou BenteauAnika MaruscykLukas Lange
2020-06-04
Neural Network for Low-Memory IoT Devices and MNIST Image Recognition Using Kernels Based on Logistic Map
Andrei Velichko
2020-06-04
CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks
Gašper Beguš
2020-06-04
Automatic Text Summarization of COVID-19 Medical Research Articles using BERT and GPT-2
| Virapat KieuvongngamBowen TanYiming Niu
2020-06-03
Rényi Generative Adversarial Networks
Himesh BhatiaWilliam PaulFady AlajajiBahman GharesifardPhilippe Burlina
2020-06-03
On the Predictive Power of Neural Language Models for Human Real-Time Comprehension Behavior
Ethan Gotlieb WilcoxJon GauthierJennifer HuPeng QianRoger Levy
2020-06-02
Subjective Question Answering: Deciphering the inner workings of Transformers in the realm of subjectivity
Lukas Muttenthaler
2020-06-02
WikiBERT models: deep transfer learning for many languages
Sampo PyysaloJenna KanervaAntti VirtanenFilip Ginter
2020-06-02
Question Answering on Scholarly Knowledge Graphs
Mohamad Yaser JaradehMarkus StockerSören Auer
2020-06-02
A Pairwise Probe for Understanding BERT Fine-Tuning on Machine Reading Comprehension
Jie CaiZhengzhou ZhuPing NieQian Liu
2020-06-02
A heterogeneous branch and multi-level classification network for person re-identification
Jiabao WangYang LiYangshuo ZhangZhuang MiaoRui Zhang
2020-06-02
BERT Based Multilingual Machine Comprehension in English and Hindi
| Somil GuptaNilesh Khade
2020-06-02
Exploring Cross-sentence Contexts for Named Entity Recognition with BERT
Jouni LuomaSampo Pyysalo
2020-06-02
Position Masking for Language Models
Andy WagnerTiyasa MitraMrinal IyerGodfrey Da CostaMarc Tremblay
2020-06-02
Online Versus Offline NMT Quality: An In-depth Analysis on English-German and German-English
Maha ElbayadMichael UstaszewskiEmmanuelle Esperança-RodierFrancis Brunet ManquatLaurent Besacier
2020-06-01
Context-based Transformer Models for Answer Sentence Selection
Ivano LauriolaAlessandro Moschitti
2020-06-01
Unsupervised Sparse-view Backprojection via Convolutional and Spatial Transformer Networks
Xueqing LiuPaul Sajda
2020-06-01
Image Search With Text Feedback by Visiolinguistic Attention Learning
| Yanbei Chen Shaogang Gong Loris Bazzani
2020-06-01
Few-Shot Learning of Part-Specific Probability Space for 3D Shape Segmentation
Lingjing Wang Xiang Li Yi Fang
2020-06-01
RDCFace: Radial Distortion Correction for Face Recognition
He Zhao Xianghua Ying Yongjie Shi Xin Tong Jingsi Wen Hongbin Zha
2020-06-01
An End-to-End Edge Aggregation Network for Moving Object Segmentation
Prashant W. Patil Kuldeep M. Biradar Akshay Dudhane Subrahmanyam Murala
2020-06-01
ActBERT: Learning Global-Local Video-Text Representations
Linchao Zhu Yi Yang
2020-06-01
Residual Squeeze-and-Excitation Network for Fast Image Deraining
Jun FuJianfeng XuKazuyuki TasakaZhibo Chen
2020-06-01
Emergence of Separable Manifolds in Deep Language Representations
Jonathan MamouHang LeMiguel Del RioCory StephensonHanlin TangYoon KimSueYeon Chung
2020-06-01
Conversational Machine Comprehension: a Literature Review
Somil GuptaBhanu Pratap Singh Rawat
2020-06-01
When Bert Forgets How To POS: Amnesic Probing of Linguistic Properties and MLM Predictions
Yanai ElazarShauli RavfogelAlon JacoviYoav Goldberg
2020-06-01
A2-LINK: Recognizing Disguised Faces via Active Learning and Adversarial Noise based Inter-Domain Knowledge
| Anshuman SuriMayank VatsaRicha Singh
2020-06-01
RNNs on Monitoring Physical Activity Energy Expenditure in Older People
Stylianos ParaschiakosCláudio Rebelo de SáJeremiah OkaiEline P. SlagboomMarian BeekmanArno Knobbe
2020-06-01
BWCNN: Blink to Word, a Real-Time Convolutional Neural Network Approach
Albara Ah RamliRex LiuRahul KrishnamoorthyVishal I BXiaoxiao WangIlias TagkopoulosXin Liu
2020-06-01
An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features
Shi-Yan WengTien-Hong LoBerlin Chen
2020-06-01
BERT-based Ensembles for Modeling Disclosure and Support in Conversational Social Media Text
Tanvi DaduKartikey PantRadhika Mamidi
2020-06-01
Correlation-Guided Attention for Corner Detection Based Visual Tracking
Fei Du Peng Liu Wei Zhao Xianglong Tang
2020-06-01
Low-Rank Compression of Neural Nets: Learning the Rank of Each Layer
Yerlan Idelbayev Miguel A. Carreira-Perpinan
2020-06-01
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning
| Zhewei YaoAmir GholamiSheng ShenKurt KeutzerMichael W. Mahoney
2020-06-01
A U-Net Based Discriminator for Generative Adversarial Networks
Edgar Schonfeld Bernt Schiele Anna Khoreva
2020-06-01
Severity-Aware Semantic Segmentation With Reinforced Wasserstein Training
Xiaofeng Liu Wenxuan Ji Jane You Georges El Fakhri Jonghye Woo
2020-06-01
BPGC at SemEval-2020 Task 11: Propaganda Detection in News Articles with Multi-Granularity Knowledge Sharing and Linguistic Features based Ensemble Learning
Rajaswa PatilSomesh SinghSwati Agarwal
2020-05-31
CNRL at SemEval-2020 Task 5: Modelling Causal Reasoning in Language with Multi-Head Self-Attention Weights based Counterfactual Detection
Rajaswa PatilVeeky Baths
2020-05-31
Neural Entity Linking: A Survey of Models based on Deep Learning
| Ozge SevgiliArtem ShelmanovMikhail ArkhipovAlexander PanchenkoChris Biemann
2020-05-31
"Judge me by my size (noun), do you?'' YodaLib: A Demographic-Aware Humor Generation Framework
Aparna GarimellaCarmen BaneaNabil HossainRada Mihalcea
2020-05-31
LRG at SemEval-2020 Task 7: Assessing the Ability of BERT and Derivative Models to Perform Short-Edits based Humor Grading
Siddhant MahurkarRajaswa Patil
2020-05-31
Blended Multi-Modal Deep ConvNet Features for Diabetic Retinopathy Severity Prediction
J. D. BodapatiN. VeeranjaneyuluS. N. ShareefS. HakakM. BilalP. K. R. MaddikuntaO. Jo
2020-05-30
Detecting Problem Statements in Peer Assessments
Yunkai XiaoGabriel ZingleQinjin JiaHarsh R. ShahYi ZhangTianyi LiMohsin KarovaliyaWeixiang ZhaoYang SongJie JiAshwin BalasubramaniamHarshit PatelPriyankha BhalasubbramanianVikram PatelEdward F. Gehringer
2020-05-30
CLARINET: A RISC-V Based Framework for Posit Arithmetic Empiricism
Riya JainNiraj SharmaFarhad MerchantSachin PatkarRainer Leupers
2020-05-30
First Neural Conjecturing Datasets and Experiments
Josef UrbanJan Jakubův
2020-05-29
Using Large Pretrained Language Models for Answering User Queries from Product Specifications
Kalyani RoySmit ShahNithish PaiJaidam RamtejPrajit Prashant NadkarnJyotirmoy BanerjeePawan GoyalSurender Kumar
2020-05-29
A Comparative Study of Lexical Substitution Approaches based on Neural Language Models
Nikolay ArefyevBoris SheludkoAlexander PodolskiyAlexander Panchenko
2020-05-29
SAFER: A Structure-free Approach for Certified Robustness to Adversarial Word Substitutions
Mao YeChengyue GongQiang Liu
2020-05-29
Glaucoma Detection From Raw Circumapillary OCT Images Using Fully Convolutional Neural Networks
Gabriel GarcíaRocío del AmorAdrián ColomerValery Naranjo
2020-05-29
Stance Prediction for Contemporary Issues: Data and Experiments
| Marjan HosseiniaEduard DragutArjun Mukherjee
2020-05-29
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
| Hanrui WangZhanghao WuZhijian LiuHan CaiLigeng ZhuChuang GanSong Han
2020-05-28
Variational Neural Machine Translation with Normalizing Flows
Hendra SetiawanMatthias SperberUdhay NallasamyMatthias Paulik
2020-05-28
Empirical Evaluation of Pretraining Strategies for Supervised Entity Linking
Thibault FévryNicholas FitzGeraldLivio Baldini SoaresTom Kwiatkowski
2020-05-28
Knowledge-Driven Learning via Experts Consult for Thyroid Nodule Classification
Danilo AvolaLuigi CinqueAlessio FagioliSebastiano FilettiGiorgio GraniEmanuele Rodolà
2020-05-28
Brief Announcement: On the Limits of Parallelizing Convolutional Neural Networks on GPUs
Behnam PourghassemiChenghao ZhangJoo Hwan LeeAparna Chandramowlishwaran
2020-05-28
On Incorporating Structural Information to improve Dialogue Response Generation
| Nikita MoghePriyesh VijayanBalaraman RavindranMitesh M. Khapra
2020-05-28
Language Models are Few-Shot Learners
| Tom B. BrownBenjamin MannNick RyderMelanie SubbiahJared KaplanPrafulla DhariwalArvind NeelakantanPranav ShyamGirish SastryAmanda AskellSandhini AgarwalAriel Herbert-VossGretchen KruegerTom HenighanRewon ChildAditya RameshDaniel M. ZieglerJeffrey WuClemens WinterChristopher HesseMark ChenEric SiglerMateusz LitwinScott GrayBenjamin ChessJack ClarkChristopher BernerSam McCandlishAlec RadfordIlya SutskeverDario Amodei
2020-05-28
General-Purpose User Embeddings based on Mobile App Usage
| Junqi ZhangBing BaiYe LinJian LiangKun BaiFei Wang
2020-05-27
Permutation Matters: Anisotropic Convolutional Layer for Learning on Point Clouds
| Zhongpai GaoGuangtao ZhaiJunchi YanXiaokang Yang
2020-05-27
Syntactic Structure Distillation Pretraining For Bidirectional Encoders
Adhiguna KuncoroLingpeng KongDaniel FriedDani YogatamaLaura RimellChris DyerPhil Blunsom
2020-05-27
CausaLM: Causal Model Explanation Through Counterfactual Language Models
Amir FederNadav OvedUri ShalitRoi Reichart
2020-05-27
Transition-based Semantic Dependency Parsing with Pointer Networks
Daniel Fernández-GonzálezCarlos Gómez-Rodríguez
2020-05-27
SSM-Net for Plants Disease Identification in Low Data Regime
Shruti Jadon
2020-05-27
Towards the Infeasibility of Membership Inference on Deep Models
Shahbaz RezaeiXin Liu
2020-05-27
Language Representation Models for Fine-Grained Sentiment Classification
Brian CheangBailey WeiDavid KoganHowey QiuMasud Ahmed
2020-05-27
Network Fusion for Content Creation with Conditional INNs
Robin RombachPatrick EsserBjörn Ommer
2020-05-27
CLOCS: Contrastive Learning of Cardiac Signals
Dani KiyassehTingting ZhuDavid A. Clifton
2020-05-27
Sparse Identification of Nonlinear Dynamical Systems via Reweighted $\ell_1$-regularized Least Squares
Alexandre CortiellaKwang-Chun ParkAlireza Doostan
2020-05-27
SafeML: Safety Monitoring of Machine Learning Classifiers through Statistical Difference Measure
| Koorosh AslansefatIoannis SorokosDeclan WhitingRamin Tavakoli KolagariYiannis Papadopoulos
2020-05-27
Insertion-Based Modeling for End-to-End Automatic Speech Recognition
Yuya FujitaShinji WatanabeMotoi OmachiXuankai Chan
2020-05-27
End-to-End Object Detection with Transformers
| Nicolas CarionFrancisco MassaGabriel SynnaeveNicolas UsunierAlexander KirillovSergey Zagoruyko
2020-05-26
GECToR -- Grammatical Error Correction: Tag, Not Rewrite
| Kostiantyn OmelianchukVitaliy AtrasevychArtem ChernodubOleksandr Skurzhanskyi
2020-05-26
Guiding Symbolic Natural Language Grammar Induction via Transformer-Based Sequence Probabilities
Ben GoertzelAndres Suarez MadrigalGino Yu
2020-05-26
Pay Attention to What You Read: Non-recurrent Handwritten Text-Line Recognition
Lei KangPau RibaMarçal RusiñolAlicia FornésMauricio Villegas
2020-05-26
A Data-driven Approach for Noise Reduction in Distantly Supervised Biomedical Relation Extraction
Saadullah AminKatherine Ann DunfieldAnna VechkaevaGünter Neumann
2020-05-26
What Are People Asking About COVID-19? A Question Classification Dataset
| Jerry WeiChengyu HuangSoroush VosoughiJason Wei
2020-05-26
ParsBERT: Transformer-based Model for Persian Language Understanding
| Mehrdad FarahaniMohammad GharachorlooMarzieh FarahaniMohammad Manthouri
2020-05-26
BEEP! Korean Corpus of Online News Comments for Toxic Speech Detection
| Jihyung MoonWon Ik ChoJunbum Lee
2020-05-26
Identification of Crystal Symmetry from Noisy Diffraction Patterns by A Shape Analysis and Deep Learning
| Leslie Ching Ow TiongJeongrae KimSang Soo HanDonghun Kim
2020-05-26
Comparing BERT against traditional machine learning text classification
Santiago González-CarvajalEduardo C. Garrido-Merchán
2020-05-26
BERT-XML: Large Scale Automated ICD Coding Using BERT Pretraining
Zachariah ZhangJingshu LiuNarges Razavian
2020-05-26
Perceptual Extreme Super Resolution Network with Receptive Field Block
Taizhang ShangQiuju DaiShengchen ZhuTong YangYandong Guo
2020-05-26
Deep Learning Models for Automatic Summarization
Pirmin Lemberger
2020-05-25
The Unreasonable Volatility of Neural Machine Translation Models
| Marzieh FadaeeChristof Monz
2020-05-25
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering
Chia-Chih KuoShang-Bao LuoKuan-Yu Chen
2020-05-25
Køpsala: Transition-Based Graph Parsing via Efficient Training and Effective Encoding
| Daniel HershcovichMiryam de LhoneuxArtur KulmizevElham PejhanJoakim Nivre
2020-05-25
Learning to Charge RF-Energy Harvesting Devices in WiFi Networks
Yizhou LuoKwan-Wu Chin
2020-05-25
Pointwise Paraphrase Appraisal is Potentially Problematic
Hannah ChenYangfeng JiDavid Evans
2020-05-25
Optimization-driven Deep Reinforcement Learning for Robust Beamforming in IRS-assisted Wireless Communications
Jiaye LinYuze ZouXiaoru DongShimin GongDinh Thai HoangDusit Niyato
2020-05-25
Egocentric Human Segmentation for Mixed Reality
Andrija GajicEster Gonzalez-SosaDiego Gonzalez-MorinMarcos Escudero-ViñoloAlvaro Villegas
2020-05-25
Identity-Preserving Realistic Talking Face Generation
Sanjana SinhaSandika BiswasBrojeshwar Bhowmick
2020-05-25
Adversarial NLI for Factual Correctness in Text Summarisation Models
Mario BarrantesBenedikt HerudekRichard Wang
2020-05-24
Stronger Baselines for Grammatical Error Correction Using Pretrained Encoder-Decoder Model
Satoru KatsumataMamoru Komachi
2020-05-24
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding
Chen LiuSu ZhuZijian ZhaoRuisheng CaoLu ChenKai Yu
2020-05-24
Devising Malware Characterstics using Transformers
Simra ShahidTanmay SinghYash SharmaKapil Sharma
2020-05-23
Coronavirus: Comparing COVID-19, SARS and MERS in the eyes of AI
Anas TahirYazan QiblaweyAmith KhandakarTawsifur RahmanUzair KhurshidFarayi MusharavatiM. T. IslamSerkan KiranyazMuhammad E. H. Chowdhury
2020-05-23
Character-level Transformer-based Neural Machine Translation
Nikolay BanarWalter DaelemansMike Kestemont
2020-05-22
A Generative Approach to Titling and Clustering Wikipedia Sections
Anjalie FieldSascha RotheSimon BaumgartnerCong YuAbe Ittycheriah
2020-05-22
Low-Latency Sequence-to-Sequence Speech Recognition and Translation by Partial Hypothesis Selection
Danni LiuGerasimos SpanakisJan Niehues
2020-05-22
Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media
Xiangjue DongChangmao LiJinho D. Choi
2020-05-22
Comparative Study of Machine Learning Models and BERT on SQuAD
Devshree PatelParam RavalRatnam ParikhYesha Shastri
2020-05-22
L2R2: Leveraging Ranking for Abductive Reasoning
| Yunchang ZhuLiang PangYanyan LanXueqi Cheng
2020-05-22
Living Machines: A study of atypical animacy
Mariona Coll ArdanuyFederico NanniKaspar BeelenKasra HosseiniRuth AhnertJon LawrenceKatherine McDonoughGiorgia TolfoDaniel CS WilsonBarbara McGillivray
2020-05-22
Robust Layout-aware IE for Visually Rich Documents with Pre-trained Language Models
Mengxi WeiYifan HeQiong Zhang
2020-05-22
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
Laila RasmyYang XiangZiqian XieCui TaoDegui Zhi
2020-05-22
Simplified Self-Attention for Transformer-based End-to-End Speech Recognition
Haoneng LuoShiliang ZhangMing LeiLei Xie
2020-05-21
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning
Zhiping ZengVan Tung PhamHaihua XuYerbolat KhassanovEng Siong ChngChongjia NiBin Ma
2020-05-21
TASO: Time and Space Optimization for Memory-Constrained DNN Inference
Yuan WenAndrew AndersonValentin RaduMichael F. P. O'BoyleDavid Gregg
2020-05-21
Text-to-Text Pre-Training for Data-to-Text Tasks
| Mihir Kale
2020-05-21
Evaluation of deep convolutional neural networks in classifying human embryo images based on their morphological quality
Prudhvi ThirumalarajuManoj Kumar KanakasabapathyCharles L BormannRaghav GuptaRohan PooniwalaHemanth KandulaIrene SouterIrene DimitriadisHadi Shafiee
2020-05-21
Applying the Transformer to Character-level Transduction
Shijie WuRyan CotterellMans Hulden
2020-05-20
Relative Positional Encoding for Speech Recognition and Direct Translation
Ngoc-Quan PhamThanh-Le HaTuan-Nam NguyenThai-Son NguyenElizabeth SaleskySebastian StuekerJan NiehuesAlexander Waibel
2020-05-20
A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
Dongwei JiangWubo LiRuixiong ZhangMiao CaoNe LuoYang HanWei ZouXiangang Li
2020-05-20
Reducing Overlearning through Disentangled Representations by Suppressing Unknown Tasks
Naveen PanwarTarun TaterAnush SankaranSenthil Mani
2020-05-20
BERTweet: A pre-trained language model for English Tweets
| Dat Quoc NguyenThanh VuAnh Tuan Nguyen
2020-05-20
Classification of Industrial Control Systems screenshots using Transfer Learning
Pablo Blanco MedinaEduardo Fidalgo FernandezEnrique AlegreFrancisco Jáñez MartinoRoberto A. Vasco-CarofilisVíctor Fidalgo Villar
2020-05-20
Creative Artificial Intelligence -- Algorithms vs. humans in an incentivized writing competition
Nils KöbisLuca Mossink
2020-05-20
Attention-based network for low-light image enhancement
Cheng ZhangQingsen YanYu zhuXianjun LiJinqiu SunYanning Zhang
2020-05-20
FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval
Dehong GaoLinbo JinBen ChenMinghui QiuPeng LiYi WeiYi HuHao Wang
2020-05-20
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis
Yusuke YasudaXin WangJunichi Yamagishi
2020-05-20
A modified deep convolutional neural network for detecting COVID-19 and pneumonia from chest X-ray images based on the concatenation of Xception and ResNet50V2
| RahimzadehMohammad; AttarAbolfazl
2020-05-20
Deep Learning based Diagnosis of COVID-19 usingChest CT-scan Images
| Talha AnwarSeemab Zakir
2020-05-20
Comparing Transformers and RNNs on predicting human sentence processing data
Danny MerkxStefan L. Frank
2020-05-19
Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition
| George SterpuChristian SaamNaomi Harte
2020-05-19
Sketch-BERT: Learning Sketch Bidirectional Encoder Representation from Transformers by Self-supervised Learning of Sketch Gestalt
Hangyu LinYanwei FuYu-Gang JiangXiangyang Xue
2020-05-19
Exploring Transformers for Large-Scale Speech Recognition
Liang LuChangliang LiuJinyu LiYifan Gong
2020-05-19
Cross-lingual Transfer Learning for Dialogue Act Recognition
Jiří MartínekChristophe CerisaraPavel KrálLadislav Lenc
2020-05-19
Table Search Using a Deep Contextualized Language Model
| Zhiyu ChenMohamed TrabelsiJeff HeflinYinan XuBrian D. Davison
2020-05-19
Synthesizing Unrestricted False Positive Adversarial Objects Using Generative Models
Martin KotuliakSandro E. SchoenbornAndrei Dan
2020-05-19
Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning
Zhenhui YeYining ChenGuanghua SongBowei YangShen Fan
2020-05-19
Self-supervised Transfer Learning for Instance Segmentation through Physical Interaction
| Andreas EitelNico HauffWolfram Burgard
2020-05-19
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption
Hongyin LuoShang-Wen LiJames Glass
2020-05-19
A Transformer-based Embedding Model for Personalized Product Search
Keping BiQingyao AiW. Bruce Croft
2020-05-18
Efficient Wait-k Models for Simultaneous Machine Translation
Maha ElbayadLaurent BesacierJakob Verbeek
2020-05-18
Spatio-Temporal Graph Transformer Networks for Pedestrian Trajectory Prediction
Cunjun YuXiao MaJiawei RenHaiyu ZhaoShuai Yi
2020-05-18
Many-to-Many Voice Transformer Network
Hirokazu KameokaWen-Chin HuangKou TanakaTakuhiro KanekoNobukatsu HojoTomoki Toda
2020-05-18
GPT-too: A language-model-first approach for AMR-to-text generation
| Manuel MagerRamon Fernandez AstudilloTahira NaseemMd Arafat SultanYoung-Suk LeeRadu FlorianSalim Roukos
2020-05-18
Weak-Attention Suppression For Transformer Based Speech Recognition
Yangyang ShiYongqiang WangChunyang WuChristian FuegenFrank ZhangDuc LeChing-Feng YehMichael L. Seltzer
2020-05-18
Mask CTC: Non-Autoregressive End-to-End ASR with CTC and Mask Predict
Yosuke HiguchiShinji WatanabeNanxin ChenTetsuji OgawaTetsunori Kobayashi
2020-05-18
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han ChiPei-Hung ChungTsung-Han WuChun-Cheng HsiehShang-Wen LiHung-yi Lee
2020-05-18
Are All Languages Created Equal in Multilingual BERT?
Shijie WuMark Dredze
2020-05-18
Cross-filter compression for CNN inference acceleration
Fuyuan LyuShien ZhuWeichen Liu
2020-05-18
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-autoregressive Pitch-dependent Dilated Convolution Model for Parametric Speech Generation
Yi-Chiao WuTomoki HayashiTakuma OkamotoHisashi KawaiTomoki Toda
2020-05-18
A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer
| Vladimir IashinEsa Rahtu
2020-05-17
Building a Hebrew Semantic Role Labeling Lexical Resource from Parallel Movie Subtitles
Ben EyalMichael Elhadad
2020-05-17
Context-Based Quotation Recommendation
Ansel MacLaughlinTao ChenBurcu Karagol AyanDan Roth
2020-05-17
Support-BERT: Predicting Quality of Question-Answer Pairs in MSDN using Deep Bidirectional Transformer
Bhaskar SenNikhil GopalXinwei Xue
2020-05-17
Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce
| Juntao LiChang LiuJian WangLidong BingHongsong LiXiaozhong LiuDongyan ZhaoRui Yan
2020-05-17
Adversarial Training for Commonsense Inference
Lis PereiraXiaodong LiuFei ChengMasayuki AsaharaIchiro Kobayashi
2020-05-17
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
| Pengcheng YinGraham NeubigWen-tau YihSebastian Riedel
2020-05-17
Conformer: Convolution-augmented Transformer for Speech Recognition
| Anmol GulatiJames QinChung-Cheng ChiuNiki ParmarYu ZhangJiahui YuWei HanShibo WangZhengdong ZhangYonghui WuRuoming Pang
2020-05-16
Recurrent Chunking Mechanisms for Long-Text Machine Reading Comprehension
Hongyu GongYelong ShenDian YuJianshu ChenDong Yu
2020-05-16
Streaming Transformer-based Acoustic Models Using Self-attention with Augmented Memory
Chunyang WuYongqiang WangYangyang ShiChing-Feng YehFrank Zhang
2020-05-16
IntelliCode Compose: Code Generation Using Transformer
Alexey SvyatkovskiyShao Kun DengShengyu FuNeel Sundaresan
2020-05-16
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition
Zhengkun TianJiangyan YiJianhua TaoYe BaiShuai ZhangZhengqi Wen
2020-05-16
CERT: Contrastive Self-supervised Learning for Language Understanding
Hongchao FangSicheng WangMeng ZhouJiayuan DingPengtao Xie
2020-05-16
Leveraging Affective Bidirectional Transformers for Offensive Language Detection
AbdelRahim ElmadanyChiyu ZhangMuhammad Abdul-MageedAzadeh Hashemi
2020-05-16
COVID-Twitter-BERT: A Natural Language Processing Model to Analyse COVID-19 Content on Twitter
| Martin MüllerMarcel SalathéPer E Kummervold
2020-05-15
Neural Entity Linking on Technical Service Tickets
Nadja KurzFelix HamannAdrian Ulges
2020-05-15
Finding Experts in Transformer Models
Xavier SuauLuca ZappellaNicholas Apostoloff
2020-05-15
JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment
Dan LimWon JangGyeonghwan OHyeyeong ParkBongwan KimJesam Yoon
2020-05-15
Spelling Error Correction with Soft-Masked BERT
| Shaohua ZhangHaoran HuangJicong LiuHang Li
2020-05-15
Challenges in Emotion Style Transfer: An Exploration with a Lexical Substitution Pipeline
David HelbigEnrica TroianoRoman Klinger
2020-05-15
ResMoNet: A Residual Mobile-based Network for Facial Emotion Recognition in Resource-Limited Systems
Rodolfo Ferro-PérezHugo Mitre-Hernandez
2020-05-15
[email protected] at SemEval-2020 Task 12: Identifying Multilingual Offensive Tweets Using Weighted Ensemble and Fine-Tuned BERT
Saja Khaled TawalbehMahmoud HammadMohammad AL-Smadi
2020-05-15
Disentangling in Latent Space by Harnessing a Pretrained Generator
Yotam NitzanAmit BermanoYangyan LiDaniel Cohen-Or
2020-05-15
NIT-Agartala-NLP-Team at SemEval-2020 Task 8: Building Multimodal Classifiers to tackle Internet Humor
Steve Durairaj SwamyShubham LaddhaBasil AbdussalamDebayan DattaAnupam Jamatia
2020-05-14
A pre-training technique to localize medical BERT and enhance BioBERT
| Shoya WadaToshihiro TakedaShiro ManabeShozo KonishiJun KamoharaYasushi Matsumura
2020-05-14
The Unstoppable Rise of Computational Linguistics in Deep Learning
James Henderson
2020-05-13
Parallel Corpus Filtering via Pre-trained Language Models
Boliang ZhangAjay NageshKevin Knight
2020-05-13
Large Scale Multi-Actor Generative Dialog Modeling
Alex BoydRaul PuriMohammad ShoeybiMostofa PatwaryBryan Catanzaro
2020-05-13
Entity-Enriched Neural Models for Clinical Question Answering
| Bhanu Pratap Singh RawatWei-Hung WengPreethi RaghavanPeter Szolovits
2020-05-13
Context Learning for Bone Shadow Exclusion in CheXNet Accuracy Improvement
Minh-Chuong HuynhTrung-Hieu NguyenMinh-Triet Tran
2020-05-13
On the Robustness of Language Encoders against Grammatical Errors
Fan YinQuanyu LongTao MengKai-Wei Chang
2020-05-12
Discriminative Multi-modality Speech Recognition
| Bo XuCheng LuYandong GuoJacob Wang
2020-05-12
Simultaneous paraphrasing and translation by fine-tuning Transformer models
Rakesh Chada
2020-05-12
Very High Resolution Land Cover Mapping of Urban Areas at Global Scale with Convolutional Neural Networks
Thomas TilakArnaud BraunDavid ChandlerNicolas DavidSylvain GalopinAmélie LombardMichaël MichaudCamille PariselMatthieu PorteMarjorie Robert
2020-05-12
Train and Deploy an Image Classifier for Disaster Response
Jianyu MaoKiana HarrisNae-Rong ChangCaleb PennellYiming Ren
2020-05-12
Generalized State-Dependent Exploration for Deep Reinforcement Learning in Robotics
| Antonin RaffinFreek Stulp
2020-05-12
SOLOIST: Few-shot Task-Oriented Dialog with A Single Pre-trained Auto-regressive Model
Baolin PengChunyuan LiJinchao LiShahin ShayandehLars LidenJianfeng Gao
2020-05-11
Hierarchical Attention Transformer Architecture For Syntactic Spell Correction
Abhishek NiranjanM Ali Basha ShaikKushal Verma
2020-05-11
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
Ye BaiJiangyan YiJianhua TaoZhengkun TianZhengqi WenShuai Zhang
2020-05-11
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
| Jie LeiLiwei WangYelong ShenDong YuTamara L. BergMohit Bansal
2020-05-11
On the Generation of Medical Dialogues for COVID-19
| Wenmian YangGuangtao ZengBowen TanZeqian JuSubrato ChakravortyXuehai HeShu ChenXingyi YangQingyang WuZhou YuEric XingPengtao Xie
2020-05-11
End-To-End Speech Synthesis Applied to Brazilian Portuguese
| Edresson CasanovaArnaldo Candido JuniorChristopher ShulbyFrederico Santos de OliveiraJoão Paulo TeixeiraMoacir Antonelli PontiSandra Maria Aluisio
2020-05-11
Detecting Adverse Drug Reactions from Twitter through Domain-Specific Preprocessing and BERT Ensembling
Amy BredenLee Moore
2020-05-11
Compact Neural Representation Using Attentive Network Pruning
Mahdi BiparvaJohn Tsotsos
2020-05-10
Epipolar Transformers
| Yihui HeRui YanKaterina FragkiadakiShoou-I Yu
2020-05-10
How Context Affects Language Models' Factual Predictions
Fabio PetroniPatrick LewisAleksandra PiktusTim RocktäschelYuxiang WuAlexander H. MillerSebastian Riedel
2020-05-10
Transformer Based Language Models for Similar Text Retrieval and Ranking
Javed Qadrud-DinAshraf Bah RabiouRyan WalkerRavi SoniMartin GajekGabriel PackAkhil Rangaraj
2020-05-10
Finding Universal Grammatical Relations in Multilingual BERT
Ethan A. ChiJohn HewittChristopher D. Manning
2020-05-09
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
| Samson TanShafiq JotyMin-Yen KanRichard Socher
2020-05-09
SocialTrans: A Deep Sequential Model with Social Information for Web-Scale Recommendation Systems
Qiaoan ChenHao GuLingling YiYishi LinPeng HeChuan ChenYangqiu Song
2020-05-09
LinCE: A Centralized Benchmark for Linguistic Code-switching Evaluation
Gustavo AguilarSudipta KarThamar Solorio
2020-05-09
schuBERT: Optimizing Elements of BERT
Ashish KhetanZohar Karnin
2020-05-09
Character Matters: Video Story Understanding with Character-Aware Relations
Shijie GengJi ZhangZuohui FuPeng GaoHang ZhangGerard de Melo
2020-05-09
Learning to Detect 3D Objects from Point Clouds in Real Time
Abhinav Sagar
2020-05-09
SentiBERT: A Transferable Transformer-Based Architecture for Compositional Sentiment Semantics
| Da YinTao MengKai-Wei Chang
2020-05-08
Distilling Knowledge from Pre-trained Language Models via Text Smoothing
Xing WuYibing LiuXiangyang ZhouDianhai Yu
2020-05-08
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference
Ali Hadi ZadehAndreas Moshovos
2020-05-08
Temporal Common Sense Acquisition with Minimal Supervision
Ben ZhouQiang NingDaniel KhashabiDan Roth
2020-05-08
Comparative Analysis of Text Classification Approaches in Electronic Health Records
Aurelie MascioZeljko KraljevicDaniel BeanRichard DobsonRobert StewartRebecca BendayanAngus Roberts
2020-05-08
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
| Marco Tulio RibeiroTongshuang WuCarlos GuestrinSameer Singh
2020-05-08
Mapping Natural Language Instructions to Mobile UI Action Sequences
| Yang LiJiacong HeXin ZhouYuan ZhangJason Baldridge
2020-05-07
LIIR at SemEval-2020 Task 12: A Cross-Lingual Augmentation Approach for Multilingual Offensive Language Identification
Erfan GhaderyMarie-Francine Moens
2020-05-07
A Systematic Assessment of Syntactic Generalization in Neural Language Models
Jennifer HuJon GauthierPeng Qi